16 18 46

Samman

Someman

AI & ML interests

Text Classification , Chatbots , NLP

Recent Activity

upvoted an article 11 days ago

Merge Large Language Models with mergekit

liked a Space 3 months ago

nanotron/ultrascale-playbook

liked a Space 5 months ago

HuggingFaceH4/blogpost-scaling-test-time-compute

View all activity

Organizations

Someman's activity

upvoted an article 11 days ago

Article

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 119

liked a Space 3 months ago

2.62k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a Space 5 months ago

564

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

liked 3 models 8 months ago

sentence-transformers/all-MiniLM-L6-v2

Snowflake/snowflake-arctic-embed-s

NovaSearch/stella_en_1.5B_v5

New activity in Someman/hindi-summarization 10 months ago

[bot] Conversion to Parquet

#1 opened 11 months ago by

parquet-converter

upvoted a paper about 1 year ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30, 2024 • 113

reacted to merve's post with 🔥 about 1 year ago

Post

3848

just landed at Hugging Face Hub: community-led computer vision course 📖🤍
learn from fundamentals to details of the bleeding edge vision transformers!

1 reply

upvoted 3 papers about 1 year ago

liked a model about 1 year ago

NousResearch/OLMo-Bitnet-1B

Text Generation • Updated Apr 11, 2024 • 65 • 118

upvoted 2 papers about 1 year ago

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26, 2024 • 81

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25, 2024 • 69

New activity in Someman/alpaca-nepali about 1 year ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

New activity in Someman/Indic-gemma-2b-finetuned-sft-Navarasa-adapters-ne-v1.0 about 1 year ago

This looks interesting for a model fine tuned with just 16 MB of data and 700 steps. Is there a way you could you share the code ??

#1 opened about 1 year ago by

Aananda-giri

upvoted a collection about 1 year ago

Similarity search

Collection

2 items • Updated Jun 4, 2024 • 2

liked 2 datasets about 1 year ago

Telugu-LLM-Labs/nepali_alpaca_yahma_cleaned_filtered

Viewer • Updated Mar 14, 2024 • 28.9k • 11 • 5

rahular/itihasa

Viewer • Updated Oct 24, 2022 • 93k • 355 • 19