Amir Hossein Kargaran's picture

Amir Hossein Kargaran

kargaranamir

·

https://kargaranamir.github.io

AI & ML interests

#NLP, checkout https://huggingface.co/cis-lmu

Recent Activity

liked a Space about 20 hours ago

lvwerra/jupyter-agent-2

upvoted a changelog 4 days ago

Connect Your MCP Client to the Hugging Face Hub

liked a Space 10 days ago

amd/gpt-oss-120b-chatbot

View all activity

Organizations

upvoted a changelog 4 days ago

Changelog

Connect Your MCP Client to the Hugging Face Hub

Jun 6

• 106

upvoted a collection 25 days ago

llm-urls-neurips

57 items • Updated May 15 • 2

upvoted an article about 2 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

By

and 1 other •

Jul 9

• 657

upvoted a collection about 2 months ago

🥂 FineWeb2

3 items • Updated Jun 27 • 20

upvoted a paper about 2 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 69

upvoted 2 articles 2 months ago

Article

Transformers backend integration in SGLang

By

and 4 others •

Jun 23

• 53

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

By

•

Apr 25

• 295

upvoted 2 papers 3 months ago

How Programming Concepts and Neurons Are Shared in Code Language Models

Paper • 2506.01074 • Published Jun 1 • 3

Tracing Multilingual Factual Knowledge Acquisition in Pretraining

Paper • 2505.14824 • Published May 20 • 4

upvoted a paper 4 months ago

Multilingual k-Nearest-Neighbor Machine Translation

Paper • 2310.14644 • Published Oct 23, 2023 • 2

upvoted a collection 5 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 633

upvoted 2 papers 5 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 242

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 302

upvoted 2 collections 5 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 611

— UI is a good thing 💅 —

cool spaces with a cool UI, what could be better? • 5 items • Updated May 5 • 23

upvoted a paper 5 months ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 55

upvoted a paper 6 months ago

On Relation-Specific Neurons in Large Language Models

Paper • 2502.17355 • Published Feb 24 • 9

upvoted a collection 6 months ago

MMTEB

Our contribution to the Massive Multilingual Text Embedding Benchmark (MMTEB). Retrieval and reranking benchmarks in 16 languages. • 4 items • Updated Jun 6, 2024 • 3

upvoted a paper 6 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 38

upvoted a collection 6 months ago

CommonCrawl

Large web-mined general corpus based on CommonCrawl. • 8 items • Updated Apr 13 • 3