inspire's picture

26 202

inspire

inspirebek

·

AI & ML interests

[ ml / reverse / llm ]

Recent Activity

updated a model 6 days ago

inspirebek/whisperiddin001

liked a model 7 days ago

inspirebek/whisperiddin001

published a model 7 days ago

inspirebek/whisperiddin001

View all activity

Organizations

None yet

inspirebek's activity

upvoted a paper 24 days ago

Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

Paper • 2410.05983 • Published Oct 8, 2024 • 2

upvoted a collection 24 days ago

Search-R1-v0.2

Exploration with a more stable RL pipeline with outcome-only reward and scaled-up LLMs. https://arxiv.org/abs/2503.09516 • 25 items • Updated 1 day ago • 3

upvoted a collection 2 months ago

reranking series v2

V2 crispy rerank series • 2 items • Updated Mar 13 • 23

upvoted 4 collections 3 months ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 30 items • Updated 24 days ago • 225

Cohere Labs Aya 23

Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 3 items • Updated Apr 15 • 55

Cohere Labs Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 4 items • Updated Apr 15 • 39

Deepseek Papers

Deepseek papers collection • 24 items • Updated 3 days ago • 232

upvoted a collection 4 months ago

UzLLM

A collection of Uzbek-adapted LLMs. • 4 items • Updated Dec 4, 2024 • 6

upvoted a paper 4 months ago

Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding

Paper • 2501.07888 • Published Jan 14 • 15

upvoted a collection 4 months ago

InternLM3

6 items • Updated Feb 11 • 25

upvoted a collection 5 months ago

DeepSeek-VL2

5 items • Updated Feb 9 • 73

upvoted a collection 7 months ago

LipSync and Face Operations

21 items • Updated 6 days ago • 53

upvoted 4 collections 8 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 24 days ago • 303

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 27 items • Updated 24 days ago • 63

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 604

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 19 days ago • 226

upvoted an article 8 months ago

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 373

upvoted a paper 8 months ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 92

upvoted 2 collections 9 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 26 days ago • 365

Qwen2-Math

Math-specific model series based on Qwen2 • 8 items • Updated 26 days ago • 52