7 146 74

Quentin Tardif

ntnq

AI & ML interests

None yet

Recent Activity

liked a model about 20 hours ago

Qwen/Qwen3-Embedding-0.6B-GGUF

upvoted a paper 18 days ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

upvoted a paper 18 days ago

System Prompt Optimization with Meta-Learning

View all activity

Organizations

ntnq's activity

liked a model about 20 hours ago

Qwen/Qwen3-Embedding-0.6B-GGUF

Updated about 16 hours ago • 163

upvoted 5 papers 18 days ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published 23 days ago • 63

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published 23 days ago • 69

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published 22 days ago • 78

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published 22 days ago • 118

Qwen3 Technical Report

Paper • 2505.09388 • Published 23 days ago • 182

upvoted an article 24 days ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

25 days ago

• 415

upvoted a paper about 1 month ago

Even Small Reasoners Should Quote Their Sources: Introducing the Pleias-RAG Model Family

Paper • 2504.18225 • Published Apr 25 • 12

upvoted a collection about 1 month ago

Qwen3

Collection

40 items • Updated 16 days ago • 737

liked a dataset about 1 month ago

AIffl/french_trivia_qa_with_wikicontext

Viewer • Updated May 26, 2024 • 4.12k • 29 • 3

upvoted 2 papers about 1 month ago

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Paper • 2504.17789 • Published Apr 24 • 23

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 74

upvoted 2 papers 2 months ago

AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation

Paper • 2503.19693 • Published Mar 25 • 75

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published Mar 28 • 44

liked a model 2 months ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated Apr 30 • 234k • 1.64k

upvoted a collection 2 months ago

Llama Nemotron

Collection

Open, Production-ready Enterprise Models • 8 items • Updated about 9 hours ago • 59

liked a dataset 2 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated 29 days ago • 3.91M • 12.6k • 497

upvoted a paper 2 months ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 51

liked a dataset 3 months ago

HuggingFaceTB/stack-edu

Viewer • Updated Mar 20 • 167M • 1.04k • 38

liked a model 3 months ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

Image-Text-to-Text • Updated 28 days ago • 131k • • 1.26k