There is no such thing as a tokenizer-free lunch
By
•
•
65Model Quality: Hugging Face Is All You Need
By
•
•
16RexBERT: Encoders for a brave new world of E-Commerce
By
and 1 other
•
•
46Nemotron-Personas-Japan: Synthesized Data for Sovereign AI
By
and 6 others
•
•
22When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance
By
and 1 other
•
•
10Preserving Agency: Why AI Safety Needs Community, Not Corporate Control
By
•
•
9Code a simple RAG from scratch
By
•
•
206Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips
By
•
•
8Uncensor any LLM with abliteration
By
•
•
684Nemotron-Personas-Japan: ソブリン AI のための合成データセット
By
and 6 others
•
•
7How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons
By
•
•
7Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi
By
and 1 other
•
•
6Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face
By
•
•
72PP-OCRv5 on Hugging Face: A Specialized Approach to OCR
By
and 5 others
•
•
102arXiv实用技巧,如何让你的paper关注度变高?
By
•
•
14DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
By
•
•
225Small Language Models (SLM): A Comprehensive Overview
By
•
•
75PrediBench: Testing AI models on prediction markets
By
and 1 other
•
•
4Introduction to State Space Models (SSM)
By
•
•
175Mastering Tensor Dimensions in Transformers
By
•
•
98