view article Article Transformers backend integration in SGLang By marcsun13 and 4 others ⢠Jun 23 ⢠53
StarVector SVG Datasets (đSVG-Bench) Collection Datasets for training and evaluating SVG generation models ⢠11 items ⢠Updated Jan 12 ⢠21
GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning Paper ⢠2505.20355 ⢠Published May 26 ⢠36
Portuguese LLM Leaderboard best models â¤ď¸âđĽ Collection A daily uploaded list of models with best evaluations on the PT-LLM leaderboard: ⢠19 items ⢠Updated 42 minutes ago ⢠37
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper ⢠2503.14476 ⢠Published Mar 18 ⢠139
ORPO: Monolithic Preference Optimization without Reference Model Paper ⢠2403.07691 ⢠Published Mar 12, 2024 ⢠68
view article Article Welcome FalconMamba: The first strong attention-free 7B model By JingweiZuo and 5 others ⢠Aug 12, 2024 ⢠113
Gemma release Collection Groups the Gemma models released by the Google team. ⢠40 items ⢠Updated Jul 10 ⢠343
LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper ⢠2307.02486 ⢠Published Jul 5, 2023 ⢠81