view article Article SmolVLM2: Bringing Video Understanding to Every Device By orrzohar and 6 others • Feb 20 • 296
view article Article VideoMamba: State Space Model for Efficient Video Understanding By vladbogo • Mar 16, 2024 • 1
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 514
view article Article Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization By TuringsSolutions • Feb 8, 2024 • 14
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6 • 184
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain • Jan 30 • 117
view article Article Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU By edbeeching and 5 others • Mar 9, 2023 • 62
view article Article The N Implementation Details of RLHF with PPO By vwxyzjn and 2 others • Oct 24, 2023 • 67
ds4sd/SmolDocling-256M-preview Image-Text-to-Text • 0.3B • Updated about 12 hours ago • 30.5k • 1.55k
rmayormartins/speech-accent-pt-br-classifier Audio Classification • 0.1B • Updated Jun 23, 2024 • 2 • 2
audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim Audio Classification • 0.2B • Updated Sep 19, 2024 • 143k • 124