view article Article DeepSearch Using Visual RAG in Agentic Frameworks 🔎 By paultltc and 1 other • Mar 21 • 32
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Paper • 2503.04724 • Published Mar 6 • 70
view article Article SmolVLM2: Bringing Video Understanding to Every Device By orrzohar and 6 others • Feb 20 • 252
view article Article Selective fine-tuning of Language Models with Spectrum By anakin87 • Sep 3, 2024 • 35
Parameter-Efficient Fine-Tuning with Discrete Fourier Transform Paper • 2405.03003 • Published May 5, 2024 • 8
Improving Text-to-Image Consistency via Automatic Prompt Optimization Paper • 2403.17804 • Published Mar 26, 2024 • 19
Proactive Detection of Voice Cloning with Localized Watermarking Paper • 2401.17264 • Published Jan 30, 2024 • 19
Interpretability Collection Select papers on language model interpretability with notes • 6 items • Updated Nov 14, 2024 • 4
DeBERTinha Collection Models from the DeBERTinha Portuguese model family • 5 items • Updated Oct 2, 2023 • 1
Contrastive Feature Masking Open-Vocabulary Vision Transformer Paper • 2309.00775 • Published Sep 2, 2023 • 10