view post Post 5078 A ton of impactful models and datasets in open AI past week, let's summarize the best π€© merve/releases-apr-21-and-may-2-6819dcc84da4190620f448a3π¬ Qwen made it rain! They released Qwen3: new dense and MoE models ranging from 0.6B to 235B π€― as well as Qwen2.5-Omni, any-to-any model in 3B and 7B!> Microsoft AI released Phi4 reasoning models (that also come in mini and plus sizes)> NVIDIA released new CoT reasoning datasetsπΌοΈ > ByteDance released UI-TARS-1.5, native multimodal UI parsing agentic model> Meta released EdgeTAM, an on-device object tracking model (SAM2 variant)π£οΈ NVIDIA released parakeet-tdt-0.6b-v2, a smol 600M automatic speech recognition model> Nari released Dia, a 1.6B text-to-speech model> Moonshot AI released Kimi Audio, a new audio understanding, generation, conversation modelπ©π»βπ» JetBrains released Melium models in base and SFT for coding> Tesslate released UIGEN-T2-7B, a new text-to-frontend-code model π€© See translation π₯ 10 10 + Reply
SentientAGI/Dobby-Unhinged-Llama-3.3-70B Text Generation β’ 71B β’ Updated Feb 12 β’ 1.12k β’ β’ 40
DavidAU/DeepSeek-Grand-Horror-SMB-R1-Distill-Llama-3.1-16B Text Generation β’ 16B β’ Updated Feb 9 β’ 12 β’ 1
huihui-ai/Llama-3.2-11B-Vision-Instruct-abliterated Image-Text-to-Text β’ 11B β’ Updated Oct 22, 2024 β’ 1.3k β’ 29