-
Jamba: A Hybrid Transformer-Mamba Language Model
Paper • 2403.19887 • Published • 104 -
sDPO: Don't Use Your Data All at Once
Paper • 2403.19270 • Published • 39 -
ViTAR: Vision Transformer with Any Resolution
Paper • 2403.18361 • Published • 52 -
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Paper • 2403.18814 • Published • 44
Phuong Pham
mp1704
AI & ML interests
None yet
Organizations
Collections
1
models
15
mp1704/tora_7b_sft_ckpt_200
Text Generation
•
Updated
mp1704/tora_7b_pt
Text Generation
•
Updated
•
2
mp1704/gpt-neo-sft-v2.1
Text Generation
•
Updated
•
6
mp1704/gpt-neo-sft-v2
Text Generation
•
Updated
•
5
mp1704/gpt-neo-sft
Text Generation
•
Updated
•
9
mp1704/gpt-neo-pt
Text Generation
•
Updated
•
7
mp1704/gemma_2b_sft
Text Generation
•
Updated
•
1
mp1704/gemma_2b_pt
Text Generation
•
Updated
•
4
mp1704/qwen_1.8b_sft_full_3
Text Generation
•
Updated
•
4
mp1704/qwen_1.8b_sft_full_2
Feature Extraction
•
Updated
•
2