Justin J
justinj92
AI & ML interests
Learning Language & Vision Models
Recent Activity
new activity
14 days ago
unsloth/Phi-4-mini-instruct:Phi-4 mini does not work inside of unsloth.
liked
a model
14 days ago
microsoft/Phi-4-multimodal-instruct
reacted
to
alvarobartt's
post
with ๐ฅ
16 days ago
๐ฅ Agents can do anything! @microsoft Research just announced the release of Magma 8B!
Magma is a new Visual Language Model (VLM) with 8B parameters for multi-modal agents designed to handle complex interactions across virtual and real environments; and it's MIT licensed!
Magma comes with exciting new features such as:
- Introduces the Set-of-Mark and Trace-of-Mark techniques for fine-tuning
- Leverages a large amount of unlabeled video data to learn the spatial-temporal grounding and planning
- A strong generalization and ability to be fine-tuned for other agentic tasks
- SOTA in different multi-modal benchmarks spanning across UI navigation, robotics manipulation, image / video understanding and spatial understanding and reasoning
- Generates goal-driven visual plans and actions for agentic use cases
Model: https://huggingface.co/microsoft/Magma-8B
Technical Report: https://huggingface.co/papers/2502.13130
Organizations
justinj92's activity
Phi-4 mini does not work inside of unsloth.
5
#1 opened 14 days ago
by
Pinkstack

Adding `safetensors` variant of this model
#1 opened about 1 year ago
by
SFconvertbot

When will gradient checkpointing be implemented?
7
#68 opened about 1 year ago
by
rishiraj
