Michael Hale's picture

154 13

Michael Hale

mhale

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Magma: A Foundation Model for Multimodal AI Agents

upvoted a paper 10 days ago

Large Language Diffusion Models

upvoted a paper 10 days ago

Qwen2.5-VL Technical Report

View all activity

Organizations

mhale's activity

upvoted a paper 9 days ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published 19 days ago • 55

upvoted 3 papers 10 days ago

Large Language Diffusion Models

Paper • 2502.09992 • Published 24 days ago • 99

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 18 days ago • 157

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 18 days ago • 83

upvoted 2 papers 19 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 25 days ago • 143

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26 • 63

upvoted a paper 20 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 186

upvoted 2 papers 23 days ago

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published about 1 month ago • 96

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published about 1 month ago • 122

upvoted 3 papers about 1 month ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Paper • 2501.12202 • Published Jan 21 • 35

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 341

upvoted 6 papers about 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 260

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8 • 86

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 134

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 80

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 55

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 90

upvoted 2 papers 4 months ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published Nov 14, 2024 • 68

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published Nov 14, 2024 • 73