21 48 29

Pavlo Molchanov PRO

pmolchanov

https://www.pmolchanov.com

AI & ML interests

Efficiency in Multi-Modal LLMs

Recent Activity

published a model 10 days ago

nvidia/VILA-HD-8B-PS3-4K-SigLIP

published a model 10 days ago

nvidia/VILA-HD-8B-PS3-1.5K-SigLIP

published a model 10 days ago

nvidia/PS3-4K-SigLIP

View all activity

Organizations

pmolchanov's activity

upvoted a paper 23 days ago

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published 25 days ago • 30

upvoted a collection 23 days ago

Llama Nemotron

Collection

Open, Production-ready Enterprise Models • 8 items • Updated 4 days ago • 59

upvoted a paper about 2 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17 • 92

upvoted 4 papers 2 months ago

upvoted a paper 3 months ago

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25 • 42

upvoted 2 articles 4 months ago

Article

Bamba: Inference-Efficient Hybrid Mamba2 Model

and 28 others •

Dec 18, 2024

• 55

Article

Finally, a Replacement for BERT: Introducing ModernBERT

and 14 others •

Dec 19, 2024

• 652

upvoted a paper 4 months ago

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published Jan 30 • 20

upvoted a collection 5 months ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated 4 days ago • 291

upvoted 2 papers 6 months ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 60

Cautious Optimizers: Improving Training with One Line of Code

Paper • 2411.16085 • Published Nov 25, 2024 • 21

upvoted a collection 7 months ago

Hymba

Collection

A series of Hybrid Small Language Models. • 3 items • Updated 4 days ago • 30

upvoted 2 papers 7 months ago

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 55

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 46

upvoted 2 papers 8 months ago

ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Paper • 2410.21465 • Published Oct 28, 2024 • 11

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25, 2024 • 19

upvoted a paper 9 months ago

PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation

Paper • 2410.01680 • Published Oct 2, 2024 • 36