15 520 229

Taufiq Dwi Purnomo

taufiqdp

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

upvoted a paper 1 day ago

OpenAI o1 System Card

liked a model 1 day ago

Qwen/QVQ-72B-Preview

liked a Space 3 days ago

burtenshaw/recap

View all activity

Organizations

taufiqdp's activity

upvoted a paper 1 day ago

OpenAI o1 System Card

Paper • 2412.16720 • Published 4 days ago • 19

upvoted 2 papers 3 days ago

MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design

Paper • 2412.14590 • Published 7 days ago • 11

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published 6 days ago • 46

upvoted a paper 5 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 8 days ago • 103

upvoted a paper 6 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 6 days ago • 328

upvoted a paper 9 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 12 days ago • 131

upvoted a paper 13 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 14 days ago • 92

upvoted a paper 14 days ago

POINTS1.5: Building a Vision-Language Model towards Real World Applications

Paper • 2412.08443 • Published 15 days ago • 38

upvoted a paper 15 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 17 days ago • 68

upvoted 2 papers 16 days ago

CompCap: Improving Multimodal Large Language Models with Composite Captions

Paper • 2412.05243 • Published 19 days ago • 18

APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published 19 days ago • 38

upvoted 2 papers 19 days ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published 20 days ago • 54

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published 20 days ago • 55

upvoted a paper 20 days ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published 21 days ago • 118

upvoted a collection 20 days ago

PaliGemma 2 Release

Collection

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 12 days ago • 119

upvoted a paper 24 days ago

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Paper • 2411.19146 • Published 28 days ago • 13

upvoted an article 27 days ago

Article

Use Models from the Hugging Face Hub in LM Studio

•

27 days ago

• 127

upvoted 2 papers 29 days ago

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published 30 days ago • 76

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published 30 days ago • 47

upvoted a collection 29 days ago

SmolVLM

Collection

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 3 days ago • 30