3 39 55

NAN

nan1248

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

POINTS-GUI-G: GUI-Grounding Journey

upvoted a paper 3 months ago

Sliding Window Attention Adaptation

upvoted a collection 3 months ago

AndesVL

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

POINTS-GUI-G: GUI-Grounding Journey

Paper • 2602.06391 • Published Feb 6 • 17

upvoted a paper 3 months ago

Sliding Window Attention Adaptation

Paper • 2512.10411 • Published Dec 11, 2025 • 21

upvoted 2 collections 3 months ago

AndesVL

Collection

AndesVL is a suite of mobile-optimized Multimodal Large Language Models (MLLMs) with 0.6B to 4B parameters. • 8 items • Updated Feb 1 • 15

Molmo2 Data

Collection

Artifacts for the Molmo2 data release • 13 items • Updated 24 days ago • 39

liked a model 4 months ago

tencent/HunyuanOCR

Image-Text-to-Text • Updated Jan 13 • 382k • 556

upvoted 2 papers 5 months ago

Can Visual Input Be Compressed? A Visual Token Compression Benchmark for Large Multimodal Models

Paper • 2511.02650 • Published Nov 4, 2025 • 10

DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents

Paper • 2510.19336 • Published Oct 22, 2025 • 17

liked 8 models 5 months ago

upvoted a paper 5 months ago

AndesVL Technical Report: An Efficient Mobile-side Multimodal Large Language Model

Paper • 2510.11496 • Published Oct 13, 2025 • 5

liked a dataset 6 months ago

Agent-Ark/Toucan-1.5M

Viewer • Updated Oct 4, 2025 • 1.65M • 5.72k • 203

liked a model 6 months ago

zai-org/GLM-4.5V-FP8

Image-Text-to-Text • Updated Oct 25, 2025 • 17.6k • • 42

liked a model 8 months ago

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 262k • 1.28k

liked a dataset 8 months ago

MegaScience/MegaScience

Viewer • Updated Jul 24, 2025 • 1.25M • 4.92k • 128

NAN

AI & ML interests

Recent Activity

Organizations

nan1248's activity