5 406

Literate Goggles

literate-goggles

AI & ML interests

None yet

Recent Activity

upvoted a paper about 17 hours ago

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

upvoted a paper 8 days ago

Gemini Robotics: Bringing AI into the Physical World

upvoted a paper 10 days ago

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

View all activity

Organizations

None yet

literate-goggles's activity

upvoted a paper about 17 hours ago

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published 2 days ago • 56

upvoted a paper 8 days ago

Gemini Robotics: Bringing AI into the Physical World

Paper • 2503.20020 • Published 9 days ago • 21

upvoted a paper 10 days ago

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

Paper • 2503.16430 • Published 14 days ago • 34

upvoted a paper 14 days ago

Scaling Rich Style-Prompted Text-to-Speech Datasets

Paper • 2503.04713 • Published 28 days ago • 1

upvoted 2 papers 18 days ago

Transformers without Normalization

Paper • 2503.10622 • Published 21 days ago • 147

WildIFEval: Instruction Following in the Wild

Paper • 2503.06573 • Published 26 days ago • 11

upvoted a paper 24 days ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 28 days ago • 112

upvoted a paper 29 days ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 71

upvoted an article about 1 month ago

Article

SigLIP 2: A better multilingual vision language encoder

Feb 21

• 148

upvoted 6 papers about 1 month ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 188

Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound

Paper • 2502.05139 • Published Feb 7 • 1

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 150

upvoted 5 papers about 2 months ago

Region-Adaptive Sampling for Diffusion Transformers

Paper • 2502.10389 • Published Feb 14 • 52

Language Models Use Trigonometry to Do Addition

Paper • 2502.00873 • Published Feb 2 • 1

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 147

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published Feb 13 • 22

XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model

Paper • 2406.04904 • Published Jun 7, 2024 • 9