12 449 45

Vlad Bogolin

vladbogo

https://vladbogo.com

AI & ML interests

LLMs, Computer Vision

Recent Activity

updated a collection 29 minutes ago

AI Paper of the Day

upvoted a paper 30 minutes ago

Humanity's Last Exam

upvoted a collection 2 days ago

Llama 3.3

View all activity

Articles

Organizations

vladbogo's activity

upvoted a paper 30 minutes ago

Humanity's Last Exam

Paper • 2501.14249 • Published 6 days ago • 44

upvoted a collection 2 days ago

Llama 3.3

Collection

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 121

upvoted a paper 3 days ago

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published 6 days ago • 28

upvoted 2 papers 4 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 8 days ago • 73

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published 6 days ago • 21

upvoted 3 papers 6 days ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published 7 days ago • 73

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 7 days ago • 260

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published 8 days ago • 79

upvoted a paper 9 days ago

SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces

Paper • 2501.09756 • Published 13 days ago • 19

upvoted a paper 11 days ago

Do generative video models learn physical principles from watching videos?

Paper • 2501.09038 • Published 15 days ago • 31

upvoted a paper 12 days ago

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Paper • 2501.09755 • Published 13 days ago • 33

upvoted 2 papers 13 days ago

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

Paper • 2501.08828 • Published 14 days ago • 30

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 15 days ago • 270

upvoted a paper 14 days ago

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published 19 days ago • 59

upvoted a paper 16 days ago

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Paper • 2501.05874 • Published 19 days ago • 66

upvoted a paper 17 days ago

The FACTS Grounding Leaderboard: Benchmarking LLMs' Ability to Ground Responses to Long-Form Input

Paper • 2501.03200 • Published 23 days ago • 1

upvoted a paper 18 days ago

InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection

Paper • 2501.04575 • Published 21 days ago • 23

upvoted a paper 19 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 22 days ago • 84

upvoted a paper 21 days ago

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published 23 days ago • 67

upvoted a paper 22 days ago

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Paper • 2501.02976 • Published 23 days ago • 52

Vlad Bogolin

AI & ML interests

Recent Activity

Articles

Many-shot jailbreaking

Gecko: Versatile Text Embeddings Distilled from Large Language Models

VideoMamba: State Space Model for Efficient Video Understanding

Genie: Generative Interactive Environments

Rephrasing the Web A Recipe for Compute and Data-Efficient Language Modeling

Reformatted Alignment

Organizations

vladbogo's activity