4 42 22

Alexey G

grib0ed0v

AI & ML interests

LLM / RLHF / AI4Everything.

Recent Activity

liked a model about 18 hours ago

answerdotai/ModernBERT-base

updated a collection 10 days ago

LLM4Russian

updated a collection 10 days ago

LLM4Russian

View all activity

Organizations

grib0ed0v's activity

upvoted a collection 16 days ago

SigLIP

Collection

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated 12 days ago • 47

upvoted a collection 28 days ago

Cultura-Ru-Edu

Collection

Our dataset for enhancing LLM training with educational content in the Russian language. • 2 items • Updated 30 days ago • 5

upvoted 2 papers 28 days ago

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 16

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 49

upvoted an article 29 days ago

Article

Let’s make a generation of amazing image generation models

•

30 days ago

• 33

upvoted 2 papers about 1 month ago

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17 • 50

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12 • 62

upvoted 3 papers about 2 months ago

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Paper • 2410.02089 • Published Oct 2 • 12

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Paper • 2410.19609 • Published Oct 25 • 17

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24 • 19

upvoted 10 papers 2 months ago

LOGO -- Long cOntext aliGnment via efficient preference Optimization

Paper • 2410.18533 • Published Oct 24 • 42

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published Oct 23 • 34

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 144

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Paper • 2410.05229 • Published Oct 7 • 21

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17 • 89

Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts

Paper • 2410.10626 • Published Oct 14 • 37

Textbooks Are All You Need II: phi-1.5 technical report

Paper • 2309.05463 • Published Sep 11, 2023 • 87

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10 • 28

LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Paper • 2410.10783 • Published Oct 14 • 26

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14 • 51