-
Virgo: A Preliminary Exploration on Reproducing o1-like MLLM
Paper • 2501.01904 • Published • 28 -
LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models
Paper • 2501.00874 • Published • 11 -
BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery
Paper • 2501.01540 • Published • 6
Collections
Discover the best community collections!
Collections including paper arxiv:2501.00874
-
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Paper • 2501.00958 • Published • 92 -
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
Paper • 2501.01257 • Published • 45 -
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Paper • 2501.01423 • Published • 34 -
REDUCIO! Generating 1024times1024 Video within 16 Seconds using Extremely Compressed Motion Latents
Paper • 2411.13552 • Published
-
UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages
Paper • 2411.14343 • Published • 7 -
SPRING Lab IITM's submission to Low Resource Indic Language Translation Shared Task
Paper • 2411.00727 • Published -
Cross-lingual transfer of multilingual models on low resource African Languages
Paper • 2409.10965 • Published -
LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models
Paper • 2501.00874 • Published • 11
-
FLAME: Factuality-Aware Alignment for Large Language Models
Paper • 2405.01525 • Published • 25 -
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
Paper • 2405.14333 • Published • 37 -
Transformers Can Do Arithmetic with the Right Embeddings
Paper • 2405.17399 • Published • 52 -
EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture
Paper • 2405.18991 • Published • 12