Collections
Discover the best community collections!
Collections including paper arxiv:2403.09029
-
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 124 -
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 54 -
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Paper • 2403.09394 • Published • 25
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 54 -
Cleaner Pretraining Corpus Curation with Neural Web Scraping
Paper • 2402.14652 • Published -
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Paper • 2403.11703 • Published • 16
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 54 -
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Paper • 2403.12968 • Published • 24 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 67 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper • 2403.09629 • Published • 72
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 54 -
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 10.5k • 329 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation • Updated • 324 • 172 -
laion/filtered-wit
Viewer • Updated • 2.8M • 1.48k • 9
-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 76 -
bigcode/starcoder2-15b
Text Generation • Updated • 23.7k • • 568 -
Zephyr: Direct Distillation of LM Alignment
Paper • 2310.16944 • Published • 121 -
mixedbread-ai/mxbai-rerank-large-v1
Text Classification • Updated • 25.1k • 105