AI2 WildBench Leaderboard (V2)
Display and explore model leaderboards and chat history
Display and explore model leaderboards and chat history
Display LMArena Leaderboard
Embedding Leaderboard
Track, rank and evaluate open LLMs and chatbots
Determine GPU requirements for large language models
Identify key entities in text
Browse and filter leaderboard of language models
Generate text from document images
Analyze document layout from images
Extract text from documents using images or PDFs
Answer questions about images by chatting
Efficient quantized retrieval over Wikipedia
Display and filter model evaluation results
Identify objects in images based on text descriptions
Analyze images to detect and label objects
VLMEvalKit Evaluation Results Collection
Run a Streamlit web app
Visualize Open vs. Proprietary LLM Progress
Upload a PDF and ask questions to get insights
Submit and evaluate models on GAIA benchmark
Identify and highlight key entities in text
Explore and analyze code evaluation data
Create a Hugging Face dataset from text files
Generate speech from text in multiple languages
Analyze images to generate captions, detect objects, or perform OCR
Generate React TypeScript App
Video captioning/tracking
Explore visual document retrieval benchmark results
In-browser speech recognition w/ word-level timestamps
Generate insights from charts using text prompts
Need to analyze data? Let a Llama-3.1 agent do it for you!
Display a text analysis tool
View and submit language model evaluations
Detect objects in images using text prompts
VLMEvalKit Eval Results in video understanding benchmark
Extract text from images using various OCR modes
Generate a leaderboard for evaluating language models
remove background from any image
Vote on AI responses to rank models
What happened in open-source AI this year, and whatβs next?
Generate interactive React app data visualizations
Detect and estimate human poses in images and videos
Run interactive Jupyter notebooks with user input
Ranking of LLMs for agentic tasks
OmniParser, turn your LLM into GUI agent
Enhance low-light images to improve clarity
PDF to Structured Data powered by Google DeepMind Gemini 2.0
Handwritten Signature Detection
Convert images and text into structured documents
Generate text and speech responses from various inputs
Detect faces in uploaded images
Convert PDFs to Markdown with open-source parsers
Remove background from images
A Unified Framework for Image Customization
Dolphin Demo
Create and enrich datasets using AI
Display OCR model leaderboard and evaluation data
Hand-controlled arpeggiator, drum machine, and visualizer
olmocr / nanonets ocr / rolmocr / qwen2vl ocr / aya vision
Display OCRBench leaderboard for text recognition models
camel doc ocr / core ocr / docscope ocr / monkey ocr
monkey ocr / nanonets ocr / smoldocling / typhoon ocr
Run GGUF directly on your browser!
Convert images to text using OCR models