Letting Large Models Debate: The First Multilingual LLM Debate Competition
•
30
None defined yet.
FlagEval VLM Leaderboard
Display and filter LLM benchmark results
Leaderboard for MVRB (Massive Visualized IR Benchmark)
Segment and caption objects in images
NOVA Text-to-Video