30 1

Christopher

TNTOutburst

AI & ML interests

None yet

Organizations

None yet

TNTOutburst's activity

New activity in open-llm-leaderboard/open_llm_leaderboard about 1 year ago

Qwen1.5 add to leaderboard

#597 opened about 1 year ago by

TNTOutburst

152334H/miqu-1-70b-sf marked as private or deleted

#587 opened about 1 year ago by

TNTOutburst

New activity in open-llm-leaderboard/open_llm_leaderboard over 1 year ago

Brainstorming: Call for a Time-Sensitive, Rolling-Update Benchmark Crowdsourced by the Community

#481 opened over 1 year ago by

JosephusCheung

Brainstorming: Suggestions for improving the leaderboard

#477 opened over 1 year ago by

xxyyy123

[FLAG] fblgit/una-xaberius-34b-v1beta

125

#444 opened over 1 year ago by

XXXGGGNEt

Black Box Benchmarks over Contamination Scanning

#470 opened over 1 year ago by

TNTOutburst

[FLAG] TigerResearch/tigerbot-70b-chat-v4-4k

#438 opened over 1 year ago by

fblgit

Feature request: Run 100B + models automatically

#434 opened over 1 year ago by

ChuckMcSneed

model was not found on hub!

#433 opened over 1 year ago by

liuda1

[FLAG?] Tigerbot-70b-chat-v2 scores are too high.

#414 opened over 1 year ago by

TNTOutburst

Add Orca-2 7b and 13b to queue

#397 opened over 1 year ago by

TNTOutburst

Improve speed leaderboard front end

#249 opened over 1 year ago by

Ostixe360

Why are there no OpenAI models here? we need GPT-3.5 and GPT4 to compare!

#169 opened over 1 year ago by

FarisHijazi

FreeWilly2 by Stability AI is about to beat GPT3.5

#120 opened over 1 year ago by

gsaivinay

New activity in open-llm-leaderboard/open_llm_leaderboard almost 2 years ago

Add a column: average score per billion parameters

#88 opened almost 2 years ago by

rfernand

How long does it take to run these tests?

#90 opened almost 2 years ago by

Goldenblood56

why isn't truthfulQA shown in the leaderboards?

#81 opened almost 2 years ago by

wfzimmerman

Models for Human/GPT4 Eval

#65 opened almost 2 years ago by

natolambert

[feature request] prioritize the queue (by user voting?)

#46 opened almost 2 years ago by

zed9h