Christopher
TNTOutburst
AI & ML interests
None yet
Organizations
None yet
TNTOutburst's activity
Qwen1.5 add to leaderboard
7
#597 opened about 1 year ago
by
TNTOutburst
152334H/miqu-1-70b-sf marked as private or deleted
3
#587 opened about 1 year ago
by
TNTOutburst
Brainstorming: Call for a Time-Sensitive, Rolling-Update Benchmark Crowdsourced by the Community
4
24
#481 opened over 1 year ago
by
JosephusCheung
Brainstorming: Suggestions for improving the leaderboard
9
25
#477 opened over 1 year ago
by
xxyyy123
[FLAG] fblgit/una-xaberius-34b-v1beta
125
#444 opened over 1 year ago
by
XXXGGGNEt
Black Box Benchmarks over Contamination Scanning
2
6
#470 opened over 1 year ago
by
TNTOutburst
[FLAG] TigerResearch/tigerbot-70b-chat-v4-4k
15
23
#438 opened over 1 year ago
by
fblgit

Feature request: Run 100B + models automatically
1
15
#434 opened over 1 year ago
by
ChuckMcSneed

model was not found on hub!
3
#433 opened over 1 year ago
by
liuda1
[FLAG?] Tigerbot-70b-chat-v2 scores are too high.
9
#414 opened over 1 year ago
by
TNTOutburst
Add Orca-2 7b and 13b to queue
2
#397 opened over 1 year ago
by
TNTOutburst
Improve speed leaderboard front end
7
#249 opened over 1 year ago
by
Ostixe360

Why are there no OpenAI models here? we need GPT-3.5 and GPT4 to compare!
2
#169 opened over 1 year ago
by
FarisHijazi

FreeWilly2 by Stability AI is about to beat GPT3.5
3
#120 opened over 1 year ago
by
gsaivinay
Add a column: average score per billion parameters
2
2
#88 opened almost 2 years ago
by
rfernand

How long does it take to run these tests?
7
#90 opened almost 2 years ago
by
Goldenblood56
why isn't truthfulQA shown in the leaderboards?
1
#81 opened almost 2 years ago
by
wfzimmerman
Models for Human/GPT4 Eval
2
25
#65 opened almost 2 years ago
by
natolambert

[feature request] prioritize the queue (by user voting?)
9
3
#46 opened almost 2 years ago
by
zed9h
