sugatoray
's Collections
Papers-LLMEval
updated
Latxa: An Open Language Model and Evaluation Suite for Basque
Paper
•
2403.20266
•
Published
•
3
TrustLLM: Trustworthiness in Large Language Models
Paper
•
2401.05561
•
Published
•
70
Prometheus 2: An Open Source Language Model Specialized in Evaluating
Other Language Models
Paper
•
2405.01535
•
Published
•
123
Beyond Scaling Laws: Understanding Transformer Performance with
Associative Memory
Paper
•
2405.08707
•
Published
•
33
tinyBenchmarks: evaluating LLMs with fewer examples
Paper
•
2402.14992
•
Published
•
13
meta-llama/Llama-3.3-70B-Instruct-evals
Viewer
•
Updated
•
41.3k
•
420
•
36
RUC-NLPIR/OmniEval-HallucinationEvaluator
Text Generation
•
Updated
•
1
Viewer
•
Updated
•
92
•
1.86k
•
21
Viewer
•
Updated
•
17.6k
•
477k
•
700
Preview
•
Updated
•
59
•
3
KRLabsOrg/lettucedect-base-modernbert-en-v1
Token Classification
•
Updated
•
4.43k
•
16
Viewer
•
Updated
•
269
•
1.29k
•
45