sugatoray
's Collections
Papers-LLMEval
updated
Latxa: An Open Language Model and Evaluation Suite for Basque
Paper
•
2403.20266
•
Published
•
3
TrustLLM: Trustworthiness in Large Language Models
Paper
•
2401.05561
•
Published
•
70
Prometheus 2: An Open Source Language Model Specialized in Evaluating
Other Language Models
Paper
•
2405.01535
•
Published
•
124
Beyond Scaling Laws: Understanding Transformer Performance with
Associative Memory
Paper
•
2405.08707
•
Published
•
33
tinyBenchmarks: evaluating LLMs with fewer examples
Paper
•
2402.14992
•
Published
•
16
meta-llama/Llama-3.3-70B-Instruct-evals
Viewer
•
Updated
•
41.3k
•
250
•
40
RUC-NLPIR/OmniEval-HallucinationEvaluator
Text Generation
•
Updated
•
1
Viewer
•
Updated
•
92
•
633
•
23
Viewer
•
Updated
•
17.6k
•
340k
•
819
Preview
•
Updated
•
31
•
3
KRLabsOrg/lettucedect-base-modernbert-en-v1
Token Classification
•
0.1B
•
Updated
•
3.46k
•
•
16
Viewer
•
Updated
•
269
•
1.08k
•
47