Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
aaesha aldahmani's picture
2

aaesha aldahmani

aaeshaaldahmani
maialsh's profile picture Neo111x's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted an article 13 days ago
Integrating benchmarks into LM Evaluation Harness
authored a paper 2 months ago
DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response
upvoted a paper 2 months ago
DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response
View all activity

Organizations

Random BINSEC AI's profile picture

upvoted an article 13 days ago
view article
Article

Integrating benchmarks into LM Evaluation Harness

By Neo111x •
13 days ago
• 3
authored a paper 2 months ago

DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response

Paper • 2505.19973 • Published May 26 • 3
upvoted a paper 2 months ago

DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response

Paper • 2505.19973 • Published May 26 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs