aaesha aldahmani's picture

2

aaesha aldahmani

aaeshaaldahmani

·

AI & ML interests

None yet

Recent Activity

upvoted an article 13 days ago

Integrating benchmarks into LM Evaluation Harness

authored a paper 2 months ago

DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response

upvoted a paper 2 months ago

DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response

View all activity

Organizations

upvoted an article 13 days ago

Article

Integrating benchmarks into LM Evaluation Harness

By

•

13 days ago

• 3

authored a paper 2 months ago

DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response

Paper • 2505.19973 • Published May 26 • 3

upvoted a paper 2 months ago

DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response

Paper • 2505.19973 • Published May 26 • 3