Weijie Xu
xwjzds
AI & ML interests
LLM Evaluation @Amazon
Recent Activity
liked
a dataset
23 days ago
sata-bench/sata-bench
authored
a paper
3 months ago
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded
Dialogue Generation
authored
a paper
3 months ago
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large
Language Models