Yitong Li
Lytttttt
ยท
AI & ML interests
None yet
Recent Activity
new activity 22 days ago
xlangai/osworld_v2_tasks:Fix evaluators: tasks 077 (L0 false positive) + 079/087/096 (L2 robustness) updated a dataset about 2 months ago
xlangai/osworld2.0_human_crosscheck new activity about 2 months ago
xlangai/osworld2.0_human_crosscheck:Add task_004 eval.log (manual cross-check)