Lawrence Jang's picture

1 4

Lawrence Jang

ljang0

ljang0

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

authored a paper about 2 months ago

The BrowserGym Ecosystem for Web Agent Research

authored a paper about 2 months ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

View all activity

Organizations

None yet

ljang0's activity

authored 3 papers about 2 months ago

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

Paper • 2410.19100 • Published Oct 24, 2024 • 6

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published Dec 6, 2024 • 19

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published Dec 18, 2024 • 50

upvoted 2 papers about 2 months ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published Dec 18, 2024 • 50

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published Dec 6, 2024 • 19

upvoted a paper 3 months ago

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

Paper • 2410.19100 • Published Oct 24, 2024 • 6

commented a paper 3 months ago

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

Paper • 2410.19100 • Published Oct 24, 2024 • 6 •

authored 3 papers 5 months ago

VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks

Paper • 2401.13649 • Published Jan 24, 2024 • 1

ICAL: Continual Learning of Multimodal Agents by Transforming Trajectories into Actionable Insights

Paper • 2406.14596 • Published Jun 20, 2024 • 5

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12, 2024 • 44

upvoted a paper 5 months ago

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12, 2024 • 44

updated 2 datasets 5 months ago

ljang0/code_ppo_java

Viewer • Updated Sep 7, 2024 • 966 • 33

ljang0/code_ppo

Viewer • Updated Sep 7, 2024 • 974 • 46