Robin Dehde's picture

3 5

Robin Dehde

Shannonigan

·

robindehde

AI & ML interests

None yet

Organizations

upvoted a paper about 1 year ago

ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities

Paper • 2408.04682 • Published Aug 8, 2024 • 18

upvoted 2 collections about 1 year ago

UI Agent

a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics • 482 items • Updated 7 days ago • 68

LLMs

468 items • Updated Feb 2 • 43