Kai Zuberbühler's picture

721 319

Kai Zuberbühler

kaizuberbuehler

·

k-zubi

AI & ML interests

language models, agents, image generation, music generation

Recent Activity

upvoted a collection 26 days ago

updated a collection 2 months ago

updated a collection 2 months ago

Code Generation

View all activity

Organizations

None yet

upvoted a collection 26 days ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated 25 days ago • 144

updated 2 collections 2 months ago

Benchmarks

104 items • Updated May 8 • 3

Code Generation

32 items • Updated May 8 • 2

upvoted a paper 2 months ago

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

Paper • 2310.06770 • Published Oct 10, 2023 • 9

updated 2 collections 2 months ago

Code Generation

32 items • Updated May 8 • 2

Agents

123 items • Updated May 8 • 3

upvoted a paper 2 months ago

SWE-smith: Scaling Data for Software Engineering Agents

Paper • 2504.21798 • Published Apr 30 • 10

liked a model 2 months ago

ACE-Step/ACE-Step-v1-3.5B

Text-to-Audio • Updated May 22 • 531

liked a Space 2 months ago

ACE Step

A Step Towards Music Generation Foundation Model

updated a Space 2 months ago

Ai Progress Charts

Generate AI benchmarking plots

updated 2 collections 2 months ago

Vision Language Models

102 items • Updated Apr 26 • 6

Benchmarks

104 items • Updated May 8 • 3

upvoted a paper 2 months ago

VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge

Paper • 2504.10342 • Published Apr 14 • 11

updated 2 collections 2 months ago

Vision Language Models

102 items • Updated Apr 26 • 6

Benchmarks

104 items • Updated May 8 • 3

upvoted a paper 2 months ago

NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples

Paper • 2410.14669 • Published Oct 18, 2024 • 40

updated 2 collections 2 months ago

Vision Language Models

102 items • Updated Apr 26 • 6

Agents

123 items • Updated May 8 • 3

upvoted a paper 2 months ago

Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru

Paper • 2503.07587 • Published Mar 10 • 11

updated a collection 2 months ago

Agents

123 items • Updated May 8 • 3