DynaMath Team

university

https://github.com/DynaMath

DynaMath

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Ray2333 authored a paper 4 days ago

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Ray2333 authored a paper 5 days ago

MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

jyzhang1208 authored a paper 6 days ago

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

View all activity

DynaMath's activity

Ray2333

authored a paper 4 days ago

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Paper • 2506.03143 • Published 4 days ago • 38

Ray2333

authored a paper 5 days ago

MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

Paper • 2505.24846 • Published 8 days ago • 15

jyzhang1208

authored a paper 6 days ago

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published 8 days ago • 89

huanzhang12

authored a paper 4 months ago

Rethinking Diverse Human Preference Learning through Principal Component Analysis

Paper • 2502.13131 • Published Feb 18 • 38

Ray2333

authored a paper 4 months ago

Rethinking Diverse Human Preference Learning through Principal Component Analysis

Paper • 2502.13131 • Published Feb 18 • 38

jyzhang1208

authored 3 papers 4 months ago

Ray2333

authored a paper 4 months ago

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Paper • 2502.09560 • Published Feb 13 • 36

jyzhang1208

authored a paper 4 months ago

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Paper • 2502.09560 • Published Feb 13 • 36

huanzhang12

authored a paper 4 months ago

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Paper • 2502.09560 • Published Feb 13 • 36

optizer

authored 3 papers 7 months ago

COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability

Paper • 2402.08679 • Published Feb 13, 2024 • 1

Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra

Paper • 2404.03647 • Published Apr 4, 2024

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

huanzhang12

authored a paper 7 months ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

jyzhang1208

authored a paper 7 months ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

OwenZou

authored a paper 7 months ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

Ray2333

authored a paper 7 months ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

Ray2333

updated a dataset 7 months ago

DynaMath/DynaMath_Sample

Viewer • Updated Nov 5, 2024 • 5.01k • 362 • 6

OwenZou

updated a dataset 7 months ago

DynaMath/DynaMath_Sample

Viewer • Updated Nov 5, 2024 • 5.01k • 362 • 6

AI & ML interests

Recent Activity

Team members 5

DynaMath's activity