1 13 6

Shi Weikang PRO

swk20

shiwk20

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents

upvoted a paper 2 months ago

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

upvoted a paper 2 months ago

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents

Paper • 2505.21496 • Published May 27 • 39

upvoted 2 papers 2 months ago

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Paper • 2505.10557 • Published May 15 • 47

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

Paper • 2505.03733 • Published May 6 • 17

upvoted a paper 4 months ago

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13 • 52

liked a dataset 4 months ago

agentica-org/DeepScaleR-Preview-Dataset

Viewer • Updated Feb 10 • 40.3k • 4.83k • 143

upvoted a paper 6 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 280

liked a Space 7 months ago

574

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

reacted to AdinaY's post with 👍 7 months ago

Post

1665

🌊 The wave of reasoning models from the Chinese community has arrived!

🚀 Marco-o1 by AIDC, Alibaba
👉 AIDC-AI/Marco-o1

✨ QwQ by Qwen, Alibaba
👉 Qwen/qwq-674762b79b75eac01735070a

🌟 Skywork-o1 by Kunlun Tech
👉 Skywork/skywork-o1-open-67453df58e12f6c3934738d0

🔥 Xkev/Llama-3.2V-11B-cot by PKU Yuan group
👉 Xkev/Llama-3.2V-11B-cot

💡 DeepSeek-R1-Lite-Preview by DeepSeek AI
👉 https://chat.deepseek.com/

🔍 InternThinker Preview by Shanghai AI Lab
👉 https://sso.openxlab.org.cn/login?redirect=https://internlm-chat.intern-ai.org.cn/&clientId=ebmrvod6yo0nlzaek1yp

📘 k0-math by Moonshot AI
🚀 https://kimi.moonshot.cn/ ( coming soon! )

Who's next? 👀
zh-ai-community/reasoning-models-67409fb3aa1ed78f10087cd7

upvoted a paper 8 months ago

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Paper • 2411.10640 • Published Nov 16, 2024 • 47

upvoted 2 papers 9 months ago

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 57

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Paper • 2410.08196 • Published Oct 10, 2024 • 48

commented a paper 10 months ago

Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning

Paper • 2406.10834 • Published Jun 16, 2024 •

liked a model 11 months ago

Qwen/Qwen2-Math-72B-Instruct

Text Generation • 73B • Updated Sep 13, 2024 • 1.14k • • 88

liked a Space 11 months ago

MathGenerator

🏆

liked a dataset 12 months ago

AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 3.08k • 464

upvoted a collection 12 months ago

NuminaMath

Collection

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10 • 78

upvoted 2 papers over 1 year ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 125

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 160

liked a Space almost 2 years ago

174

Mistral-7B-OpenOrca

🌊

upvoted a paper almost 2 years ago

MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning

Paper • 2310.03731 • Published Oct 5, 2023 • 29