-
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Paper • 2501.04519 • Published • 258 -
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics
Paper • 2501.04686 • Published • 50 -
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though
Paper • 2501.04682 • Published • 91 -
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 86
Isaac Kargar
kargarisaac
·
AI & ML interests
Interested in computer vision, nlp, and reinforcement learning
Recent Activity
reacted
to
burtenshaw's
post
with 🚀
9 days ago
AGENTS + FINETUNING! This week Hugging Face learn has a whole pathway on finetuning for agentic applications. You can follow these two courses to get knowledge on levelling up your agent game beyond prompts:
1️⃣ New Supervised Fine-tuning unit in the NLP Course https://huggingface.co/learn/nlp-course/en/chapter11/1
2️⃣New Finetuning for agents bonus module in the Agents Course https://huggingface.co/learn/agents-course/bonus-unit1/introduction
Fine-tuning will squeeze everything out of your model for how you’re using it, more than any prompt.
upvoted
an
article
14 days ago
The Large Language Model Course
upvoted
an
article
17 days ago
Open R1: Update #2
Organizations
Collections
1
models
None public yet
datasets
None public yet