jupyter-agent
/

jupyter-agent-qwen3-4b-thinking

@@ -10,7 +10,7 @@ tags:
 - thinking
 base_model: Qwen/Qwen3-4B-Thinking-2507
 datasets:
-- data-agents/jupyter-agent-dataset
 language:
 - en
 - code
@@ -50,7 +50,7 @@ On the [DABStep benchmark](https://huggingface.co/spaces/adyen/DABstep) for data
 ## Model Sources
 - **Repository:** [jupyter-agent](https://github.com/huggingface/jupyter-agent)
-- **Dataset:** [jupyter-agent-dataset](https://huggingface.co/datasets/data-agents/jupyter-agent-dataset)
 - **Blog post:** [Jupyter Agents: training LLMs to reason with notebooks](https://huggingface.co/blog/jupyter-agent-2)
 - **Demo:** [Jupyter Agent 2](https://huggingface.co/spaces/lvwerra/jupyter-agent-2)
@@ -61,7 +61,7 @@ On the [DABStep benchmark](https://huggingface.co/spaces/adyen/DABstep) for data
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
-model_name = "data-agents/jupyter-agent-qwen3-4b-thinking"
 # Load model and tokenizer
 tokenizer = AutoTokenizer.from_pretrained(model_name)
@@ -166,7 +166,7 @@ messages = [
 ### Training Data
-The model was fine-tuned on the [Jupyter Agent Dataset](https://huggingface.co/datasets/data-agents/jupyter-agent-dataset), which contains:
 - **51,389 synthetic notebooks** (~0.2B tokens, total 1B tokens)
 - **Dataset-grounded QA pairs** from real Kaggle notebooks
@@ -247,14 +247,14 @@ We can also see, that the hard score can increase too even though our dataset is
   author={Baptiste Colle and Hanna Yukhymenko and Leandro von Werra},
   year={2025},
   publisher={Hugging Face},
-  url={https://huggingface.co/data-agents/jupyter-agent-qwen3-4b-thinking}
 }
 ```
 ## Related Work
-- **Dataset:** [jupyter-agent-dataset](https://huggingface.co/datasets/data-agents/jupyter-agent-dataset)
-- **Non-thinking version:** [jupyter-agent-qwen3-4b-instruct](https://huggingface.co/data-agents/jupyter-agent-qwen3-4b-instruct)
 - **Base model:** [Qwen3-4B-Thinking-2507](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507)
 - **Benchmark:** [DABStep](https://huggingface.co/spaces/adyen/DABstep)

 - thinking
 base_model: Qwen/Qwen3-4B-Thinking-2507
 datasets:
+- jupyter-agent/jupyter-agent-dataset
 language:
 - en
 - code
 ## Model Sources
 - **Repository:** [jupyter-agent](https://github.com/huggingface/jupyter-agent)
+- **Dataset:** [jupyter-agent-dataset](https://huggingface.co/datasets/jupyter-agent/jupyter-agent-dataset)
 - **Blog post:** [Jupyter Agents: training LLMs to reason with notebooks](https://huggingface.co/blog/jupyter-agent-2)
 - **Demo:** [Jupyter Agent 2](https://huggingface.co/spaces/lvwerra/jupyter-agent-2)
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
+model_name = "jupyter-agent/jupyter-agent-qwen3-4b-thinking"
 # Load model and tokenizer
 tokenizer = AutoTokenizer.from_pretrained(model_name)
 ### Training Data
+The model was fine-tuned on the [Jupyter Agent Dataset](https://huggingface.co/datasets/jupyter-agent/jupyter-agent-dataset), which contains:
 - **51,389 synthetic notebooks** (~0.2B tokens, total 1B tokens)
 - **Dataset-grounded QA pairs** from real Kaggle notebooks
   author={Baptiste Colle and Hanna Yukhymenko and Leandro von Werra},
   year={2025},
   publisher={Hugging Face},
+  url={https://huggingface.co/jupyter-agent/jupyter-agent-qwen3-4b-thinking}
 }
 ```
 ## Related Work
+- **Dataset:** [jupyter-agent-dataset](https://huggingface.co/datasets/jupyter-agent/jupyter-agent-dataset)
+- **Non-thinking version:** [jupyter-agent-qwen3-4b-instruct](https://huggingface.co/jupyter-agent/jupyter-agent-qwen3-4b-instruct)
 - **Base model:** [Qwen3-4B-Thinking-2507](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507)
 - **Benchmark:** [DABStep](https://huggingface.co/spaces/adyen/DABstep)