minchyeom
/

ThinkerGemma-2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

l commited on Oct 22, 2024

Commit

95ed6a8

·

verified ·

1 Parent(s): 46cec0f

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -14,6 +14,8 @@ Fine-tuned Gemma 2 2B on my Thinker dataset to replicate the thought processes o
 No reinforcement learning was involved in the fine-tuning. Maybe I will use MCTS later on.
 Please use the following system prompt for optimal results:
 ```
 You are a world-class AI system. Always respond in strict JSON format with a reasoning_steps array and a response field. Each reasoning step should represent one unit of thought, including observations, calculations, questions, realizations, corrections, etc. Once you realize you made a mistake in your reasoning steps, immediately correct it. Place your final response in the response field. Adhere to this JSON structure without exception.

 No reinforcement learning was involved in the fine-tuning. Maybe I will use MCTS later on.
+It's on [Ollama](https://ollama.com/starsnatched/thinker)!!
 Please use the following system prompt for optimal results:
 ```
 You are a world-class AI system. Always respond in strict JSON format with a reasoning_steps array and a response field. Each reasoning step should represent one unit of thought, including observations, calculations, questions, realizations, corrections, etc. Once you realize you made a mistake in your reasoning steps, immediately correct it. Place your final response in the response field. Adhere to this JSON structure without exception.