Replicated R1 Strategy on 8*H100 GPUs - For Qwen-2.5-1.5b

#9
by bhaviktheslider - opened

Hello

The model can be useful for community for unstructured text to structured json creation. We replicated R1 strategy for Qwen 2.5 1.5b. Here is the link: MasterControlAIML/DeepSeek-R1-Strategy-Qwen-2.5-1.5b-Unstructured-To-Structured (https://huggingface.co/MasterControlAIML/DeepSeek-R1-Strategy-Qwen-2.5-1.5b-Unstructured-To-Structured)

Sign up or log in to comment