shreyasmeher
/

Qwen-GLOCON-Reasoning

Text Classification

Model card Files Files and versions Community

shreyasmeher commited on Mar 26

Commit

0df242e

·

verified ·

1 Parent(s): 556ea25

Update README.md

Files changed (1) hide show

README.md +24 -0

README.md CHANGED Viewed

@@ -15,6 +15,30 @@ pipeline_tag: text-classification
 [![Model](https://img.shields.io/badge/Base_Model-Qwen2.5--3B--Instruct-purple)](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct)
 [![License](https://img.shields.io/badge/License-Apache_2.0-red)](https://www.apache.org/licenses/LICENSE-2.0)
 ## Reinforcement Learning Highlights
 Unlike traditional supervised fine-tuning (used in ConflLlama), this model uses GRPO to:
 1. **Optimize multiple reward signals** simultaneously

 [![Model](https://img.shields.io/badge/Base_Model-Qwen2.5--3B--Instruct-purple)](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct)
 [![License](https://img.shields.io/badge/License-Apache_2.0-red)](https://www.apache.org/licenses/LICENSE-2.0)
+## Important Usage Note
+**Essential:** When using this model, you **must** set the prompt as described below to ensure the model follows the required structured reasoning format. Without explicitly setting the prompt, the model's outputs may not adhere to the expected XML structure and reasoning guidelines.
+For instance, include the following prompt in your inference code:
+```python
+prompt = """
+Respond in the following format:
+<reasoning>
+1. Triggers detected: [List any event triggers]
+2. Participants and organizers: [List any actors involved]
+3. Location details: [Specify the location]
+4. Violence assessment: [Indicate if violent or non-violent]
+5. Event category determination: [State and justify the category]
+</reasoning>
+<answer>
+[Final category]
+</answer>
+"""
+```
 ## Reinforcement Learning Highlights
 Unlike traditional supervised fine-tuning (used in ConflLlama), this model uses GRPO to:
 1. **Optimize multiple reward signals** simultaneously