diff --git a/README.md b/README.md index 1d7fe0ace0dce07fcefc1833bf257efbe40d28be..cee3a2d14b0a633b406fd6c1f2fa4330846cc106 100644 --- a/README.md +++ b/README.md @@ -20,7 +20,6 @@ datasets: base_model: - Qwen/Qwen3-4B --- - # ToxiFrench: Benchmarking and Investigating SLMs and CoT Finetuning for French Toxicity Detection @@ -42,6 +41,15 @@ base_model: --- +## Table of Contents +- [Abstract](#abstract) +- [Key Contributions](#key-contributions) +- [How to use ?](#how-to-use) + - [Notations](#notations) + - [Example Usage](#example-usage) +- [License](#license) +- [Citation](#citation) + ## Abstract Despite significant progress in English toxicity detection, performance drastically degrades in other languages like French, a gap stemming from disparities in training corpora and the culturally nuanced nature of toxicity. This paper addresses this critical gap with three key contributions. First, we introduce ToxiFrench, a new public benchmark dataset for French toxicity detection, comprising 53,622 entries. This dataset was constructed using a novel annotation strategy that required manual labeling for only 10% of the data, minimizing effort and error. Second, we conducted a comprehensive evaluation of toxicity detection models. Our findings reveal that while Large Language Models (LLMs) often achieve high performance, Small Language Models (SLMs) can demonstrate greater robustness to bias, better cross-language consistency, and superior generalization to novel forms of toxicity. Third, to identify optimal transfer-learning methods, we conducted a systematic comparison of In-Context Learning (ICL), Supervised Fine-tuning (SFT), and Chain-of-Thought (CoT) reasoning using `Qwen3-4B` and analyzed the impact of data imbalance. We propose a novel approach for CoT fine-tuning that employs a dynamic weighted loss function, significantly boosting performance by ensuring the model's reasoning is faithful to its final conclusion. @@ -58,26 +66,85 @@ Despite significant progress in English toxicity detection, performance drastica --- -## Models overview +## How to use ? This repository contains the **ToxiFrench** model, a **French language model** fine-tuned for **toxic comment classification**. It is based on the [**Qwen/Qwen3-4B**](https://huggingface.co/Qwen/Qwen3-4B) architecture and is designed to detect and classify toxic comments in French text. We performed a series of experiments to evaluate the model's performance under different fine-tuning configurations, focusing on the impact of **data selection strategies** and **Chain-of-Thought (CoT)** annotations. -## Finetuning notations - -Each experiment follows a naming scheme like: **(r/o)(e/d)(a/b)(s/m/l)** -Where: - -- `r` = random order, `o` = ordered (curriculum) -- `e` = equal toxic/non-toxic, `d` = real-world imbalance -- `a` = with CoT finetuning, `b` = without CoT -- `s` = small (100), `m` = medium (1000), `l` = large (all) - -> e.g. `rdal` is the model trained on the natural distribution of toxicity (`d`), on an arbitrary order (`r`), with CoT annotations (`a`), and on the whole dataset (`l`). - -If a label like `` is present in the checkpoint name, it indicates that the CoT that was used during training did not include this specific reasoning step. - +We used QLORA adapters, make sure to specify `adapter_name` when loading the model, otherwise the base model, without any fine-tuning, will be loaded. + +### Notations + +For conciseness, we use a three-letter notation to describe the different configurations of the fine-tuning experiments. Each experiment follows a naming scheme like: **(r/o)(e/d)(c/b)** +Where: + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
ParameterCodeDescription
Data Order[r]Training data is presented in a random order.
[o]Data is ordered (Curriculum Learning).
Class Balance[e]Training set has an equal (balanced) number of toxic and non-toxic samples.
[d]Training set uses a different (imbalanced) class distribution.
Training Target[c]Finetuning on the complete Chain-of-Thought annotation.
[b]Finetuning on the final binary label only (direct classification).
+ +> e.g. `rec` is the model trained on an oversampled dataset for balance, with batches in an arbitrary order (`r`), and with CoT reasoning (`c`). + +### Example Usage + +```python +import torch +from transformers import AutoModelForCausalLM, AutoTokenizer +from peft import PeftModel + +# Choose which adapter to load +target_adapter_name = "rec" # Among the following six configurations : "odc", "oeb", "oec", "rdc", "reb", "rec" + +# Load the base model +base_model_name = "Qwen/Qwen3-4B" +model = AutoModelForCausalLM.from_pretrained(base_model_name, device_map="auto") +tokenizer = AutoTokenizer.from_pretrained(base_model_name) + +# Load the specific adapter by name from the repository +adapter_repo_id = "Naela00/ToxiFrench" +model = PeftModel.from_pretrained( + model, + adapter_repo_id, + adapter_name=target_adapter_name # Precise which experiment to load +) + +print(f"Successfully loaded the '{target_adapter_name}' adapter!") +``` --- @@ -91,6 +158,13 @@ This project is licensed under the MIT License - see the [LICENSE](LICENSE) file If you use this project in your research, please cite it as follows: +```bibtex +@misc{delaval2025toxifrench, + title={ToxiFrench: Benchmarking and Investigating SLMs and CoT Finetuning for French Toxicity Detection}, + author={Axel Delaval}, + year={2025}, +If you use this project in your research, please cite it as follows: + ```bibtex @misc{delaval2025toxifrench, title={ToxiFrench: Benchmarking and Investigating SLMs and CoT Finetuning for French Toxicity Detection}, diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/README.md b/checkpoints/odc_Qwen3-4B/checkpoint-420/README.md similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/README.md rename to checkpoints/odc_Qwen3-4B/checkpoint-420/README.md diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/adapter_config.json b/checkpoints/odc_Qwen3-4B/checkpoint-420/adapter_config.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/adapter_config.json rename to checkpoints/odc_Qwen3-4B/checkpoint-420/adapter_config.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/adapter_model.safetensors b/checkpoints/odc_Qwen3-4B/checkpoint-420/adapter_model.safetensors similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/adapter_model.safetensors rename to checkpoints/odc_Qwen3-4B/checkpoint-420/adapter_model.safetensors diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/added_tokens.json b/checkpoints/odc_Qwen3-4B/checkpoint-420/added_tokens.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/added_tokens.json rename to checkpoints/odc_Qwen3-4B/checkpoint-420/added_tokens.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/chat_template.jinja b/checkpoints/odc_Qwen3-4B/checkpoint-420/chat_template.jinja similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/chat_template.jinja rename to checkpoints/odc_Qwen3-4B/checkpoint-420/chat_template.jinja diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/global_step417/mp_rank_00_model_states.pt b/checkpoints/odc_Qwen3-4B/checkpoint-420/global_step417/mp_rank_00_model_states.pt similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/global_step417/mp_rank_00_model_states.pt rename to checkpoints/odc_Qwen3-4B/checkpoint-420/global_step417/mp_rank_00_model_states.pt diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/global_step417/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/odc_Qwen3-4B/checkpoint-420/global_step417/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/global_step417/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/odc_Qwen3-4B/checkpoint-420/global_step417/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/global_step417/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/odc_Qwen3-4B/checkpoint-420/global_step417/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/global_step417/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/odc_Qwen3-4B/checkpoint-420/global_step417/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/latest b/checkpoints/odc_Qwen3-4B/checkpoint-420/latest similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/latest rename to checkpoints/odc_Qwen3-4B/checkpoint-420/latest diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/merges.txt b/checkpoints/odc_Qwen3-4B/checkpoint-420/merges.txt similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/merges.txt rename to checkpoints/odc_Qwen3-4B/checkpoint-420/merges.txt diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/rng_state_0.pth b/checkpoints/odc_Qwen3-4B/checkpoint-420/rng_state_0.pth similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/rng_state_0.pth rename to checkpoints/odc_Qwen3-4B/checkpoint-420/rng_state_0.pth diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/rng_state_1.pth b/checkpoints/odc_Qwen3-4B/checkpoint-420/rng_state_1.pth similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/rng_state_1.pth rename to checkpoints/odc_Qwen3-4B/checkpoint-420/rng_state_1.pth diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/special_tokens_map.json b/checkpoints/odc_Qwen3-4B/checkpoint-420/special_tokens_map.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/special_tokens_map.json rename to checkpoints/odc_Qwen3-4B/checkpoint-420/special_tokens_map.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/tokenizer.json b/checkpoints/odc_Qwen3-4B/checkpoint-420/tokenizer.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/tokenizer.json rename to checkpoints/odc_Qwen3-4B/checkpoint-420/tokenizer.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/tokenizer_config.json b/checkpoints/odc_Qwen3-4B/checkpoint-420/tokenizer_config.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/tokenizer_config.json rename to checkpoints/odc_Qwen3-4B/checkpoint-420/tokenizer_config.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/trainer_state.json b/checkpoints/odc_Qwen3-4B/checkpoint-420/trainer_state.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/trainer_state.json rename to checkpoints/odc_Qwen3-4B/checkpoint-420/trainer_state.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/training_args.bin b/checkpoints/odc_Qwen3-4B/checkpoint-420/training_args.bin similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/training_args.bin rename to checkpoints/odc_Qwen3-4B/checkpoint-420/training_args.bin diff --git a/output_odal_Qwen3-4B_-/checkpoint-420/vocab.json b/checkpoints/odc_Qwen3-4B/checkpoint-420/vocab.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-420/vocab.json rename to checkpoints/odc_Qwen3-4B/checkpoint-420/vocab.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/README.md b/checkpoints/odc_Qwen3-4B/checkpoint-540/README.md similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/README.md rename to checkpoints/odc_Qwen3-4B/checkpoint-540/README.md diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/adapter_config.json b/checkpoints/odc_Qwen3-4B/checkpoint-540/adapter_config.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/adapter_config.json rename to checkpoints/odc_Qwen3-4B/checkpoint-540/adapter_config.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/adapter_model.safetensors b/checkpoints/odc_Qwen3-4B/checkpoint-540/adapter_model.safetensors similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/adapter_model.safetensors rename to checkpoints/odc_Qwen3-4B/checkpoint-540/adapter_model.safetensors diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/added_tokens.json b/checkpoints/odc_Qwen3-4B/checkpoint-540/added_tokens.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/added_tokens.json rename to checkpoints/odc_Qwen3-4B/checkpoint-540/added_tokens.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/chat_template.jinja b/checkpoints/odc_Qwen3-4B/checkpoint-540/chat_template.jinja similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/chat_template.jinja rename to checkpoints/odc_Qwen3-4B/checkpoint-540/chat_template.jinja diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/global_step537/mp_rank_00_model_states.pt b/checkpoints/odc_Qwen3-4B/checkpoint-540/global_step537/mp_rank_00_model_states.pt similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/global_step537/mp_rank_00_model_states.pt rename to checkpoints/odc_Qwen3-4B/checkpoint-540/global_step537/mp_rank_00_model_states.pt diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/global_step537/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/odc_Qwen3-4B/checkpoint-540/global_step537/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/global_step537/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/odc_Qwen3-4B/checkpoint-540/global_step537/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/global_step537/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/odc_Qwen3-4B/checkpoint-540/global_step537/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/global_step537/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/odc_Qwen3-4B/checkpoint-540/global_step537/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/latest b/checkpoints/odc_Qwen3-4B/checkpoint-540/latest similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/latest rename to checkpoints/odc_Qwen3-4B/checkpoint-540/latest diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/merges.txt b/checkpoints/odc_Qwen3-4B/checkpoint-540/merges.txt similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/merges.txt rename to checkpoints/odc_Qwen3-4B/checkpoint-540/merges.txt diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/rng_state_0.pth b/checkpoints/odc_Qwen3-4B/checkpoint-540/rng_state_0.pth similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/rng_state_0.pth rename to checkpoints/odc_Qwen3-4B/checkpoint-540/rng_state_0.pth diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/rng_state_1.pth b/checkpoints/odc_Qwen3-4B/checkpoint-540/rng_state_1.pth similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/rng_state_1.pth rename to checkpoints/odc_Qwen3-4B/checkpoint-540/rng_state_1.pth diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/special_tokens_map.json b/checkpoints/odc_Qwen3-4B/checkpoint-540/special_tokens_map.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/special_tokens_map.json rename to checkpoints/odc_Qwen3-4B/checkpoint-540/special_tokens_map.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/tokenizer.json b/checkpoints/odc_Qwen3-4B/checkpoint-540/tokenizer.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/tokenizer.json rename to checkpoints/odc_Qwen3-4B/checkpoint-540/tokenizer.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/tokenizer_config.json b/checkpoints/odc_Qwen3-4B/checkpoint-540/tokenizer_config.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/tokenizer_config.json rename to checkpoints/odc_Qwen3-4B/checkpoint-540/tokenizer_config.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/trainer_state.json b/checkpoints/odc_Qwen3-4B/checkpoint-540/trainer_state.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/trainer_state.json rename to checkpoints/odc_Qwen3-4B/checkpoint-540/trainer_state.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/training_args.bin b/checkpoints/odc_Qwen3-4B/checkpoint-540/training_args.bin similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/training_args.bin rename to checkpoints/odc_Qwen3-4B/checkpoint-540/training_args.bin diff --git a/output_odal_Qwen3-4B_-/checkpoint-540/vocab.json b/checkpoints/odc_Qwen3-4B/checkpoint-540/vocab.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-540/vocab.json rename to checkpoints/odc_Qwen3-4B/checkpoint-540/vocab.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/README.md b/checkpoints/odc_Qwen3-4B/checkpoint-548/README.md similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/README.md rename to checkpoints/odc_Qwen3-4B/checkpoint-548/README.md diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/adapter_config.json b/checkpoints/odc_Qwen3-4B/checkpoint-548/adapter_config.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/adapter_config.json rename to checkpoints/odc_Qwen3-4B/checkpoint-548/adapter_config.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/adapter_model.safetensors b/checkpoints/odc_Qwen3-4B/checkpoint-548/adapter_model.safetensors similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/adapter_model.safetensors rename to checkpoints/odc_Qwen3-4B/checkpoint-548/adapter_model.safetensors diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/added_tokens.json b/checkpoints/odc_Qwen3-4B/checkpoint-548/added_tokens.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/added_tokens.json rename to checkpoints/odc_Qwen3-4B/checkpoint-548/added_tokens.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/chat_template.jinja b/checkpoints/odc_Qwen3-4B/checkpoint-548/chat_template.jinja similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/chat_template.jinja rename to checkpoints/odc_Qwen3-4B/checkpoint-548/chat_template.jinja diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/global_step544/mp_rank_00_model_states.pt b/checkpoints/odc_Qwen3-4B/checkpoint-548/global_step544/mp_rank_00_model_states.pt similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/global_step544/mp_rank_00_model_states.pt rename to checkpoints/odc_Qwen3-4B/checkpoint-548/global_step544/mp_rank_00_model_states.pt diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/global_step544/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/odc_Qwen3-4B/checkpoint-548/global_step544/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/global_step544/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/odc_Qwen3-4B/checkpoint-548/global_step544/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/global_step544/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/odc_Qwen3-4B/checkpoint-548/global_step544/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/global_step544/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/odc_Qwen3-4B/checkpoint-548/global_step544/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/latest b/checkpoints/odc_Qwen3-4B/checkpoint-548/latest similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/latest rename to checkpoints/odc_Qwen3-4B/checkpoint-548/latest diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/merges.txt b/checkpoints/odc_Qwen3-4B/checkpoint-548/merges.txt similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/merges.txt rename to checkpoints/odc_Qwen3-4B/checkpoint-548/merges.txt diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/rng_state_0.pth b/checkpoints/odc_Qwen3-4B/checkpoint-548/rng_state_0.pth similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/rng_state_0.pth rename to checkpoints/odc_Qwen3-4B/checkpoint-548/rng_state_0.pth diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/rng_state_1.pth b/checkpoints/odc_Qwen3-4B/checkpoint-548/rng_state_1.pth similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/rng_state_1.pth rename to checkpoints/odc_Qwen3-4B/checkpoint-548/rng_state_1.pth diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/special_tokens_map.json b/checkpoints/odc_Qwen3-4B/checkpoint-548/special_tokens_map.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/special_tokens_map.json rename to checkpoints/odc_Qwen3-4B/checkpoint-548/special_tokens_map.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/tokenizer.json b/checkpoints/odc_Qwen3-4B/checkpoint-548/tokenizer.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/tokenizer.json rename to checkpoints/odc_Qwen3-4B/checkpoint-548/tokenizer.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/tokenizer_config.json b/checkpoints/odc_Qwen3-4B/checkpoint-548/tokenizer_config.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/tokenizer_config.json rename to checkpoints/odc_Qwen3-4B/checkpoint-548/tokenizer_config.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/trainer_state.json b/checkpoints/odc_Qwen3-4B/checkpoint-548/trainer_state.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/trainer_state.json rename to checkpoints/odc_Qwen3-4B/checkpoint-548/trainer_state.json diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/training_args.bin b/checkpoints/odc_Qwen3-4B/checkpoint-548/training_args.bin similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/training_args.bin rename to checkpoints/odc_Qwen3-4B/checkpoint-548/training_args.bin diff --git a/output_odal_Qwen3-4B_-/checkpoint-548/vocab.json b/checkpoints/odc_Qwen3-4B/checkpoint-548/vocab.json similarity index 100% rename from output_odal_Qwen3-4B_-/checkpoint-548/vocab.json rename to checkpoints/odc_Qwen3-4B/checkpoint-548/vocab.json diff --git a/output_odal_Qwen3-4B_-/final_adapters/README.md b/checkpoints/odc_Qwen3-4B/final_adapters/README.md similarity index 100% rename from output_odal_Qwen3-4B_-/final_adapters/README.md rename to checkpoints/odc_Qwen3-4B/final_adapters/README.md diff --git a/output_odal_Qwen3-4B_-/final_adapters/adapter_config.json b/checkpoints/odc_Qwen3-4B/final_adapters/adapter_config.json similarity index 100% rename from output_odal_Qwen3-4B_-/final_adapters/adapter_config.json rename to checkpoints/odc_Qwen3-4B/final_adapters/adapter_config.json diff --git a/output_odal_Qwen3-4B_-/final_adapters/adapter_model.safetensors b/checkpoints/odc_Qwen3-4B/final_adapters/adapter_model.safetensors similarity index 100% rename from output_odal_Qwen3-4B_-/final_adapters/adapter_model.safetensors rename to checkpoints/odc_Qwen3-4B/final_adapters/adapter_model.safetensors diff --git a/output_odal_Qwen3-4B_-/final_adapters/added_tokens.json b/checkpoints/odc_Qwen3-4B/final_adapters/added_tokens.json similarity index 100% rename from output_odal_Qwen3-4B_-/final_adapters/added_tokens.json rename to checkpoints/odc_Qwen3-4B/final_adapters/added_tokens.json diff --git a/output_odal_Qwen3-4B_-/final_adapters/chat_template.jinja b/checkpoints/odc_Qwen3-4B/final_adapters/chat_template.jinja similarity index 100% rename from output_odal_Qwen3-4B_-/final_adapters/chat_template.jinja rename to checkpoints/odc_Qwen3-4B/final_adapters/chat_template.jinja diff --git a/output_odal_Qwen3-4B_-/final_adapters/merges.txt b/checkpoints/odc_Qwen3-4B/final_adapters/merges.txt similarity index 100% rename from output_odal_Qwen3-4B_-/final_adapters/merges.txt rename to checkpoints/odc_Qwen3-4B/final_adapters/merges.txt diff --git a/output_odal_Qwen3-4B_-/final_adapters/special_tokens_map.json b/checkpoints/odc_Qwen3-4B/final_adapters/special_tokens_map.json similarity index 100% rename from output_odal_Qwen3-4B_-/final_adapters/special_tokens_map.json rename to checkpoints/odc_Qwen3-4B/final_adapters/special_tokens_map.json diff --git a/output_odal_Qwen3-4B_-/final_adapters/tokenizer.json b/checkpoints/odc_Qwen3-4B/final_adapters/tokenizer.json similarity index 100% rename from output_odal_Qwen3-4B_-/final_adapters/tokenizer.json rename to checkpoints/odc_Qwen3-4B/final_adapters/tokenizer.json diff --git a/output_odal_Qwen3-4B_-/final_adapters/tokenizer_config.json b/checkpoints/odc_Qwen3-4B/final_adapters/tokenizer_config.json similarity index 100% rename from output_odal_Qwen3-4B_-/final_adapters/tokenizer_config.json rename to checkpoints/odc_Qwen3-4B/final_adapters/tokenizer_config.json diff --git a/output_odal_Qwen3-4B_-/final_adapters/training_args.bin b/checkpoints/odc_Qwen3-4B/final_adapters/training_args.bin similarity index 100% rename from output_odal_Qwen3-4B_-/final_adapters/training_args.bin rename to checkpoints/odc_Qwen3-4B/final_adapters/training_args.bin diff --git a/output_odal_Qwen3-4B_-/final_adapters/vocab.json b/checkpoints/odc_Qwen3-4B/final_adapters/vocab.json similarity index 100% rename from output_odal_Qwen3-4B_-/final_adapters/vocab.json rename to checkpoints/odc_Qwen3-4B/final_adapters/vocab.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/README.md b/checkpoints/oeb_Qwen3-4B/checkpoint-100/README.md similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/README.md rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/README.md diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/adapter_config.json b/checkpoints/oeb_Qwen3-4B/checkpoint-100/adapter_config.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/adapter_config.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/adapter_config.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/adapter_model.safetensors b/checkpoints/oeb_Qwen3-4B/checkpoint-100/adapter_model.safetensors similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/adapter_model.safetensors rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/adapter_model.safetensors diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/added_tokens.json b/checkpoints/oeb_Qwen3-4B/checkpoint-100/added_tokens.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/added_tokens.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/added_tokens.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/chat_template.jinja b/checkpoints/oeb_Qwen3-4B/checkpoint-100/chat_template.jinja similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/chat_template.jinja rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/chat_template.jinja diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/global_step96/mp_rank_00_model_states.pt b/checkpoints/oeb_Qwen3-4B/checkpoint-100/global_step96/mp_rank_00_model_states.pt similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/global_step96/mp_rank_00_model_states.pt rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/global_step96/mp_rank_00_model_states.pt diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/global_step96/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/oeb_Qwen3-4B/checkpoint-100/global_step96/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/global_step96/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/global_step96/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/global_step96/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/oeb_Qwen3-4B/checkpoint-100/global_step96/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/global_step96/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/global_step96/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/latest b/checkpoints/oeb_Qwen3-4B/checkpoint-100/latest similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/latest rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/latest diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/merges.txt b/checkpoints/oeb_Qwen3-4B/checkpoint-100/merges.txt similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/merges.txt rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/merges.txt diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/rng_state_0.pth b/checkpoints/oeb_Qwen3-4B/checkpoint-100/rng_state_0.pth similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/rng_state_0.pth rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/rng_state_0.pth diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/rng_state_1.pth b/checkpoints/oeb_Qwen3-4B/checkpoint-100/rng_state_1.pth similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/rng_state_1.pth rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/rng_state_1.pth diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/special_tokens_map.json b/checkpoints/oeb_Qwen3-4B/checkpoint-100/special_tokens_map.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/special_tokens_map.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/special_tokens_map.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/tokenizer.json b/checkpoints/oeb_Qwen3-4B/checkpoint-100/tokenizer.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/tokenizer.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/tokenizer.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/tokenizer_config.json b/checkpoints/oeb_Qwen3-4B/checkpoint-100/tokenizer_config.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/tokenizer_config.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/tokenizer_config.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/trainer_state.json b/checkpoints/oeb_Qwen3-4B/checkpoint-100/trainer_state.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/trainer_state.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/trainer_state.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/training_args.bin b/checkpoints/oeb_Qwen3-4B/checkpoint-100/training_args.bin similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/training_args.bin rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/training_args.bin diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/vocab.json b/checkpoints/oeb_Qwen3-4B/checkpoint-100/vocab.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/vocab.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-100/vocab.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/README.md b/checkpoints/oeb_Qwen3-4B/checkpoint-140/README.md similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/README.md rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/README.md diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/adapter_config.json b/checkpoints/oeb_Qwen3-4B/checkpoint-140/adapter_config.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/adapter_config.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/adapter_config.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/adapter_model.safetensors b/checkpoints/oeb_Qwen3-4B/checkpoint-140/adapter_model.safetensors similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/adapter_model.safetensors rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/adapter_model.safetensors diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/added_tokens.json b/checkpoints/oeb_Qwen3-4B/checkpoint-140/added_tokens.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/added_tokens.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/added_tokens.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/chat_template.jinja b/checkpoints/oeb_Qwen3-4B/checkpoint-140/chat_template.jinja similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/chat_template.jinja rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/chat_template.jinja diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/global_step134/mp_rank_00_model_states.pt b/checkpoints/oeb_Qwen3-4B/checkpoint-140/global_step134/mp_rank_00_model_states.pt similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/global_step134/mp_rank_00_model_states.pt rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/global_step134/mp_rank_00_model_states.pt diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/global_step134/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/oeb_Qwen3-4B/checkpoint-140/global_step134/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/global_step134/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/global_step134/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/global_step134/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/oeb_Qwen3-4B/checkpoint-140/global_step134/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/global_step134/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/global_step134/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/latest b/checkpoints/oeb_Qwen3-4B/checkpoint-140/latest similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/latest rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/latest diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/merges.txt b/checkpoints/oeb_Qwen3-4B/checkpoint-140/merges.txt similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/merges.txt rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/merges.txt diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/rng_state_0.pth b/checkpoints/oeb_Qwen3-4B/checkpoint-140/rng_state_0.pth similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/rng_state_0.pth rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/rng_state_0.pth diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/rng_state_1.pth b/checkpoints/oeb_Qwen3-4B/checkpoint-140/rng_state_1.pth similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/rng_state_1.pth rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/rng_state_1.pth diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/special_tokens_map.json b/checkpoints/oeb_Qwen3-4B/checkpoint-140/special_tokens_map.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/special_tokens_map.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/special_tokens_map.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/tokenizer.json b/checkpoints/oeb_Qwen3-4B/checkpoint-140/tokenizer.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/tokenizer.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/tokenizer.json diff --git a/output_oeal_Qwen3-4B_-/final_adapters/tokenizer_config.json b/checkpoints/oeb_Qwen3-4B/checkpoint-140/tokenizer_config.json similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/tokenizer_config.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/tokenizer_config.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/trainer_state.json b/checkpoints/oeb_Qwen3-4B/checkpoint-140/trainer_state.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/trainer_state.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/trainer_state.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/training_args.bin b/checkpoints/oeb_Qwen3-4B/checkpoint-140/training_args.bin similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/training_args.bin rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/training_args.bin diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/vocab.json b/checkpoints/oeb_Qwen3-4B/checkpoint-140/vocab.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/vocab.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-140/vocab.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/README.md b/checkpoints/oeb_Qwen3-4B/checkpoint-50/README.md similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/README.md rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/README.md diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/adapter_config.json b/checkpoints/oeb_Qwen3-4B/checkpoint-50/adapter_config.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/adapter_config.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/adapter_config.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/adapter_model.safetensors b/checkpoints/oeb_Qwen3-4B/checkpoint-50/adapter_model.safetensors similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/adapter_model.safetensors rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/adapter_model.safetensors diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/added_tokens.json b/checkpoints/oeb_Qwen3-4B/checkpoint-50/added_tokens.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/added_tokens.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/added_tokens.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/chat_template.jinja b/checkpoints/oeb_Qwen3-4B/checkpoint-50/chat_template.jinja similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/chat_template.jinja rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/chat_template.jinja diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/global_step48/mp_rank_00_model_states.pt b/checkpoints/oeb_Qwen3-4B/checkpoint-50/global_step48/mp_rank_00_model_states.pt similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/global_step48/mp_rank_00_model_states.pt rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/global_step48/mp_rank_00_model_states.pt diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/global_step48/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/oeb_Qwen3-4B/checkpoint-50/global_step48/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/global_step48/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/global_step48/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/global_step48/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/oeb_Qwen3-4B/checkpoint-50/global_step48/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/global_step48/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/global_step48/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/latest b/checkpoints/oeb_Qwen3-4B/checkpoint-50/latest similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/latest rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/latest diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/merges.txt b/checkpoints/oeb_Qwen3-4B/checkpoint-50/merges.txt similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/merges.txt rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/merges.txt diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/rng_state_0.pth b/checkpoints/oeb_Qwen3-4B/checkpoint-50/rng_state_0.pth similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/rng_state_0.pth rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/rng_state_0.pth diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/rng_state_1.pth b/checkpoints/oeb_Qwen3-4B/checkpoint-50/rng_state_1.pth similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/rng_state_1.pth rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/rng_state_1.pth diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/special_tokens_map.json b/checkpoints/oeb_Qwen3-4B/checkpoint-50/special_tokens_map.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/special_tokens_map.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/special_tokens_map.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/tokenizer.json b/checkpoints/oeb_Qwen3-4B/checkpoint-50/tokenizer.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/tokenizer.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/tokenizer.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/tokenizer_config.json b/checkpoints/oeb_Qwen3-4B/checkpoint-50/tokenizer_config.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/tokenizer_config.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/tokenizer_config.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/trainer_state.json b/checkpoints/oeb_Qwen3-4B/checkpoint-50/trainer_state.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/trainer_state.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/trainer_state.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/training_args.bin b/checkpoints/oeb_Qwen3-4B/checkpoint-50/training_args.bin similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/training_args.bin rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/training_args.bin diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/vocab.json b/checkpoints/oeb_Qwen3-4B/checkpoint-50/vocab.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/vocab.json rename to checkpoints/oeb_Qwen3-4B/checkpoint-50/vocab.json diff --git a/output_oeal_Qwen3-4B_-/final_adapters/README.md b/checkpoints/oeb_Qwen3-4B/final_adapters/README.md similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/README.md rename to checkpoints/oeb_Qwen3-4B/final_adapters/README.md diff --git a/output_oebl_Qwen3-4B_-/final_adapters/adapter_config.json b/checkpoints/oeb_Qwen3-4B/final_adapters/adapter_config.json similarity index 100% rename from output_oebl_Qwen3-4B_-/final_adapters/adapter_config.json rename to checkpoints/oeb_Qwen3-4B/final_adapters/adapter_config.json diff --git a/output_oebl_Qwen3-4B_-/final_adapters/adapter_model.safetensors b/checkpoints/oeb_Qwen3-4B/final_adapters/adapter_model.safetensors similarity index 100% rename from output_oebl_Qwen3-4B_-/final_adapters/adapter_model.safetensors rename to checkpoints/oeb_Qwen3-4B/final_adapters/adapter_model.safetensors diff --git a/output_oeal_Qwen3-4B_-/final_adapters/added_tokens.json b/checkpoints/oeb_Qwen3-4B/final_adapters/added_tokens.json similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/added_tokens.json rename to checkpoints/oeb_Qwen3-4B/final_adapters/added_tokens.json diff --git a/output_oeal_Qwen3-4B_-/final_adapters/chat_template.jinja b/checkpoints/oeb_Qwen3-4B/final_adapters/chat_template.jinja similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/chat_template.jinja rename to checkpoints/oeb_Qwen3-4B/final_adapters/chat_template.jinja diff --git a/output_oeal_Qwen3-4B_-/final_adapters/merges.txt b/checkpoints/oeb_Qwen3-4B/final_adapters/merges.txt similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/merges.txt rename to checkpoints/oeb_Qwen3-4B/final_adapters/merges.txt diff --git a/output_oeal_Qwen3-4B_-/final_adapters/special_tokens_map.json b/checkpoints/oeb_Qwen3-4B/final_adapters/special_tokens_map.json similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/special_tokens_map.json rename to checkpoints/oeb_Qwen3-4B/final_adapters/special_tokens_map.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/tokenizer.json b/checkpoints/oeb_Qwen3-4B/final_adapters/tokenizer.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/tokenizer.json rename to checkpoints/oeb_Qwen3-4B/final_adapters/tokenizer.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/tokenizer_config.json b/checkpoints/oeb_Qwen3-4B/final_adapters/tokenizer_config.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/tokenizer_config.json rename to checkpoints/oeb_Qwen3-4B/final_adapters/tokenizer_config.json diff --git a/output_oebl_Qwen3-4B_-/final_adapters/training_args.bin b/checkpoints/oeb_Qwen3-4B/final_adapters/training_args.bin similarity index 100% rename from output_oebl_Qwen3-4B_-/final_adapters/training_args.bin rename to checkpoints/oeb_Qwen3-4B/final_adapters/training_args.bin diff --git a/output_oeal_Qwen3-4B_-/final_adapters/vocab.json b/checkpoints/oeb_Qwen3-4B/final_adapters/vocab.json similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/vocab.json rename to checkpoints/oeb_Qwen3-4B/final_adapters/vocab.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/README.md b/checkpoints/oec_Qwen3-4B/checkpoint-600/README.md similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/README.md rename to checkpoints/oec_Qwen3-4B/checkpoint-600/README.md diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/adapter_config.json b/checkpoints/oec_Qwen3-4B/checkpoint-600/adapter_config.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/adapter_config.json rename to checkpoints/oec_Qwen3-4B/checkpoint-600/adapter_config.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/adapter_model.safetensors b/checkpoints/oec_Qwen3-4B/checkpoint-600/adapter_model.safetensors similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/adapter_model.safetensors rename to checkpoints/oec_Qwen3-4B/checkpoint-600/adapter_model.safetensors diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/added_tokens.json b/checkpoints/oec_Qwen3-4B/checkpoint-600/added_tokens.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/added_tokens.json rename to checkpoints/oec_Qwen3-4B/checkpoint-600/added_tokens.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/chat_template.jinja b/checkpoints/oec_Qwen3-4B/checkpoint-600/chat_template.jinja similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/chat_template.jinja rename to checkpoints/oec_Qwen3-4B/checkpoint-600/chat_template.jinja diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/global_step598/mp_rank_00_model_states.pt b/checkpoints/oec_Qwen3-4B/checkpoint-600/global_step598/mp_rank_00_model_states.pt similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/global_step598/mp_rank_00_model_states.pt rename to checkpoints/oec_Qwen3-4B/checkpoint-600/global_step598/mp_rank_00_model_states.pt diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/oec_Qwen3-4B/checkpoint-600/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/oec_Qwen3-4B/checkpoint-600/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/oec_Qwen3-4B/checkpoint-600/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/oec_Qwen3-4B/checkpoint-600/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/latest b/checkpoints/oec_Qwen3-4B/checkpoint-600/latest similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/latest rename to checkpoints/oec_Qwen3-4B/checkpoint-600/latest diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/merges.txt b/checkpoints/oec_Qwen3-4B/checkpoint-600/merges.txt similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/merges.txt rename to checkpoints/oec_Qwen3-4B/checkpoint-600/merges.txt diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/rng_state_0.pth b/checkpoints/oec_Qwen3-4B/checkpoint-600/rng_state_0.pth similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/rng_state_0.pth rename to checkpoints/oec_Qwen3-4B/checkpoint-600/rng_state_0.pth diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/rng_state_1.pth b/checkpoints/oec_Qwen3-4B/checkpoint-600/rng_state_1.pth similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/rng_state_1.pth rename to checkpoints/oec_Qwen3-4B/checkpoint-600/rng_state_1.pth diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/special_tokens_map.json b/checkpoints/oec_Qwen3-4B/checkpoint-600/special_tokens_map.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/special_tokens_map.json rename to checkpoints/oec_Qwen3-4B/checkpoint-600/special_tokens_map.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/tokenizer.json b/checkpoints/oec_Qwen3-4B/checkpoint-600/tokenizer.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/tokenizer.json rename to checkpoints/oec_Qwen3-4B/checkpoint-600/tokenizer.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/tokenizer_config.json b/checkpoints/oec_Qwen3-4B/checkpoint-600/tokenizer_config.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/tokenizer_config.json rename to checkpoints/oec_Qwen3-4B/checkpoint-600/tokenizer_config.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/trainer_state.json b/checkpoints/oec_Qwen3-4B/checkpoint-600/trainer_state.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/trainer_state.json rename to checkpoints/oec_Qwen3-4B/checkpoint-600/trainer_state.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-600/training_args.bin b/checkpoints/oec_Qwen3-4B/checkpoint-600/training_args.bin similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-600/training_args.bin rename to checkpoints/oec_Qwen3-4B/checkpoint-600/training_args.bin diff --git a/output_oebl_Qwen3-4B_-/checkpoint-100/vocab.json b/checkpoints/oec_Qwen3-4B/checkpoint-600/vocab.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-100/vocab.json rename to checkpoints/oec_Qwen3-4B/checkpoint-600/vocab.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/README.md b/checkpoints/oec_Qwen3-4B/checkpoint-825/README.md similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/README.md rename to checkpoints/oec_Qwen3-4B/checkpoint-825/README.md diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/adapter_config.json b/checkpoints/oec_Qwen3-4B/checkpoint-825/adapter_config.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/adapter_config.json rename to checkpoints/oec_Qwen3-4B/checkpoint-825/adapter_config.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/adapter_model.safetensors b/checkpoints/oec_Qwen3-4B/checkpoint-825/adapter_model.safetensors similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/adapter_model.safetensors rename to checkpoints/oec_Qwen3-4B/checkpoint-825/adapter_model.safetensors diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/added_tokens.json b/checkpoints/oec_Qwen3-4B/checkpoint-825/added_tokens.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/added_tokens.json rename to checkpoints/oec_Qwen3-4B/checkpoint-825/added_tokens.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/chat_template.jinja b/checkpoints/oec_Qwen3-4B/checkpoint-825/chat_template.jinja similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/chat_template.jinja rename to checkpoints/oec_Qwen3-4B/checkpoint-825/chat_template.jinja diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/global_step823/mp_rank_00_model_states.pt b/checkpoints/oec_Qwen3-4B/checkpoint-825/global_step823/mp_rank_00_model_states.pt similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/global_step823/mp_rank_00_model_states.pt rename to checkpoints/oec_Qwen3-4B/checkpoint-825/global_step823/mp_rank_00_model_states.pt diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/global_step823/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/oec_Qwen3-4B/checkpoint-825/global_step823/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/global_step823/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/oec_Qwen3-4B/checkpoint-825/global_step823/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/global_step823/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/oec_Qwen3-4B/checkpoint-825/global_step823/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/global_step823/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/oec_Qwen3-4B/checkpoint-825/global_step823/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/latest b/checkpoints/oec_Qwen3-4B/checkpoint-825/latest similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/latest rename to checkpoints/oec_Qwen3-4B/checkpoint-825/latest diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/merges.txt b/checkpoints/oec_Qwen3-4B/checkpoint-825/merges.txt similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/merges.txt rename to checkpoints/oec_Qwen3-4B/checkpoint-825/merges.txt diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/rng_state_0.pth b/checkpoints/oec_Qwen3-4B/checkpoint-825/rng_state_0.pth similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/rng_state_0.pth rename to checkpoints/oec_Qwen3-4B/checkpoint-825/rng_state_0.pth diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/rng_state_1.pth b/checkpoints/oec_Qwen3-4B/checkpoint-825/rng_state_1.pth similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/rng_state_1.pth rename to checkpoints/oec_Qwen3-4B/checkpoint-825/rng_state_1.pth diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/special_tokens_map.json b/checkpoints/oec_Qwen3-4B/checkpoint-825/special_tokens_map.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/special_tokens_map.json rename to checkpoints/oec_Qwen3-4B/checkpoint-825/special_tokens_map.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/tokenizer.json b/checkpoints/oec_Qwen3-4B/checkpoint-825/tokenizer.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/tokenizer.json rename to checkpoints/oec_Qwen3-4B/checkpoint-825/tokenizer.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/tokenizer_config.json b/checkpoints/oec_Qwen3-4B/checkpoint-825/tokenizer_config.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/tokenizer_config.json rename to checkpoints/oec_Qwen3-4B/checkpoint-825/tokenizer_config.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/trainer_state.json b/checkpoints/oec_Qwen3-4B/checkpoint-825/trainer_state.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/trainer_state.json rename to checkpoints/oec_Qwen3-4B/checkpoint-825/trainer_state.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-825/training_args.bin b/checkpoints/oec_Qwen3-4B/checkpoint-825/training_args.bin similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-825/training_args.bin rename to checkpoints/oec_Qwen3-4B/checkpoint-825/training_args.bin diff --git a/output_oebl_Qwen3-4B_-/checkpoint-140/vocab.json b/checkpoints/oec_Qwen3-4B/checkpoint-825/vocab.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-140/vocab.json rename to checkpoints/oec_Qwen3-4B/checkpoint-825/vocab.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/README.md b/checkpoints/oec_Qwen3-4B/checkpoint-840/README.md similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/README.md rename to checkpoints/oec_Qwen3-4B/checkpoint-840/README.md diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/adapter_config.json b/checkpoints/oec_Qwen3-4B/checkpoint-840/adapter_config.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/adapter_config.json rename to checkpoints/oec_Qwen3-4B/checkpoint-840/adapter_config.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/adapter_model.safetensors b/checkpoints/oec_Qwen3-4B/checkpoint-840/adapter_model.safetensors similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/adapter_model.safetensors rename to checkpoints/oec_Qwen3-4B/checkpoint-840/adapter_model.safetensors diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/added_tokens.json b/checkpoints/oec_Qwen3-4B/checkpoint-840/added_tokens.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/added_tokens.json rename to checkpoints/oec_Qwen3-4B/checkpoint-840/added_tokens.json diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/chat_template.jinja b/checkpoints/oec_Qwen3-4B/checkpoint-840/chat_template.jinja similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/chat_template.jinja rename to checkpoints/oec_Qwen3-4B/checkpoint-840/chat_template.jinja diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/global_step838/mp_rank_00_model_states.pt b/checkpoints/oec_Qwen3-4B/checkpoint-840/global_step838/mp_rank_00_model_states.pt similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/global_step838/mp_rank_00_model_states.pt rename to checkpoints/oec_Qwen3-4B/checkpoint-840/global_step838/mp_rank_00_model_states.pt diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/global_step838/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/oec_Qwen3-4B/checkpoint-840/global_step838/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/global_step838/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/oec_Qwen3-4B/checkpoint-840/global_step838/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/global_step838/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/oec_Qwen3-4B/checkpoint-840/global_step838/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/global_step838/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/oec_Qwen3-4B/checkpoint-840/global_step838/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/latest b/checkpoints/oec_Qwen3-4B/checkpoint-840/latest similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/latest rename to checkpoints/oec_Qwen3-4B/checkpoint-840/latest diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/merges.txt b/checkpoints/oec_Qwen3-4B/checkpoint-840/merges.txt similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/merges.txt rename to checkpoints/oec_Qwen3-4B/checkpoint-840/merges.txt diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/rng_state_0.pth b/checkpoints/oec_Qwen3-4B/checkpoint-840/rng_state_0.pth similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/rng_state_0.pth rename to checkpoints/oec_Qwen3-4B/checkpoint-840/rng_state_0.pth diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/rng_state_1.pth b/checkpoints/oec_Qwen3-4B/checkpoint-840/rng_state_1.pth similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/rng_state_1.pth rename to checkpoints/oec_Qwen3-4B/checkpoint-840/rng_state_1.pth diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/special_tokens_map.json b/checkpoints/oec_Qwen3-4B/checkpoint-840/special_tokens_map.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/special_tokens_map.json rename to checkpoints/oec_Qwen3-4B/checkpoint-840/special_tokens_map.json diff --git a/output_oebl_Qwen3-4B_-/final_adapters/tokenizer.json b/checkpoints/oec_Qwen3-4B/checkpoint-840/tokenizer.json similarity index 100% rename from output_oebl_Qwen3-4B_-/final_adapters/tokenizer.json rename to checkpoints/oec_Qwen3-4B/checkpoint-840/tokenizer.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/tokenizer_config.json b/checkpoints/oec_Qwen3-4B/checkpoint-840/tokenizer_config.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/tokenizer_config.json rename to checkpoints/oec_Qwen3-4B/checkpoint-840/tokenizer_config.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/trainer_state.json b/checkpoints/oec_Qwen3-4B/checkpoint-840/trainer_state.json similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/trainer_state.json rename to checkpoints/oec_Qwen3-4B/checkpoint-840/trainer_state.json diff --git a/output_oeal_Qwen3-4B_-/checkpoint-840/training_args.bin b/checkpoints/oec_Qwen3-4B/checkpoint-840/training_args.bin similarity index 100% rename from output_oeal_Qwen3-4B_-/checkpoint-840/training_args.bin rename to checkpoints/oec_Qwen3-4B/checkpoint-840/training_args.bin diff --git a/output_oebl_Qwen3-4B_-/checkpoint-50/vocab.json b/checkpoints/oec_Qwen3-4B/checkpoint-840/vocab.json similarity index 100% rename from output_oebl_Qwen3-4B_-/checkpoint-50/vocab.json rename to checkpoints/oec_Qwen3-4B/checkpoint-840/vocab.json diff --git a/output_oebl_Qwen3-4B_-/final_adapters/README.md b/checkpoints/oec_Qwen3-4B/final_adapters/README.md similarity index 100% rename from output_oebl_Qwen3-4B_-/final_adapters/README.md rename to checkpoints/oec_Qwen3-4B/final_adapters/README.md diff --git a/output_oeal_Qwen3-4B_-/final_adapters/adapter_config.json b/checkpoints/oec_Qwen3-4B/final_adapters/adapter_config.json similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/adapter_config.json rename to checkpoints/oec_Qwen3-4B/final_adapters/adapter_config.json diff --git a/output_oeal_Qwen3-4B_-/final_adapters/adapter_model.safetensors b/checkpoints/oec_Qwen3-4B/final_adapters/adapter_model.safetensors similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/adapter_model.safetensors rename to checkpoints/oec_Qwen3-4B/final_adapters/adapter_model.safetensors diff --git a/output_oebl_Qwen3-4B_-/final_adapters/added_tokens.json b/checkpoints/oec_Qwen3-4B/final_adapters/added_tokens.json similarity index 100% rename from output_oebl_Qwen3-4B_-/final_adapters/added_tokens.json rename to checkpoints/oec_Qwen3-4B/final_adapters/added_tokens.json diff --git a/output_oebl_Qwen3-4B_-/final_adapters/chat_template.jinja b/checkpoints/oec_Qwen3-4B/final_adapters/chat_template.jinja similarity index 100% rename from output_oebl_Qwen3-4B_-/final_adapters/chat_template.jinja rename to checkpoints/oec_Qwen3-4B/final_adapters/chat_template.jinja diff --git a/output_oeal_Qwen3-4B_-/final_adapters/global_step598/mp_rank_00_model_states.pt b/checkpoints/oec_Qwen3-4B/final_adapters/global_step598/mp_rank_00_model_states.pt similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/global_step598/mp_rank_00_model_states.pt rename to checkpoints/oec_Qwen3-4B/final_adapters/global_step598/mp_rank_00_model_states.pt diff --git a/output_oeal_Qwen3-4B_-/final_adapters/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/oec_Qwen3-4B/final_adapters/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/oec_Qwen3-4B/final_adapters/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_oeal_Qwen3-4B_-/final_adapters/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/oec_Qwen3-4B/final_adapters/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/oec_Qwen3-4B/final_adapters/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_oeal_Qwen3-4B_-/final_adapters/latest b/checkpoints/oec_Qwen3-4B/final_adapters/latest similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/latest rename to checkpoints/oec_Qwen3-4B/final_adapters/latest diff --git a/output_oebl_Qwen3-4B_-/final_adapters/merges.txt b/checkpoints/oec_Qwen3-4B/final_adapters/merges.txt similarity index 100% rename from output_oebl_Qwen3-4B_-/final_adapters/merges.txt rename to checkpoints/oec_Qwen3-4B/final_adapters/merges.txt diff --git a/output_oeal_Qwen3-4B_-/final_adapters/rng_state_0.pth b/checkpoints/oec_Qwen3-4B/final_adapters/rng_state_0.pth similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/rng_state_0.pth rename to checkpoints/oec_Qwen3-4B/final_adapters/rng_state_0.pth diff --git a/output_oeal_Qwen3-4B_-/final_adapters/rng_state_1.pth b/checkpoints/oec_Qwen3-4B/final_adapters/rng_state_1.pth similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/rng_state_1.pth rename to checkpoints/oec_Qwen3-4B/final_adapters/rng_state_1.pth diff --git a/output_oebl_Qwen3-4B_-/final_adapters/special_tokens_map.json b/checkpoints/oec_Qwen3-4B/final_adapters/special_tokens_map.json similarity index 100% rename from output_oebl_Qwen3-4B_-/final_adapters/special_tokens_map.json rename to checkpoints/oec_Qwen3-4B/final_adapters/special_tokens_map.json diff --git a/output_oeal_Qwen3-4B_-/final_adapters/tokenizer.json b/checkpoints/oec_Qwen3-4B/final_adapters/tokenizer.json similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/tokenizer.json rename to checkpoints/oec_Qwen3-4B/final_adapters/tokenizer.json diff --git a/output_oebl_Qwen3-4B_-/final_adapters/tokenizer_config.json b/checkpoints/oec_Qwen3-4B/final_adapters/tokenizer_config.json similarity index 100% rename from output_oebl_Qwen3-4B_-/final_adapters/tokenizer_config.json rename to checkpoints/oec_Qwen3-4B/final_adapters/tokenizer_config.json diff --git a/output_oeal_Qwen3-4B_-/final_adapters/trainer_state.json b/checkpoints/oec_Qwen3-4B/final_adapters/trainer_state.json similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/trainer_state.json rename to checkpoints/oec_Qwen3-4B/final_adapters/trainer_state.json diff --git a/output_oeal_Qwen3-4B_-/final_adapters/training_args.bin b/checkpoints/oec_Qwen3-4B/final_adapters/training_args.bin similarity index 100% rename from output_oeal_Qwen3-4B_-/final_adapters/training_args.bin rename to checkpoints/oec_Qwen3-4B/final_adapters/training_args.bin diff --git a/output_oebl_Qwen3-4B_-/final_adapters/vocab.json b/checkpoints/oec_Qwen3-4B/final_adapters/vocab.json similarity index 100% rename from output_oebl_Qwen3-4B_-/final_adapters/vocab.json rename to checkpoints/oec_Qwen3-4B/final_adapters/vocab.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/README.md b/checkpoints/rdc_Qwen3-4B/checkpoint-300/README.md similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/README.md rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/README.md diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/adapter_config.json b/checkpoints/rdc_Qwen3-4B/checkpoint-300/adapter_config.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/adapter_config.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/adapter_config.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/adapter_model.safetensors b/checkpoints/rdc_Qwen3-4B/checkpoint-300/adapter_model.safetensors similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/adapter_model.safetensors rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/adapter_model.safetensors diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/added_tokens.json b/checkpoints/rdc_Qwen3-4B/checkpoint-300/added_tokens.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/added_tokens.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/added_tokens.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/chat_template.jinja b/checkpoints/rdc_Qwen3-4B/checkpoint-300/chat_template.jinja similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/chat_template.jinja rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/chat_template.jinja diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/global_step298/mp_rank_00_model_states.pt b/checkpoints/rdc_Qwen3-4B/checkpoint-300/global_step298/mp_rank_00_model_states.pt similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/global_step298/mp_rank_00_model_states.pt rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/global_step298/mp_rank_00_model_states.pt diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/global_step298/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/rdc_Qwen3-4B/checkpoint-300/global_step298/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/global_step298/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/global_step298/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/global_step298/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/rdc_Qwen3-4B/checkpoint-300/global_step298/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/global_step298/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/global_step298/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/latest b/checkpoints/rdc_Qwen3-4B/checkpoint-300/latest similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/latest rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/latest diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/merges.txt b/checkpoints/rdc_Qwen3-4B/checkpoint-300/merges.txt similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/merges.txt rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/merges.txt diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/rng_state_0.pth b/checkpoints/rdc_Qwen3-4B/checkpoint-300/rng_state_0.pth similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/rng_state_0.pth rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/rng_state_0.pth diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/rng_state_1.pth b/checkpoints/rdc_Qwen3-4B/checkpoint-300/rng_state_1.pth similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/rng_state_1.pth rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/rng_state_1.pth diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/special_tokens_map.json b/checkpoints/rdc_Qwen3-4B/checkpoint-300/special_tokens_map.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/special_tokens_map.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/special_tokens_map.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/tokenizer.json b/checkpoints/rdc_Qwen3-4B/checkpoint-300/tokenizer.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/tokenizer.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/tokenizer.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/tokenizer_config.json b/checkpoints/rdc_Qwen3-4B/checkpoint-300/tokenizer_config.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/tokenizer_config.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/tokenizer_config.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/trainer_state.json b/checkpoints/rdc_Qwen3-4B/checkpoint-300/trainer_state.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/trainer_state.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/trainer_state.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/training_args.bin b/checkpoints/rdc_Qwen3-4B/checkpoint-300/training_args.bin similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/training_args.bin rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/training_args.bin diff --git a/output_rdal_Qwen3-4B_-/checkpoint-300/vocab.json b/checkpoints/rdc_Qwen3-4B/checkpoint-300/vocab.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-300/vocab.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-300/vocab.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/README.md b/checkpoints/rdc_Qwen3-4B/checkpoint-540/README.md similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/README.md rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/README.md diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/adapter_config.json b/checkpoints/rdc_Qwen3-4B/checkpoint-540/adapter_config.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/adapter_config.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/adapter_config.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/adapter_model.safetensors b/checkpoints/rdc_Qwen3-4B/checkpoint-540/adapter_model.safetensors similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/adapter_model.safetensors rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/adapter_model.safetensors diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/added_tokens.json b/checkpoints/rdc_Qwen3-4B/checkpoint-540/added_tokens.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/added_tokens.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/added_tokens.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/chat_template.jinja b/checkpoints/rdc_Qwen3-4B/checkpoint-540/chat_template.jinja similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/chat_template.jinja rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/chat_template.jinja diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/global_step537/mp_rank_00_model_states.pt b/checkpoints/rdc_Qwen3-4B/checkpoint-540/global_step537/mp_rank_00_model_states.pt similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/global_step537/mp_rank_00_model_states.pt rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/global_step537/mp_rank_00_model_states.pt diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/global_step537/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/rdc_Qwen3-4B/checkpoint-540/global_step537/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/global_step537/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/global_step537/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/global_step537/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/rdc_Qwen3-4B/checkpoint-540/global_step537/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/global_step537/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/global_step537/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/latest b/checkpoints/rdc_Qwen3-4B/checkpoint-540/latest similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/latest rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/latest diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/merges.txt b/checkpoints/rdc_Qwen3-4B/checkpoint-540/merges.txt similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/merges.txt rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/merges.txt diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/rng_state_0.pth b/checkpoints/rdc_Qwen3-4B/checkpoint-540/rng_state_0.pth similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/rng_state_0.pth rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/rng_state_0.pth diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/rng_state_1.pth b/checkpoints/rdc_Qwen3-4B/checkpoint-540/rng_state_1.pth similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/rng_state_1.pth rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/rng_state_1.pth diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/special_tokens_map.json b/checkpoints/rdc_Qwen3-4B/checkpoint-540/special_tokens_map.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/special_tokens_map.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/special_tokens_map.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/tokenizer.json b/checkpoints/rdc_Qwen3-4B/checkpoint-540/tokenizer.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/tokenizer.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/tokenizer.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/tokenizer_config.json b/checkpoints/rdc_Qwen3-4B/checkpoint-540/tokenizer_config.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/tokenizer_config.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/tokenizer_config.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/trainer_state.json b/checkpoints/rdc_Qwen3-4B/checkpoint-540/trainer_state.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/trainer_state.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/trainer_state.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/training_args.bin b/checkpoints/rdc_Qwen3-4B/checkpoint-540/training_args.bin similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/training_args.bin rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/training_args.bin diff --git a/output_rdal_Qwen3-4B_-/checkpoint-540/vocab.json b/checkpoints/rdc_Qwen3-4B/checkpoint-540/vocab.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-540/vocab.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-540/vocab.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/README.md b/checkpoints/rdc_Qwen3-4B/checkpoint-548/README.md similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/README.md rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/README.md diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/adapter_config.json b/checkpoints/rdc_Qwen3-4B/checkpoint-548/adapter_config.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/adapter_config.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/adapter_config.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/adapter_model.safetensors b/checkpoints/rdc_Qwen3-4B/checkpoint-548/adapter_model.safetensors similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/adapter_model.safetensors rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/adapter_model.safetensors diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/added_tokens.json b/checkpoints/rdc_Qwen3-4B/checkpoint-548/added_tokens.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/added_tokens.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/added_tokens.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/chat_template.jinja b/checkpoints/rdc_Qwen3-4B/checkpoint-548/chat_template.jinja similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/chat_template.jinja rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/chat_template.jinja diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/global_step544/mp_rank_00_model_states.pt b/checkpoints/rdc_Qwen3-4B/checkpoint-548/global_step544/mp_rank_00_model_states.pt similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/global_step544/mp_rank_00_model_states.pt rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/global_step544/mp_rank_00_model_states.pt diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/global_step544/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/rdc_Qwen3-4B/checkpoint-548/global_step544/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/global_step544/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/global_step544/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/global_step544/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/rdc_Qwen3-4B/checkpoint-548/global_step544/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/global_step544/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/global_step544/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/latest b/checkpoints/rdc_Qwen3-4B/checkpoint-548/latest similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/latest rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/latest diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/merges.txt b/checkpoints/rdc_Qwen3-4B/checkpoint-548/merges.txt similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/merges.txt rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/merges.txt diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/rng_state_0.pth b/checkpoints/rdc_Qwen3-4B/checkpoint-548/rng_state_0.pth similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/rng_state_0.pth rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/rng_state_0.pth diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/rng_state_1.pth b/checkpoints/rdc_Qwen3-4B/checkpoint-548/rng_state_1.pth similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/rng_state_1.pth rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/rng_state_1.pth diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/special_tokens_map.json b/checkpoints/rdc_Qwen3-4B/checkpoint-548/special_tokens_map.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/special_tokens_map.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/special_tokens_map.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/tokenizer.json b/checkpoints/rdc_Qwen3-4B/checkpoint-548/tokenizer.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/tokenizer.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/tokenizer.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/tokenizer_config.json b/checkpoints/rdc_Qwen3-4B/checkpoint-548/tokenizer_config.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/tokenizer_config.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/tokenizer_config.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/trainer_state.json b/checkpoints/rdc_Qwen3-4B/checkpoint-548/trainer_state.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/trainer_state.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/trainer_state.json diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/training_args.bin b/checkpoints/rdc_Qwen3-4B/checkpoint-548/training_args.bin similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/training_args.bin rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/training_args.bin diff --git a/output_rdal_Qwen3-4B_-/checkpoint-548/vocab.json b/checkpoints/rdc_Qwen3-4B/checkpoint-548/vocab.json similarity index 100% rename from output_rdal_Qwen3-4B_-/checkpoint-548/vocab.json rename to checkpoints/rdc_Qwen3-4B/checkpoint-548/vocab.json diff --git a/output_rdal_Qwen3-4B_-/final_adapters/README.md b/checkpoints/rdc_Qwen3-4B/final_adapters/README.md similarity index 100% rename from output_rdal_Qwen3-4B_-/final_adapters/README.md rename to checkpoints/rdc_Qwen3-4B/final_adapters/README.md diff --git a/output_rdal_Qwen3-4B_-/final_adapters/adapter_config.json b/checkpoints/rdc_Qwen3-4B/final_adapters/adapter_config.json similarity index 100% rename from output_rdal_Qwen3-4B_-/final_adapters/adapter_config.json rename to checkpoints/rdc_Qwen3-4B/final_adapters/adapter_config.json diff --git a/output_rdal_Qwen3-4B_-/final_adapters/adapter_model.safetensors b/checkpoints/rdc_Qwen3-4B/final_adapters/adapter_model.safetensors similarity index 100% rename from output_rdal_Qwen3-4B_-/final_adapters/adapter_model.safetensors rename to checkpoints/rdc_Qwen3-4B/final_adapters/adapter_model.safetensors diff --git a/output_rdal_Qwen3-4B_-/final_adapters/added_tokens.json b/checkpoints/rdc_Qwen3-4B/final_adapters/added_tokens.json similarity index 100% rename from output_rdal_Qwen3-4B_-/final_adapters/added_tokens.json rename to checkpoints/rdc_Qwen3-4B/final_adapters/added_tokens.json diff --git a/output_rdal_Qwen3-4B_-/final_adapters/chat_template.jinja b/checkpoints/rdc_Qwen3-4B/final_adapters/chat_template.jinja similarity index 100% rename from output_rdal_Qwen3-4B_-/final_adapters/chat_template.jinja rename to checkpoints/rdc_Qwen3-4B/final_adapters/chat_template.jinja diff --git a/output_rdal_Qwen3-4B_-/final_adapters/merges.txt b/checkpoints/rdc_Qwen3-4B/final_adapters/merges.txt similarity index 100% rename from output_rdal_Qwen3-4B_-/final_adapters/merges.txt rename to checkpoints/rdc_Qwen3-4B/final_adapters/merges.txt diff --git a/output_rdal_Qwen3-4B_-/final_adapters/special_tokens_map.json b/checkpoints/rdc_Qwen3-4B/final_adapters/special_tokens_map.json similarity index 100% rename from output_rdal_Qwen3-4B_-/final_adapters/special_tokens_map.json rename to checkpoints/rdc_Qwen3-4B/final_adapters/special_tokens_map.json diff --git a/output_rdal_Qwen3-4B_-/final_adapters/tokenizer.json b/checkpoints/rdc_Qwen3-4B/final_adapters/tokenizer.json similarity index 100% rename from output_rdal_Qwen3-4B_-/final_adapters/tokenizer.json rename to checkpoints/rdc_Qwen3-4B/final_adapters/tokenizer.json diff --git a/output_rdal_Qwen3-4B_-/final_adapters/tokenizer_config.json b/checkpoints/rdc_Qwen3-4B/final_adapters/tokenizer_config.json similarity index 100% rename from output_rdal_Qwen3-4B_-/final_adapters/tokenizer_config.json rename to checkpoints/rdc_Qwen3-4B/final_adapters/tokenizer_config.json diff --git a/output_rdal_Qwen3-4B_-/final_adapters/training_args.bin b/checkpoints/rdc_Qwen3-4B/final_adapters/training_args.bin similarity index 100% rename from output_rdal_Qwen3-4B_-/final_adapters/training_args.bin rename to checkpoints/rdc_Qwen3-4B/final_adapters/training_args.bin diff --git a/output_rdal_Qwen3-4B_-/final_adapters/vocab.json b/checkpoints/rdc_Qwen3-4B/final_adapters/vocab.json similarity index 100% rename from output_rdal_Qwen3-4B_-/final_adapters/vocab.json rename to checkpoints/rdc_Qwen3-4B/final_adapters/vocab.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/README.md b/checkpoints/reb_Qwen3-4B/checkpoint-1035/README.md similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/README.md rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/README.md diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/adapter_config.json b/checkpoints/reb_Qwen3-4B/checkpoint-1035/adapter_config.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/adapter_config.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/adapter_config.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/adapter_model.safetensors b/checkpoints/reb_Qwen3-4B/checkpoint-1035/adapter_model.safetensors similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/adapter_model.safetensors rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/adapter_model.safetensors diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/added_tokens.json b/checkpoints/reb_Qwen3-4B/checkpoint-1035/added_tokens.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/added_tokens.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/added_tokens.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/chat_template.jinja b/checkpoints/reb_Qwen3-4B/checkpoint-1035/chat_template.jinja similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/chat_template.jinja rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/chat_template.jinja diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/global_step1032/mp_rank_00_model_states.pt b/checkpoints/reb_Qwen3-4B/checkpoint-1035/global_step1032/mp_rank_00_model_states.pt similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/global_step1032/mp_rank_00_model_states.pt rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/global_step1032/mp_rank_00_model_states.pt diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/global_step1032/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/reb_Qwen3-4B/checkpoint-1035/global_step1032/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/global_step1032/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/global_step1032/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/global_step1032/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/reb_Qwen3-4B/checkpoint-1035/global_step1032/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/global_step1032/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/global_step1032/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/latest b/checkpoints/reb_Qwen3-4B/checkpoint-1035/latest similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/latest rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/latest diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/merges.txt b/checkpoints/reb_Qwen3-4B/checkpoint-1035/merges.txt similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/merges.txt rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/merges.txt diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/rng_state_0.pth b/checkpoints/reb_Qwen3-4B/checkpoint-1035/rng_state_0.pth similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/rng_state_0.pth rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/rng_state_0.pth diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/rng_state_1.pth b/checkpoints/reb_Qwen3-4B/checkpoint-1035/rng_state_1.pth similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/rng_state_1.pth rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/rng_state_1.pth diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/special_tokens_map.json b/checkpoints/reb_Qwen3-4B/checkpoint-1035/special_tokens_map.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/special_tokens_map.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/special_tokens_map.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/tokenizer.json b/checkpoints/reb_Qwen3-4B/checkpoint-1035/tokenizer.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/tokenizer.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/tokenizer.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/tokenizer_config.json b/checkpoints/reb_Qwen3-4B/checkpoint-1035/tokenizer_config.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/tokenizer_config.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/tokenizer_config.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/trainer_state.json b/checkpoints/reb_Qwen3-4B/checkpoint-1035/trainer_state.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/trainer_state.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/trainer_state.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/training_args.bin b/checkpoints/reb_Qwen3-4B/checkpoint-1035/training_args.bin similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/training_args.bin rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/training_args.bin diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/vocab.json b/checkpoints/reb_Qwen3-4B/checkpoint-1035/vocab.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/vocab.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1035/vocab.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/README.md b/checkpoints/reb_Qwen3-4B/checkpoint-1048/README.md similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/README.md rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/README.md diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/adapter_config.json b/checkpoints/reb_Qwen3-4B/checkpoint-1048/adapter_config.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/adapter_config.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/adapter_config.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/adapter_model.safetensors b/checkpoints/reb_Qwen3-4B/checkpoint-1048/adapter_model.safetensors similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/adapter_model.safetensors rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/adapter_model.safetensors diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/added_tokens.json b/checkpoints/reb_Qwen3-4B/checkpoint-1048/added_tokens.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/added_tokens.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/added_tokens.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/chat_template.jinja b/checkpoints/reb_Qwen3-4B/checkpoint-1048/chat_template.jinja similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/chat_template.jinja rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/chat_template.jinja diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/global_step1044/mp_rank_00_model_states.pt b/checkpoints/reb_Qwen3-4B/checkpoint-1048/global_step1044/mp_rank_00_model_states.pt similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/global_step1044/mp_rank_00_model_states.pt rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/global_step1044/mp_rank_00_model_states.pt diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/global_step1044/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/reb_Qwen3-4B/checkpoint-1048/global_step1044/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/global_step1044/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/global_step1044/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/global_step1044/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/reb_Qwen3-4B/checkpoint-1048/global_step1044/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/global_step1044/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/global_step1044/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/latest b/checkpoints/reb_Qwen3-4B/checkpoint-1048/latest similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/latest rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/latest diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/merges.txt b/checkpoints/reb_Qwen3-4B/checkpoint-1048/merges.txt similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/merges.txt rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/merges.txt diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/rng_state_0.pth b/checkpoints/reb_Qwen3-4B/checkpoint-1048/rng_state_0.pth similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/rng_state_0.pth rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/rng_state_0.pth diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/rng_state_1.pth b/checkpoints/reb_Qwen3-4B/checkpoint-1048/rng_state_1.pth similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/rng_state_1.pth rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/rng_state_1.pth diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/special_tokens_map.json b/checkpoints/reb_Qwen3-4B/checkpoint-1048/special_tokens_map.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/special_tokens_map.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/special_tokens_map.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/tokenizer.json b/checkpoints/reb_Qwen3-4B/checkpoint-1048/tokenizer.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/tokenizer.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/tokenizer.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/tokenizer_config.json b/checkpoints/reb_Qwen3-4B/checkpoint-1048/tokenizer_config.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/tokenizer_config.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/tokenizer_config.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/trainer_state.json b/checkpoints/reb_Qwen3-4B/checkpoint-1048/trainer_state.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/trainer_state.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/trainer_state.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/training_args.bin b/checkpoints/reb_Qwen3-4B/checkpoint-1048/training_args.bin similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/training_args.bin rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/training_args.bin diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/vocab.json b/checkpoints/reb_Qwen3-4B/checkpoint-1048/vocab.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/vocab.json rename to checkpoints/reb_Qwen3-4B/checkpoint-1048/vocab.json diff --git a/output_real_Qwen3-4B_-/checkpoint-915/README.md b/checkpoints/reb_Qwen3-4B/checkpoint-600/README.md similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/README.md rename to checkpoints/reb_Qwen3-4B/checkpoint-600/README.md diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/adapter_config.json b/checkpoints/reb_Qwen3-4B/checkpoint-600/adapter_config.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/adapter_config.json rename to checkpoints/reb_Qwen3-4B/checkpoint-600/adapter_config.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/adapter_model.safetensors b/checkpoints/reb_Qwen3-4B/checkpoint-600/adapter_model.safetensors similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/adapter_model.safetensors rename to checkpoints/reb_Qwen3-4B/checkpoint-600/adapter_model.safetensors diff --git a/output_real_Qwen3-4B_-/checkpoint-915/added_tokens.json b/checkpoints/reb_Qwen3-4B/checkpoint-600/added_tokens.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/added_tokens.json rename to checkpoints/reb_Qwen3-4B/checkpoint-600/added_tokens.json diff --git a/output_real_Qwen3-4B_-/checkpoint-915/chat_template.jinja b/checkpoints/reb_Qwen3-4B/checkpoint-600/chat_template.jinja similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/chat_template.jinja rename to checkpoints/reb_Qwen3-4B/checkpoint-600/chat_template.jinja diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/global_step598/mp_rank_00_model_states.pt b/checkpoints/reb_Qwen3-4B/checkpoint-600/global_step598/mp_rank_00_model_states.pt similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/global_step598/mp_rank_00_model_states.pt rename to checkpoints/reb_Qwen3-4B/checkpoint-600/global_step598/mp_rank_00_model_states.pt diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/reb_Qwen3-4B/checkpoint-600/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/reb_Qwen3-4B/checkpoint-600/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/reb_Qwen3-4B/checkpoint-600/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/reb_Qwen3-4B/checkpoint-600/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/latest b/checkpoints/reb_Qwen3-4B/checkpoint-600/latest similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/latest rename to checkpoints/reb_Qwen3-4B/checkpoint-600/latest diff --git a/output_real_Qwen3-4B_-/checkpoint-915/merges.txt b/checkpoints/reb_Qwen3-4B/checkpoint-600/merges.txt similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/merges.txt rename to checkpoints/reb_Qwen3-4B/checkpoint-600/merges.txt diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/rng_state_0.pth b/checkpoints/reb_Qwen3-4B/checkpoint-600/rng_state_0.pth similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/rng_state_0.pth rename to checkpoints/reb_Qwen3-4B/checkpoint-600/rng_state_0.pth diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/rng_state_1.pth b/checkpoints/reb_Qwen3-4B/checkpoint-600/rng_state_1.pth similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/rng_state_1.pth rename to checkpoints/reb_Qwen3-4B/checkpoint-600/rng_state_1.pth diff --git a/output_real_Qwen3-4B_-/checkpoint-915/special_tokens_map.json b/checkpoints/reb_Qwen3-4B/checkpoint-600/special_tokens_map.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/special_tokens_map.json rename to checkpoints/reb_Qwen3-4B/checkpoint-600/special_tokens_map.json diff --git a/output_real_Qwen3-4B_-/checkpoint-915/tokenizer.json b/checkpoints/reb_Qwen3-4B/checkpoint-600/tokenizer.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/tokenizer.json rename to checkpoints/reb_Qwen3-4B/checkpoint-600/tokenizer.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/tokenizer_config.json b/checkpoints/reb_Qwen3-4B/checkpoint-600/tokenizer_config.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/tokenizer_config.json rename to checkpoints/reb_Qwen3-4B/checkpoint-600/tokenizer_config.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/trainer_state.json b/checkpoints/reb_Qwen3-4B/checkpoint-600/trainer_state.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/trainer_state.json rename to checkpoints/reb_Qwen3-4B/checkpoint-600/trainer_state.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/training_args.bin b/checkpoints/reb_Qwen3-4B/checkpoint-600/training_args.bin similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/training_args.bin rename to checkpoints/reb_Qwen3-4B/checkpoint-600/training_args.bin diff --git a/output_real_Qwen3-4B_-/checkpoint-915/vocab.json b/checkpoints/reb_Qwen3-4B/checkpoint-600/vocab.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/vocab.json rename to checkpoints/reb_Qwen3-4B/checkpoint-600/vocab.json diff --git a/output_real_Qwen3-4B_-/final_adapters/README.md b/checkpoints/reb_Qwen3-4B/final_adapters/README.md similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/README.md rename to checkpoints/reb_Qwen3-4B/final_adapters/README.md diff --git a/output_rebl_Qwen3-4B_-/final_adapters/adapter_config.json b/checkpoints/reb_Qwen3-4B/final_adapters/adapter_config.json similarity index 100% rename from output_rebl_Qwen3-4B_-/final_adapters/adapter_config.json rename to checkpoints/reb_Qwen3-4B/final_adapters/adapter_config.json diff --git a/output_rebl_Qwen3-4B_-/final_adapters/adapter_model.safetensors b/checkpoints/reb_Qwen3-4B/final_adapters/adapter_model.safetensors similarity index 100% rename from output_rebl_Qwen3-4B_-/final_adapters/adapter_model.safetensors rename to checkpoints/reb_Qwen3-4B/final_adapters/adapter_model.safetensors diff --git a/output_real_Qwen3-4B_-/final_adapters/added_tokens.json b/checkpoints/reb_Qwen3-4B/final_adapters/added_tokens.json similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/added_tokens.json rename to checkpoints/reb_Qwen3-4B/final_adapters/added_tokens.json diff --git a/output_real_Qwen3-4B_-/final_adapters/chat_template.jinja b/checkpoints/reb_Qwen3-4B/final_adapters/chat_template.jinja similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/chat_template.jinja rename to checkpoints/reb_Qwen3-4B/final_adapters/chat_template.jinja diff --git a/output_real_Qwen3-4B_-/final_adapters/merges.txt b/checkpoints/reb_Qwen3-4B/final_adapters/merges.txt similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/merges.txt rename to checkpoints/reb_Qwen3-4B/final_adapters/merges.txt diff --git a/output_real_Qwen3-4B_-/final_adapters/special_tokens_map.json b/checkpoints/reb_Qwen3-4B/final_adapters/special_tokens_map.json similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/special_tokens_map.json rename to checkpoints/reb_Qwen3-4B/final_adapters/special_tokens_map.json diff --git a/output_real_Qwen3-4B_-/final_adapters/tokenizer.json b/checkpoints/reb_Qwen3-4B/final_adapters/tokenizer.json similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/tokenizer.json rename to checkpoints/reb_Qwen3-4B/final_adapters/tokenizer.json diff --git a/output_rebl_Qwen3-4B_-/final_adapters/tokenizer_config.json b/checkpoints/reb_Qwen3-4B/final_adapters/tokenizer_config.json similarity index 100% rename from output_rebl_Qwen3-4B_-/final_adapters/tokenizer_config.json rename to checkpoints/reb_Qwen3-4B/final_adapters/tokenizer_config.json diff --git a/output_rebl_Qwen3-4B_-/final_adapters/training_args.bin b/checkpoints/reb_Qwen3-4B/final_adapters/training_args.bin similarity index 100% rename from output_rebl_Qwen3-4B_-/final_adapters/training_args.bin rename to checkpoints/reb_Qwen3-4B/final_adapters/training_args.bin diff --git a/output_real_Qwen3-4B_-/final_adapters/vocab.json b/checkpoints/reb_Qwen3-4B/final_adapters/vocab.json similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/vocab.json rename to checkpoints/reb_Qwen3-4B/final_adapters/vocab.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/README.md b/checkpoints/rec_Qwen3-4B/checkpoint-1095/README.md similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/README.md rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/README.md diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/adapter_config.json b/checkpoints/rec_Qwen3-4B/checkpoint-1095/adapter_config.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/adapter_config.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/adapter_config.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/adapter_model.safetensors b/checkpoints/rec_Qwen3-4B/checkpoint-1095/adapter_model.safetensors similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/adapter_model.safetensors rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/adapter_model.safetensors diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/added_tokens.json b/checkpoints/rec_Qwen3-4B/checkpoint-1095/added_tokens.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/added_tokens.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/added_tokens.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/chat_template.jinja b/checkpoints/rec_Qwen3-4B/checkpoint-1095/chat_template.jinja similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/chat_template.jinja rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/chat_template.jinja diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/global_step1091/mp_rank_00_model_states.pt b/checkpoints/rec_Qwen3-4B/checkpoint-1095/global_step1091/mp_rank_00_model_states.pt similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/global_step1091/mp_rank_00_model_states.pt rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/global_step1091/mp_rank_00_model_states.pt diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/global_step1091/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/rec_Qwen3-4B/checkpoint-1095/global_step1091/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/global_step1091/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/global_step1091/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/global_step1091/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/rec_Qwen3-4B/checkpoint-1095/global_step1091/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/global_step1091/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/global_step1091/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/latest b/checkpoints/rec_Qwen3-4B/checkpoint-1095/latest similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/latest rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/latest diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/merges.txt b/checkpoints/rec_Qwen3-4B/checkpoint-1095/merges.txt similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/merges.txt rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/merges.txt diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/rng_state_0.pth b/checkpoints/rec_Qwen3-4B/checkpoint-1095/rng_state_0.pth similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/rng_state_0.pth rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/rng_state_0.pth diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/rng_state_1.pth b/checkpoints/rec_Qwen3-4B/checkpoint-1095/rng_state_1.pth similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/rng_state_1.pth rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/rng_state_1.pth diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/special_tokens_map.json b/checkpoints/rec_Qwen3-4B/checkpoint-1095/special_tokens_map.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/special_tokens_map.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/special_tokens_map.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/tokenizer.json b/checkpoints/rec_Qwen3-4B/checkpoint-1095/tokenizer.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/tokenizer.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/tokenizer.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/tokenizer_config.json b/checkpoints/rec_Qwen3-4B/checkpoint-1095/tokenizer_config.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/tokenizer_config.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/tokenizer_config.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/trainer_state.json b/checkpoints/rec_Qwen3-4B/checkpoint-1095/trainer_state.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/trainer_state.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/trainer_state.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1095/training_args.bin b/checkpoints/rec_Qwen3-4B/checkpoint-1095/training_args.bin similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1095/training_args.bin rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/training_args.bin diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1035/vocab.json b/checkpoints/rec_Qwen3-4B/checkpoint-1095/vocab.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1035/vocab.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1095/vocab.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/README.md b/checkpoints/rec_Qwen3-4B/checkpoint-1110/README.md similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/README.md rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/README.md diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/adapter_config.json b/checkpoints/rec_Qwen3-4B/checkpoint-1110/adapter_config.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/adapter_config.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/adapter_config.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/adapter_model.safetensors b/checkpoints/rec_Qwen3-4B/checkpoint-1110/adapter_model.safetensors similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/adapter_model.safetensors rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/adapter_model.safetensors diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/added_tokens.json b/checkpoints/rec_Qwen3-4B/checkpoint-1110/added_tokens.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/added_tokens.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/added_tokens.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/chat_template.jinja b/checkpoints/rec_Qwen3-4B/checkpoint-1110/chat_template.jinja similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/chat_template.jinja rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/chat_template.jinja diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/global_step1106/mp_rank_00_model_states.pt b/checkpoints/rec_Qwen3-4B/checkpoint-1110/global_step1106/mp_rank_00_model_states.pt similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/global_step1106/mp_rank_00_model_states.pt rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/global_step1106/mp_rank_00_model_states.pt diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/global_step1106/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/rec_Qwen3-4B/checkpoint-1110/global_step1106/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/global_step1106/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/global_step1106/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/global_step1106/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/rec_Qwen3-4B/checkpoint-1110/global_step1106/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/global_step1106/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/global_step1106/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/latest b/checkpoints/rec_Qwen3-4B/checkpoint-1110/latest similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/latest rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/latest diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/merges.txt b/checkpoints/rec_Qwen3-4B/checkpoint-1110/merges.txt similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/merges.txt rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/merges.txt diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/rng_state_0.pth b/checkpoints/rec_Qwen3-4B/checkpoint-1110/rng_state_0.pth similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/rng_state_0.pth rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/rng_state_0.pth diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/rng_state_1.pth b/checkpoints/rec_Qwen3-4B/checkpoint-1110/rng_state_1.pth similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/rng_state_1.pth rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/rng_state_1.pth diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/special_tokens_map.json b/checkpoints/rec_Qwen3-4B/checkpoint-1110/special_tokens_map.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/special_tokens_map.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/special_tokens_map.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/tokenizer.json b/checkpoints/rec_Qwen3-4B/checkpoint-1110/tokenizer.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/tokenizer.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/tokenizer.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/tokenizer_config.json b/checkpoints/rec_Qwen3-4B/checkpoint-1110/tokenizer_config.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/tokenizer_config.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/tokenizer_config.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/trainer_state.json b/checkpoints/rec_Qwen3-4B/checkpoint-1110/trainer_state.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/trainer_state.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/trainer_state.json diff --git a/output_real_Qwen3-4B_-/checkpoint-1110/training_args.bin b/checkpoints/rec_Qwen3-4B/checkpoint-1110/training_args.bin similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-1110/training_args.bin rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/training_args.bin diff --git a/output_rebl_Qwen3-4B_-/checkpoint-1048/vocab.json b/checkpoints/rec_Qwen3-4B/checkpoint-1110/vocab.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-1048/vocab.json rename to checkpoints/rec_Qwen3-4B/checkpoint-1110/vocab.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/README.md b/checkpoints/rec_Qwen3-4B/checkpoint-915/README.md similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/README.md rename to checkpoints/rec_Qwen3-4B/checkpoint-915/README.md diff --git a/output_real_Qwen3-4B_-/checkpoint-915/adapter_config.json b/checkpoints/rec_Qwen3-4B/checkpoint-915/adapter_config.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/adapter_config.json rename to checkpoints/rec_Qwen3-4B/checkpoint-915/adapter_config.json diff --git a/output_real_Qwen3-4B_-/checkpoint-915/adapter_model.safetensors b/checkpoints/rec_Qwen3-4B/checkpoint-915/adapter_model.safetensors similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/adapter_model.safetensors rename to checkpoints/rec_Qwen3-4B/checkpoint-915/adapter_model.safetensors diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/added_tokens.json b/checkpoints/rec_Qwen3-4B/checkpoint-915/added_tokens.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/added_tokens.json rename to checkpoints/rec_Qwen3-4B/checkpoint-915/added_tokens.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/chat_template.jinja b/checkpoints/rec_Qwen3-4B/checkpoint-915/chat_template.jinja similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/chat_template.jinja rename to checkpoints/rec_Qwen3-4B/checkpoint-915/chat_template.jinja diff --git a/output_real_Qwen3-4B_-/checkpoint-915/global_step912/mp_rank_00_model_states.pt b/checkpoints/rec_Qwen3-4B/checkpoint-915/global_step912/mp_rank_00_model_states.pt similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/global_step912/mp_rank_00_model_states.pt rename to checkpoints/rec_Qwen3-4B/checkpoint-915/global_step912/mp_rank_00_model_states.pt diff --git a/output_real_Qwen3-4B_-/checkpoint-915/global_step912/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/rec_Qwen3-4B/checkpoint-915/global_step912/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/global_step912/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/rec_Qwen3-4B/checkpoint-915/global_step912/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_real_Qwen3-4B_-/checkpoint-915/global_step912/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/rec_Qwen3-4B/checkpoint-915/global_step912/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/global_step912/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/rec_Qwen3-4B/checkpoint-915/global_step912/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_real_Qwen3-4B_-/checkpoint-915/latest b/checkpoints/rec_Qwen3-4B/checkpoint-915/latest similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/latest rename to checkpoints/rec_Qwen3-4B/checkpoint-915/latest diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/merges.txt b/checkpoints/rec_Qwen3-4B/checkpoint-915/merges.txt similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/merges.txt rename to checkpoints/rec_Qwen3-4B/checkpoint-915/merges.txt diff --git a/output_real_Qwen3-4B_-/checkpoint-915/rng_state_0.pth b/checkpoints/rec_Qwen3-4B/checkpoint-915/rng_state_0.pth similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/rng_state_0.pth rename to checkpoints/rec_Qwen3-4B/checkpoint-915/rng_state_0.pth diff --git a/output_real_Qwen3-4B_-/checkpoint-915/rng_state_1.pth b/checkpoints/rec_Qwen3-4B/checkpoint-915/rng_state_1.pth similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/rng_state_1.pth rename to checkpoints/rec_Qwen3-4B/checkpoint-915/rng_state_1.pth diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/special_tokens_map.json b/checkpoints/rec_Qwen3-4B/checkpoint-915/special_tokens_map.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/special_tokens_map.json rename to checkpoints/rec_Qwen3-4B/checkpoint-915/special_tokens_map.json diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/tokenizer.json b/checkpoints/rec_Qwen3-4B/checkpoint-915/tokenizer.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/tokenizer.json rename to checkpoints/rec_Qwen3-4B/checkpoint-915/tokenizer.json diff --git a/output_real_Qwen3-4B_-/checkpoint-915/tokenizer_config.json b/checkpoints/rec_Qwen3-4B/checkpoint-915/tokenizer_config.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/tokenizer_config.json rename to checkpoints/rec_Qwen3-4B/checkpoint-915/tokenizer_config.json diff --git a/output_real_Qwen3-4B_-/checkpoint-915/trainer_state.json b/checkpoints/rec_Qwen3-4B/checkpoint-915/trainer_state.json similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/trainer_state.json rename to checkpoints/rec_Qwen3-4B/checkpoint-915/trainer_state.json diff --git a/output_real_Qwen3-4B_-/checkpoint-915/training_args.bin b/checkpoints/rec_Qwen3-4B/checkpoint-915/training_args.bin similarity index 100% rename from output_real_Qwen3-4B_-/checkpoint-915/training_args.bin rename to checkpoints/rec_Qwen3-4B/checkpoint-915/training_args.bin diff --git a/output_rebl_Qwen3-4B_-/checkpoint-600/vocab.json b/checkpoints/rec_Qwen3-4B/checkpoint-915/vocab.json similarity index 100% rename from output_rebl_Qwen3-4B_-/checkpoint-600/vocab.json rename to checkpoints/rec_Qwen3-4B/checkpoint-915/vocab.json diff --git a/output_rebl_Qwen3-4B_-/final_adapters/README.md b/checkpoints/rec_Qwen3-4B/final_adapters/README.md similarity index 100% rename from output_rebl_Qwen3-4B_-/final_adapters/README.md rename to checkpoints/rec_Qwen3-4B/final_adapters/README.md diff --git a/output_real_Qwen3-4B_-/final_adapters/adapter_config.json b/checkpoints/rec_Qwen3-4B/final_adapters/adapter_config.json similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/adapter_config.json rename to checkpoints/rec_Qwen3-4B/final_adapters/adapter_config.json diff --git a/output_real_Qwen3-4B_-/final_adapters/adapter_model.safetensors b/checkpoints/rec_Qwen3-4B/final_adapters/adapter_model.safetensors similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/adapter_model.safetensors rename to checkpoints/rec_Qwen3-4B/final_adapters/adapter_model.safetensors diff --git a/output_rebl_Qwen3-4B_-/final_adapters/added_tokens.json b/checkpoints/rec_Qwen3-4B/final_adapters/added_tokens.json similarity index 100% rename from output_rebl_Qwen3-4B_-/final_adapters/added_tokens.json rename to checkpoints/rec_Qwen3-4B/final_adapters/added_tokens.json diff --git a/output_rebl_Qwen3-4B_-/final_adapters/chat_template.jinja b/checkpoints/rec_Qwen3-4B/final_adapters/chat_template.jinja similarity index 100% rename from output_rebl_Qwen3-4B_-/final_adapters/chat_template.jinja rename to checkpoints/rec_Qwen3-4B/final_adapters/chat_template.jinja diff --git a/output_real_Qwen3-4B_-/final_adapters/global_step912/mp_rank_00_model_states.pt b/checkpoints/rec_Qwen3-4B/final_adapters/global_step912/mp_rank_00_model_states.pt similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/global_step912/mp_rank_00_model_states.pt rename to checkpoints/rec_Qwen3-4B/final_adapters/global_step912/mp_rank_00_model_states.pt diff --git a/output_real_Qwen3-4B_-/final_adapters/global_step912/zero_pp_rank_0_mp_rank_00_optim_states.pt b/checkpoints/rec_Qwen3-4B/final_adapters/global_step912/zero_pp_rank_0_mp_rank_00_optim_states.pt similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/global_step912/zero_pp_rank_0_mp_rank_00_optim_states.pt rename to checkpoints/rec_Qwen3-4B/final_adapters/global_step912/zero_pp_rank_0_mp_rank_00_optim_states.pt diff --git a/output_real_Qwen3-4B_-/final_adapters/global_step912/zero_pp_rank_1_mp_rank_00_optim_states.pt b/checkpoints/rec_Qwen3-4B/final_adapters/global_step912/zero_pp_rank_1_mp_rank_00_optim_states.pt similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/global_step912/zero_pp_rank_1_mp_rank_00_optim_states.pt rename to checkpoints/rec_Qwen3-4B/final_adapters/global_step912/zero_pp_rank_1_mp_rank_00_optim_states.pt diff --git a/output_real_Qwen3-4B_-/final_adapters/latest b/checkpoints/rec_Qwen3-4B/final_adapters/latest similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/latest rename to checkpoints/rec_Qwen3-4B/final_adapters/latest diff --git a/output_rebl_Qwen3-4B_-/final_adapters/merges.txt b/checkpoints/rec_Qwen3-4B/final_adapters/merges.txt similarity index 100% rename from output_rebl_Qwen3-4B_-/final_adapters/merges.txt rename to checkpoints/rec_Qwen3-4B/final_adapters/merges.txt diff --git a/output_real_Qwen3-4B_-/final_adapters/rng_state_0.pth b/checkpoints/rec_Qwen3-4B/final_adapters/rng_state_0.pth similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/rng_state_0.pth rename to checkpoints/rec_Qwen3-4B/final_adapters/rng_state_0.pth diff --git a/output_real_Qwen3-4B_-/final_adapters/rng_state_1.pth b/checkpoints/rec_Qwen3-4B/final_adapters/rng_state_1.pth similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/rng_state_1.pth rename to checkpoints/rec_Qwen3-4B/final_adapters/rng_state_1.pth diff --git a/output_rebl_Qwen3-4B_-/final_adapters/special_tokens_map.json b/checkpoints/rec_Qwen3-4B/final_adapters/special_tokens_map.json similarity index 100% rename from output_rebl_Qwen3-4B_-/final_adapters/special_tokens_map.json rename to checkpoints/rec_Qwen3-4B/final_adapters/special_tokens_map.json diff --git a/output_rebl_Qwen3-4B_-/final_adapters/tokenizer.json b/checkpoints/rec_Qwen3-4B/final_adapters/tokenizer.json similarity index 100% rename from output_rebl_Qwen3-4B_-/final_adapters/tokenizer.json rename to checkpoints/rec_Qwen3-4B/final_adapters/tokenizer.json diff --git a/output_real_Qwen3-4B_-/final_adapters/tokenizer_config.json b/checkpoints/rec_Qwen3-4B/final_adapters/tokenizer_config.json similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/tokenizer_config.json rename to checkpoints/rec_Qwen3-4B/final_adapters/tokenizer_config.json diff --git a/output_real_Qwen3-4B_-/final_adapters/trainer_state.json b/checkpoints/rec_Qwen3-4B/final_adapters/trainer_state.json similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/trainer_state.json rename to checkpoints/rec_Qwen3-4B/final_adapters/trainer_state.json diff --git a/output_real_Qwen3-4B_-/final_adapters/training_args.bin b/checkpoints/rec_Qwen3-4B/final_adapters/training_args.bin similarity index 100% rename from output_real_Qwen3-4B_-/final_adapters/training_args.bin rename to checkpoints/rec_Qwen3-4B/final_adapters/training_args.bin diff --git a/output_rebl_Qwen3-4B_-/final_adapters/vocab.json b/checkpoints/rec_Qwen3-4B/final_adapters/vocab.json similarity index 100% rename from output_rebl_Qwen3-4B_-/final_adapters/vocab.json rename to checkpoints/rec_Qwen3-4B/final_adapters/vocab.json diff --git a/odc/README.md b/odc/README.md new file mode 100644 index 0000000000000000000000000000000000000000..8a9591cde2bc956137b9f7d13e4fe8be718b6a61 --- /dev/null +++ b/odc/README.md @@ -0,0 +1,202 @@ +--- +base_model: Qwen/Qwen3-4B +library_name: peft +--- + +# Model Card for Model ID + + + + + +## Model Details + +### Model Description + + + + + +- **Developed by:** [More Information Needed] +- **Funded by [optional]:** [More Information Needed] +- **Shared by [optional]:** [More Information Needed] +- **Model type:** [More Information Needed] +- **Language(s) (NLP):** [More Information Needed] +- **License:** [More Information Needed] +- **Finetuned from model [optional]:** [More Information Needed] + +### Model Sources [optional] + + + +- **Repository:** [More Information Needed] +- **Paper [optional]:** [More Information Needed] +- **Demo [optional]:** [More Information Needed] + +## Uses + + + +### Direct Use + + + +[More Information Needed] + +### Downstream Use [optional] + + + +[More Information Needed] + +### Out-of-Scope Use + + + +[More Information Needed] + +## Bias, Risks, and Limitations + + + +[More Information Needed] + +### Recommendations + + + +Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. + +## How to Get Started with the Model + +Use the code below to get started with the model. + +[More Information Needed] + +## Training Details + +### Training Data + + + +[More Information Needed] + +### Training Procedure + + + +#### Preprocessing [optional] + +[More Information Needed] + + +#### Training Hyperparameters + +- **Training regime:** [More Information Needed] + +#### Speeds, Sizes, Times [optional] + + + +[More Information Needed] + +## Evaluation + + + +### Testing Data, Factors & Metrics + +#### Testing Data + + + +[More Information Needed] + +#### Factors + + + +[More Information Needed] + +#### Metrics + + + +[More Information Needed] + +### Results + +[More Information Needed] + +#### Summary + + + +## Model Examination [optional] + + + +[More Information Needed] + +## Environmental Impact + + + +Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). + +- **Hardware Type:** [More Information Needed] +- **Hours used:** [More Information Needed] +- **Cloud Provider:** [More Information Needed] +- **Compute Region:** [More Information Needed] +- **Carbon Emitted:** [More Information Needed] + +## Technical Specifications [optional] + +### Model Architecture and Objective + +[More Information Needed] + +### Compute Infrastructure + +[More Information Needed] + +#### Hardware + +[More Information Needed] + +#### Software + +[More Information Needed] + +## Citation [optional] + + + +**BibTeX:** + +[More Information Needed] + +**APA:** + +[More Information Needed] + +## Glossary [optional] + + + +[More Information Needed] + +## More Information [optional] + +[More Information Needed] + +## Model Card Authors [optional] + +[More Information Needed] + +## Model Card Contact + +[More Information Needed] +### Framework versions + +- PEFT 0.15.2 \ No newline at end of file diff --git a/odc/adapter_config.json b/odc/adapter_config.json new file mode 100644 index 0000000000000000000000000000000000000000..be253dec68498dcbfdb5ee8017ebd9f58db38f48 --- /dev/null +++ b/odc/adapter_config.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c7230bbb1d85421684376a57f17cb053f867a001c5ca3ff30f574021e3e3c167 +size 828 diff --git a/odc/adapter_model.safetensors b/odc/adapter_model.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..7dff035cfef683187ab13cd728266b27648a5624 --- /dev/null +++ b/odc/adapter_model.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9a220a58078aa7dbeedaece01045895702dc1f4341e6eb68eb372147c69363b1 +size 1579102000 diff --git a/odc/added_tokens.json b/odc/added_tokens.json new file mode 100644 index 0000000000000000000000000000000000000000..2b76afd7cb1e665182c309e62849419f75306b75 --- /dev/null +++ b/odc/added_tokens.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c0284b582e14987fbd3d5a2cb2bd139084371ed9acbae488829a1c900833c680 +size 707 diff --git a/odc/chat_template.jinja b/odc/chat_template.jinja new file mode 100644 index 0000000000000000000000000000000000000000..d85ff3045a68273657a9f871cdb9938c940b110b --- /dev/null +++ b/odc/chat_template.jinja @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a55ee1b1660128b7098723e0abcd92caa0788061051c62d51cbe87d9cf1974d8 +size 4168 diff --git a/odc/merges.txt b/odc/merges.txt new file mode 100644 index 0000000000000000000000000000000000000000..80c1a19fae38f8f4c9ab32cc9d4e145c241147e6 --- /dev/null +++ b/odc/merges.txt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8831e4f1a044471340f7c0a83d7bd71306a5b867e95fd870f74d0c5308a904d5 +size 1671853 diff --git a/odc/special_tokens_map.json b/odc/special_tokens_map.json new file mode 100644 index 0000000000000000000000000000000000000000..ee3f6af3ba18b7d721f02bdf3fbca2814b7eda35 --- /dev/null +++ b/odc/special_tokens_map.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:76862e765266b85aa9459767e33cbaf13970f327a0e88d1c65846c2ddd3a1ecd +size 613 diff --git a/odc/tokenizer.json b/odc/tokenizer.json new file mode 100644 index 0000000000000000000000000000000000000000..cd71f61a15a522601badb3dc960d800d9cb3766c --- /dev/null +++ b/odc/tokenizer.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:aeb13307a71acd8fe81861d94ad54ab689df773318809eed3cbe794b4492dae4 +size 11422654 diff --git a/odc/tokenizer_config.json b/odc/tokenizer_config.json new file mode 100644 index 0000000000000000000000000000000000000000..0d8ad241b789c1419579e4b54e31b0befbad9882 --- /dev/null +++ b/odc/tokenizer_config.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:443bfa629eb16387a12edbf92a76f6a6f10b2af3b53d87ba1550adfcf45f7fa0 +size 5404 diff --git a/odc/training_args.bin b/odc/training_args.bin new file mode 100644 index 0000000000000000000000000000000000000000..8e4cae759b1a71b73c7998972c9abc7b50ff8b9f --- /dev/null +++ b/odc/training_args.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a27bfb6651c77744bd50f9d980f357ec5e3e1a52718e7cd7cbf37b8716ffd1a0 +size 7889 diff --git a/odc/vocab.json b/odc/vocab.json new file mode 100644 index 0000000000000000000000000000000000000000..6c49fc63bcb109de13abe49e58f85a4cdba7b679 --- /dev/null +++ b/odc/vocab.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ca10d7e9fb3ed18575dd1e277a2579c16d108e32f27439684afa0e10b1440910 +size 2776833 diff --git a/oeb/README.md b/oeb/README.md new file mode 100644 index 0000000000000000000000000000000000000000..8a9591cde2bc956137b9f7d13e4fe8be718b6a61 --- /dev/null +++ b/oeb/README.md @@ -0,0 +1,202 @@ +--- +base_model: Qwen/Qwen3-4B +library_name: peft +--- + +# Model Card for Model ID + + + + + +## Model Details + +### Model Description + + + + + +- **Developed by:** [More Information Needed] +- **Funded by [optional]:** [More Information Needed] +- **Shared by [optional]:** [More Information Needed] +- **Model type:** [More Information Needed] +- **Language(s) (NLP):** [More Information Needed] +- **License:** [More Information Needed] +- **Finetuned from model [optional]:** [More Information Needed] + +### Model Sources [optional] + + + +- **Repository:** [More Information Needed] +- **Paper [optional]:** [More Information Needed] +- **Demo [optional]:** [More Information Needed] + +## Uses + + + +### Direct Use + + + +[More Information Needed] + +### Downstream Use [optional] + + + +[More Information Needed] + +### Out-of-Scope Use + + + +[More Information Needed] + +## Bias, Risks, and Limitations + + + +[More Information Needed] + +### Recommendations + + + +Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. + +## How to Get Started with the Model + +Use the code below to get started with the model. + +[More Information Needed] + +## Training Details + +### Training Data + + + +[More Information Needed] + +### Training Procedure + + + +#### Preprocessing [optional] + +[More Information Needed] + + +#### Training Hyperparameters + +- **Training regime:** [More Information Needed] + +#### Speeds, Sizes, Times [optional] + + + +[More Information Needed] + +## Evaluation + + + +### Testing Data, Factors & Metrics + +#### Testing Data + + + +[More Information Needed] + +#### Factors + + + +[More Information Needed] + +#### Metrics + + + +[More Information Needed] + +### Results + +[More Information Needed] + +#### Summary + + + +## Model Examination [optional] + + + +[More Information Needed] + +## Environmental Impact + + + +Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). + +- **Hardware Type:** [More Information Needed] +- **Hours used:** [More Information Needed] +- **Cloud Provider:** [More Information Needed] +- **Compute Region:** [More Information Needed] +- **Carbon Emitted:** [More Information Needed] + +## Technical Specifications [optional] + +### Model Architecture and Objective + +[More Information Needed] + +### Compute Infrastructure + +[More Information Needed] + +#### Hardware + +[More Information Needed] + +#### Software + +[More Information Needed] + +## Citation [optional] + + + +**BibTeX:** + +[More Information Needed] + +**APA:** + +[More Information Needed] + +## Glossary [optional] + + + +[More Information Needed] + +## More Information [optional] + +[More Information Needed] + +## Model Card Authors [optional] + +[More Information Needed] + +## Model Card Contact + +[More Information Needed] +### Framework versions + +- PEFT 0.15.2 \ No newline at end of file diff --git a/oeb/adapter_config.json b/oeb/adapter_config.json new file mode 100644 index 0000000000000000000000000000000000000000..c5dd95413f985030423481af19caaa519c45d1de --- /dev/null +++ b/oeb/adapter_config.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d24c948342ca579064152f31c52b4669fab0b418f1f45d441e54177b8feb8a94 +size 828 diff --git a/oeb/adapter_model.safetensors b/oeb/adapter_model.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..6b7d83422464f61533a39d24379179244e436211 --- /dev/null +++ b/oeb/adapter_model.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9fd5dce1e22abf8efabaf6830723a55ab608fa3a814109a0d7b42b9ab0bfcd51 +size 1579102000 diff --git a/oeb/added_tokens.json b/oeb/added_tokens.json new file mode 100644 index 0000000000000000000000000000000000000000..2b76afd7cb1e665182c309e62849419f75306b75 --- /dev/null +++ b/oeb/added_tokens.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c0284b582e14987fbd3d5a2cb2bd139084371ed9acbae488829a1c900833c680 +size 707 diff --git a/oeb/chat_template.jinja b/oeb/chat_template.jinja new file mode 100644 index 0000000000000000000000000000000000000000..d85ff3045a68273657a9f871cdb9938c940b110b --- /dev/null +++ b/oeb/chat_template.jinja @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a55ee1b1660128b7098723e0abcd92caa0788061051c62d51cbe87d9cf1974d8 +size 4168 diff --git a/oeb/merges.txt b/oeb/merges.txt new file mode 100644 index 0000000000000000000000000000000000000000..80c1a19fae38f8f4c9ab32cc9d4e145c241147e6 --- /dev/null +++ b/oeb/merges.txt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8831e4f1a044471340f7c0a83d7bd71306a5b867e95fd870f74d0c5308a904d5 +size 1671853 diff --git a/oeb/special_tokens_map.json b/oeb/special_tokens_map.json new file mode 100644 index 0000000000000000000000000000000000000000..ee3f6af3ba18b7d721f02bdf3fbca2814b7eda35 --- /dev/null +++ b/oeb/special_tokens_map.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:76862e765266b85aa9459767e33cbaf13970f327a0e88d1c65846c2ddd3a1ecd +size 613 diff --git a/oeb/tokenizer.json b/oeb/tokenizer.json new file mode 100644 index 0000000000000000000000000000000000000000..cd71f61a15a522601badb3dc960d800d9cb3766c --- /dev/null +++ b/oeb/tokenizer.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:aeb13307a71acd8fe81861d94ad54ab689df773318809eed3cbe794b4492dae4 +size 11422654 diff --git a/oeb/tokenizer_config.json b/oeb/tokenizer_config.json new file mode 100644 index 0000000000000000000000000000000000000000..0d8ad241b789c1419579e4b54e31b0befbad9882 --- /dev/null +++ b/oeb/tokenizer_config.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:443bfa629eb16387a12edbf92a76f6a6f10b2af3b53d87ba1550adfcf45f7fa0 +size 5404 diff --git a/oeb/training_args.bin b/oeb/training_args.bin new file mode 100644 index 0000000000000000000000000000000000000000..73bf85771f819b3a8169ab8d1e49fa3f13cf1544 --- /dev/null +++ b/oeb/training_args.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ffbe4648d61459b6bf1672387ea81ae4255c9993a6f3c26550f43d5c98b3dbc9 +size 7544 diff --git a/oeb/vocab.json b/oeb/vocab.json new file mode 100644 index 0000000000000000000000000000000000000000..6c49fc63bcb109de13abe49e58f85a4cdba7b679 --- /dev/null +++ b/oeb/vocab.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ca10d7e9fb3ed18575dd1e277a2579c16d108e32f27439684afa0e10b1440910 +size 2776833 diff --git a/oec/README.md b/oec/README.md new file mode 100644 index 0000000000000000000000000000000000000000..8a9591cde2bc956137b9f7d13e4fe8be718b6a61 --- /dev/null +++ b/oec/README.md @@ -0,0 +1,202 @@ +--- +base_model: Qwen/Qwen3-4B +library_name: peft +--- + +# Model Card for Model ID + + + + + +## Model Details + +### Model Description + + + + + +- **Developed by:** [More Information Needed] +- **Funded by [optional]:** [More Information Needed] +- **Shared by [optional]:** [More Information Needed] +- **Model type:** [More Information Needed] +- **Language(s) (NLP):** [More Information Needed] +- **License:** [More Information Needed] +- **Finetuned from model [optional]:** [More Information Needed] + +### Model Sources [optional] + + + +- **Repository:** [More Information Needed] +- **Paper [optional]:** [More Information Needed] +- **Demo [optional]:** [More Information Needed] + +## Uses + + + +### Direct Use + + + +[More Information Needed] + +### Downstream Use [optional] + + + +[More Information Needed] + +### Out-of-Scope Use + + + +[More Information Needed] + +## Bias, Risks, and Limitations + + + +[More Information Needed] + +### Recommendations + + + +Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. + +## How to Get Started with the Model + +Use the code below to get started with the model. + +[More Information Needed] + +## Training Details + +### Training Data + + + +[More Information Needed] + +### Training Procedure + + + +#### Preprocessing [optional] + +[More Information Needed] + + +#### Training Hyperparameters + +- **Training regime:** [More Information Needed] + +#### Speeds, Sizes, Times [optional] + + + +[More Information Needed] + +## Evaluation + + + +### Testing Data, Factors & Metrics + +#### Testing Data + + + +[More Information Needed] + +#### Factors + + + +[More Information Needed] + +#### Metrics + + + +[More Information Needed] + +### Results + +[More Information Needed] + +#### Summary + + + +## Model Examination [optional] + + + +[More Information Needed] + +## Environmental Impact + + + +Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). + +- **Hardware Type:** [More Information Needed] +- **Hours used:** [More Information Needed] +- **Cloud Provider:** [More Information Needed] +- **Compute Region:** [More Information Needed] +- **Carbon Emitted:** [More Information Needed] + +## Technical Specifications [optional] + +### Model Architecture and Objective + +[More Information Needed] + +### Compute Infrastructure + +[More Information Needed] + +#### Hardware + +[More Information Needed] + +#### Software + +[More Information Needed] + +## Citation [optional] + + + +**BibTeX:** + +[More Information Needed] + +**APA:** + +[More Information Needed] + +## Glossary [optional] + + + +[More Information Needed] + +## More Information [optional] + +[More Information Needed] + +## Model Card Authors [optional] + +[More Information Needed] + +## Model Card Contact + +[More Information Needed] +### Framework versions + +- PEFT 0.15.2 \ No newline at end of file diff --git a/oec/adapter_config.json b/oec/adapter_config.json new file mode 100644 index 0000000000000000000000000000000000000000..cd249f2932b8f8b3c4a8cd8984ec276952c599b8 --- /dev/null +++ b/oec/adapter_config.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:15769b4da00c79f0e2677ff72e6e8ae172b3155e62b376e58a93c8952efb6995 +size 828 diff --git a/oec/adapter_model.safetensors b/oec/adapter_model.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..6d5bc7dcd4aa1c7cb99c97149e7f78a4e108c2b9 --- /dev/null +++ b/oec/adapter_model.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3e56fe94c129a90a2cce427c5a16d0c7a8a3dcb5dd33db50a0a4da7ae9f7b392 +size 1579102000 diff --git a/oec/added_tokens.json b/oec/added_tokens.json new file mode 100644 index 0000000000000000000000000000000000000000..2b76afd7cb1e665182c309e62849419f75306b75 --- /dev/null +++ b/oec/added_tokens.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c0284b582e14987fbd3d5a2cb2bd139084371ed9acbae488829a1c900833c680 +size 707 diff --git a/oec/chat_template.jinja b/oec/chat_template.jinja new file mode 100644 index 0000000000000000000000000000000000000000..d85ff3045a68273657a9f871cdb9938c940b110b --- /dev/null +++ b/oec/chat_template.jinja @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a55ee1b1660128b7098723e0abcd92caa0788061051c62d51cbe87d9cf1974d8 +size 4168 diff --git a/oec/global_step598/mp_rank_00_model_states.pt b/oec/global_step598/mp_rank_00_model_states.pt new file mode 100644 index 0000000000000000000000000000000000000000..5d995e0372057203167379d42e1a3daab3b751a7 --- /dev/null +++ b/oec/global_step598/mp_rank_00_model_states.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ae1390bd0c5adb4508d3c6d79fbc1b1ced1fec86355db6e0e041ec7c494321c2 +size 861285289 diff --git a/oec/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt b/oec/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt new file mode 100644 index 0000000000000000000000000000000000000000..394edad0dc87a646636323a9c8346e11c29682b2 --- /dev/null +++ b/oec/global_step598/zero_pp_rank_0_mp_rank_00_optim_states.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:766446e3e181e416fafe2ce7b72f9a198e297cb4c16bce846bc77fa3b34e5a64 +size 77884216 diff --git a/oec/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt b/oec/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt new file mode 100644 index 0000000000000000000000000000000000000000..20e0ec762f31ad2ac1e4dfb804dc7d16cce9884d --- /dev/null +++ b/oec/global_step598/zero_pp_rank_1_mp_rank_00_optim_states.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0521582f76826e13cd387f41808bc76398e001100005cf16e813f4ea94fceaa3 +size 77884344 diff --git a/oec/latest b/oec/latest new file mode 100644 index 0000000000000000000000000000000000000000..8a87fdc3e8d4d236bfa15c87e3acae6c027d8460 --- /dev/null +++ b/oec/latest @@ -0,0 +1 @@ +global_step598 \ No newline at end of file diff --git a/oec/merges.txt b/oec/merges.txt new file mode 100644 index 0000000000000000000000000000000000000000..80c1a19fae38f8f4c9ab32cc9d4e145c241147e6 --- /dev/null +++ b/oec/merges.txt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8831e4f1a044471340f7c0a83d7bd71306a5b867e95fd870f74d0c5308a904d5 +size 1671853 diff --git a/oec/rng_state_0.pth b/oec/rng_state_0.pth new file mode 100644 index 0000000000000000000000000000000000000000..d7eff0c6f8bbdccfdc6598329e45f13796e054a3 --- /dev/null +++ b/oec/rng_state_0.pth @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:233b2beced6e46147e0d535df7bc94f047fc8d7a5b3828b381e1077e8f5af631 +size 14917 diff --git a/oec/rng_state_1.pth b/oec/rng_state_1.pth new file mode 100644 index 0000000000000000000000000000000000000000..e728608c4aa596f9c27f17e87e5a2c0dc51dcfef --- /dev/null +++ b/oec/rng_state_1.pth @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:105643a0823f635ae4241293ce5ef99154425d151fa6cccca68105b0514afe8e +size 14917 diff --git a/oec/special_tokens_map.json b/oec/special_tokens_map.json new file mode 100644 index 0000000000000000000000000000000000000000..ee3f6af3ba18b7d721f02bdf3fbca2814b7eda35 --- /dev/null +++ b/oec/special_tokens_map.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:76862e765266b85aa9459767e33cbaf13970f327a0e88d1c65846c2ddd3a1ecd +size 613 diff --git a/oec/tokenizer.json b/oec/tokenizer.json new file mode 100644 index 0000000000000000000000000000000000000000..38dda57c0535c5ded5ad9b8a5b8c4e4a89aba355 --- /dev/null +++ b/oec/tokenizer.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:abfbf1d13376ad75c980dc37b59f5030a8eac0ced7b8fef55166f88a7c9cac34 +size 11422922 diff --git a/oec/tokenizer_config.json b/oec/tokenizer_config.json new file mode 100644 index 0000000000000000000000000000000000000000..0d8ad241b789c1419579e4b54e31b0befbad9882 --- /dev/null +++ b/oec/tokenizer_config.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:443bfa629eb16387a12edbf92a76f6a6f10b2af3b53d87ba1550adfcf45f7fa0 +size 5404 diff --git a/oec/trainer_state.json b/oec/trainer_state.json new file mode 100644 index 0000000000000000000000000000000000000000..d6762f439a816a01e58b795420316b017ad3cb67 --- /dev/null +++ b/oec/trainer_state.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3ab9184fef1f179e65d79c352fd52ae7bdaec8e57010e8d814c3c2dcfb0365ec +size 16728 diff --git a/oec/training_args.bin b/oec/training_args.bin new file mode 100644 index 0000000000000000000000000000000000000000..970918e1fdea383f7bbb41f529e5a496c104d052 --- /dev/null +++ b/oec/training_args.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9564a20f74071dd6173ca96b7e99f3be3226a0e2e863768a3896c383c6d2b1d8 +size 7889 diff --git a/oec/vocab.json b/oec/vocab.json new file mode 100644 index 0000000000000000000000000000000000000000..6c49fc63bcb109de13abe49e58f85a4cdba7b679 --- /dev/null +++ b/oec/vocab.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ca10d7e9fb3ed18575dd1e277a2579c16d108e32f27439684afa0e10b1440910 +size 2776833 diff --git a/rdc/README.md b/rdc/README.md new file mode 100644 index 0000000000000000000000000000000000000000..8a9591cde2bc956137b9f7d13e4fe8be718b6a61 --- /dev/null +++ b/rdc/README.md @@ -0,0 +1,202 @@ +--- +base_model: Qwen/Qwen3-4B +library_name: peft +--- + +# Model Card for Model ID + + + + + +## Model Details + +### Model Description + + + + + +- **Developed by:** [More Information Needed] +- **Funded by [optional]:** [More Information Needed] +- **Shared by [optional]:** [More Information Needed] +- **Model type:** [More Information Needed] +- **Language(s) (NLP):** [More Information Needed] +- **License:** [More Information Needed] +- **Finetuned from model [optional]:** [More Information Needed] + +### Model Sources [optional] + + + +- **Repository:** [More Information Needed] +- **Paper [optional]:** [More Information Needed] +- **Demo [optional]:** [More Information Needed] + +## Uses + + + +### Direct Use + + + +[More Information Needed] + +### Downstream Use [optional] + + + +[More Information Needed] + +### Out-of-Scope Use + + + +[More Information Needed] + +## Bias, Risks, and Limitations + + + +[More Information Needed] + +### Recommendations + + + +Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. + +## How to Get Started with the Model + +Use the code below to get started with the model. + +[More Information Needed] + +## Training Details + +### Training Data + + + +[More Information Needed] + +### Training Procedure + + + +#### Preprocessing [optional] + +[More Information Needed] + + +#### Training Hyperparameters + +- **Training regime:** [More Information Needed] + +#### Speeds, Sizes, Times [optional] + + + +[More Information Needed] + +## Evaluation + + + +### Testing Data, Factors & Metrics + +#### Testing Data + + + +[More Information Needed] + +#### Factors + + + +[More Information Needed] + +#### Metrics + + + +[More Information Needed] + +### Results + +[More Information Needed] + +#### Summary + + + +## Model Examination [optional] + + + +[More Information Needed] + +## Environmental Impact + + + +Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). + +- **Hardware Type:** [More Information Needed] +- **Hours used:** [More Information Needed] +- **Cloud Provider:** [More Information Needed] +- **Compute Region:** [More Information Needed] +- **Carbon Emitted:** [More Information Needed] + +## Technical Specifications [optional] + +### Model Architecture and Objective + +[More Information Needed] + +### Compute Infrastructure + +[More Information Needed] + +#### Hardware + +[More Information Needed] + +#### Software + +[More Information Needed] + +## Citation [optional] + + + +**BibTeX:** + +[More Information Needed] + +**APA:** + +[More Information Needed] + +## Glossary [optional] + + + +[More Information Needed] + +## More Information [optional] + +[More Information Needed] + +## Model Card Authors [optional] + +[More Information Needed] + +## Model Card Contact + +[More Information Needed] +### Framework versions + +- PEFT 0.15.2 \ No newline at end of file diff --git a/rdc/adapter_config.json b/rdc/adapter_config.json new file mode 100644 index 0000000000000000000000000000000000000000..3b1a1bc8092d9144e19f34860219fc41770d9df7 --- /dev/null +++ b/rdc/adapter_config.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:268ce1b414ef5c505b6c1f46312767afe7ac9d0817b48b3cc4d7351749e2c2cb +size 828 diff --git a/rdc/adapter_model.safetensors b/rdc/adapter_model.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..e0431276e820dc85e12117970c576d128b0247ee --- /dev/null +++ b/rdc/adapter_model.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bd2aa90f44e9311177737da6ea384dd5286e0e91b5af4cdc9d8ee79241c84f3f +size 1579102000 diff --git a/rdc/added_tokens.json b/rdc/added_tokens.json new file mode 100644 index 0000000000000000000000000000000000000000..2b76afd7cb1e665182c309e62849419f75306b75 --- /dev/null +++ b/rdc/added_tokens.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c0284b582e14987fbd3d5a2cb2bd139084371ed9acbae488829a1c900833c680 +size 707 diff --git a/rdc/chat_template.jinja b/rdc/chat_template.jinja new file mode 100644 index 0000000000000000000000000000000000000000..d85ff3045a68273657a9f871cdb9938c940b110b --- /dev/null +++ b/rdc/chat_template.jinja @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a55ee1b1660128b7098723e0abcd92caa0788061051c62d51cbe87d9cf1974d8 +size 4168 diff --git a/rdc/merges.txt b/rdc/merges.txt new file mode 100644 index 0000000000000000000000000000000000000000..80c1a19fae38f8f4c9ab32cc9d4e145c241147e6 --- /dev/null +++ b/rdc/merges.txt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8831e4f1a044471340f7c0a83d7bd71306a5b867e95fd870f74d0c5308a904d5 +size 1671853 diff --git a/rdc/special_tokens_map.json b/rdc/special_tokens_map.json new file mode 100644 index 0000000000000000000000000000000000000000..ee3f6af3ba18b7d721f02bdf3fbca2814b7eda35 --- /dev/null +++ b/rdc/special_tokens_map.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:76862e765266b85aa9459767e33cbaf13970f327a0e88d1c65846c2ddd3a1ecd +size 613 diff --git a/rdc/tokenizer.json b/rdc/tokenizer.json new file mode 100644 index 0000000000000000000000000000000000000000..cd71f61a15a522601badb3dc960d800d9cb3766c --- /dev/null +++ b/rdc/tokenizer.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:aeb13307a71acd8fe81861d94ad54ab689df773318809eed3cbe794b4492dae4 +size 11422654 diff --git a/rdc/tokenizer_config.json b/rdc/tokenizer_config.json new file mode 100644 index 0000000000000000000000000000000000000000..0d8ad241b789c1419579e4b54e31b0befbad9882 --- /dev/null +++ b/rdc/tokenizer_config.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:443bfa629eb16387a12edbf92a76f6a6f10b2af3b53d87ba1550adfcf45f7fa0 +size 5404 diff --git a/rdc/training_args.bin b/rdc/training_args.bin new file mode 100644 index 0000000000000000000000000000000000000000..b7afdb141adad72b781a9154c76e1bacafc146e9 --- /dev/null +++ b/rdc/training_args.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0586b3dd15eb1b60e40c5650b3c7344d1abdd741de8e4b60484753137df60d5f +size 7889 diff --git a/rdc/vocab.json b/rdc/vocab.json new file mode 100644 index 0000000000000000000000000000000000000000..6c49fc63bcb109de13abe49e58f85a4cdba7b679 --- /dev/null +++ b/rdc/vocab.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ca10d7e9fb3ed18575dd1e277a2579c16d108e32f27439684afa0e10b1440910 +size 2776833 diff --git a/reb/README.md b/reb/README.md new file mode 100644 index 0000000000000000000000000000000000000000..8a9591cde2bc956137b9f7d13e4fe8be718b6a61 --- /dev/null +++ b/reb/README.md @@ -0,0 +1,202 @@ +--- +base_model: Qwen/Qwen3-4B +library_name: peft +--- + +# Model Card for Model ID + + + + + +## Model Details + +### Model Description + + + + + +- **Developed by:** [More Information Needed] +- **Funded by [optional]:** [More Information Needed] +- **Shared by [optional]:** [More Information Needed] +- **Model type:** [More Information Needed] +- **Language(s) (NLP):** [More Information Needed] +- **License:** [More Information Needed] +- **Finetuned from model [optional]:** [More Information Needed] + +### Model Sources [optional] + + + +- **Repository:** [More Information Needed] +- **Paper [optional]:** [More Information Needed] +- **Demo [optional]:** [More Information Needed] + +## Uses + + + +### Direct Use + + + +[More Information Needed] + +### Downstream Use [optional] + + + +[More Information Needed] + +### Out-of-Scope Use + + + +[More Information Needed] + +## Bias, Risks, and Limitations + + + +[More Information Needed] + +### Recommendations + + + +Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. + +## How to Get Started with the Model + +Use the code below to get started with the model. + +[More Information Needed] + +## Training Details + +### Training Data + + + +[More Information Needed] + +### Training Procedure + + + +#### Preprocessing [optional] + +[More Information Needed] + + +#### Training Hyperparameters + +- **Training regime:** [More Information Needed] + +#### Speeds, Sizes, Times [optional] + + + +[More Information Needed] + +## Evaluation + + + +### Testing Data, Factors & Metrics + +#### Testing Data + + + +[More Information Needed] + +#### Factors + + + +[More Information Needed] + +#### Metrics + + + +[More Information Needed] + +### Results + +[More Information Needed] + +#### Summary + + + +## Model Examination [optional] + + + +[More Information Needed] + +## Environmental Impact + + + +Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). + +- **Hardware Type:** [More Information Needed] +- **Hours used:** [More Information Needed] +- **Cloud Provider:** [More Information Needed] +- **Compute Region:** [More Information Needed] +- **Carbon Emitted:** [More Information Needed] + +## Technical Specifications [optional] + +### Model Architecture and Objective + +[More Information Needed] + +### Compute Infrastructure + +[More Information Needed] + +#### Hardware + +[More Information Needed] + +#### Software + +[More Information Needed] + +## Citation [optional] + + + +**BibTeX:** + +[More Information Needed] + +**APA:** + +[More Information Needed] + +## Glossary [optional] + + + +[More Information Needed] + +## More Information [optional] + +[More Information Needed] + +## Model Card Authors [optional] + +[More Information Needed] + +## Model Card Contact + +[More Information Needed] +### Framework versions + +- PEFT 0.15.2 \ No newline at end of file diff --git a/reb/adapter_config.json b/reb/adapter_config.json new file mode 100644 index 0000000000000000000000000000000000000000..b7b5283e0a45eae761ecf7499b7d7e4dd8f8f10b --- /dev/null +++ b/reb/adapter_config.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d184256a3ccb78e67ec3d01340569d527e9539de90a81532bdad51f881de2535 +size 828 diff --git a/reb/adapter_model.safetensors b/reb/adapter_model.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..bbd4b89e8d8f3ff27fa9552be188c3246a157133 --- /dev/null +++ b/reb/adapter_model.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7ae45d20e708f57e03e68a15392e4cb668ed5476039899b1f6b932f27446ffa3 +size 1579102000 diff --git a/reb/added_tokens.json b/reb/added_tokens.json new file mode 100644 index 0000000000000000000000000000000000000000..2b76afd7cb1e665182c309e62849419f75306b75 --- /dev/null +++ b/reb/added_tokens.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c0284b582e14987fbd3d5a2cb2bd139084371ed9acbae488829a1c900833c680 +size 707 diff --git a/reb/chat_template.jinja b/reb/chat_template.jinja new file mode 100644 index 0000000000000000000000000000000000000000..d85ff3045a68273657a9f871cdb9938c940b110b --- /dev/null +++ b/reb/chat_template.jinja @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a55ee1b1660128b7098723e0abcd92caa0788061051c62d51cbe87d9cf1974d8 +size 4168 diff --git a/reb/merges.txt b/reb/merges.txt new file mode 100644 index 0000000000000000000000000000000000000000..80c1a19fae38f8f4c9ab32cc9d4e145c241147e6 --- /dev/null +++ b/reb/merges.txt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8831e4f1a044471340f7c0a83d7bd71306a5b867e95fd870f74d0c5308a904d5 +size 1671853 diff --git a/reb/special_tokens_map.json b/reb/special_tokens_map.json new file mode 100644 index 0000000000000000000000000000000000000000..ee3f6af3ba18b7d721f02bdf3fbca2814b7eda35 --- /dev/null +++ b/reb/special_tokens_map.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:76862e765266b85aa9459767e33cbaf13970f327a0e88d1c65846c2ddd3a1ecd +size 613 diff --git a/reb/tokenizer.json b/reb/tokenizer.json new file mode 100644 index 0000000000000000000000000000000000000000..cd71f61a15a522601badb3dc960d800d9cb3766c --- /dev/null +++ b/reb/tokenizer.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:aeb13307a71acd8fe81861d94ad54ab689df773318809eed3cbe794b4492dae4 +size 11422654 diff --git a/reb/tokenizer_config.json b/reb/tokenizer_config.json new file mode 100644 index 0000000000000000000000000000000000000000..0d8ad241b789c1419579e4b54e31b0befbad9882 --- /dev/null +++ b/reb/tokenizer_config.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:443bfa629eb16387a12edbf92a76f6a6f10b2af3b53d87ba1550adfcf45f7fa0 +size 5404 diff --git a/reb/training_args.bin b/reb/training_args.bin new file mode 100644 index 0000000000000000000000000000000000000000..18fe4e7ff998c19517bc692d9827aaa0ba935a08 --- /dev/null +++ b/reb/training_args.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:34a8ae223a9be91f945faad6045997f8a35870054555f96eb02fccba0cd4f86a +size 7889 diff --git a/reb/vocab.json b/reb/vocab.json new file mode 100644 index 0000000000000000000000000000000000000000..6c49fc63bcb109de13abe49e58f85a4cdba7b679 --- /dev/null +++ b/reb/vocab.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ca10d7e9fb3ed18575dd1e277a2579c16d108e32f27439684afa0e10b1440910 +size 2776833 diff --git a/rec/README.md b/rec/README.md new file mode 100644 index 0000000000000000000000000000000000000000..8a9591cde2bc956137b9f7d13e4fe8be718b6a61 --- /dev/null +++ b/rec/README.md @@ -0,0 +1,202 @@ +--- +base_model: Qwen/Qwen3-4B +library_name: peft +--- + +# Model Card for Model ID + + + + + +## Model Details + +### Model Description + + + + + +- **Developed by:** [More Information Needed] +- **Funded by [optional]:** [More Information Needed] +- **Shared by [optional]:** [More Information Needed] +- **Model type:** [More Information Needed] +- **Language(s) (NLP):** [More Information Needed] +- **License:** [More Information Needed] +- **Finetuned from model [optional]:** [More Information Needed] + +### Model Sources [optional] + + + +- **Repository:** [More Information Needed] +- **Paper [optional]:** [More Information Needed] +- **Demo [optional]:** [More Information Needed] + +## Uses + + + +### Direct Use + + + +[More Information Needed] + +### Downstream Use [optional] + + + +[More Information Needed] + +### Out-of-Scope Use + + + +[More Information Needed] + +## Bias, Risks, and Limitations + + + +[More Information Needed] + +### Recommendations + + + +Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. + +## How to Get Started with the Model + +Use the code below to get started with the model. + +[More Information Needed] + +## Training Details + +### Training Data + + + +[More Information Needed] + +### Training Procedure + + + +#### Preprocessing [optional] + +[More Information Needed] + + +#### Training Hyperparameters + +- **Training regime:** [More Information Needed] + +#### Speeds, Sizes, Times [optional] + + + +[More Information Needed] + +## Evaluation + + + +### Testing Data, Factors & Metrics + +#### Testing Data + + + +[More Information Needed] + +#### Factors + + + +[More Information Needed] + +#### Metrics + + + +[More Information Needed] + +### Results + +[More Information Needed] + +#### Summary + + + +## Model Examination [optional] + + + +[More Information Needed] + +## Environmental Impact + + + +Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). + +- **Hardware Type:** [More Information Needed] +- **Hours used:** [More Information Needed] +- **Cloud Provider:** [More Information Needed] +- **Compute Region:** [More Information Needed] +- **Carbon Emitted:** [More Information Needed] + +## Technical Specifications [optional] + +### Model Architecture and Objective + +[More Information Needed] + +### Compute Infrastructure + +[More Information Needed] + +#### Hardware + +[More Information Needed] + +#### Software + +[More Information Needed] + +## Citation [optional] + + + +**BibTeX:** + +[More Information Needed] + +**APA:** + +[More Information Needed] + +## Glossary [optional] + + + +[More Information Needed] + +## More Information [optional] + +[More Information Needed] + +## Model Card Authors [optional] + +[More Information Needed] + +## Model Card Contact + +[More Information Needed] +### Framework versions + +- PEFT 0.15.2 \ No newline at end of file diff --git a/rec/adapter_config.json b/rec/adapter_config.json new file mode 100644 index 0000000000000000000000000000000000000000..4d572b96b2c3584e44e8f5ccdaca9c34f63e5a67 --- /dev/null +++ b/rec/adapter_config.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:60211157f9d0e1f4b6c558d6c492c22492b3b7be840f92778705e07232158147 +size 828 diff --git a/rec/adapter_model.safetensors b/rec/adapter_model.safetensors new file mode 100644 index 0000000000000000000000000000000000000000..d8aa5122b3a1fce069627d0532cba5df80503a63 --- /dev/null +++ b/rec/adapter_model.safetensors @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c35aebc5a80bdb30788d9ddd6f4c75b51ad71baec6802f7b777240e0aea4ec3e +size 1579102000 diff --git a/rec/added_tokens.json b/rec/added_tokens.json new file mode 100644 index 0000000000000000000000000000000000000000..2b76afd7cb1e665182c309e62849419f75306b75 --- /dev/null +++ b/rec/added_tokens.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c0284b582e14987fbd3d5a2cb2bd139084371ed9acbae488829a1c900833c680 +size 707 diff --git a/rec/chat_template.jinja b/rec/chat_template.jinja new file mode 100644 index 0000000000000000000000000000000000000000..d85ff3045a68273657a9f871cdb9938c940b110b --- /dev/null +++ b/rec/chat_template.jinja @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a55ee1b1660128b7098723e0abcd92caa0788061051c62d51cbe87d9cf1974d8 +size 4168 diff --git a/rec/global_step912/mp_rank_00_model_states.pt b/rec/global_step912/mp_rank_00_model_states.pt new file mode 100644 index 0000000000000000000000000000000000000000..886209bcf5afcef45735cb2c299a7ebcf1d71610 --- /dev/null +++ b/rec/global_step912/mp_rank_00_model_states.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bbb053ecf39b418319c89124f95c945f2e189bbb2c20f4d5507ab7fe16783eec +size 861285289 diff --git a/rec/global_step912/zero_pp_rank_0_mp_rank_00_optim_states.pt b/rec/global_step912/zero_pp_rank_0_mp_rank_00_optim_states.pt new file mode 100644 index 0000000000000000000000000000000000000000..c0cae81c682a12a7d1538b59a04e20a1e8fa42bb --- /dev/null +++ b/rec/global_step912/zero_pp_rank_0_mp_rank_00_optim_states.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1b9fe5b649c74b1192809fdec3cc6cfbc2923ef17346f6c90ccef43d8fc883c0 +size 77884216 diff --git a/rec/global_step912/zero_pp_rank_1_mp_rank_00_optim_states.pt b/rec/global_step912/zero_pp_rank_1_mp_rank_00_optim_states.pt new file mode 100644 index 0000000000000000000000000000000000000000..555c83118127db1b0c658fbcf96097e7841dc004 --- /dev/null +++ b/rec/global_step912/zero_pp_rank_1_mp_rank_00_optim_states.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:42604aafd02835293829e079f3adf5d7bb788fdb0f7e6aa8a94f27441ac66598 +size 77884344 diff --git a/rec/latest b/rec/latest new file mode 100644 index 0000000000000000000000000000000000000000..4afd592634f97e88eb5ee1e76d3bc17c633ec221 --- /dev/null +++ b/rec/latest @@ -0,0 +1 @@ +global_step912 \ No newline at end of file diff --git a/rec/merges.txt b/rec/merges.txt new file mode 100644 index 0000000000000000000000000000000000000000..80c1a19fae38f8f4c9ab32cc9d4e145c241147e6 --- /dev/null +++ b/rec/merges.txt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8831e4f1a044471340f7c0a83d7bd71306a5b867e95fd870f74d0c5308a904d5 +size 1671853 diff --git a/rec/rng_state_0.pth b/rec/rng_state_0.pth new file mode 100644 index 0000000000000000000000000000000000000000..246a8ba2df47d0428386c9cb6fe481b85ef0fbd2 --- /dev/null +++ b/rec/rng_state_0.pth @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:944814b8416a775767c2d5554eaf498fcb3b9eabc9421bc5e539229bc02e6d42 +size 14917 diff --git a/rec/rng_state_1.pth b/rec/rng_state_1.pth new file mode 100644 index 0000000000000000000000000000000000000000..3d492dacb2c39fb597482331b4f6f4ba3cb23919 --- /dev/null +++ b/rec/rng_state_1.pth @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6a0c933bfefb8f95545100b963dd1d5f3ecc7579953faaba48a0129afcdc4285 +size 14917 diff --git a/rec/special_tokens_map.json b/rec/special_tokens_map.json new file mode 100644 index 0000000000000000000000000000000000000000..ee3f6af3ba18b7d721f02bdf3fbca2814b7eda35 --- /dev/null +++ b/rec/special_tokens_map.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:76862e765266b85aa9459767e33cbaf13970f327a0e88d1c65846c2ddd3a1ecd +size 613 diff --git a/rec/tokenizer.json b/rec/tokenizer.json new file mode 100644 index 0000000000000000000000000000000000000000..cd71f61a15a522601badb3dc960d800d9cb3766c --- /dev/null +++ b/rec/tokenizer.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:aeb13307a71acd8fe81861d94ad54ab689df773318809eed3cbe794b4492dae4 +size 11422654 diff --git a/rec/tokenizer_config.json b/rec/tokenizer_config.json new file mode 100644 index 0000000000000000000000000000000000000000..5c0cfc2d0ad23392bd84cad162c5d524f389f509 --- /dev/null +++ b/rec/tokenizer_config.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:502723b70f4fd98e2ee7aa681501d73ed0cbebd40fa8bc6f0585a61c3a14e156 +size 5597 diff --git a/rec/trainer_state.json b/rec/trainer_state.json new file mode 100644 index 0000000000000000000000000000000000000000..29951e60ff18ed21be11b76a1d60093ff08f1739 --- /dev/null +++ b/rec/trainer_state.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8f572793c8560574d45448b7f4f12887274286c86ec3a258bd374982e1226435 +size 24979 diff --git a/rec/training_args.bin b/rec/training_args.bin new file mode 100644 index 0000000000000000000000000000000000000000..0016dbc8c99357d1a3177558b3ce141136ea4c80 --- /dev/null +++ b/rec/training_args.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:344362c00cd71936c4828a55897fe77cd516512a502116f243d4e6cdd38d6a20 +size 7889 diff --git a/rec/vocab.json b/rec/vocab.json new file mode 100644 index 0000000000000000000000000000000000000000..6c49fc63bcb109de13abe49e58f85a4cdba7b679 --- /dev/null +++ b/rec/vocab.json @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ca10d7e9fb3ed18575dd1e277a2579c16d108e32f27439684afa0e10b1440910 +size 2776833