Training in progress, step 500

Files changed (5) hide show

README.md CHANGED Viewed

@@ -1,18 +1,17 @@
 ---
 base_model: openai-community/gpt2
-datasets: rajpurkar/squad
 library_name: transformers
 model_name: gpt2-qat
 tags:
 - generated_from_trainer
-- sft
 - trl
 licence: license
 ---
 # Model Card for gpt2-qat
-This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2) on the [rajpurkar/squad](https://huggingface.co/datasets/rajpurkar/squad) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
@@ -28,7 +27,7 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/shoubing-apple/huggingface/runs/qxm8cp32)
 This model was trained with SFT.

 ---
 base_model: openai-community/gpt2
 library_name: transformers
 model_name: gpt2-qat
 tags:
 - generated_from_trainer
 - trl
+- sft
 licence: license
 ---
 # Model Card for gpt2-qat
+This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/shoubing-apple/huggingface/runs/8j5wiaig)
 This model was trained with SFT.

adapter_config.json CHANGED Viewed

@@ -27,8 +27,8 @@
   "revision": null,
   "target_modules": [
     "c_attn",
-    "c_fc",
-    "c_proj"
   ],
   "task_type": "CAUSAL_LM",
   "trainable_token_indices": null,

   "revision": null,
   "target_modules": [
     "c_attn",
+    "c_proj",
+    "c_fc"
   ],
   "task_type": "CAUSAL_LM",
   "trainable_token_indices": null,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0dac91bba90b0ac1e5c9c8b474f583a3224247b7692f382eca37ff06049ebb16
 size 9449344

 version https://git-lfs.github.com/spec/v1
+oid sha256:1a98eaf7d06e050fbc2f36fc815b58408bdf7b6e67fe47bf1bc9eb4ff3666b70
 size 9449344

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f712ff63cc9a70b692b1ec3bac12c33ac5162dc25ba202c8d7c5018ea3c147f5
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:66ae79c362a52c043fea3c4401af6ce7449adf5cba325b88f998de68ae080b54
 size 497774208

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:271ff6fde2951f2ce463377f88eb6b9cf60b392e9cf9eba2f9175dd748f6b309
 size 5752

 version https://git-lfs.github.com/spec/v1
+oid sha256:d4c4dc2ccc1968527222ae4b48d920c514561a5ec43d5513429641c2f5fbdce7
 size 5752