shoubing35
/

gpt2-qat

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

shoubing35 commited on Jun 30

Commit

b8dda94

·

verified ·

1 Parent(s): fcd8ca9

Training in progress, step 40

Files changed (4) hide show

README.md +2 -3
adapter_config.json +2 -2
adapter_model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,6 +1,5 @@
 ---
 base_model: openai-community/gpt2
-datasets: rajpurkar/squad
 library_name: transformers
 model_name: gpt2-qat
 tags:
@@ -12,7 +11,7 @@ licence: license
 # Model Card for gpt2-qat
-This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2) on the [rajpurkar/squad](https://huggingface.co/datasets/rajpurkar/squad) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
@@ -28,7 +27,7 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/shoubing-apple/huggingface/runs/mw9xfzt9)
 This model was trained with SFT.

 ---
 base_model: openai-community/gpt2
 library_name: transformers
 model_name: gpt2-qat
 tags:
 # Model Card for gpt2-qat
+This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/shoubing-apple/huggingface/runs/2o8in7w7)
 This model was trained with SFT.

adapter_config.json CHANGED Viewed

@@ -26,9 +26,9 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "c_attn",
     "c_fc",
-    "c_proj"
   ],
   "task_type": "CAUSAL_LM",
   "trainable_token_indices": null,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "c_fc",
+    "c_proj",
+    "c_attn"
   ],
   "task_type": "CAUSAL_LM",
   "trainable_token_indices": null,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:12a59246df7d018a86b2c6631eb5a0c44a3a1b05521b1b27cd6af522755bdb00
 size 9449344

 version https://git-lfs.github.com/spec/v1
+oid sha256:5f99b958f3a2583c2fdd66daff5ef70600b583639da35179eaefd549478489e8
 size 9449344

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ee8d1a3b9fe120913111e94e849aa2f4bb28ee3359f4d37678c591eafa00016b
 size 5752

 version https://git-lfs.github.com/spec/v1
+oid sha256:f8dc717f2de20299a3a7fabc46c01a5c37b6ec18f6838d7a42388baf751107d7
 size 5752