shoubing35 commited on
Commit
1caf0f3
·
verified ·
1 Parent(s): 61af4f0

Training in progress, step 500

Browse files
README.md CHANGED
@@ -1,18 +1,17 @@
1
  ---
2
  base_model: openai-community/gpt2
3
- datasets: rajpurkar/squad
4
  library_name: transformers
5
  model_name: gpt2-qat
6
  tags:
7
  - generated_from_trainer
8
- - sft
9
  - trl
 
10
  licence: license
11
  ---
12
 
13
  # Model Card for gpt2-qat
14
 
15
- This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2) on the [rajpurkar/squad](https://huggingface.co/datasets/rajpurkar/squad) dataset.
16
  It has been trained using [TRL](https://github.com/huggingface/trl).
17
 
18
  ## Quick start
@@ -28,7 +27,7 @@ print(output["generated_text"])
28
 
29
  ## Training procedure
30
 
31
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/shoubing-apple/huggingface/runs/qxm8cp32)
32
 
33
 
34
  This model was trained with SFT.
 
1
  ---
2
  base_model: openai-community/gpt2
 
3
  library_name: transformers
4
  model_name: gpt2-qat
5
  tags:
6
  - generated_from_trainer
 
7
  - trl
8
+ - sft
9
  licence: license
10
  ---
11
 
12
  # Model Card for gpt2-qat
13
 
14
+ This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2).
15
  It has been trained using [TRL](https://github.com/huggingface/trl).
16
 
17
  ## Quick start
 
27
 
28
  ## Training procedure
29
 
30
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/shoubing-apple/huggingface/runs/8j5wiaig)
31
 
32
 
33
  This model was trained with SFT.
adapter_config.json CHANGED
@@ -27,8 +27,8 @@
27
  "revision": null,
28
  "target_modules": [
29
  "c_attn",
30
- "c_fc",
31
- "c_proj"
32
  ],
33
  "task_type": "CAUSAL_LM",
34
  "trainable_token_indices": null,
 
27
  "revision": null,
28
  "target_modules": [
29
  "c_attn",
30
+ "c_proj",
31
+ "c_fc"
32
  ],
33
  "task_type": "CAUSAL_LM",
34
  "trainable_token_indices": null,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0dac91bba90b0ac1e5c9c8b474f583a3224247b7692f382eca37ff06049ebb16
3
  size 9449344
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1a98eaf7d06e050fbc2f36fc815b58408bdf7b6e67fe47bf1bc9eb4ff3666b70
3
  size 9449344
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f712ff63cc9a70b692b1ec3bac12c33ac5162dc25ba202c8d7c5018ea3c147f5
3
  size 497774208
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:66ae79c362a52c043fea3c4401af6ce7449adf5cba325b88f998de68ae080b54
3
  size 497774208
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:271ff6fde2951f2ce463377f88eb6b9cf60b392e9cf9eba2f9175dd748f6b309
3
  size 5752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4c4dc2ccc1968527222ae4b48d920c514561a5ec43d5513429641c2f5fbdce7
3
  size 5752