XelotX commited on
Commit
0faf0b2
1 Parent(s): 860459a

Upload folder using huggingface_hub

Browse files
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. README.md +63 -0
  2. config.json +28 -0
  3. generation_config.json +12 -0
  4. model-00001-of-00162.safetensors +3 -0
  5. model-00002-of-00162.safetensors +3 -0
  6. model-00003-of-00162.safetensors +3 -0
  7. model-00004-of-00162.safetensors +3 -0
  8. model-00005-of-00162.safetensors +3 -0
  9. model-00006-of-00162.safetensors +3 -0
  10. model-00007-of-00162.safetensors +3 -0
  11. model-00008-of-00162.safetensors +3 -0
  12. model-00009-of-00162.safetensors +3 -0
  13. model-00010-of-00162.safetensors +3 -0
  14. model-00011-of-00162.safetensors +3 -0
  15. model-00012-of-00162.safetensors +3 -0
  16. model-00013-of-00162.safetensors +3 -0
  17. model-00014-of-00162.safetensors +3 -0
  18. model-00015-of-00162.safetensors +3 -0
  19. model-00016-of-00162.safetensors +3 -0
  20. model-00017-of-00162.safetensors +3 -0
  21. model-00018-of-00162.safetensors +3 -0
  22. model-00019-of-00162.safetensors +3 -0
  23. model-00020-of-00162.safetensors +3 -0
  24. model-00021-of-00162.safetensors +3 -0
  25. model-00022-of-00162.safetensors +3 -0
  26. model-00023-of-00162.safetensors +3 -0
  27. model-00024-of-00162.safetensors +3 -0
  28. model-00025-of-00162.safetensors +3 -0
  29. model-00026-of-00162.safetensors +3 -0
  30. model-00027-of-00162.safetensors +3 -0
  31. model-00028-of-00162.safetensors +3 -0
  32. model-00029-of-00162.safetensors +3 -0
  33. model-00030-of-00162.safetensors +3 -0
  34. model-00031-of-00162.safetensors +3 -0
  35. model-00032-of-00162.safetensors +3 -0
  36. model-00033-of-00162.safetensors +3 -0
  37. model-00034-of-00162.safetensors +3 -0
  38. model-00035-of-00162.safetensors +3 -0
  39. model-00036-of-00162.safetensors +3 -0
  40. model-00037-of-00162.safetensors +3 -0
  41. model-00038-of-00162.safetensors +3 -0
  42. model-00039-of-00162.safetensors +3 -0
  43. model-00040-of-00162.safetensors +3 -0
  44. model-00041-of-00162.safetensors +3 -0
  45. model-00042-of-00162.safetensors +3 -0
  46. model-00043-of-00162.safetensors +3 -0
  47. model-00044-of-00162.safetensors +3 -0
  48. model-00045-of-00162.safetensors +3 -0
  49. model-00046-of-00162.safetensors +3 -0
  50. model-00047-of-00162.safetensors +3 -0
README.md ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3.1
3
+ base_model: meta-llama/Meta-Llama-3.1-70B-Instruct
4
+ pipeline_tag: text-generation
5
+ library_name: transformers
6
+ ---
7
+ # Reflection Llama-3.1 70B
8
+
9
+ **Reflection Llama-3.1 70B is (currently) the world's top open-source LLM, trained with a new technique called Reflection-Tuning that teaches a LLM to detect mistakes in its reasoning and correct course.**
10
+
11
+ The model was trained on synthetic data generated by [Glaive](https://glaive.ai). If you're training a model, Glaive is incredible — use them.
12
+
13
+ You can [try the model here](https://reflection-playground-production.up.railway.app/).
14
+
15
+ ## Benchmarks
16
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/60518f3731c5be7f3dd5ebc3/zNs-ZFs0SbnomH7mikiOU.png)
17
+
18
+ All benchmarks tested have been checked for contamination by running [LMSys's LLM Decontaminator](https://github.com/lm-sys/llm-decontaminator). When benchmarking, we isolate the `<output>` and benchmark on solely that section.
19
+
20
+ Trained from Llama 3.1 70B Instruct, you can sample from Reflection Llama-3.1 70B using the same code, pipelines, etc. as any other Llama model. It even uses the stock Llama 3.1 chat template format (though, we've trained in a few new special tokens to aid in reasoning and reflection).
21
+
22
+ During sampling, the model will start by outputting reasoning inside `<thinking>` and `</thinking>` tags, and then once it is satisfied with its reasoning, it will output the final answer inside `<output>` and `</output>` tags. Each of these tags are special tokens, trained into the model.
23
+
24
+ This enables the model to separate its internal thoughts and reasoning from its final answer, improving the experience for the user.
25
+
26
+ Inside the `<thinking>` section, the model may output one or more `<reflection>` tags, which signals the model has caught an error in its reasoning and will attempt to correct it before providing a final answer.
27
+
28
+ ## System Prompt
29
+
30
+ The system prompt used for training this model is:
31
+
32
+ ```
33
+ You are a world-class AI system, capable of complex reasoning and reflection. Reason through the query inside <thinking> tags, and then provide your final response inside <output> tags. If you detect that you made a mistake in your reasoning at any point, correct yourself inside <reflection> tags.
34
+ ```
35
+
36
+ We recommend using this exact system prompt to get the best results from Reflection Llama-3.1 70B. You may also want to experiment combining this system prompt with your own custom instructions to customize the behavior of the model.
37
+
38
+ ## Chat Format
39
+
40
+ As mentioned above, the model uses the standard Llama 3.1 chat format. Here’s an example:
41
+
42
+ ```
43
+ <|begin_of_text|><|start_header_id|>system<|end_header_id|>
44
+
45
+ You are a world-class AI system, capable of complex reasoning and reflection. Reason through the query inside <thinking> tags, and then provide your final response inside <output> tags. If you detect that you made a mistake in your reasoning at any point, correct yourself inside <reflection> tags.<|eot_id|><|start_header_id|>user<|end_header_id|>
46
+
47
+ what is 2+2?<|eot_id|><|start_header_id|>assistant<|end_header_id|>
48
+ ```
49
+
50
+ ## Tips for Performance
51
+
52
+ - We are initially recommending a `temperature` of `.7` and a `top_p` of `.95`.
53
+ - For increased accuracy, append `Think carefully.` at the end of your messages.
54
+
55
+ ## Dataset / Report
56
+
57
+ Both the dataset and a brief report detailing how we trained this model will be released next week, alongside our Reflection 405B model that we expect will be the top-performing LLM in the world, including closed-source models.
58
+
59
+ ---
60
+
61
+ Thanks to Jason Kuperberg and Josh Bickett from the [HyperWrite](https://hyperwriteai.com) team for reviewing drafts of the report we'll be releasing next week.
62
+
63
+ Also, we know right now the model is split into a ton of files. We'll condense this soon to make the model easier to download and work with!
config.json ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "meta-llama/Meta-Llama-3-70B-Instruct",
3
+ "architectures": [
4
+ "LlamaForCausalLM"
5
+ ],
6
+ "attention_bias": false,
7
+ "attention_dropout": 0.0,
8
+ "bos_token_id": 128000,
9
+ "eos_token_id": 128009,
10
+ "hidden_act": "silu",
11
+ "hidden_size": 8192,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 28672,
14
+ "max_position_embeddings": 8192,
15
+ "model_type": "llama",
16
+ "num_attention_heads": 64,
17
+ "num_hidden_layers": 80,
18
+ "num_key_value_heads": 8,
19
+ "pretraining_tp": 1,
20
+ "rms_norm_eps": 1e-05,
21
+ "rope_scaling": null,
22
+ "rope_theta": 500000.0,
23
+ "tie_word_embeddings": false,
24
+ "torch_dtype": "float32",
25
+ "transformers_version": "4.40.0",
26
+ "use_cache": true,
27
+ "vocab_size": 128262
28
+ }
generation_config.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token_id": 128000,
3
+ "do_sample": true,
4
+ "eos_token_id": [
5
+ 128001,
6
+ 128009
7
+ ],
8
+ "max_length": 4096,
9
+ "temperature": 0.6,
10
+ "top_p": 0.9,
11
+ "transformers_version": "4.40.0"
12
+ }
model-00001-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f809c805d669903d3de4fa7e016c6f477a2abde4feea8cdd24beaea30c894aa
3
+ size 4202889352
model-00002-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:44821dc7a19e2ff5c15461bdb3cf45648b2d167dca2e103e491fad5ad8559c87
3
+ size 1543504480
model-00003-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:242634933a6eb95c58e27345c378c5f80286d832b300533182b6adbf87ac85d5
3
+ size 1879114208
model-00004-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fbf99f1ca533a697d6bb27d2ffff50c3e9120b37d94e08d256204d84006e3208
3
+ size 1543504480
model-00005-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:caa075ce0ccc45b44afe182d0d064a34766f3bc15d2ffbc6ae0295c0c3b3bcc5
3
+ size 1879114208
model-00006-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d3fad36e66abf09860c6c026adeeee0c527922a474e1b419056cb910acb9a2ec
3
+ size 1543504480
model-00007-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e9257bdb60a4825e167b474a76021fb2dc58b5ef929f1840c4de93b761dc7bde
3
+ size 1879114208
model-00008-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1e117cbf79b7c8142417444d64cdf162edf4887e59039df1ab5430c608a7d361
3
+ size 1543504480
model-00009-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ea6e795e1aea9889541f39faed08af077399068c5470159eab35563d0169cbc9
3
+ size 1879114208
model-00010-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:22ccaa07394e718f5891b3d4a85c2bd369e2df86e3f42ba6a5e70103adbe215e
3
+ size 1543504480
model-00011-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:720378d2fadef6bb733ffc4266842358f8f540c215252cdc5a2c0077b7ac5c26
3
+ size 1879114208
model-00012-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:65cfa32187488131cd3d99be28965df1be5793107eec125a40354ec11bdd8788
3
+ size 1543504480
model-00013-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8ee493198ac58b82ba9db612979723927c5d1b505af7bec4de3eaca5a34abc63
3
+ size 1879114208
model-00014-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5b2b105825f41a1393a248396ce1a932f02b65a4e2d352afa6bd83d8ee57d4be
3
+ size 1543504480
model-00015-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:96d31dd63129e54cdf34db5263dcf0dab36a27a745c815f6cb3bde75a78d819f
3
+ size 1879114208
model-00016-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:231a84636210ff2726d2d50ce065356eea18cacbcb3cebe5a85432405280edd4
3
+ size 1543504480
model-00017-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c6d115675f5b0d3ebca0e580f3f177eb68bffb71e664ba26fb01cea253e6e87d
3
+ size 1879114208
model-00018-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c07583b2fab739fb973c264a507fba881945bc51264b8b24656a6d809b82bc79
3
+ size 1543504480
model-00019-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df6d5867db9f1b6381fb7b5a0028811f947f81cbc77afdac565ad72ee3184c8f
3
+ size 1879114208
model-00020-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3596fd7261b56ec897b4c0d957f031cc2663c67ca1c4a36b8949bb81616cf026
3
+ size 1543504480
model-00021-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b8d78241036d4ae35651bf463d00fe2ba08b5072521ec45f33a4a1cc6cd4b8f8
3
+ size 1879114208
model-00022-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f6e2f9cb78e01da982ef2a89cb239566b4d03a374e9506ca1b86822cf0681465
3
+ size 1543504488
model-00023-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a1fa97677beaabf459a6e6888cb3cfdbb3cd78a9a860ed3982dbe67f3de1f4d3
3
+ size 1879114208
model-00024-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d7e9c7808ff0f707d21d831c7be8f7908a36207e60efb19581a4317ee34477d0
3
+ size 1543504488
model-00025-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:66414420b04c67d2397ac9f8bc1a49882ea5e9853d2adbd61b8e7484ba7bb79c
3
+ size 1879114208
model-00026-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:339035476299b3c936c22c9406a607033e6da9380ab69b0f9ef1b7d0a2a6af2b
3
+ size 1543504488
model-00027-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c307f06094239e7e37a04353d508a4e516414ac5cd76ec4f26a26f6a2b484d7a
3
+ size 1879114208
model-00028-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5adeb0fb794a775ea83aeaa0c32ded964ef8f910aaff510eebc9a6d6ced47f5a
3
+ size 1543504488
model-00029-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2a5febe564290a3c570da54342128efbdc9cb2b646ef384597252e04d3736848
3
+ size 1879114208
model-00030-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fa88625c23c929f03970a6380883c3cbfc6c24cf58bf1351fb9c08fe67f71a98
3
+ size 1543504488
model-00031-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:666b233e63540fd20f34d5ffb12a4cc9457cdc765ecc2dff22aa71970d6e962d
3
+ size 1879114208
model-00032-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cb2eccbe44efe9736f4982c4ebc1f358c416de80df47c4127df1cc0677113892
3
+ size 1543504488
model-00033-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6bae3cb6e0101d00e20f8618a9bf6948aa350506877708ac419c3ed31d983913
3
+ size 1879114208
model-00034-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e019f5ddd347087db938ba4880a5f8cf014d248ab8bdca436028d57d9586af3f
3
+ size 1543504488
model-00035-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:91009dff3feb99e10c98ae735f382e2015d06e0e74e4f3f69acdf6ebea138b8f
3
+ size 1879114208
model-00036-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:82ce2b573295f4b7bc6ee25f290bd5f6fd7142a43f45c6698d8ad8ff72e3ed9b
3
+ size 1543504488
model-00037-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8b909066f1bbac069f56ba563b28cfa2e29a39ffed6493ed8ecee3fa05045e81
3
+ size 1879114208
model-00038-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bb0f55c0c9adf45844cbed1255d7d7403bd60255694896c3892b08f42c506977
3
+ size 1543504488
model-00039-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ddb0b3997fe468f87b386b21ed1b206bfc051d5b240cb3b26a80935e9c3be92f
3
+ size 1879114208
model-00040-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a77e4e295e3033964e52dfced378067a39d5a74fd354fedda545a3a1f333a1c1
3
+ size 1543504488
model-00041-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e7249b344a6e9cca7dc11b32fe424d5b9e1f913f069ab9299217d303327f00a9
3
+ size 1879114208
model-00042-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:79c19a7a99701cd40eed3be151fdae45a6980eb8e33116f52c3ff1b903a33271
3
+ size 1543504488
model-00043-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7839554a2797b0035ec3877edce3c52d2e15ac5d1b4a5ee7d317e92a9bc8c67e
3
+ size 1879114208
model-00044-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:814ba11743a5fd231ce20a1b4e26958c71f3a7841060a61b02edccdc1c222e9c
3
+ size 3087008976
model-00045-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:da63d6468d174ec50efb7c483f5c84b47eb1a6a6a90247ae85d3484ddb598d36
3
+ size 1879114208
model-00046-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:65ccff7133fd4937b82d04a6be1d9a1c05ff86e15d752710153d354d0fc0c166
3
+ size 1543504488
model-00047-of-00162.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:69458a8a4d8001af69038a1a8b2e807f054716dab885befb5d2112bfeeed6c6f
3
+ size 1879114208