Mahesh Sinha
commited on
Commit
·
0aeb7c7
1
Parent(s):
9492d4e
Fixing Data Readme
Browse files- data/README.md +4 -4
data/README.md
CHANGED
|
@@ -10,10 +10,10 @@ curl https://github.com/teknium1/GPTeacher/blob/main/Roleplay/roleplay-similarit
|
|
| 10 |
## Convert the JSON data files to JSONL.
|
| 11 |
|
| 12 |
```shell
|
| 13 |
-
python3 ./scripts/alpaca_json_to_jsonl.py --
|
| 14 |
-
python3 ./scripts/alpaca_json_to_jsonl.py --
|
| 15 |
-
python3 ./scripts/alpaca_json_to_jsonl.py --
|
| 16 |
-
python3 ./scripts/alpaca_json_to_jsonl.py --
|
| 17 |
```
|
| 18 |
---
|
| 19 |
|
|
|
|
| 10 |
## Convert the JSON data files to JSONL.
|
| 11 |
|
| 12 |
```shell
|
| 13 |
+
python3 ./scripts/alpaca_json_to_jsonl.py --file data/alpaca_data_gpt4.json --output data/alpaca_data_gpt4.jsonl
|
| 14 |
+
python3 ./scripts/alpaca_json_to_jsonl.py --file data/raw/vicuna_cleaned.json --output data/vicuna_cleaned.jsonl
|
| 15 |
+
python3 ./scripts/alpaca_json_to_jsonl.py --file data/raw/roleplay-similarity_0.6-instruct-dataset.json --output data/roleplay-similarity_0.6-instruct-dataset.jsonl
|
| 16 |
+
python3 ./scripts/alpaca_json_to_jsonl.py --file data/raw/gpt4-instruct-similarity-0.6-dataset.json --output data/gpt4-instruct-similarity-0.6-dataset.jsonl
|
| 17 |
```
|
| 18 |
---
|
| 19 |
|