lightblue
/

kurage-en

Text Generation

Model card Files Files and versions Community

ptrdvn commited on Sep 13, 2024

Commit

2de5cb6

·

verified ·

1 Parent(s): be0d7e1

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -513,4 +513,6 @@ We took chunks of size 250 tokens, 500 tokens, and 1000 tokens randomly for each
 We then used these chunks to generate questions and answers based on this text using a state-of-the-art LLM.
-Finally, we selected negatives for each chunk using the similarity from the dense embeddings of the [BAAI/bge-m3](https://huggingface.co/BAAI/bge-m3) model.

 We then used these chunks to generate questions and answers based on this text using a state-of-the-art LLM.
+Finally, we selected negatives for each chunk using the similarity from the dense embeddings of the [BAAI/bge-m3](https://huggingface.co/BAAI/bge-m3) model.
+The training data for this model can be found at [lightblue/kurage_training_data](https://huggingface.co/datasets/lightblue/kurage_training_data)