Spaces:

Dovakiins
/

qwerrwe

Build error

felixonmars commited on Sep 27, 2023

Commit

d1236f2

unverified ·

1 Parent(s): 895f0a0

Correct typos in datasets.py (#639)

Files changed (1) hide show

src/axolotl/datasets.py CHANGED Viewed

@@ -22,7 +22,7 @@ class TokenizedPromptDataset(Dataset):
     """
     Dataset that returns tokenized prompts from a stream of text files.
         Args:
-            prompt_tokenizer (PromptTokenizingStrategy): The prompt tokenizing method for proccessing the data.
             dataset (dataset.Dataset): Dataset with text files.
     """
@@ -55,7 +55,7 @@ class ConstantLengthDataset(IterableDataset):
     """
     Iterable dataset that returns constant length chunks of tokens from stream of text files.
         Args:
-            tokenizer (Tokenizer): The processor used for proccessing the data.
             dataset (dataset.Dataset): Dataset with text files.
             seq_length (int): Length of token sequences to return.
     """

     """
     Dataset that returns tokenized prompts from a stream of text files.
         Args:
+            prompt_tokenizer (PromptTokenizingStrategy): The prompt tokenizing method for processing the data.
             dataset (dataset.Dataset): Dataset with text files.
     """
     """
     Iterable dataset that returns constant length chunks of tokens from stream of text files.
         Args:
+            tokenizer (Tokenizer): The processor used for processing the data.
             dataset (dataset.Dataset): Dataset with text files.
             seq_length (int): Length of token sequences to return.
     """