FineWeb2 Edu Japanese: A high-quality, filtered Japanese dataset (120M texts, 89.3B tokens) for educational AI training.
Yuichi Tateno PRO
hotchpotch
AI & ML interests
Information Retrieval with LLMs
Recent Activity
updated
a dataset
5 days ago
hotchpotch/JFWIR
updated
a collection
6 days ago
FineWeb2 Edu Japanese
liked
a model
6 days ago
ibm-granite/granite-embedding-107m-multilingual
Organizations
Collections
4
japanese-reranker
日本語rerankerシリーズ
-
hotchpotch/japanese-reranker-tiny-v2
Text Ranking • Updated • 2.41k • 4 -
hotchpotch/japanese-reranker-xsmall-v2
Text Ranking • Updated • 3.99k • 1 -
hotchpotch/japanese-reranker-cross-encoder-xsmall-v1
Text Ranking • Updated • 5.23k • 7 -
hotchpotch/japanese-bge-reranker-v2-m3-v1
Text Ranking • Updated • 1.82k • 15
FineWeb2 Edu Japanese
FineWeb2 Edu Japanese: A high-quality, filtered Japanese dataset (120M texts, 89.3B tokens) for educational AI training.
japanese-reranker
日本語rerankerシリーズ
-
hotchpotch/japanese-reranker-tiny-v2
Text Ranking • Updated • 2.41k • 4 -
hotchpotch/japanese-reranker-xsmall-v2
Text Ranking • Updated • 3.99k • 1 -
hotchpotch/japanese-reranker-cross-encoder-xsmall-v1
Text Ranking • Updated • 5.23k • 7 -
hotchpotch/japanese-bge-reranker-v2-m3-v1
Text Ranking • Updated • 1.82k • 15
models
34

hotchpotch/japanese-reranker-xsmall-v2
Text Ranking
•
Updated
•
3.99k
•
1

hotchpotch/japanese-reranker-tiny-v2
Text Ranking
•
Updated
•
2.41k
•
4

hotchpotch/japanese-bge-reranker-v2-m3-v1
Text Ranking
•
Updated
•
1.82k
•
15

hotchpotch/japanese-reranker-cross-encoder-large-v1
Text Ranking
•
Updated
•
4.52k
•
15

hotchpotch/japanese-reranker-cross-encoder-base-v1
Text Ranking
•
Updated
•
1.87k
•
1

hotchpotch/japanese-reranker-cross-encoder-xsmall-v1
Text Ranking
•
Updated
•
5.23k
•
7

hotchpotch/japanese-reranker-cross-encoder-small-v1
Text Ranking
•
Updated
•
66
•
3

hotchpotch/query-crafter-japanese-Qwen3-1.7B
Updated
•
1.72k
•
9

hotchpotch/query-crafter-japanese-Qwen3-4B
Updated
•
44

hotchpotch/tmp-mb-jp-30m-reranker
Updated
•
10
datasets
23
hotchpotch/JFWIR
Viewer
•
Updated
•
128M
•
147
•
1
hotchpotch/fineweb-2-edu-japanese
Viewer
•
Updated
•
262M
•
2.87k
•
12
hotchpotch/japanese-query-crafter-reasoning-80k
Viewer
•
Updated
•
83.3k
•
45
•
2
hotchpotch/tmp-5M-qa-small-tokens-cleaned
Viewer
•
Updated
•
5M
•
86
hotchpotch/japanese-qa-reasoning-100k
Viewer
•
Updated
•
106k
•
73
•
1
hotchpotch/fineweb-2-edu-japanese-noise-detect-raw
Viewer
•
Updated
•
64.2M
•
62
hotchpotch/fineweb-2-japanese-noise-spans
Viewer
•
Updated
•
344k
•
13
hotchpotch/fineweb-2-edu-japanese-scores
Viewer
•
Updated
•
313k
•
23
•
1
hotchpotch/sentence_transformer_japanese
Viewer
•
Updated
•
13.2M
•
510
•
5
hotchpotch/JQaRA
Viewer
•
Updated
•
278k
•
413
•
19