FineWeb2 Edu Japanese: A high-quality, filtered Japanese dataset (120M texts, 89.3B tokens) for educational AI training.
Yuichi Tateno PRO
hotchpotch
AI & ML interests
Information Retrieval with LLMs
Recent Activity
liked
a model
3 days ago
hotchpotch/japanese-reranker-base-v2
liked
a model
3 days ago
hotchpotch/japanese-reranker-small-v2
upvoted
an
article
19 days ago
Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders
Organizations
japanese-reranker
日本語rerankerシリーズ
-
hotchpotch/japanese-reranker-tiny-v2
Text Ranking • 0.0B • Updated • 1.41k • 4 -
hotchpotch/japanese-reranker-xsmall-v2
Text Ranking • 0.0B • Updated • 32.2k • 1 -
hotchpotch/japanese-reranker-small-v2
Text Ranking • 0.1B • Updated • 53 • 1 -
hotchpotch/japanese-reranker-base-v2
Text Ranking • 0.1B • Updated • 355 • 2
FineWeb2 Edu Japanese
FineWeb2 Edu Japanese: A high-quality, filtered Japanese dataset (120M texts, 89.3B tokens) for educational AI training.
japanese-reranker
日本語rerankerシリーズ
-
hotchpotch/japanese-reranker-tiny-v2
Text Ranking • 0.0B • Updated • 1.41k • 4 -
hotchpotch/japanese-reranker-xsmall-v2
Text Ranking • 0.0B • Updated • 32.2k • 1 -
hotchpotch/japanese-reranker-small-v2
Text Ranking • 0.1B • Updated • 53 • 1 -
hotchpotch/japanese-reranker-base-v2
Text Ranking • 0.1B • Updated • 355 • 2
models
36

hotchpotch/japanese-reranker-xsmall-v2
Text Ranking
•
0.0B
•
Updated
•
32.2k
•
1

hotchpotch/japanese-reranker-base-v2
Text Ranking
•
0.1B
•
Updated
•
355
•
2

hotchpotch/japanese-reranker-small-v2
Text Ranking
•
0.1B
•
Updated
•
53
•
1

hotchpotch/japanese-reranker-tiny-v2
Text Ranking
•
0.0B
•
Updated
•
1.41k
•
4

hotchpotch/japanese-reranker-cross-encoder-small-v1
Text Ranking
•
0.1B
•
Updated
•
3.45k
•
3

hotchpotch/japanese-reranker-cross-encoder-base-v1
Text Ranking
•
0.1B
•
Updated
•
1.16k
•
1

hotchpotch/japanese-reranker-cross-encoder-large-v1
Text Ranking
•
0.3B
•
Updated
•
4.92k
•
15

hotchpotch/japanese-bge-reranker-v2-m3-v1
Text Ranking
•
0.6B
•
Updated
•
722
•
15

hotchpotch/japanese-reranker-cross-encoder-xsmall-v1
Text Ranking
•
0.1B
•
Updated
•
4.12k
•
7

hotchpotch/query-crafter-japanese-Qwen3-1.7B
2B
•
Updated
•
136
•
9
datasets
24
hotchpotch/miracl-hf-unified
Viewer
•
Updated
•
106M
•
315
hotchpotch/JFWIR
Viewer
•
Updated
•
128M
•
332
•
1
hotchpotch/fineweb-2-edu-japanese
Viewer
•
Updated
•
262M
•
1.57k
•
16
hotchpotch/japanese-query-crafter-reasoning-80k
Viewer
•
Updated
•
83.3k
•
55
•
3
hotchpotch/tmp-5M-qa-small-tokens-cleaned
Viewer
•
Updated
•
5M
•
90
hotchpotch/japanese-qa-reasoning-100k
Viewer
•
Updated
•
106k
•
42
•
1
hotchpotch/fineweb-2-edu-japanese-noise-detect-raw
Viewer
•
Updated
•
64.2M
•
212
hotchpotch/fineweb-2-japanese-noise-spans
Viewer
•
Updated
•
344k
•
40
hotchpotch/fineweb-2-edu-japanese-scores
Viewer
•
Updated
•
313k
•
48
•
1
hotchpotch/sentence_transformer_japanese
Viewer
•
Updated
•
13.2M
•
123
•
5