Post
331
On the verge of releasing Poseidon-Reasoning-5M, a dataset built to excel in general thought processes, mathematics, and science across a diverse mixture of domains, Iβm also dropping the Gargantua-R1-Compact dataset, a collection of over six million high-quality reasoning QA pair traces. π€π
β¦ Gargantua-R1-Compact : prithivMLmods/Gargantua-R1-Compact
Additionally, Iβm adding the mini version of Gargantua β the Gargantua-R1-Wee : prithivMLmods/Gargantua-R1-Wee
The composition spans 73.93% core mathematical reasoning involving problems, proofs, and computational challenges, 12.11% across diverse scientific domains such as physics, chemistry, biology, and interdisciplinary topics, 11.35% in competitive coding covering algorithms and data structures, 1.37% in academic science focusing on research-level methodology, 0.95% in creative and analytical reasoning through logic puzzles and problem-solving tasks, 0.25% in specialized technical areas like MLOps, LLMs, diffusion models, and CUDA, and 0.06% involving data from graphs and charts converted into structured JSON formats. Designed with both rich contextual depth and formal structural clarity, Gargantua-R1-Compact is an optimal resource for advancing research in symbolic reasoning, interpretability, and high-precision question answering in mathematical domains.
β¦ Collection : prithivMLmods/gargantua-r1-mod-6896bfd7834e82b89ad2b38b
To know more about it, visit the dataset card of the respective dataset. !!
β¦ Gargantua-R1-Compact : prithivMLmods/Gargantua-R1-Compact
from datasets import load_dataset
dataset = load_dataset("prithivMLmods/Gargantua-R1-Compact", split="train")
Additionally, Iβm adding the mini version of Gargantua β the Gargantua-R1-Wee : prithivMLmods/Gargantua-R1-Wee
from datasets import load_dataset
dataset = load_dataset("prithivMLmods/Gargantua-R1-Wee", split="train")
The composition spans 73.93% core mathematical reasoning involving problems, proofs, and computational challenges, 12.11% across diverse scientific domains such as physics, chemistry, biology, and interdisciplinary topics, 11.35% in competitive coding covering algorithms and data structures, 1.37% in academic science focusing on research-level methodology, 0.95% in creative and analytical reasoning through logic puzzles and problem-solving tasks, 0.25% in specialized technical areas like MLOps, LLMs, diffusion models, and CUDA, and 0.06% involving data from graphs and charts converted into structured JSON formats. Designed with both rich contextual depth and formal structural clarity, Gargantua-R1-Compact is an optimal resource for advancing research in symbolic reasoning, interpretability, and high-precision question answering in mathematical domains.
β¦ Collection : prithivMLmods/gargantua-r1-mod-6896bfd7834e82b89ad2b38b
To know more about it, visit the dataset card of the respective dataset. !!