Kazuki Fujii's picture

Kazuki Fujii

kazukifujii

·

AI & ML interests

None yet

Organizations

upvoted a paper 16 days ago

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Paper • 2507.16815 • Published 17 days ago • 35

upvoted 2 collections 18 days ago

🏟️ Long Code Arena

All the resources for our Long Code Arena benchmark! • 13 items • Updated Mar 14 • 6

Common Pile v0.1 Raw Data

8TB of public domain and openly licensed text • 30 items • Updated Jun 6 • 18

upvoted 2 collections 19 days ago

Essential-Web v1.0

10 items • Updated Jun 18 • 8

OpenReasoning-Nemotron

Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. • 6 items • Updated 9 days ago • 39

upvoted an article 21 days ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

By

and 3 others •

21 days ago

• 47

upvoted a paper 25 days ago

Essential-Web v1.0: 24T tokens of organized web data

Paper • 2506.14111 • Published Jun 17 • 42

upvoted an article 27 days ago

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

By

and 6 others •

Jun 12

• 120

upvoted a collection 29 days ago

Llama-3.1-Swallow

11 items • Updated Jul 2 • 10

upvoted 2 collections about 1 month ago

Gemma-2-Swallow

6 items • Updated May 18 • 4

👩‍💻 OlympicCoder

Reasoning datasets and models for competitive coding • 4 items • Updated May 13 • 19

upvoted a paper about 2 months ago

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Paper • 2406.07522 • Published Jun 11, 2024 • 41

upvoted a collection about 2 months ago

Llama-3.1-Swallow-v0.5

2 items • Updated Jun 21 • 1

upvoted a paper about 2 months ago

LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning

Paper • 2505.16933 • Published May 22 • 33

upvoted a collection about 2 months ago

Qwen3

84 items • Updated 2 days ago • 1.04k

upvoted an article about 2 months ago

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 404

upvoted a collection 2 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 18 days ago • 331

upvoted an article 2 months ago

Article

Announcing the Common Pile and Comma v0.1

By

•

Jun 6

• 14

upvoted a paper 2 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 126

upvoted a collection 2 months ago

Medical QA Datasets

A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22 • 45