Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shubham Toshniwal's picture
9 13 25

Shubham Toshniwal

stoshniwal
ankitmahato's profile picture Somesh24's profile picture darragh's profile picture
·
https://shtoshni.github.io/
  • shtoshni

AI & ML interests

NLP, LLM

Recent Activity

upvoted a paper about 9 hours ago
Is Human-Written Data Enough? The Challenge of Teaching Reasoning to LLMs Without RL or Distillation
liked a dataset 1 day ago
nvidia/Nemotron-Math-HumanReasoning
commented on a paper about 2 months ago
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning
View all activity

Organizations

NVIDIA's profile picture

authored a paper 2 months ago

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2 • 38
authored 5 papers 3 months ago

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Paper • 2206.04615 • Published Jun 9, 2022 • 5

Nemotron-4 340B Technical Report

Paper • 2406.11704 • Published Jun 17, 2024

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

Paper • 2410.01560 • Published Oct 2, 2024 • 4

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4 • 13

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset

Paper • 2504.16891 • Published Apr 23 • 22
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs