Soumye Singhal

soumye

AI & ML interests

LLM Post-training

Recent Activity

upvoted an article 10 days ago

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B

liked a model 20 days ago

nvidia/Nemotron-H-47B-Reasoning-128K

liked a model 24 days ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

View all activity

Organizations

upvoted an article 10 days ago

Article

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B

and 3 others •

16 days ago

• 6

liked a model 20 days ago

nvidia/Nemotron-H-47B-Reasoning-128K

Text Generation • Updated 20 days ago • 310 • 14

liked a model 24 days ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

Text Generation • Updated 22 days ago • 8.9k • • 169

liked a model 27 days ago

nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1

Text Generation • Updated 23 days ago • 21.2k • 90

liked a model about 1 month ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1-FP8

Text Generation • Updated May 13 • 6.94k • 6

upvoted a paper about 2 months ago

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset

Paper • 2504.16891 • Published Apr 23 • 22

upvoted 3 collections about 2 months ago

authored 6 papers about 2 months ago

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2 • 35

Effective Backdoor Mitigation in Vision-Language Models Depends on the Pre-training Objective

Paper • 2311.14948 • Published Nov 25, 2023

Adversarial Training of Reward Models

Paper • 2504.06141 • Published Apr 8

Countering Language Drift with Seeded Iterated Learning

Paper • 2003.12694 • Published Mar 28, 2020 • 1

Recall Traces: Backtracking Models for Efficient Reinforcement Learning

Paper • 1804.00379 • Published Apr 2, 2018

Supervised Seeded Iterated Learning for Interactive Language Learning

Paper • 2010.02975 • Published Oct 6, 2020

upvoted 2 papers about 2 months ago

Countering Language Drift with Seeded Iterated Learning

Paper • 2003.12694 • Published Mar 28, 2020 • 1

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2 • 35

liked a model about 2 months ago

nvidia/Llama-3_1-Nemotron-Ultra-253B-v1-FP8

Text Generation • Updated May 8 • 2.39k • 7

upvoted a paper about 2 months ago

Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment

Paper • 2502.00203 • Published Jan 31 • 2

authored a paper 2 months ago

Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment

Paper • 2502.00203 • Published Jan 31 • 2

Soumye Singhal

AI & ML interests

Recent Activity

Organizations

soumye's activity

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B