Arunkumar Venkataramanan's picture

89 204

Arunkumar Venkataramanan

ArunkumarVR

·

https://arunkumarramanan.github.io

AI & ML interests

AGI Research: Reasoning, Safety & Alignment (Superalignment), Generative AI (GenAI), Multi-Modal Foundation Models (FMs), Large Language Models (LLMs), Transformers & Diffusion Models, Open LLM Training, Optimization & Finetuning, Serving & Inference

Recent Activity

liked a dataset 1 day ago

HuggingFaceFW/fineweb-edu

liked a dataset 1 day ago

HuggingFaceTB/finemath

liked a dataset 1 day ago

HuggingFaceTB/smoltalk

View all activity

Organizations

upvoted 3 collections 1 day ago

Reasoning datasets

24 items • Updated May 22 • 5

SmolLM3 evaluation datasets

Datasets to decontaminate the post-training mixtures against. Use the subset and column values described per entry • 13 items • Updated 2 days ago • 4

SmolLM3 pretraining datasets

datasets used in SmolLM3 pretraining • 14 items • Updated 2 days ago • 11

upvoted an article 1 day ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

3 days ago

• 433

upvoted a paper 3 months ago

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16 • 74

upvoted 2 collections 3 months ago

Llama 4

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 8 days ago • 45

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 567

upvoted a paper 3 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 161

upvoted a collection 4 months ago

Google's Gemma models family

319 items • Updated about 5 hours ago • 367

upvoted an article 4 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 295

upvoted 3 collections 4 months ago

Gemma 3 Release

24 items • Updated about 5 hours ago • 400

QwQ

Qwen with Questions • 6 items • Updated Apr 28 • 97

Model Optimizer

A collection of generative models quantized and optimized with TensorRT Model Optimizer. • 21 items • Updated 3 days ago • 23

upvoted a paper 5 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 142

upvoted 3 collections 5 months ago

RLHFlow MATH Process Reward Model

This is a collection of datasets and models of process reward modeling. • 15 items • Updated Nov 9, 2024 • 11

Skywork-o1-Open

Skywork o1 open model collections • 3 items • Updated 28 days ago • 20

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated Apr 28 • 82

upvoted an article 5 months ago

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 216

upvoted 2 collections 5 months ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 123

Reasoning Datasets

Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2 • 61