Raja Biswas's picture

Raja Biswas

rbiswasfc

·

AI & ML interests

NLP, Generative AI

Recent Activity

upvoted a paper 8 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

upvoted an article 11 days ago

DABStep: Data Agent Benchmark for Multi-step Reasoning

liked a model 11 days ago

nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct

View all activity

Organizations

rbiswasfc's activity

upvoted a paper 8 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published 11 days ago • 87

upvoted an article 11 days ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

Feb 4

• 77

liked a model 11 days ago

nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct

Text Generation • Updated 11 days ago • 4.24k • 103

liked a model 14 days ago

agentica-org/DeepCoder-14B-Preview

Text Generation • Updated 18 days ago • 44.7k • 613

upvoted a collection 22 days ago

SigLIP2

36 items • Updated 25 days ago • 67

upvoted 3 papers 26 days ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 114

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 182

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 122

upvoted a paper 27 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 147

upvoted 2 collections 27 days ago

RLVR

Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated 28 days ago • 11

ReSearch

Trained models as described in the paper "ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning" • 5 items • Updated Mar 27 • 5

updated a dataset 29 days ago

rbiswasfc/r1-7b

Viewer • Updated 29 days ago • 64 • 41

upvoted a paper about 1 month ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 161

liked a model about 1 month ago

nvidia/GR00T-N1-2B

Robotics • Updated Mar 18 • 3.65k • 284

upvoted a collection about 1 month ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 236

upvoted an article about 1 month ago

Article

The N Implementation Details of RLHF with PPO

Oct 24, 2023

• 50

liked a dataset about 2 months ago

qihoo360/Light-R1-SFTData

Viewer • Updated Mar 17 • 79.4k • 1.31k • 40

upvoted a paper about 2 months ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 98

upvoted an article about 2 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 401

upvoted a collection about 2 months ago

Gemma 3 Release

24 items • Updated 10 days ago • 346