Hao Jiang's picture

Hao Jiang

TechxGenus

·

https://techxgenus.github.io/

TechxGenus

AI & ML interests

Code Intelligence; Large Language Model; AI Alignment; Efficient Inference

Recent Activity

liked a model 14 days ago

mistralai/Magistral-Small-2506

upvoted a paper 15 days ago

Reinforcement Pre-Training

liked a Space 17 days ago

webml-community/conversational-webgpu

View all activity

Organizations

None yet

upvoted a paper 15 days ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published 15 days ago • 222

upvoted a collection 2 months ago

OpenMathReasoning

Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 7 days ago • 40

upvoted 3 papers 2 months ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8 • 170

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 60

Sleep-time Compute: Beyond Inference Scaling at Test-time

Paper • 2504.13171 • Published Apr 17 • 15

upvoted a paper 3 months ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published Apr 3 • 55

upvoted 2 collections 3 months ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated May 21 • 144

Gemma 3 Release

24 items • Updated 26 days ago • 391

upvoted a paper 4 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 193

upvoted a collection 4 months ago

The Ultimate Collection of Code Classifiers

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated May 5 • 11

upvoted a paper 4 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 142

upvoted 2 papers 5 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 404

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 104

upvoted 2 collections 6 months ago

DeepSeek-V3

4 items • Updated Mar 25 • 261

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 144

upvoted 2 collections 7 months ago

DeepSeek-V2.5

2 items • Updated Dec 10, 2024 • 41

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Apr 30 • 86

upvoted 2 papers 7 months ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published Nov 14, 2024 • 79

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 126

upvoted a collection 7 months ago

OpenCoder Datasets

OpenCoder datasets! • 6 items • Updated Nov 15, 2024 • 40