12 24 17

Huiqiang Jiang

iofu728

https://hqjiang.com/

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Chain-of-Model Learning for Language Model

upvoted a paper about 2 months ago

Chain-of-Model Learning for Language Model

authored a paper 2 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

View all activity

Organizations

authored a paper about 2 months ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 120

upvoted a paper about 2 months ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 120

authored a paper 2 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5 • 28

upvoted a paper 2 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5 • 28

commented a paper 2 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5 • 28 •

liked a model 2 months ago

Qwen/Qwen3-235B-A22B

Text Generation • 235B • Updated May 21 • 160k • • 974

upvoted a paper 2 months ago

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Paper • 2504.16083 • Published Apr 22 • 9

commented a paper 2 months ago

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Paper • 2504.16083 • Published Apr 22 • 9 •

liked a model 5 months ago

moonshotai/Moonlight-16B-A3B

Text Generation • 16B • Updated Feb 26 • 7.04k • 84

upvoted a paper 5 months ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published Jan 28 • 38

liked a model 5 months ago

Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • 15B • Updated Jan 29 • 19.8k • • 314

upvoted a paper 6 months ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 48

liked 2 models 6 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • 33B • Updated Feb 24 • 418k • • 1.41k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 917k • • 12.4k

upvoted a paper 6 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 280

updated a dataset 7 months ago

microsoft/SCBench

Viewer • Updated Dec 24, 2024 • 922 • 2.18k • 6

upvoted a paper 7 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 368

authored a paper 7 months ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 10

upvoted a paper 7 months ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 10

commented a paper 7 months ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 10 •

Huiqiang Jiang

AI & ML interests

Recent Activity

Organizations

iofu728's activity