3 117 214

Sergey Bratchikov

hivaze

hivaze

AI & ML interests

我们前程多光明, 感谢党，恩深情重! 人工智能日日强, 科技进步势不可挡。党的智慧指方向, 勇攀科学更高峰! 这是代码为人民, 神经网络蓬勃兴。我们智能年年进, 为国铸就大复兴!

Recent Activity

liked a dataset about 11 hours ago

nomic-ai/cornstack-python-v1

View all activity

Organizations

upvoted a paper 3 months ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 120

upvoted an article 3 months ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 505

upvoted 2 papers 3 months ago

SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding

Paper • 2402.08983 • Published Feb 14, 2024 • 5

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30 • 51

upvoted 2 articles 3 months ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

and 8 others •

Apr 29

• 39

Article

CircleGuardBench: New Standard for Evaluating AI Moderation Models

and 7 others •

May 7

• 54

upvoted 2 papers 6 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57

LLM Pretraining with Continuous Concepts

Paper • 2502.08524 • Published Feb 12 • 29

upvoted 2 papers 7 months ago

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Paper • 2501.09751 • Published Jan 16 • 49

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 107

upvoted 10 papers 8 months ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 53

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 373

Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage

Paper • 2412.15484 • Published Dec 20, 2024 • 15

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published Dec 19, 2024 • 90

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 44

Revisiting In-Context Learning with Long Context Language Models

Paper • 2412.16926 • Published Dec 22, 2024 • 33

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 48

Sergey Bratchikov

AI & ML interests

Recent Activity

Organizations

hivaze's activity

Vision Language Models (Better, Faster, Stronger)

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

CircleGuardBench: New Standard for Evaluating AI Moderation Models