byteprobe (忍者)

upvoted a changelog 3 months ago

Changelog

Organization and User profiles now include repository listing pages

Jun 20

• 126

upvoted 8 papers 3 months ago

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Paper • 2506.10521 • Published Jun 12 • 74

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Paper • 2506.09513 • Published Jun 11 • 99

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 263

Magistral

Paper • 2506.10910 • Published Jun 12 • 64

upvoted 4 changelogs 3 months ago

Changelog

Add MCP-Compatible Spaces to Your Tools

Jun 17

• 82

Changelog

New Model Filtering Options on the Hub

Jun 16

• 74

Changelog

New Inference Providers Dashboard

Jun 5

• 64

Changelog

Connect Your MCP Client to the Hugging Face Hub

Jun 6

• 108

upvoted 7 papers 3 months ago

Skywork Open Reasoner 1 Technical Report

Paper • 2505.22312 • Published May 28 • 55

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Paper • 2505.11594 • Published May 16 • 76

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20 • 76

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23 • 81

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23 • 89

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26 • 88

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 122

忍者

AI & ML interests

Organizations

Organization and User profiles now include repository listing pages

Scaling Test-time Compute for LLM Agents

Essential-Web v1.0: 24T tokens of organized web data

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Magistral

Add MCP-Compatible Spaces to Your Tools

New Model Filtering Options on the Hub

New Inference Providers Dashboard

Connect Your MCP Client to the Hugging Face Hub

Skywork Open Reasoner 1 Technical Report

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Scaling Law for Quantization-Aware Training

Distilling LLM Agent into Small Models with Retrieval and Code Tools

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Chain-of-Model Learning for Language Model

忍者

AI & ML interests

Organizations

byteprobe's activity

Organization and User profiles now include repository listing pages

Add MCP-Compatible Spaces to Your Tools

New Model Filtering Options on the Hub

New Inference Providers Dashboard

Connect Your MCP Client to the Hugging Face Hub