-
InfiAlign: A Scalable and Sample-Efficient Framework for Aligning LLMs to Enhance Reasoning Capabilities
Paper • 2508.05496 • Published • 9 -
InfiX-ai/InfiAlign-Qwen-7B-SFT
8B • Updated • 163 • 4 -
InfiX-ai/InfiAlign-Qwen-7B-DPO
Text Generation • 8B • Updated • 177 • 3 -
InfiX-ai/InfiAlign-Qwen-7B-DPO-Eval-Response
Preview • Updated • 95
AI & ML interests
None defined yet.
Recent Activity
The comprehensive model fusion strategies
The comprehensive model fusion strategies, including SFT fusion, DPO fusion, and new merging.
-
InfiX-ai/InfiFusion-14B
Updated • 2.38k • 4 -
InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion
Paper • 2501.02795 • Published -
InfiX-ai/InfiGFusion-14B
Updated • 3.3k • 6 -
InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion
Paper • 2505.13893 • Published
-
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
Paper • 2504.14239 • Published • 14 -
InfiX-ai/InfiGUI-R1-3B
Image-Text-to-Text • 4B • Updated • 1.43k • 6 -
InfiX-ai/android_control_train
Viewer • Updated • 13.6k • 78 -
InfiX-ai/android_control_test
Updated • 107 • 1
InfiGUI-G1 enhances GUI grounding with Adaptive Exploration Policy Optimization (AEPO) to overcome exploration bottlenecks.
-
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
Paper • 2504.14239 • Published • 14 -
InfiX-ai/InfiGUI-R1-3B
Image-Text-to-Text • 4B • Updated • 1.43k • 6 -
InfiX-ai/android_control_train
Viewer • Updated • 13.6k • 78 -
InfiX-ai/android_control_test
Updated • 107 • 1
InfiR : Crafting Effective Small Language Models and Multimodal Small
Language Models in Reasoning
-
InfiX-ai/InfiR-1B-Base
Text Generation • 1B • Updated • 2.28k • 6 -
InfiX-ai/InfiR-1B-Instruct
Text Generation • 1B • Updated • 2.42k • 8 -
InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
Paper • 2502.11573 • Published • 9 -
InfiX-ai/InfiAlign-Qwen-7B-SFT
8B • Updated • 163 • 4
-
InfiAlign: A Scalable and Sample-Efficient Framework for Aligning LLMs to Enhance Reasoning Capabilities
Paper • 2508.05496 • Published • 9 -
InfiX-ai/InfiAlign-Qwen-7B-SFT
8B • Updated • 163 • 4 -
InfiX-ai/InfiAlign-Qwen-7B-DPO
Text Generation • 8B • Updated • 177 • 3 -
InfiX-ai/InfiAlign-Qwen-7B-DPO-Eval-Response
Preview • Updated • 95
InfiGUI-G1 enhances GUI grounding with Adaptive Exploration Policy Optimization (AEPO) to overcome exploration bottlenecks.
-
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
Paper • 2504.14239 • Published • 14 -
InfiX-ai/InfiGUI-R1-3B
Image-Text-to-Text • 4B • Updated • 1.43k • 6 -
InfiX-ai/android_control_train
Viewer • Updated • 13.6k • 78 -
InfiX-ai/android_control_test
Updated • 107 • 1
The comprehensive model fusion strategies
The comprehensive model fusion strategies, including SFT fusion, DPO fusion, and new merging.
-
InfiX-ai/InfiFusion-14B
Updated • 2.38k • 4 -
InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion
Paper • 2501.02795 • Published -
InfiX-ai/InfiGFusion-14B
Updated • 3.3k • 6 -
InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion
Paper • 2505.13893 • Published
InfiR : Crafting Effective Small Language Models and Multimodal Small
Language Models in Reasoning
-
InfiX-ai/InfiR-1B-Base
Text Generation • 1B • Updated • 2.28k • 6 -
InfiX-ai/InfiR-1B-Instruct
Text Generation • 1B • Updated • 2.42k • 8 -
InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
Paper • 2502.11573 • Published • 9 -
InfiX-ai/InfiAlign-Qwen-7B-SFT
8B • Updated • 163 • 4
-
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
Paper • 2504.14239 • Published • 14 -
InfiX-ai/InfiGUI-R1-3B
Image-Text-to-Text • 4B • Updated • 1.43k • 6 -
InfiX-ai/android_control_train
Viewer • Updated • 13.6k • 78 -
InfiX-ai/android_control_test
Updated • 107 • 1