congkai commited on
Commit
fb40b16
Β·
verified Β·
1 Parent(s): 49bb411

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -3
README.md CHANGED
@@ -27,16 +27,19 @@ Our Mission: **Generative AI for all, intelligence in every task.**
27
 
28
  - [InfiR](https://huggingface.co/papers/2502.11573): **InfiR** aims to advance AI systems by improving reasoning, reducing adoption barriers, and addressing privacy concerns through smaller model sizes.
29
  - [InfiR-FP8](https://huggingface.co/InfiX-ai): **InfiR-FP8** is a smaller reasoning-enhanced model trained from scratch using FP8 precision, achieving successful convergence while reducing memory usage by 10% and improving training speed by 20% during the training process. The model will be released in mid-September.
30
- - [InfiAlign](http://huggingface.co/collections/InfiX-ai/infialign-688600ca363ab9ef0b78f7a6): **InfiAlign** is a scalable and data-efficient post-training framework that combines supervised fine-tuning (SFT) and reinforcement learning (RL) with a high-quality data selection pipeline to enhance reasoning in large language models.
31
- - [InfiMMR](https://arxiv.org/abs/2505.23091): **InfiMMR** is a novel three-phase curriculum framework that systematically enhances multimodal reasoning capabilities in small language models through foundational reasoning activation, cross-modal adaptation, and multimodal reasoning enhancement.
32
 
33
  #### πŸ–₯️ Advanced Vision-Native Agent for GUI Interaction
34
  - [InfiGUIAgent](https://huggingface.co/papers/2501.04575): **InfiGUIAgent** is a GUI agent that embeds native hierarchical and expectation-reflection reasoning through a unique two-stage supervised pipeline, enabling robust, multi-step GUI task automation.
35
- - [InfiGUI-R1](https://huggingface.co/InfiX-ai/InfiGUI-R1-3B): **InfiGUI-R1** is a GUI agent developed via the Actor2Reasoner framework, which evolves a reactive model into a deliberative reasoner capable of sophisticated planning and error recovery through spatial reasoning distillation and reinforcement learning.
 
36
 
37
  ---
38
  ### πŸ“° News
39
 
 
 
40
  - πŸ”₯[2025/5/20] Our paper "[InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion
41
  ](https://arxiv.org/abs/2505.13893)" released. More information can be found in [the repository](https://github.com/InfiXAI/InfiGFusion). Model is available [here](https://huggingface.co/InfiX-ai/InfiGFusion-14B)
42
  - πŸ”₯[2025/5/20] Our paper "[InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models
 
27
 
28
  - [InfiR](https://huggingface.co/papers/2502.11573): **InfiR** aims to advance AI systems by improving reasoning, reducing adoption barriers, and addressing privacy concerns through smaller model sizes.
29
  - [InfiR-FP8](https://huggingface.co/InfiX-ai): **InfiR-FP8** is a smaller reasoning-enhanced model trained from scratch using FP8 precision, achieving successful convergence while reducing memory usage by 10% and improving training speed by 20% during the training process. The model will be released in mid-September.
30
+ - [InfiAlign](https://huggingface.co/papers/2508.05496): **InfiAlign** is a scalable and data-efficient post-training framework that combines supervised fine-tuning (SFT) and reinforcement learning (RL) with a high-quality data selection pipeline to enhance reasoning in large language models.
31
+ - [InfiMMR](https://huggingface.co/papers/2505.23091): **InfiMMR** is a novel three-phase curriculum framework that systematically enhances multimodal reasoning capabilities in small language models through foundational reasoning activation, cross-modal adaptation, and multimodal reasoning enhancement.
32
 
33
  #### πŸ–₯️ Advanced Vision-Native Agent for GUI Interaction
34
  - [InfiGUIAgent](https://huggingface.co/papers/2501.04575): **InfiGUIAgent** is a GUI agent that embeds native hierarchical and expectation-reflection reasoning through a unique two-stage supervised pipeline, enabling robust, multi-step GUI task automation.
35
+ - [InfiGUI-R1](https://huggingface.co/papers/2504.14239v1): **InfiGUI-R1** is a GUI agent developed via the Actor2Reasoner framework, which evolves a reactive model into a deliberative reasoner capable of sophisticated planning and error recovery through spatial reasoning distillation and reinforcement learning.
36
+ - [InfiGUI-G1](https://huggingface.co/papers/2508.05731): **InfiGUI-G1** is a multimodal GUI agent that employs Adaptive Exploration Policy Optimization (AEPO) to improve semantic alignment in GUI grounding. The novel training framework achieves up to **8.3%** relative improvement over baseline methods.
37
 
38
  ---
39
  ### πŸ“° News
40
 
41
+ - πŸ”₯[2025/8/11] Our paper "[InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization
42
+ ](https://arxiv.org/abs/2508.05731)" released. More information can be found in [the repository](https://github.com/InfiXAI/InfiGUI-G1). Model is available [here](https://huggingface.co/InfiX-ai/InfiGUI-G1-7B)
43
  - πŸ”₯[2025/5/20] Our paper "[InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion
44
  ](https://arxiv.org/abs/2505.13893)" released. More information can be found in [the repository](https://github.com/InfiXAI/InfiGFusion). Model is available [here](https://huggingface.co/InfiX-ai/InfiGFusion-14B)
45
  - πŸ”₯[2025/5/20] Our paper "[InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models