AI & ML interests

None defined yet.

Recent Activity

QiushiSunΒ  updated a collection about 21 hours ago
OS-Genesis
QiushiSunΒ  updated a collection about 21 hours ago
OS-Genesis
QiushiSunΒ  updated a collection about 21 hours ago
OS-Genesis
View all activity

OS-Copilot's activity

Symbol-LLMΒ 
posted an update about 1 month ago
view post
Post
947
πŸ₯³ Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning !

πŸ“• Title: Vision-Language Models Can Self-Improve Reasoning via Reflection

πŸ”— Link: Vision-Language Models Can Self-Improve Reasoning via Reflection (2411.00855)

πŸ˜‡Takeaways:

- We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing.

- Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !
Symbol-LLMΒ 
posted an update about 2 months ago
view post
Post
2151
πŸš€ Excited to introduce a new member of the OS-Copilot family: OS-Atlas - an open-sourced foundational action model for GUI agents

πŸ“˜ Paper: OS-ATLAS: A Foundation Action Model for Generalist GUI Agents (2410.23218)
πŸ”— Website: https://osatlas.github.io

πŸ˜‡ TL;DR: OS-Atlas offers:
1. State-of-the-Art GUI Grounding: Helps GUI agents accurately locate GUI elements.
2. Strong OOD Performance and Cross-platform Compatibility: Excels in out-of-domain agentic tasks across MacOS, Windows, Linux, Android, and Web.
3. Complete Infrastructure for GUI Data Synthesis:
You can easily build your own OS agent upon it!

Symbol-LLMΒ 
posted an update 5 months ago
view post
Post
2119
πŸ”₯Thrilled to release our 8B version of Symbol-LLM-Instruct !

It follows the two-stage training strategy proposed in the original paper and is continually optimized on LLaMA3-Chat-8B model.

Symbol-LLM was accepted by ACL'24 main conference ! See you in Thailand !

Paper link: https://arxiv.org/abs/2311.09278
Paper Title: Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models
  • 1 reply
Β·
Symbol-LLMΒ 
posted an update 6 months ago
view post
Post
1914
πŸ“Excited to make public a series of checkpoints !

- Final checkpoints after self-training with ENVISIONS framework
- Cover math, logic, and agent domains
- Include 7B / 13B

πŸ“• Check our paper:
Title: Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models
Link: https://arxiv.org/abs/2406.11736
  • 2 replies
Β·