Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
34
5
75
Rajiv Shah
rajistics
Follow
kramp's profile picture
ducatte's profile picture
rengjey's profile picture
80 followers
·
31 following
https://www.rajivshah.com
rajistics
rajshah4
rajistics
AI & ML interests
None yet
Recent Activity
reacted
to
Kseniase
's
post
with ❤️
about 1 month ago
9 Recent advances in Multi-Agent Systems (all open-source) The idea to split tasks across multiple agents instead of relying on one universal agent is now seen as one of the most effective ways to build an AI stack. Concepts like “agent swarms” were highlighted at the AI Engineer Code Summit in NYC (Nov 20–21) as the winning architecture. And this trend is not only about coding and software. It applies across all AI domains. So here is some recent research that helps keep multi-agent systems (MAS) better and up-to-date: 1. LatentMAS → https://huggingface.co/papers/2511.20639 AI agents share their hidden "thoughts" directly in latent space instead of talking through text. This makes collaboration and reasoning way faster and accurate (no extra training needed) 2. Puppeteer → https://huggingface.co/papers/2505.19591 Uses a “puppeteer” LLM that dynamically decides which agents (“puppets”) to call and in what order. By learning this orchestration with reinforcement learning (RL), the system solves complex tasks more efficiently and with fewer compute costs 3. MADD → https://huggingface.co/papers/2511.08217 A MAS with 4 agents for drug discovery. It lets researchers describe a drug discovery task in plain language. Then MADD automatically builds and runs the full hit-identification pipeline, making AI-driven drug design a simple end-to-end workflow 4. Multi-Agent Tool-Integrated Policy Optimization (MATPO) → https://huggingface.co/papers/2510.04678 Lets one LLM act as multiple agents (like a planner and a worker) by using different prompts and training them together with RL. So you get the benefits of a multi-agent system without needing multiple models If you're interested in trends in multi-agent for software development of the future, explore my article with the emergent playbook. This is super interesting → https://www.turingpost.com/p/aisoftwarestack Also, subscribe to the Turing Post: https://www.turingpost.com/subscribe Read further below ⬇️
liked
a Space
about 1 month ago
otellm/leaderboard
upvoted
a
collection
about 2 months ago
DR Tulu
View all activity
Organizations
rajistics
's models
45
Sort: Recently updated
rajistics/layoutlmv3-finetuned-cord_500
Token Classification
•
Updated
Aug 28, 2022
•
25
•
1
rajistics/layoutlmv3-finetuned-cord_800
Token Classification
•
Updated
Aug 28, 2022
•
7
rajistics/test
Updated
Aug 28, 2022
rajistics/layoutlmv2-finetuned-cord_100
Token Classification
•
Updated
Aug 28, 2022
•
6
rajistics/layoutlmv2-finetuned-cord_200
Token Classification
•
Updated
Aug 28, 2022
•
5
rajistics/layoutlmv2-finetuned-cord_300
Token Classification
•
Updated
Aug 28, 2022
•
3
rajistics/layoutlmv2-finetuned-cord_500
Token Classification
•
Updated
Aug 28, 2022
•
6
rajistics/layoutlmv2-finetuned-cord
Token Classification
•
Updated
Aug 27, 2022
•
6
rajistics/testpyramidsrnd
Reinforcement Learning
•
Updated
Jul 13, 2022
•
15
rajistics/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Jun 19, 2022
•
1
rajistics/h2o_gbm_prostate
Updated
Jun 6, 2022
rajistics/autotrain-Adult-934630783
Tabular Classification
•
Updated
May 31, 2022
•
7
•
2
rajistics/q-FrozenLake-v1-8x8-slippery
Reinforcement Learning
•
Updated
May 21, 2022
rajistics/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
May 8, 2022
rajistics/TEST2ppo-LunarLander-v2
Reinforcement Learning
•
Updated
May 8, 2022
Previous
1
2
Next