Rajiv Shah's picture

Rajiv Shah

rajistics

·

https://www.rajivshah.com

AI & ML interests

None yet

Recent Activity

reacted to Kseniase's post with ❤️ about 1 month ago

9 Recent advances in Multi-Agent Systems (all open-source) The idea to split tasks across multiple agents instead of relying on one universal agent is now seen as one of the most effective ways to build an AI stack. Concepts like “agent swarms” were highlighted at the AI Engineer Code Summit in NYC (Nov 20–21) as the winning architecture. And this trend is not only about coding and software. It applies across all AI domains. So here is some recent research that helps keep multi-agent systems (MAS) better and up-to-date: 1. LatentMAS → https://huggingface.co/papers/2511.20639 AI agents share their hidden "thoughts" directly in latent space instead of talking through text. This makes collaboration and reasoning way faster and accurate (no extra training needed) 2. Puppeteer → https://huggingface.co/papers/2505.19591 Uses a “puppeteer” LLM that dynamically decides which agents (“puppets”) to call and in what order. By learning this orchestration with reinforcement learning (RL), the system solves complex tasks more efficiently and with fewer compute costs 3. MADD → https://huggingface.co/papers/2511.08217 A MAS with 4 agents for drug discovery. It lets researchers describe a drug discovery task in plain language. Then MADD automatically builds and runs the full hit-identification pipeline, making AI-driven drug design a simple end-to-end workflow 4. Multi-Agent Tool-Integrated Policy Optimization (MATPO) → https://huggingface.co/papers/2510.04678 Lets one LLM act as multiple agents (like a planner and a worker) by using different prompts and training them together with RL. So you get the benefits of a multi-agent system without needing multiple models If you're interested in trends in multi-agent for software development of the future, explore my article with the emergent playbook. This is super interesting → https://www.turingpost.com/p/aisoftwarestack Also, subscribe to the Turing Post: https://www.turingpost.com/subscribe Read further below ⬇️

liked a Space about 1 month ago

otellm/leaderboard

upvoted a collection about 2 months ago

View all activity

Organizations

rajistics 's models 45

rajistics/layoutlmv3-finetuned-cord_500

Token Classification • Updated Aug 28, 2022 • 25 • 1

rajistics/layoutlmv3-finetuned-cord_800

Token Classification • Updated Aug 28, 2022 • 7

rajistics/test

Updated Aug 28, 2022

rajistics/layoutlmv2-finetuned-cord_100

Token Classification • Updated Aug 28, 2022 • 6

rajistics/layoutlmv2-finetuned-cord_200

Token Classification • Updated Aug 28, 2022 • 5

rajistics/layoutlmv2-finetuned-cord_300

Token Classification • Updated Aug 28, 2022 • 3

rajistics/layoutlmv2-finetuned-cord_500

Token Classification • Updated Aug 28, 2022 • 6

rajistics/layoutlmv2-finetuned-cord

Token Classification • Updated Aug 27, 2022 • 6

rajistics/testpyramidsrnd

Reinforcement Learning • Updated Jul 13, 2022 • 15

rajistics/dqn-SpaceInvadersNoFrameskip-v4

Reinforcement Learning • Updated Jun 19, 2022 • 1

rajistics/h2o_gbm_prostate

Updated Jun 6, 2022

rajistics/autotrain-Adult-934630783

Tabular Classification • Updated May 31, 2022 • 7 • 2

rajistics/q-FrozenLake-v1-8x8-slippery

Reinforcement Learning • Updated May 21, 2022

rajistics/ppo-LunarLander-v2

Reinforcement Learning • Updated May 8, 2022

rajistics/TEST2ppo-LunarLander-v2

Reinforcement Learning • Updated May 8, 2022