A compilation of the best multi-agent papers
🐦 Twitter • 📢 Discord • Swarms Platform • 📙 Framework
A compilation of the best multi-agent papers by the Swarms Team. Our mission is to democratize multi-agent systems to automate the world economy with agents and usher in a post-scarcity Human civilization. Join our community now!
- [Paper Name] ([PDF PAPER LINK ]) bibtex short name
- K-Level Reasoning with Large Language Models
- More Agents is All You Need
- LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration
- AgentScope: A Flexible yet Robust Multi-Agent Platform
- Learning to Decode Collaboratively with Multiple Language Models
- AIOS: LLM Agent Operating System
- AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation
- Chain of Agents: Large Language Models Collaborating on Long-Context Tasks
- Mixture-of-Agents Enhances Large Language Model Capabilities
- EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms
- Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
- Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning
- Optimizing Collaboration of LLM based Agents
- LLM-Agent-UMF: LLM-based Agent Unified Modeling Framework
- Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System
- AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
- Very Large-Scale Multi-Agent Simulation in AgentScope
- AgentClinic: A Multimodal Agent Benchmark for AI in Clinical Environments
- MultiAgentBench: Evaluating the Collaboration and Competition of LLM Agents
- TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
- BoxingGym: Benchmarking Progress in Automated Experimental Design
- Automated Unit Test Improvement using Large Language Models
- Experiential Co-Learning of Software-Developing Agents
- ChatDev: Communicative Agents for Software Development
- MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution
- CodeR: Issue Resolving with Multi-Agent and Task Graphs
- From LLMs to LLM-based Agents for Software Engineering: A Survey
- CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
- Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
- Large Language Model-Based Agents for Software Engineering: A Survey
- AutoSafeCoder: A Multi-Agent Framework for Securing LLM Code Generation
- RGD: Multi-LLM Based Agent Debugger via Refinement and Generation Guidance
- Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents
- MEDCO: Medical Education Copilots Based on A Multi-Agent Framework
- Multi Agent based Medical Assistant for Edge Devices
- LAMBDA: A Large Model Based Data Agent
- Agentic Retrieval-Augmented Generation for Time Series Analysis
- Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining
- AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML
- AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
- DataLab: A Unified Platform for LLM-Powered Business Intelligence
- Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
- Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation
- Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
- MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception
- PC-Agent: A Hierarchical Multi-Agent Framework for Complex Task Automation on PC
- Human-level play in Diplomacy by combining language models with strategic reasoning
- CulturePark: Boosting Cross-cultural Understanding in Large Language Models
- Beyond Human Translation: Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
- FanCric: Multi-Agentic Framework for Crafting Fantasy 11 Cricket Teams
- Can Large Language Models Grasp Legal Theories? Enhance Legal Reasoning with Multi-Agent Collaboration
- Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Match Human Crowd Accuracy
- Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
- Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
- Evolutionary Optimization of Model Merging Recipes
- Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models
- Constitutional AI: Harmlessness from AI Feedback
- On scalable oversight with weak LLMs judging strong LLMs
- ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate
- RouteLLM: An Open-Source Framework for Cost-Effective LLM Routing
- Agent-as-a-Judge: Evaluate Agents with Agents
- Adversarial Multi-Agent Evaluation of Large Language Models through Iterative Debates
- MALT: Improving Reasoning with Multi-Agent LLM Training
- Why Do Multi-Agent LLM Systems Fail?
- Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent
- Generative Agents: Interactive Simulacra of Human Behavior
- SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents
- Scaling Instructable Agents Across Many Simulated Worlds
- Scaling Synthetic Data Creation with 1,000,000,000 Personas
- Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents
- From Text to Life: On the Reciprocal Relationship between Artificial Life and Large Language Models
- Mindstorms in Natural Language-Based Societies of Mind
- The AI Scientist: The world's first AI system for automating scientific research
- Agents' Room: Narrative Generation through Multi-step Collaboration
- GenSim: A General Social Simulation Platform with Large Language Model based Agents
- Large Language Models can Achieve Social Balance
- Cultural Evolution of Cooperation among LLM Agents
- SDPO: Segment-Level Direct Preference Optimization for Social Agents
- AgentSociety: Large-Scale Simulation of LLM-Driven Generative Agents
- OASIS: Open Agent Social Interaction Simulations with One Million Agents
- AllHands: Ask Me Anything on Large-scale Verbatim Feedback via Large Language Models
- AgentInstruct: Toward Generative Teaching with Agentic Flows
- SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning
- Minstrel: Structural Prompt Generation with Multi-Agents Coordination for Non-AI Experts
- AFlow: Automating Agentic Workflow Generation
- Agents Thinking Fast and Slow: A Talker-Reasoner Architecture
- DynaSaur: Large Language Agents Beyond Predefined Actions
- LLMs as Method Actors: A Model for Prompt Engineering and Architecture
- Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents
- Automated Design of Agentic Systems
- The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization
- Multi-agent Architecture Search via Agentic Supernet
- Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems
- When One LLM Drools, Multi-LLM Collaboration Rules
- Enhancing Reasoning with Collaboration and Memory
- Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning
- Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
In the arxiv_bibtex.bib file, you can find the bibtex citations for all the papers in this repository.
Join the multi-agent community now on discord HERE