Aussie AI
Agent AI Architectures
-
Last Updated 27 February, 2025
-
by David Spuler, Ph.D.
Agents. Ah, yes, the final phase of AI. You tell your AI assistant to go away and book you a vacation, and then you're done. Sorted! (What could go wrong?)
Agent architectures are a very powerful extension of LLMs, especially if you use the trendy "agentic" terminology. There are literally dozens of startups building them, but no-one really agrees what they are. Here's one way to look at it using computer programmer vernacular:
- Read-only agents — go away and research, coming back to give you more detailed answers.
- Read-write agents — they go away and research it, book your vacation, and then tell you about it.
Actually, to further confuse issues, there are a few agent-like architectures already available and in common usage:
- "Plug-ins" (such as for OpenAI's ChatGPT) — access third-party data sources in your LLM queries, which is a RAG-like architecture whereby the AI engine gets more data from the integrated data source, and then uses the LLM to complete the answer.
- "Tools" — your AI actually launches some executable tools behind-the-scenes, such as a clock to answer time-related queries, a calculator for arithmetic, and so on. The LLM has to be trained to know when to launch a tool, and which ones to use for what queries.
There's also a time-based categorization possible:
- Interactive agents — launch now, do their best to handle it quickly, and then report on what they found (or what they did). For example, you tell your agent to send a text message or an email.
- Launched agents — the idea is longer-running agents, such as one that goes off and books your concert tickets.
- Scheduled, Autonomous, or Perpetual agents — the idea is for an agent to sit there, always watching, and then report on this, or get triggered by something (e.g. it watches a newsfeed and summarizes them for you each morning; or an agent watches stock prices, and then trades for you as your own personal HFT).
Some of the other practical issues involved in running an agent include:
- Approval-needed versus unattended execution. (It's all about trust, or lack thereof.)
- Follow-up questions, whereby the AI determines what you're asking, but then reverses the power balance in the relationship by asking for clarification of your wishes ("window or aisle?").
- Personalization — it really should already know that you want a window seat.
- Context — the agent would ideally know your geo location, or what's on your screen, or where you're standing (e.g. it really should look out through your camera and know that you're in the bathroom, as everyone would want).
Related areas of LLM research include:
Survey Papers on AI Agents
Survey papers on agents:
- Z. Xi, W. Chen, X. Guo, W. He, Y. Ding, B. Hong, M. Zhang, J. Wang, S. Jin, E. Zhou et al., “The rise and potential of large language model based agents: A survey,” arXiv preprint arXiv:2309.07864, 2023. https://arxiv.org/abs/2309.07864
- T. Guo, X. Chen, Y. Wang, R. Chang, S. Pei, N. V. Chawla, O. Wiest, and X. Zhang, “Large language model based multi agents: A survey of progress and challenges,” arXiv preprint arXiv:2402.01680, 2024. https://arxiv.org/abs/2402.01680
- Junwei Liu, Kaixin Wang, Yixuan Chen, Xin Peng, Zhenpeng Chen, Lingming Zhang, Yiling Lou, 4 Sep 2024, Large Language Model-Based Agents for Software Engineering: A Survey, https://arxiv.org/abs/2409.02977 Project: https://github.com/FudanSELab/Agent4SE-Paper-List
- Yanxian Huang, Wanjun Zhong, Ensheng Shi, Min Yang, Jiachi Chen, Hui Li, Yuchi Ma, Qianxiang Wang, Zibin Zheng, Yanlin Wang, 13 Sep 2024, Agents in Software Engineering: Survey, Landscape, and Vision, https://arxiv.org/abs/2409.09030 https://github.com/DeepSoftwareAnalytics/Awesome-Agent4SE
- Biao Wu, Yanda Li, Meng Fang, Zirui Song, Zhiwei Zhang, Yunchao Wei, Ling Chen, 4 Nov 2024, Foundations and Recent Trends in Multimodal Mobile Agents: A Survey, https://arxiv.org/abs/2411.02006 https://github.com/aialt/awesome-mobile-agents
- Shuai Wang, Weiwen Liu, Jingxuan Chen, Weinan Gan, Xingshan Zeng, Shuai Yu, Xinlong Hao, Kun Shao, Yasheng Wang, Ruiming Tang, 7 Nov 2024, GUI Agents with Foundation Models: A Comprehensive Survey, https://arxiv.org/abs/2411.04890
- Chris Sypherd, Vaishak Belle, 5 Dec 2024, Practical Considerations for Agentic LLM Systems, https://arxiv.org/abs/2412.04093
- Dang Nguyen, Jian Chen, Yu Wang, Gang Wu, Namyong Park, Zhengmian Hu, Hanjia Lyu, Junda Wu, Ryan Aponte, Yu Xia, Xintong Li, Jing Shi, Hongjie Chen, Viet Dac Lai, Zhouhang Xie, Sungchul Kim, Ruiyi Zhang, Tong Yu, Mehrab Tanjim, Nesreen K. Ahmed, Puneet Mathur, Seunghyun Yoon, Lina Yao, Branislav Kveton, Thien Huu Nguyen, Trung Bui, Tianyi Zhou, Ryan A. Rossi, Franck Dernoncourt, 18 Dec 2024, GUI Agents: A Survey, https://arxiv.org/abs/2412.13501
- Shuaihang Chen, Yuanxing Liu, Wei Han, Weinan Zhang, Ting Liu, 23 Dec 2024. A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers. https://arxiv.org/abs/2412.17481
- Aditi Singh, Abul Ehtesham, Saket Kumar, Tala Talaei Khoei, 15 Jan 2025, Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG, https://arxiv.org/abs/2501.09136
- Xinzhe Li, Jan 2025, A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback Learning, Proceedings of the 31st International Conference on Computational Linguistics, pages 9760–9779, January 19–24, 2025. ©2025 Association for Computational Linguistics, https://aclanthology.org/2025.coling-main.652.pdf https://github.com/xinzhel/LLM-Agent-Survey
Building AI Agents
Research papers on how to build AI agents:
- Julian Yip, Apr 2, 2024, Build Autonomous AI Agents with Function Calling: Transform your chatbot into an agent that can interact with external APIs, https://towardsdatascience.com/build-autonomous-ai-agents-with-function-calling-0bb483753975 (Implement agents via models that output a JSON object that describes the API to call and the parmaeters to send.)
- Anton Antich, May 17, 2024, Anatomy of an AI Multi-Agent: How do we build a useful AI agent? https://medium.com/superstringtheory/anatomy-of-an-ai-multi-agent-e2cfedc3b050
- Patrick Dougherty, Jun 4, 2024, Building AI Agents: Lessons Learned over the past Year, https://medium.com/@cpdough/building-ai-agents-lessons-learned-over-the-past-year-41dc4725d8e5
- Lak Lakshmanan, March 7, 2024, Building an AI Assistant with DSPy: A way to program and tune prompt-agnostic LLM agent pipelines, https://towardsdatascience.com/building-an-ai-assistant-with-dspy-2e1e749a1a95
- Valentina Alto, May 2024, Building LLM Powered Applications: Create intelligent apps and agents with large language models, Packt Publishing, https://www.amazon.com/Building-LLM-Apps-Intelligent-Language/dp/1835462316/
- Irene Weber, 13 Jun 2024, Large Language Models as Software Components: A Taxonomy for LLM-Integrated Applications, https://arxiv.org/abs/2406.10300
- Assaf Elovic, May 10, 2024, How to Build the Ultimate AI Automation with Multi-Agent Collaboration, https://medium.com/@assafelovic/how-to-build-the-ultimate-ai-automation-with-multi-agent-collaboration-ed61a1ea8f3b
- Eddie Forson, Apr 29, 2024, Why I’m building my own AI Agent library, https://medium.com/@Ed_Forson/why-im-building-my-own-ai-agent-library-e20ec9aa3647
- Mistral AI Team, Aug 7, 2024, Build, tweak, repeat: Making it easier to develop and share generative AI applications, https://mistral.ai/news/build-tweak-repeat/
- Yuanchun Li, Hao Wen, Weijun Wang, Xiangyu Li, Yizhen Yuan, Guohong Liu, Jiacheng Liu, Wenxing Xu, Xiang Wang, Yi Sun, Rui Kong, Yile Wang, Hanfei Geng, Jian Luan, Xuefeng Jin, Zilong Ye, Guanjing Xiong, Fan Zhang, Xiang Li, Mengwei Xu, Zhijun Li, Peng Li, Yang Liu, Ya-Qin Zhang, Yunxin Liu, 8 May 2024 (v2), Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security, https://arxiv.org/abs/2401.05459 https://github.com/MobileLLM/Personal_LLM_Agents_Survey
- Giancarlo Mori, Aug 14, 2024, Top 5 AI Agent Platforms You Should Know, https://gcmori.medium.com/top-5-ai-agent-platforms-you-should-know-418e7e7e24a9
- Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang Zhu, Chi Wang, Saleema Amershi, 9 Aug 2024, AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems, https://arxiv.org/abs/2408.15247 https://github.com/microsoft/autogen/tree/main/samples/apps/autogen-studio
- Zhiwei Liu, Weiran Yao, Jianguo Zhang, Liangwei Yang, Zuxin Liu, Juntao Tan, Prafulla K. Choubey, Tian Lan, Jason Wu, Huan Wang, Shelby Heinecke, Caiming Xiong, Silvio Savarese, 23 Feb 2024, AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System, https://arxiv.org/abs/2402.15538 https://github.com/SalesforceAIResearch/AgentLite
- Kai Mei, Zelong Li, Shuyuan Xu, Ruosong Ye, Yingqiang Ge, Yongfeng Zhang, 26 Mar 2024 (v2), AIOS: LLM Agent Operating System, https://arxiv.org/abs/2403.16971 https://github.com/agiresearch/AIOS https://aios.readthedocs.io/en/latest/
- Emilia David, September 10, 2024, ServiceNow introduces a library of enterprise AI agents you can customize to fit your workflow, https://venturebeat.com/ai/servicenow-introduces-a-library-of-enterprise-ai-agents-you-can-customize-to-fit-your-workflow/
- Cobus Greyling, Sep 2024, An AI Agent Architecture & Framework Is Emerging, https://cobusgreyling.medium.com/an-ai-agent-architecture-framework-is-emerging-addae3804f23
- Grant Gross, 19 Sep 2024, Thinking of building your own AI agents? Don’t do it, advisors say, CIO, https://www.cio.com/article/3529740/thinking-of-building-your-own-ai-agents-dont-do-it-advisors-say.html
- Shuyan Zhou, July 25, 2024, Solving Real-World Tasks with AI Agents Ph.D. Thesis, CMU-LTI-24-014, Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, https://kilthub.cmu.edu/ndownloader/files/48699703
- Aparna Dhinakaran, Sep 2024, Choosing Between LLM Agent Frameworks. The tradeoffs between building bespoke code-based agents and the major agent frameworks. https://towardsdatascience.com/choosing-between-llm-agent-frameworks-69019493b259
- Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 25 Sep 2024, Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents, https://arxiv.org/abs/2409.17140
- David Gewirtz, Oct. 21, 2024, Microsoft's upgraded Copilot Studio is like a LEGO set for building AI agents, https://www.zdnet.com/article/microsofts-upgraded-copilot-studio-is-like-a-lego-set-for-building-ai-agents/
- Charles Lamanna, Oct 21, 2024, Unlocking autonomous agent capabilities with Microsoft Copilot Studio, https://www.microsoft.com/en-us/microsoft-copilot/blog/copilot-studio/unlocking-autonomous-agent-capabilities-with-microsoft-copilot-studio/
- Dawei Gao, Zitao Li, Xuchen Pan, Weirui Kuang, Zhijian Ma, Bingchen Qian, Fei Wei, Wenhao Zhang, Yuexiang Xie, Daoyuan Chen, Liuyi Yao, Hongyi Peng, Zeyu Zhang, Lin Zhu, Chen Cheng, Hongzhu Shi, Yaliang Li, Bolin Ding, Jingren Zhou, 20 May 2024 (v2), AgentScope: A Flexible yet Robust Multi-Agent Platform, https://arxiv.org/abs/2402.14034 https://github.com/modelscope/agentscope
- Bryson Masse, October 31, 2024, Microsoft’s agentic AI tool OmniParser rockets up the open source charts, https://venturebeat.com/ai/microsofts-agentic-ai-tool-omniparser-rockets-up-the-open-source-charts/
- Jared Spataro, November 19, 2024, Introducing Copilot Actions, new agents, and tools to empower IT teams, https://www.microsoft.com/en-us/microsoft-365/blog/2024/11/19/introducing-copilot-actions-new-agents-and-tools-to-empower-it-teams/ ("Copilot is the UI for AI")
- Sahar Mor, Nov 28, 2024, The Open-Source Toolkit for Building AI Agents. Curated frameworks, tools, and libraries every developer needs to build functional and efficient AI agents, https://www.aitidbits.ai/p/open-source-agents
- Prasad Thammineni, Oct 14, 2024, From No-Code Builders to Agent-First Apps: 7 Trends Shaping the Future of AI Agents, https://medium.com/towards-generative-ai-applications/from-no-code-builders-to-agent-first-apps-trends-shaping-the-future-of-ai-agents-b423a76c55b7
- Sahar Mor, Dec 22, 2024, The Voice Agents Toolkit for Builders: Curated frameworks, tools, and libraries to launch reliable and efficient voice agents, https://www.aitidbits.ai/p/voice-agents-toolkit
- Jens Kohl, Luisa Gloger, Rui Costa, Otto Kruse, Manuel P. Luitz, David Katz, Gonzalo Barbeito, Markus Schweier, Ryan French, Jonas Schroeder, Thomas Riedl, Raphael Perri, Youssef Mostafa, 18 Dec 2024, Generative AI Toolkit -- a framework for increasing the quality of LLM-based applications over their whole life cycle, https://arxiv.org/abs/2412.14215 https://github.com/awslabs/generative-ai-toolkit
Research on AI Agent Architectures
There are plenty of research papers on agent extensions to AI.
- Jon Turow, June 5, 2024, The Rise of AI Agent Infrastructure, Madrona, https://www.madrona.com/the-rise-of-ai-agent-infrastructure/
- Julian Horsey, May 2, 2024, How to start building AI Agents in 2024, Geeky Gadgets, https://www.geeky-gadgets.com/building-ai-agents-pathway/
- Sirui Hong, Yizhang Lin, Bang Liu, Bangbang Liu, Binhao Wu, Danyang Li, Jiaqi Chen, Jiayi Zhang, Jinlin Wang, Li Zhang, Lingyao Zhang, Min Yang, Mingchen Zhuge, Taicheng Guo, Tuo Zhou, Wei Tao, Wenyi Wang, Xiangru Tang, Xiangtao Lu, Xiawu Zheng, Xinbing Liang, Yaying Fei, Yuheng Cheng, Zongze Xu, Chenglin Wu, 12 Mar 2024 (v3), Data Interpreter: An LLM Agent For Data Science, https://arxiv.org/abs/2402.18679 Code: https://github.com/geekan/MetaGPT
- Zelong Li, Wenyue Hua, Hao Wang, He Zhu, Yongfeng Zhang, 4 Feb 2024 (v2), Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents, https://arxiv.org/abs/2402.00798 Code: https://github.com/agiresearch/Formal-LLM
- Qiusi Zhan, Zhixiang Liang, Zifan Ying, Daniel Kang, 25 Mar 2024 (v2), InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents, https://arxiv.org/abs/2403.02691
- Quentin Gallouédec, Edward Beeching, ClementRomac, Thomas Wolf, April 22, 2024, Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent, Hugging Face Blog, https://huggingface.co/blog/jat
- Juyong Lee, Taywon Min, Minyong An, Changyeon Kim, Kimin Lee, 25 Apr 2024, Benchmarking Mobile Device Control Agents across Diverse Configurations, https://arxiv.org/abs/2404.16660 Code: https://b-moca.github.io/
- S Zahedi Jahromi, Feb 2024, Conversational QA Agents with Session Management, Master's Degree Thesis, Computer Engineering, Politechnico di Torino, Italy, https://webthesis.biblio.polito.it/secure/31037/1/tesi.pdf
- Benjamin A Newman, Chris Paxton, Kris Kitani, Henny Admoni, 16 Apr 2024, Bootstrapping Linear Models for Fast Online Adaptation in Human-Agent Collaboration, https://arxiv.org/abs/2404.10733
- Mengkang Hu, Yao Mu, Xinmiao Yu, Mingyu Ding, Shiguang Wu, Wenqi Shao, Qiguang Chen, Bin Wang, Yu Qiao, and Ping Luo. 2023a. Tree-planner: Efficient close-loop task planning with large language models. arXiv preprint arXiv:2310.08582. https://arxiv.org/abs/2310.08582
- Noah Shinn, Federico Cassano, Ashwin Gopinath, Karthik R Narasimhan, and Shunyu Yao. 2023. Reflexion: Language agents with verbal reinforcement learning. In Thirty-seventh Conference on Neural Information Processing Systems. https://arxiv.org/abs/2303.11366
- Yujia Qin, Shihao Liang, Yining Ye, Kunlun Zhu, Lan Yan, Yaxi Lu, Yankai Lin, Xin Cong, Xiangru Tang, Bill Qian, et al. 2023b. ToolLLM: Facilitating large language models to master 16000+ real-world apis. arXiv preprint arXiv:2307.16789. https://arxiv.org/abs/2307.16789
- Lei Wang, Chen Ma, Xueyang Feng, Zeyu Zhang, Hao Yang, Jingsen Zhang, Zhiyuan Chen, Jiakai Tang, XuChen, Yankai Lin, et al. 2023c. A survey on large language model based autonomous agents. arXiv preprint arXiv:2308.11432. https://arxiv.org/abs/2308.11432
- Joongwon Kim, Bhargavi Paranjape, Tushar Khot, Hannaneh Hajishirzi, 10 Jun 2024, Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning, https://arxiv.org/abs/2406.06469 Code: https://github.com/agent-husky/Husky-v1
- David Linthicum, June 7, 2024, Generative AI agents will revolutionize AI architecture, InfoWorld, https://www.infoworld.com/article/3715457/generative-ai-agents-will-revolutionize-ai-architecture.html
- Difei Gao, Lei Ji, Zechen Bai, Mingyu Ouyang, Peiran Li, Dongxing Mao, Qinchen Wu, Weichen Zhang, Peiyi Wang, Xiangwu Guo, Hengxu Wang, Luowei Zhou, Mike Zheng Shou, 2024, ASSISTGUI:Task-Oriented PC Graphical User Interface Automation, https://openaccess.thecvf.com/content/CVPR2024/papers/Gao_AssistGUI_Task-Oriented_PC_Graphical_User_Interface_Automation_CVPR_2024_paper.pdf Code: https://showlab.github.io/assistgui/
- Hesam Sheikh, Jun 1, 2024, Towards AI Build Blog Writer and Researcher AI Agents with Ollama (100% local): Creating AI agents with Crewai and using Ollama to run them 100% locally in 5 very easy steps!, https://pub.towardsai.net/build-your-first-ai-agent-in-5-easy-steps-100-local-2fb771438a8f
- Simeon Emanuilov, Apr 4, 2024 LLM agent operating system (AIOS) and the future of LLM-powered agents, https://medium.com/@simeon.emanuilov/llm-agent-operating-system-aios-and-the-future-of-llm-powered-agents-3d08b4e91c34 https://unfoldai.com/aios-llm-powered-agents/
- Wei Chen, Zhiyuan Li, 3 Apr 2024 (v2), Octopus v2: On-device language model for super agent, https://arxiv.org/abs/2404.01744
- Yuanchun Li, Hao Wen, Weijun Wang, Xiangyu Li, Yizhen Yuan, Guohong Liu, Jiacheng Liu, Wenxing Xu, Xiang Wang, Yi Sun, Rui Kong, Yile Wang, Hanfei Geng, Jian Luan, Xuefeng Jin, Zilong Ye, Guanjing Xiong, Fan Zhang, Xiang Li, Mengwei Xu, Zhijun Li, Peng Li, Yang Liu, Ya-Qin Zhang, Yunxin Liu, 10 Jan 2024, Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security, https://arxiv.org/abs/2401.05459 Code: https://github.com/MobileLLM/Personal_LLM_Agents_Survey
- Jesus Rodriguez, Jan 5, 2024, Inside MemGPT: An LLM Framework for Autonomous Agents Inspired by Operating Systems Architectures, Towards AI, https://pub.towardsai.net/inside-memgpt-an-llm-framework-for-autonomous-agents-inspired-by-operating-systems-architectures-674b7bcca6a5
- Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Kuang-Huei Lee, Sergey Levine, Yao Lu, Linda Luu, Carolina Parada, Peter Pastor, Jornell Quiambao, Kanishka Rao, Jarek Rettinghouse, Diego Reyes, Pierre Sermanet, Nicolas Sievers, Clayton Tan, Alexander Toshev, Vincent Vanhoucke, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Mengyuan Yan, Andy Zeng, Aug 2022, Do As I Can, Not As I Say: Grounding Language in Robotic Affordances https://arxiv.org/abs/2204.01691 Project: https://say-can.github.io/
- Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman, Jun 2022, WebGPT: Browser-assisted question-answering with human feedback https://arxiv.org/abs/2112.09332
- James Nguyen, Nov 19, 2023, Forget RAG: Embrace agent design for a more intelligent grounded ChatGPT! https://james-tn.medium.com/forget-rag-embrace-agent-design-for-a-more-intelligent-grounded-chatgpt-6c562d903c61
- Julian Yip, Apr 2, 2024, Build Autonomous AI Agents with Function Calling: Transform your chatbot into an agent that can interact with external APIs, https://towardsdatascience.com/build-autonomous-ai-agents-with-function-calling-0bb483753975 (Implement agents via models that output a JSON object that describes the API to call and the parmaeters to send.)
- Anton Antich, May 17, 2024, Anatomy of an AI Multi-Agent: How do we build a useful AI agent? https://medium.com/superstringtheory/anatomy-of-an-ai-multi-agent-e2cfedc3b050
- Luyuan Wang, Yongyu Deng, Yiwei Zha, Guodong Mao, Qinmin Wang, Tianchen Min, Wei Chen, Shoufa Chen, 12 Jun 2024, MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents, https://arxiv.org/abs/2406.08184 Project: https://mobileagentbench.github.io/ Code: https://mobileagentbench.github.io/
- Patrick Dougherty, Jun 4, 2024, Building AI Agents: Lessons Learned over the past Year, https://medium.com/@cpdough/building-ai-agents-lessons-learned-over-the-past-year-41dc4725d8e5
- Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su, 22 Feb 2024, Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments, https://arxiv.org/abs/2402.14672
- 8 Jun 2024 (v2), A Survey on Efficient Inference for Large Language Models, Zixuan Zhou, Xuefei Ning, Ke Hong, Tianyu Fu, Jiaming Xu, Shiyao Li, Yuming Lou, Luning Wang, Zhihang Yuan, Xiuhong Li, Shengen Yan, Guohao Dai, Xiao-Ping Zhang, Yuhan Dong, Yu Wang, https://arxiv.org/abs/2404.14294
- Sandi Besen, Apr 24, 2024, The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey, Towards Data Science, https://towardsdatascience.com/the-landscape-of-emerging-ai-agent-architectures-for-reasoning-planning-and-tool-calling-a-a95214b743c1
- Junlin Wang, Jue Wang, Ben Athiwaratkun, Ce Zhang, James Zou, 7 Jun 2024, Mixture-of-Agents Enhances Large Language Model Capabilities, https://arxiv.org/abs/2406.04692
- Krishankant Singhal, Apr 12, 2024, CrewAI: Aframework for building and orchestrating multi-agent AI systems. https://krishankantsinghal.medium.com/crewai-ae3eb2a9e87a
- Lak Lakshmanan, March 7, 2024, Building an AI Assistant with DSPy: A way to program and tune prompt-agnostic LLM agent pipelines, https://towardsdatascience.com/building-an-ai-assistant-with-dspy-2e1e749a1a95
- Tianlin Shi, Andrej Karpathy, Linxi Fan, Jonathan Hernandez, Percy Liang, 2017, World of Bits: An Open-Domain Platform for Web-Based Agents, Proceedings of the 34th International Conference on Machine Learning, PMLR 70:3135-3144, https://proceedings.mlr.press/v70/shi17a.html
- Peter C Humphreys, David Raposo, Toby Pohlen, Gregory Thornton, Rachita Chhaparia, Alistair Muldal, Josh Abramson, Petko Georgiev, Alex Goldin, Adam Santoro, Timothy Lillicrap, 11 Nov 2022 (v2), A data-driven approach for learning to control computers, https://arxiv.org/abs/2202.08137
- Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Luke Zettlemoyer, Nicola Cancedda, Thomas Scialom, 9 Feb 2023, Toolformer: Language Models Can Teach Themselves to Use Tools, https://arxiv.org/abs/2302.04761
- Z. Xi, W. Chen, X. Guo, W. He, Y. Ding, B. Hong, M. Zhang, J. Wang, S. Jin, E. Zhou et al., “The rise and potential of large language model based agents: A survey,” arXiv preprint arXiv:2309.07864, 2023. https://arxiv.org/abs/2309.07864
- T. Guo, X. Chen, Y. Wang, R. Chang, S. Pei, N. V. Chawla, O. Wiest, and X. Zhang, “Large language model based multi agents: A survey of progress and challenges,” arXiv preprint arXiv:2402.01680, 2024. https://arxiv.org/abs/2402.01680
- James O'Donnell, May 1, 2024, Sam Altman says helpful agents are poised to become AI’s killer function, https://www.technologyreview.com/2024/05/01/1091979/sam-altman-says-helpful-agents-are-poised-to-become-ais-killer-function/
- Fabian Both, June 2024, why we no longer use LangChain for building our AI agents , https://www.octomind.dev/blog/why-we-no-longer-use-langchain-for-building-our-ai-agents (Replaces LangChain with their own more-focused internal tool sets.)
- Janelle Teng, June 21, 2024, State of the Cloud 2024, The Legacy Cloud is dead — long live AI Cloud! https://nextbigteng.substack.com/p/state-of-the-cloud-2024
- Ignacio de Gregorio, June 2024, Mixture-of-Agents Beats ChatGPT-4o: Collaboration is Intelligence, https://medium.com/@ignacio.de.gregorio.noblejas/mixture-of-agents-beats-chatgpt-4o-6470a74f1525
- Wes Brewer, Ana Gainaru, Frédéric Suter, Feiyi Wang, Murali Emani, Shantenu Jha, 20 Jun 2024, AI-coupled HPC Workflow Applications, Middleware and Performance, (Examines integrations of various workflows into LLMs.) https://arxiv.org/abs/2406.14315
- Valentina Alto, May 2024, Building LLM Powered Applications: Create intelligent apps and agents with large language models, Packt Publishing, https://www.amazon.com/Building-LLM-Apps-Intelligent-Language/dp/1835462316/
- Irene Weber, 13 Jun 2024, Large Language Models as Software Components: A Taxonomy for LLM-Integrated Applications, https://arxiv.org/abs/2406.10300
- Honghua Dong, Qidong Su, Yubo Gao, Zhaoyu Li, Yangjun Ruan, Gennady Pekhimenko, Chris J. Maddison, Xujie Si, 19 Jun 2024, APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts, https://arxiv.org/abs/2406.13161 Code: https://github.com/appl-team/appl (A Python-like script language for prompt engineering integration into applications and agents.)
- Kapil Raina, Jun 3, 2024, Analytics with GenAI Agents (Text2SQL, CrewAI, LangChain, GPT4o), https://kapil-raina.medium.com/analytics-with-genai-agents-text2sql-crewai-langchain-gpt4o-314985bf6dc6
- Louis-François Bouchard, Louie Peters, May 2024, Chapter 9: Agents, Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG, https://www.amazon.com/Building-LLMs-Production-Reliability-Fine-Tuning/dp/B0D4FFPFW8/
- Aarushi Kansal, Chapter 3: Chains, Tools and Agents Building Generative AI-Powered Apps: A Hands-on Guide for Developers, Apress, https://www.amazon.com/Building-Generative-AI-Powered-Apps-Hands-ebook/dp/B0CTXXP1S4/
- Vishal Rajput, Apr 11, 2024, What’s next for AI: AI agentic workflows? https://medium.com/aiguys/next-for-llms-and-rag-ai-agentic-workflows-1869ba0a6796
- Assaf Elovic, May 10, 2024, How to Build the Ultimate AI Automation with Multi-Agent Collaboration, https://medium.com/@assafelovic/how-to-build-the-ultimate-ai-automation-with-multi-agent-collaboration-ed61a1ea8f3b
- Rachel Metz, July 12, 2024, OpenAI Scale Ranks Progress Toward ‘Human-Level’ Problem Solving: The company believes its technology is approaching the second level of five on the path to artificial general intelligence, Bloomberg, https://www.bloomberg.com/news/articles/2024-07-11/openai-sets-levels-to-track-progress-toward-superintelligent-ai?sref=P6Q0mxvj
- Asankhaya Sharma, 26 Jul 2024, Patched MOA: optimizing inference for diverse software development tasks, https://arxiv.org/abs/2407.18521
- Eddie Forson, Apr 29, 2024, Why I’m building my own AI Agent library, https://medium.com/@Ed_Forson/why-im-building-my-own-ai-agent-library-e20ec9aa3647
- Dexola, Jul 29, 2024, Autonomous AI Agents: From Concept to Real-World Application, https://generativeai.pub/autonomous-ai-agents-from-concept-to-real-world-application-593546229cdf
- Haolin Jin, Linghan Huang, Haipeng Cai, Jun Yan, Bo Li, Huaming Chen, 5 Aug 2024, From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future, https://arxiv.org/abs/2408.02479
- Lakshmi narayana .U, Jul 28, 2024, STORM: Stanford’s Revolutionary Research Tool Harnessing the Power of Agents and Agentic Workflows, https://blog.stackademic.com/storm-stanfords-revolutionary-research-tool-harnessing-the-power-of-agents-and-agentic-workflows-a2fa0e1a7fe3
- Sayash Kapoor, Benedikt Stroebl, Zachary S. Siegel, Nitya Nadgir, Arvind Narayanan, 1 Jul 2024, AI Agents That Matter, https://arxiv.org/abs/2407.01502 (Examining measuring both accuracy and cost in building agent-based systems.)
- Cognine, 2024, Why 2024 is the Year of AI Agents and Compound AI Systems? https://cognine.com/why-2024-is-the-year-of-ai-agents-and-compound-ai-systems/
- Zoumana Keita, Aug 12, 2024, AI Agents — From Concepts to Practical Implementation in Python: This will change the way you think about AI and its capabilities, https://towardsdatascience.com/ai-agents-from-concepts-to-practical-implementation-in-python-fb26789b1560
- Vala Afshar, Aug. 25, 2024, Six levels of autonomous work: How AI augments, then replaces: In the next decade, the six levels of autonomous work will drive augmentation capabilities at first, and then full replacement of tasks, roles, teams, and - ultimately - lines-of-business, https://www.zdnet.com/article/six-levels-of-autonomous-work-how-ai-augments-then-replaces/
- Grant Gross, 30 Aug 2024, Agentic AI: Decisive, operational AI arrives in business, https://www.cio.com/article/3496519/agentic-ai-decisive-operational-ai-arrives-in-business.html
- Giancarlo Mori, Jul 11, 2024, Designing and Implementing AI Agents in Your Enterprise, https://gcmori.medium.com/designing-and-implementing-ai-agents-in-your-enterprise-3330bd034091
- Daniel Chin, Yuxuan Wang, Gus Xia, 19 May 2024, Human-Centered LLM-Agent User Interface: A Position Paper, https://arxiv.org/abs/2405.13050
- Giancarlo Mori, Jul 2, 2024, Intro to AI Agents and Architectures, https://gcmori.medium.com/intro-to-ai-agents-and-architectures-3dfa8c887425
- Lutfi Eren Erdogan, Nicholas Lee, Siddharth Jha, Sehoon Kim, Ryan Tabrizi, Suhong Moon, Coleman Hooper, Gopala Anumanchipalli, Kurt Keutzer, Amir Gholami, 1 Sep 2024, TinyAgent: Function Calling at the Edge, https://arxiv.org/abs/2409.00608 https://github.com/SqueezeAILab/TinyAgent
- Junwei Liu, Kaixin Wang, Yixuan Chen, Xin Peng, Zhenpeng Chen, Lingming Zhang, Yiling Lou, 4 Sep 2024, Large Language Model-Based Agents for Software Engineering: A Survey, https://arxiv.org/abs/2409.02977 Project: https://github.com/FudanSELab/Agent4SE-Paper-List
- Emilia David, September 10, 2024, ServiceNow introduces a library of enterprise AI agents you can customize to fit your workflow, https://venturebeat.com/ai/servicenow-introduces-a-library-of-enterprise-ai-agents-you-can-customize-to-fit-your-workflow/
- Michael Nuñez, September 13, 2024, Microsoft’s Windows Agent Arena: Teaching AI assistants to navigate your PC, https://venturebeat.com/ai/microsofts-windows-agent-arena-teaching-ai-assistants-to-navigate-your-pc/
- Jeremy Kahn, September 17, 2024, Microsoft introduces AI agents and updates to Copilot 365 apps as the war to make AI more useful intensifies, https://fortune.com/2024/09/16/microsoft-launches-ai-agents-updates-to-copilot-365-apps/
- Cobus Greyling, Sep 2024, An AI Agent Architecture & Framework Is Emerging, https://cobusgreyling.medium.com/an-ai-agent-architecture-framework-is-emerging-addae3804f23
- Yanxian Huang, Wanjun Zhong, Ensheng Shi, Min Yang, Jiachi Chen, Hui Li, Yuchi Ma, Qianxiang Wang, Zibin Zheng, Yanlin Wang, 13 Sep 2024, Agents in Software Engineering: Survey, Landscape, and Vision, https://arxiv.org/abs/2409.09030 https://github.com/DeepSoftwareAnalytics/Awesome-Agent4SE
- Grant Gross, 19 Sep 2024, Thinking of building your own AI agents? Don’t do it, advisors say, CIO, https://www.cio.com/article/3529740/thinking-of-building-your-own-ai-agents-dont-do-it-advisors-say.html
- Shuyan Zhou, July 25, 2024, Solving Real-World Tasks with AI Agents Ph.D. Thesis, CMU-LTI-24-014, Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, https://kilthub.cmu.edu/ndownloader/files/48699703
- Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 25 Sep 2024, Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents, https://arxiv.org/abs/2409.17140
- Mareike Hartmann, Alexander Koller, 27 Sep 2024, A Survey on Complex Tasks for Goal-Directed Interactive Agents, https://arxiv.org/abs/2409.18538 https://coli-saar.github.io/interactive-agents
- Bessemer, Oct 17, 2024, Part II: Multimodal capabilities unlock new opportunities in Vertical AI. Vertical AI applications born out of novel audio, voice, and vision capabilities will fundamentally change the way we work. https://www.bvp.com/atlas/part-ii-multimodal-capabilities-unlock-new-opportunities-in-vertical-ai
- Charles Lamanna, Oct 21, 2024, Unlocking autonomous agent capabilities with Microsoft Copilot Studio, https://www.microsoft.com/en-us/microsoft-copilot/blog/copilot-studio/unlocking-autonomous-agent-capabilities-with-microsoft-copilot-studio/
- LangChain, Oct 19, 2024, Memory for agents, https://blog.langchain.dev/memory-for-agents/
- LangChain, Jul 20, 2024, Planning for Agents, https://blog.langchain.dev/planning-for-agents/
- Taryn Plumb, October 28, 2024 , Gartner predicts AI agents will transform work, but disillusionment is growing, https://venturebeat.com/ai/gartner-predicts-ai-agents-will-transform-work-but-disillusionment-is-growing/
- Matthew Finnegan, 21 Nov 2024, AI agents are coming to work — here’s what businesses need to know, https://www.computerworld.com/article/3609764/ai-agents-are-coming-to-work-heres-what-businesses-need-to-know.html
- Carl Franzen, November 21, 2024, Google Cloud launches AI Agent Space amid rising competition, https://venturebeat.com/ai/google-cloud-launches-ai-agent-space-amid-rising-competition/
- Ziyang Huang, Jun Zhao, Kang Liu, 1 Dec 2024, Towards Adaptive Mechanism Activation in Language Agent, https://arxiv.org/abs/2412.00722
- Lu Wang, Fangkai Yang, Chaoyun Zhang, Junting Lu, Jiaxu Qian, Shilin He, Pu Zhao, Bo Qiao, Ray Huang, Si Qin, Qisheng Su, Jiayi Ye, Yudi Zhang, Jian-Guang Lou, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 13 Dec 2024, Large Action Models: From Inception to Implementation, https://arxiv.org/abs/2412.10047 https://github.com/microsoft/UFO/tree/main/dataflow https://microsoft.github.io/UFO/dataflow/overview/
- Wenchao Xu, Jinyu Chen, Peirong Zheng, Xiaoquan Yi, Tianyi Tian, Wenhui Zhu, Quan Wan, Haozhao Wang, Yunfeng Fan, Qinliang Su, Xuemin Shen, https://arxiv.org/abs/2412.13437 18 Dec 2024, Deploying Foundation Model Powered Agent Services: A Survey, (A survey of not just deployment, but many inference optimization techniques.)
- Julia Wiesinger, Patrick Marlow and Vladimir Vuskovic, Sep 2024, Agents, Google Whitepaper, https://www.kaggle.com/whitepaper-agents
- Frank Landymore, Jan 25, 2025, OpenAI's Agent Has a Problem: Before It Does Anything Important, You Have to Double-Check It Hasn't Screwed Up: Not as hands-off as you might hope, https://futurism.com/openai-asks-permission-important
Multi-Agent Architectures
Research papers on multi-agent architectures:
- Anton Antich, May 17, 2024, Anatomy of an AI Multi-Agent: How do we build a useful AI agent? https://medium.com/superstringtheory/anatomy-of-an-ai-multi-agent-e2cfedc3b050
- Sandi Besen, Apr 24, 2024, The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey, Towards Data Science, https://towardsdatascience.com/the-landscape-of-emerging-ai-agent-architectures-for-reasoning-planning-and-tool-calling-a-a95214b743c1
- Junlin Wang, Jue Wang, Ben Athiwaratkun, Ce Zhang, James Zou, 7 Jun 2024, Mixture-of-Agents Enhances Large Language Model Capabilities, https://arxiv.org/abs/2406.04692
- Ignacio de Gregorio, June 2024, Mixture-of-Agents Beats ChatGPT-4o: Collaboration is Intelligence, https://medium.com/@ignacio.de.gregorio.noblejas/mixture-of-agents-beats-chatgpt-4o-6470a74f1525
- Honghua Dong, Qidong Su, Yubo Gao, Zhaoyu Li, Yangjun Ruan, Gennady Pekhimenko, Chris J. Maddison, Xujie Si, 19 Jun 2024, APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts, https://arxiv.org/abs/2406.13161 Code: https://github.com/appl-team/appl (A Python-like script language for prompt engineering integration into applications and agents.)
- Mariya Mansurova, Jun 17, 2024, Multi AI Agent Systems 101: Automating Routine Tasks in Data Source Management with CrewAI, https://towardsdatascience.com/multi-ai-agent-systems-101-bac58e3bcc47
- Assaf Elovic, May 10, 2024, How to Build the Ultimate AI Automation with Multi-Agent Collaboration, https://medium.com/@assafelovic/how-to-build-the-ultimate-ai-automation-with-multi-agent-collaboration-ed61a1ea8f3b
- Haolin Jin, Linghan Huang, Haipeng Cai, Jun Yan, Bo Li, Huaming Chen, 5 Aug 2024, From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future, https://arxiv.org/abs/2408.02479
- Lakshmi narayana .U, Jul 28, 2024, STORM: Stanford’s Revolutionary Research Tool Harnessing the Power of Agents and Agentic Workflows, https://blog.stackademic.com/storm-stanfords-revolutionary-research-tool-harnessing-the-power-of-agents-and-agentic-workflows-a2fa0e1a7fe3
- Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang Zhu, Chi Wang, Saleema Amershi, 9 Aug 2024, AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems, https://arxiv.org/abs/2408.15247
- Vipin Nair, Aug 10, 2024, A Simple Guide to Collaborative AI Agents with LangGraph, https://medium.com/aitech/a-simple-guide-to-collaborative-ai-agents-with-langgraph-d6b89e13560f
- Junwei Liu, Kaixin Wang, Yixuan Chen, Xin Peng, Zhenpeng Chen, Lingming Zhang, Yiling Lou, 4 Sep 2024, Large Language Model-Based Agents for Software Engineering: A Survey, https://arxiv.org/abs/2409.02977 Project: https://github.com/FudanSELab/Agent4SE-Paper-List
- Federico Berto, Chuanbo Hua, Laurin Luttmann, Jiwoo Son, Junyoung Park, Kyuree Ahn, Changhyun Kwon, Lin Xie, Jinkyoo Park, 5 Sep 2024, PARCO: Learning Parallel Autoregressive Policies for Efficient Multi-Agent Combinatorial Optimization, https://arxiv.org/abs/2409.03811 https://github.com/ai4co/parco
- Ilan Bigio, Oct 10, 2024, Orchestrating Agents: Routines and Handoffs, https://cookbook.openai.com/examples/orchestrating_agents
- Guibin Zhang, Yanwei Yue, Zhixun Li, Sukwon Yun, Guancheng Wan, Kun Wang, Dawei Cheng, Jeffrey Xu Yu, Tianlong Chen, 3 Oct 2024, Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems, https://arxiv.org/abs/2410.02506 (Address the inter-agent communication bottleneck in multi-agent systems.)
- Anita Kirkovska, David Vargas, Jul 11, 2024, Agentic Workflows in 2024: The ultimate guide, https://www.vellum.ai/blog/agentic-workflows-emerging-architectures-and-design-patterns
- A. Singh, A. Ehtesham, S. Kumar and T. T. Khoei, "Enhancing AI Systems with Agentic Workflows Patterns in Large Language Model," 2024 IEEE World AI IoT Congress (AIIoT), Seattle, WA, USA, 2024, pp. 527-532, doi: 10.1109/AIIoT61789.2024.10578990. https://ieeexplore.ieee.org/abstract/document/10578990
- Chawla, Chhavi; Chatterjee, Siddharth; Gadadinni, Sanketh Siddanna; Verma, Pulkit; Banerjee, Sourav, 2024, Agentic AI: The building blocks of sophisticated AI business applications, Journal of AI, Robotics & Workplace Automation, Volume 3 / Number 3 / Summer 2024, pp. 1-15(15), Henry Stewart Publications, DOI: https://doi.org/10.69554/XEHZ1946 https://www.ingentaconnect.com/content/hsp/airwa/2024/00000003/00000003/art00001
- Dawei Gao, Zitao Li, Xuchen Pan, Weirui Kuang, Zhijian Ma, Bingchen Qian, Fei Wei, Wenhao Zhang, Yuexiang Xie, Daoyuan Chen, Liuyi Yao, Hongyi Peng, Zeyu Zhang, Lin Zhu, Chen Cheng, Hongzhu Shi, Yaliang Li, Bolin Ding, Jingren Zhou, 20 May 2024 (v2), AgentScope: A Flexible yet Robust Multi-Agent Platform, https://arxiv.org/abs/2402.14034 https://github.com/modelscope/agentscope
- Haoyang Su, Renqi Chen, Shixiang Tang, Xinzhe Zheng, Jingzhe Li, Zhenfei Yin, Wanli Ouyang, Nanqing Dong, 12 Oct 2024, Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation, https://arxiv.org/abs/2410.09403
- Jiarui Ji, Runlin Lei, Jialing Bi, Zhewei Wei, Yankai Lin, Xuchen Pan, Yaliang Li, Bolin Ding, 13 Oct 2024, Dynamic and Textual Graph Generation Via Large-Scale LLM-based Agent Simulation, https://arxiv.org/abs/2410.09824
- Yuwei Hu, Runlin Lei, Xinyi Huang, Zhewei Wei, Yongchao Liu, 7 Oct 2024, Scalable and Accurate Graph Reasoning with LLM-based Multi-Agents, https://arxiv.org/abs/2410.05130
- Xuchen Pan, Dawei Gao, Yuexiang Xie, Zhewei Wei, Yaliang Li, Bolin Ding, Ji-Rong Wen, Jingren Zhou, 25 Jul 2024, Very Large-Scale Multi-Agent Simulation in AgentScope, https://arxiv.org/abs/2407.17789
- Qian Wang, Tianyu Wang, Qinbin Li, Jingsheng Liang, Bingsheng He, 20 Aug 2024 (v2), MegaAgent: A Practical Framework for Autonomous Cooperation in Large-Scale LLM Agent Systems, https://arxiv.org/abs/2408.09955 https://anonymous.4open.science/r/MegaAgent-81F3
- Mohammadreza Doostmohammadian, Sérgio Pequito, 27 Oct 2024, Logarithmically Quantized Distributed Optimization over Dynamic Multi-Agent Networks. https://arxiv.org/abs/2410.20345
- Jonas Becker, 30 Oct 2024, Multi-Agent Large Language Models for Conversational Task-Solving, https://arxiv.org/abs/2410.22932
- Do Xuan Long, Duong Ngoc Yen, Anh Tuan Luu, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen, 1 Nov 2024, Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models, https://arxiv.org/abs/2411.00492
- Rogerio Bonatti, Dan Zhao, Francesco Bonacci, Dillon Dupont, Sara Abdali, Yinheng Li, Yadong Lu, Justin Wagle, Kazuhito Koishida, Arthur Bucker, Lawrence Jang, Zack Hui, 13 Sep 2024 (v2), Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale, https://arxiv.org/abs/2409.08264
- Biao Wu, Yanda Li, Meng Fang, Zirui Song, Zhiwei Zhang, Yunchao Wei, Ling Chen, 4 Nov 2024, Foundations and Recent Trends in Multimodal Mobile Agents: A Survey, https://arxiv.org/abs/2411.02006 https://github.com/aialt/awesome-mobile-agents
- Eric Broda, Nov 2024, Agentic Mesh: The Future of Generative AI-Enabled Autonomous Agent Ecosystems https://towardsdatascience.com/agentic-mesh-the-future-of-generative-ai-enabled-autonomous-agent-ecosystems-d6a11381c979
- Mohammed Lubbad, Oct 11, 2024, Top 4 Agentic AI Architecture Design Patterns, https://mlubbad.medium.com/top-4-agentic-ai-architecture-design-patterns-2ad890a543e8
- Zhiqiang Xie, Hao Kang, Ying Sheng, Tushar Krishna, Kayvon Fatahalian, Christos Kozyrakis, 5 Nov 2024, AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution https://arxiv.org/abs/2411.03519 (Scheduling multiple agents.)
- Shubham Gandhi, Manasi Patwardhan, Lovekesh Vig, Gautam Shroff, 12 Nov 2024, BudgetMLAgent: A Cost-Effective LLM Multi-Agent system for Automating Machine Learning Tasks, https://arxiv.org/abs/2411.07464
- Jared Spataro, November 19, 2024, Introducing Copilot Actions, new agents, and tools to empower IT teams, https://www.microsoft.com/en-us/microsoft-365/blog/2024/11/19/introducing-copilot-actions-new-agents-and-tools-to-empower-it-teams/ ("Copilot is the UI for AI")
- Yingxuan Yang, Qiuying Peng, Jun Wang, Weinan Zhang, 21 Nov 2024, Multi-LLM-Agent Systems: Techniques and Business Perspectives, https://arxiv.org/abs/2411.14033
- Ziyang Huang, Jun Zhao, Kang Liu, 1 Dec 2024, Towards Adaptive Mechanism Activation in Language Agent, https://arxiv.org/abs/2412.00722
- Agnostiq, Dec 2024, multi-agent-llm: LLM based Multi-Agent methods: Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT), https://github.com/AgnostiqHQ/multi-agent-llm
- Wenchao Xu, Jinyu Chen, Peirong Zheng, Xiaoquan Yi, Tianyi Tian, Wenhui Zhu, Quan Wan, Haozhao Wang, Yunfeng Fan, Qinliang Su, Xuemin Shen, https://arxiv.org/abs/2412.13437 18 Dec 2024, Deploying Foundation Model Powered Agent Services: A Survey, (A survey of not just deployment, but many inference optimization techniques.)
- Siddharth Narayanan, James D. Braza, Ryan-Rhys Griffiths, Manu Ponnapati, Albert Bou, Jon Laurent, Ori Kabeli, Geemi Wellawatte, Sam Cox, Samuel G. Rodriques, Andrew D. White, 30 Dec 2024. Aviary: training language agents on challenging scientific tasks, https://arxiv.org/abs/2412.21154
- Mayi Xu, Yunfeng Ning, Yongqi Li, Jianhao Chen, Jintao Wen, Yao Xiao, Shen Zhou, Birong Pan, Zepeng Bao, Xin Miao, Hankun Kang, Ke Sun, Tieyun Qian, 2 Jan 2025, Reasoning based on symbolic and parametric knowledge bases: a survey, https://arxiv.org/abs/2501.01030 (Extensive survey of reasoning from CoT to knowledge graphs to table-based reasoning.)
- Chirag Shah, Ryen W. White, 19 Dec 2024, Agents Are Not Enough, https://www.arxiv.org/abs/2412.16241
- Austin Starks, Jan 2025, You are an absolute moron for believing in the hype of “AI Agents”. https://medium.com/@austin-starks/you-are-an-absolute-moron-for-believing-in-the-hype-of-ai-agents-c0f760e7e48e
- Manish Sanwal, 3 Feb 2025 (v2), Layered Chain-of-Thought Prompting for Multi-Agent LLM Systems: A Comprehensive Approach to Explainable Large Language Models, https://arxiv.org/abs/2501.18645
Agentic Workflow
- Arun Shankar, Oct 2024, Designing Cognitive Architectures: Agentic Workflow Patterns from Scratch, https://medium.com/google-cloud/designing-cognitive-architectures-agentic-workflow-patterns-from-scratch-63baa74c54bc
- AI Agent Workflows: A Complete Guide on Whether to Build With LangGraph or LangChain, Sandi Besen, Oct 2024, https://towardsdatascience.com/ai-agent-workflows-a-complete-guide-on-whether-to-build-with-langgraph-or-langchain-117025509fa0
- Anita Kirkovska, David Vargas, Jul 11, 2024, Agentic Workflows in 2024: The ultimate guide, https://www.vellum.ai/blog/agentic-workflows-emerging-architectures-and-design-patterns
- Shuofei Qiao, Runnan Fang, Zhisong Qiu, Xiaobin Wang, Ningyu Zhang, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen, 10 Oct 2024, Benchmarking Agentic Workflow Generation, https://arxiv.org/abs/2410.07869
- A. Singh, A. Ehtesham, S. Kumar and T. T. Khoei, "Enhancing AI Systems with Agentic Workflows Patterns in Large Language Model," 2024 IEEE World AI IoT Congress (AIIoT), Seattle, WA, USA, 2024, pp. 527-532, doi: 10.1109/AIIoT61789.2024.10578990. https://ieeexplore.ieee.org/abstract/document/10578990
- Chawla, Chhavi; Chatterjee, Siddharth; Gadadinni, Sanketh Siddanna; Verma, Pulkit; Banerjee, Sourav, 2024, Agentic AI: The building blocks of sophisticated AI business applications, Journal of AI, Robotics & Workplace Automation, Volume 3 / Number 3 / Summer 2024, pp. 1-15(15), Henry Stewart Publications, DOI: https://doi.org/10.69554/XEHZ1946 https://www.ingentaconnect.com/content/hsp/airwa/2024/00000003/00000003/art00001
- Jiayi Zhang, Jinyu Xiang, Zhaoyang Yu, Fengwei Teng, Xionghui Chen, Jiaqi Chen, Mingchen Zhuge, Xin Cheng, Sirui Hong, Jinlin Wang, Bingnan Zheng, Bang Liu, Yuyu Luo, Chenglin Wu, 14 Oct 2024, AFlow: Automating Agentic Workflow Generation, https://arxiv.org/abs/2410.10762 https://github.com/geekan/MetaGPT
- Ruixuan Xiao, Wentao Ma, Ke Wang, Yuchuan Wu, Junbo Zhao, Haobo Wang, Fei Huang, Yongbin Li, 21 Jun 2024, FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents, https://arxiv.org/abs/2406.14884
- Dawei Gao, Zitao Li, Xuchen Pan, Weirui Kuang, Zhijian Ma, Bingchen Qian, Fei Wei, Wenhao Zhang, Yuexiang Xie, Daoyuan Chen, Liuyi Yao, Hongyi Peng, Zeyu Zhang, Lin Zhu, Chen Cheng, Hongzhu Shi, Yaliang Li, Bolin Ding, Jingren Zhou, 20 May 2024 (v2), AgentScope: A Flexible yet Robust Multi-Agent Platform, https://arxiv.org/abs/2402.14034 https://github.com/modelscope/agentscope
- Omer Mahmood, Dec 25, 2024, Getting Started With Agentic Workflows: Moving beyond AI tools to automating high-value processes! https://pub.towardsai.net/getting-started-with-agentic-workflows-9703ac6ded62
- Chirag Shah, Ryen W. White, 19 Dec 2024, Agents Are Not Enough, https://www.arxiv.org/abs/2412.16241
- Fengli Xu, Qianyue Hao, Zefang Zong, Jingwei Wang, Yunke Zhang, Jingyi Wang, Xiaochong Lan, Jiahui Gong, Tianjian Ouyang, Fanjin Meng, Chenyang Shao, Yuwei Yan, Qinglong Yang, Yiwen Song, Sijian Ren, Xinyuan Hu, Yu Li, Jie Feng, Chen Gao, Yong Li, 17 Jan 2025 (v2), Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities, https://arxiv.org/abs/2501.09686
Agentic Architectures
- Arun Shankar, Oct 2024, Designing Cognitive Architectures: Agentic Workflow Patterns from Scratch, https://medium.com/google-cloud/designing-cognitive-architectures-agentic-workflow-patterns-from-scratch-63baa74c54bc
- Anita Kirkovska, David Vargas, Jul 11, 2024, Agentic Workflows in 2024: The ultimate guide, https://www.vellum.ai/blog/agentic-workflows-emerging-architectures-and-design-patterns
- Shuofei Qiao, Runnan Fang, Zhisong Qiu, Xiaobin Wang, Ningyu Zhang, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen, 10 Oct 2024, Benchmarking Agentic Workflow Generation, https://arxiv.org/abs/2410.07869
- A. Singh, A. Ehtesham, S. Kumar and T. T. Khoei, "Enhancing AI Systems with Agentic Workflows Patterns in Large Language Model," 2024 IEEE World AI IoT Congress (AIIoT), Seattle, WA, USA, 2024, pp. 527-532, doi: 10.1109/AIIoT61789.2024.10578990. https://ieeexplore.ieee.org/abstract/document/10578990
- Chawla, Chhavi; Chatterjee, Siddharth; Gadadinni, Sanketh Siddanna; Verma, Pulkit; Banerjee, Sourav, 2024, Agentic AI: The building blocks of sophisticated AI business applications, Journal of AI, Robotics & Workplace Automation, Volume 3 / Number 3 / Summer 2024, pp. 1-15(15), Henry Stewart Publications, DOI: https://doi.org/10.69554/XEHZ1946 https://www.ingentaconnect.com/content/hsp/airwa/2024/00000003/00000003/art00001
- Jiayi Zhang, Jinyu Xiang, Zhaoyang Yu, Fengwei Teng, Xionghui Chen, Jiaqi Chen, Mingchen Zhuge, Xin Cheng, Sirui Hong, Jinlin Wang, Bingnan Zheng, Bang Liu, Yuyu Luo, Chenglin Wu, 14 Oct 2024, AFlow: Automating Agentic Workflow Generation, https://arxiv.org/abs/2410.10762 https://github.com/geekan/MetaGPT
- Ruixuan Xiao, Wentao Ma, Ke Wang, Yuchuan Wu, Junbo Zhao, Haobo Wang, Fei Huang, Yongbin Li, 21 Jun 2024, FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents, https://arxiv.org/abs/2406.14884
- Dawei Gao, Zitao Li, Xuchen Pan, Weirui Kuang, Zhijian Ma, Bingchen Qian, Fei Wei, Wenhao Zhang, Yuexiang Xie, Daoyuan Chen, Liuyi Yao, Hongyi Peng, Zeyu Zhang, Lin Zhu, Chen Cheng, Hongzhu Shi, Yaliang Li, Bolin Ding, Jingren Zhou, 20 May 2024 (v2), AgentScope: A Flexible yet Robust Multi-Agent Platform, https://arxiv.org/abs/2402.14034 https://github.com/modelscope/agentscope
- Bryson Masse, October 31, 2024, Microsoft’s agentic AI tool OmniParser rockets up the open source charts, https://venturebeat.com/ai/microsofts-agentic-ai-tool-omniparser-rockets-up-the-open-source-charts/
- Eric Broda, Nov 2024, Agentic Mesh: The Future of Generative AI-Enabled Autonomous Agent Ecosystems https://towardsdatascience.com/agentic-mesh-the-future-of-generative-ai-enabled-autonomous-agent-ecosystems-d6a11381c979
- Mohammed Lubbad, Oct 11, 2024, Top 4 Agentic AI Architecture Design Patterns, https://mlubbad.medium.com/top-4-agentic-ai-architecture-design-patterns-2ad890a543e8
- Shirin Ghaffary and Rachel Metz November 14, 2024, OpenAI Nears Launch of AI Agent Tool to Automate Tasks for Users. The new software, codenamed “Operator,” is set to be released in January. https://www.bloomberg.com/news/articles/2024-11-13/openai-nears-launch-of-ai-agents-to-automate-tasks-for-users
- Shubham Sharma. November 12, 2024, How agentic RAG can be a game-changer for data processing and retrieval, https://venturebeat.com/ai/how-agentic-rag-can-be-a-game-changer-for-data-processing-and-retrieval/
- Mike Elgan, 22 Nov 2024, AI agents are unlike any technology ever, https://www.computerworld.com/article/3608973/ai-agents-are-unlike-any-technology-ever.html
- Chris Sypherd, Vaishak Belle, 5 Dec 2024, Practical Considerations for Agentic LLM Systems, https://arxiv.org/abs/2412.04093
- Benj Edwards, 12 Dec 2024, Google goes “agentic” with Gemini 2.0’s ambitious AI agent features. Google barrels ahead with a push into AI systems that take action for you. https://arstechnica.com/information-technology/2024/12/google-goes-agentic-with-gemini-2-0s-ambitious-ai-agent-features/
- Lu Wang, Fangkai Yang, Chaoyun Zhang, Junting Lu, Jiaxu Qian, Shilin He, Pu Zhao, Bo Qiao, Ray Huang, Si Qin, Qisheng Su, Jiayi Ye, Yudi Zhang, Jian-Guang Lou, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 13 Dec 2024, Large Action Models: From Inception to Implementation, https://arxiv.org/abs/2412.10047 https://github.com/microsoft/UFO/tree/main/dataflow https://microsoft.github.io/UFO/dataflow/overview/
- Julia Wiesinger, Patrick Marlow and Vladimir Vuskovic, Sep 2024, Agents, Google Whitepaper, https://www.kaggle.com/whitepaper-agents
Agent Frameworks and Platforms
Agent frameworks, or agent platforms, are overall LLM infrastructure to create and run LLM agents. Various commercial and open-source agent platforms have gained prominence, and there is much more to come in this space.
- Giancarlo Mori, Aug 14, 2024, Top 5 AI Agent Platforms You Should Know, https://gcmori.medium.com/top-5-ai-agent-platforms-you-should-know-418e7e7e24a9
- Aparna Dhinakaran, Sep 2024, Choosing Between LLM Agent Frameworks. The tradeoffs between building bespoke code-based agents and the major agent frameworks. https://towardsdatascience.com/choosing-between-llm-agent-frameworks-69019493b259
- Rogerio Bonatti, Dan Zhao, Francesco Bonacci, Dillon Dupont, Sara Abdali, Yinheng Li, Yadong Lu, Justin Wagle, Kazuhito Koishida, Arthur Bucker, Lawrence Jang, Zack Hui, 13 Sep 2024 (v2), Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale, https://arxiv.org/abs/2409.08264
- Letta, November 14, 2024, The AI agents stack, https://www.letta.com/blog/ai-agents-stack
- Jared Spataro, November 19, 2024, Introducing Copilot Actions, new agents, and tools to empower IT teams, https://www.microsoft.com/en-us/microsoft-365/blog/2024/11/19/introducing-copilot-actions-new-agents-and-tools-to-empower-it-teams/ ("Copilot is the UI for AI")
- Emilia David, November 19, 2024, Orchestrator agents: Integration, human interaction, and enterprise knowledge at the core, https://venturebeat.com/ai/orchestrator-agents-integration-human-interaction-and-enterprise-knowledge-at-the-core/
- Carl Franzen, November 21, 2024, Google Cloud launches AI Agent Space amid rising competition, https://venturebeat.com/ai/google-cloud-launches-ai-agent-space-amid-rising-competition/
- Sahar Mor, Nov 28, 2024, The Open-Source Toolkit for Building AI Agents. Curated frameworks, tools, and libraries every developer needs to build functional and efficient AI agents, https://www.aitidbits.ai/p/open-source-agents
- M K Pavan Kumar, Dec 2024, Navigating the AI Agent Landscape: In-Depth Analysis of Autogen, CrewAI, LlamaIndex, and LangChain Frameworks, https://medium.com/@manthapavankumar11/navigating-the-ai-agent-landscape-in-depth-analysis-of-autogen-crewai-llamaindex-and-langchain-2a3bcd932abc
- Composio, Dec 2024, Introduction: Overview. https://docs.composio.dev/introduction/intro/overview
- Amos Gyamfi, Nov 26, 2024, Best 5 Frameworks To Build Multi-Agent AI Applications, https://medium.com/@amosgyamfi/best-5-frameworks-to-build-multi-agent-ai-applications-1f88530ef8d8
- Isaac Sacolick, Jul 29, 2024, How to choose the right low-code, no-code, or process automation platform, https://www.infoworld.com/article/3476848/how-to-choose-the-right-low-code-no-code-or-process-automation-platform.html
- Ziyang Huang, Jun Zhao, Kang Liu, 1 Dec 2024, Towards Adaptive Mechanism Activation in Language Agent, https://arxiv.org/abs/2412.00722
- Wenchao Xu, Jinyu Chen, Peirong Zheng, Xiaoquan Yi, Tianyi Tian, Wenhui Zhu, Quan Wan, Haozhao Wang, Yunfeng Fan, Qinliang Su, Xuemin Shen, https://arxiv.org/abs/2412.13437 18 Dec 2024, Deploying Foundation Model Powered Agent Services: A Survey, (A survey of not just deployment, but many inference optimization techniques.)
- Shuaihang Chen, Yuanxing Liu, Wei Han, Weinan Zhang, Ting Liu, 23 Dec 2024. A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers. https://arxiv.org/abs/2412.17481
- Siddharth Narayanan, James D. Braza, Ryan-Rhys Griffiths, Manu Ponnapati, Albert Bou, Jon Laurent, Ori Kabeli, Geemi Wellawatte, Sam Cox, Samuel G. Rodriques, Andrew D. White, 30 Dec 2024. Aviary: training language agents on challenging scientific tasks, https://arxiv.org/abs/2412.21154
- Chirag Shah, Ryen W. White, 19 Dec 2024, Agents Are Not Enough, https://www.arxiv.org/abs/2412.16241
- Ori Ziv, Jan 2025, How AI Agents Will Disrupt SaaS in 2025, https://medium.com/@oriziv4/how-ai-agents-will-disrupt-saas-in-2025-7567d793ca68
- Meta, Jan 2025 (accessed), Llama Stack: Composable building blocks to build Llama Apps, https://github.com/meta-llama/llama-stack
- Elad Levi, Ilan Kadar, 19 Jan 2025, IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems, https://arxiv.org/abs/2501.11067 https://github.com/plurai-ai/intellagent
- Carl Franzen, January 28, 2025, Jack Dorsey is back with Goose, a new, ultra-simple open-source AI agent-building platform from his startup Block, https://venturebeat.com/programming-development/jack-dorsey-is-back-with-goose-a-new-ultra-simple-open-source-ai-agent-building-platform-from-his-startup-block/ (Open-source agent building framework seemingly initially focused on software developer agents.)
More AI Research
Read more about:
- Advanced AI Mathematics
- Zero-Multiplication Models
- Matrix Algebra
- Logarithmic Models
- Inference Optimizations
- « Research Home