Aussie AI

Applications of Generative AI

Last Updated 26 March, 2025

by David Spuler, Ph.D.

Apps Built on AI

Timo Lehto, June 2024, Developing LLM-powered Applications Using Modern Frameworks, Bachelor’s Thesis, Information and Communications Technology, Jamk University of Applied Sciences, Finland, June 2024, 53 pages., https://www.theseus.fi/bitstream/handle/10024/862271/Lehto_Timo.pdf?sequence=2 (Building LLM-based applications in RAG architecture using LangChain.)
Evelyn Cheng Apr 17, 2024 Baidu releases new AI tools to promote application development, https://www.cnbc.com/2024/04/18/baidu-releases-new-ai-tools-to-promote-application-development.html
David Cahn, Sep 20, 2023, AI’s $200B Question: GPU capacity is getting overbuilt. Long-term, this is good. Short-term, things could get messy, https://www.sequoiacap.com/article/follow-the-gpus-perspective/
Kirill Kolodiazhnyi, May 15, 2020, Hands-On Machine Learning with C++: Build, train, and deploy end-to-end machine learning and deep learning pipelines, https://www.amazon.com/Hands-Machine-Learning-end-end/dp/1789955335/
Prakash Joshi Pax, Aug 26, 2024, Fabric: The Best AI Tool That Nobody is Talking About. An open-source AI tool to automate every day tasks https://beingpax.medium.com/why-fabric-ai-can-change-the-way-you-use-ai-973e725354da
Andrew Ng, Sep 2024, X post, https://x.com/AndrewYNg/status/1829190549842321758 (Dropping token prices for LLMs means developers can focus on the app layer.)
Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 25 Sep 2024, Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents, https://arxiv.org/abs/2409.17140
Sonya Huang, Pat Grady, and o1, Sequoia, October 9, 2024 Generative AI’s Act o1, https://www.sequoiacap.com/article/generative-ais-act-o1/
Apple, December 16, 2024, Apple reveals 2024’s most downloaded apps and games on the App Store, https://www.apple.com/newsroom/2024/12/apple-reveals-2024s-most-downloaded-apps-and-games-on-the-app-store/
Sarah Perez, December 16, 2024, Temu is the most downloaded app on the US App Store in 2024, https://techcrunch.com/2024/12/16/temu-is-the-most-downloaded-app-on-the-u-s-app-store-in-2024/
Jess Weatherbed, Dec 10, 2024, AI is booming on the App Store, and developers are taking advantage of it. Many high-ranking AI apps feel like an attempted cash grab, and it’s not easy to spot the trash from the treasure. https://www.theverge.com/2024/12/9/24314972/apple-app-store-ai-apps-art-design-photography

Building Applications for Generative AI

Research on building Gen AI apps:

Metin Karatas, June 25, 2024, Developing AI Applications: An Introduction (New Edition), Rheinwerk Computing; New edition, https://www.amazon.com/Developing-AI-Applications-Metin-Karatas/dp/1493226010/
Mistral AI Team, Aug 7, 2024, Build, tweak, repeat: Making it easier to develop and share generative AI applications, https://mistral.ai/news/build-tweak-repeat/
Yorick Sens, Henriette Knopp, Sven Peldszus, Thorsten Berger, 12 Aug 2024, A Large-Scale Study of Model Integration in ML-Enabled Software Systems, https://arxiv.org/abs/2408.06226
Google, 2024, L’Oréal: Launching Gen AI as a Service in 3 months with Cloud Run and LangChain, https://services.google.com/fh/files/misc/google_loreal_with_langchain_case_study.pdf
Raymond Lo, Jul 10, 2024, How to Build Faster GenAI Apps with Fewer Lines of Code using OpenVINO™ GenAI API, https://medium.com/openvino-toolkit/how-to-build-faster-genai-apps-with-fewer-lines-of-code-using-openvino-genai-api-5dd5fcabea17
Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang Zhu, Chi Wang, Saleema Amershi, 9 Aug 2024, AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems, https://arxiv.org/abs/2408.15247
Abhimanyu Bambhaniya, Ritik Raj, Geonhwa Jeong, Souvik Kundu, Sudarshan Srinivasan, Midhilesh Elavazhagan, Madhu Kumar, Tushar Krishna, 3 Jun 2024, Demystifying Platform Requirements for Diverse LLM Inference Use Cases, https://arxiv.org/abs/2406.01698 Code: https://github.com/abhibambhaniya/GenZ-LLM-Analyzer (Analysis of cost of serving LLMs, including separate profiles of prefill versus decoding phases, and the cost of extra prompt processing in RAG architectures with prepended information.)
Timo Lehto, June 2024, Developing LLM-powered Applications Using Modern Frameworks, Bachelor’s Thesis, Information and Communications Technology, Jamk University of Applied Sciences, Finland, June 2024, 53 pages., https://www.theseus.fi/bitstream/handle/10024/862271/Lehto_Timo.pdf?sequence=2 (Building LLM-based applications in RAG architecture using LangChain.)
Fareed Khan, March 2024, BasicLINGUA: LLM Based NLP Library, https://github.com/FareedKhan-dev/basiclingua-LLM-Based-NLP
Eugene Yan, Bryan Bischof, Charles Frye, Hamel Husain, Jason Liu and Shreya Shankar, May 28, 2024, What We Learned from a Year of Building with LLMs (Part I), https://www.oreilly.com/radar/what-we-learned-from-a-year-of-building-with-llms-part-i/
Dell Technologies, May 20, 2024, Dell Technologies Expands Dell AI Factory with NVIDIA to Turbocharge AI Adoption, PR Newswire, https://www.prnewswire.com/news-releases/dell-technologies-expands-dell-ai-factory-with-nvidia-to-turbocharge-ai-adoption-302150245.html
JH Jones, May 2024, A Quantitative Comparison of Pre-Trained Model Registries to Traditional Software Package Registries, Masters Thesis, Electrical and Computer Engineering, Purdue University, https://hammer.purdue.edu/articles/thesis/A_Quantitative_Comparison_of_Pre-Trained_Model_Registries_to_Traditional_Software_Package_Registries/25686447/1 PDF: https://hammer.purdue.edu/ndownloader/files/46096152
Evelyn Cheng Apr 17, 2024 Baidu releases new AI tools to promote application development, https://www.cnbc.com/2024/04/18/baidu-releases-new-ai-tools-to-promote-application-development.html
Priyank Rathod, May 21, 2024, Efficient Usage of RAG Systems in the World of LLMs, https://www.techrxiv.org/doi/full/10.36227/techrxiv.171625877.73379410/v1
Kirill Kolodiazhnyi, May 15, 2020, Hands-On Machine Learning with C++: Build, train, and deploy end-to-end machine learning and deep learning pipelines, https://www.amazon.com/Hands-Machine-Learning-end-end/dp/1789955335/
Mozilla, June 3, 2024, Announcing Mozilla Builders: 2024 Accelerator Theme: Local AI, https://future.mozilla.org/builders/blog/announcing-mozilla-builders/
June 2024 (accessed), R2R: The ultimate open-source RAG framework, https://github.com/SciPhi-AI/R2R
Hesam Sheikh, Jun 1, 2024, Towards AI Build Blog Writer and Researcher AI Agents with Ollama (100% local): Creating AI agents with Crewai and using Ollama to run them 100% locally in 5 very easy steps!, https://pub.towardsai.net/build-your-first-ai-agent-in-5-easy-steps-100-local-2fb771438a8f
Simeon Emanuilov, Apr 4, 2024 LLM agent operating system (AIOS) and the future of LLM-powered agents, https://medium.com/@simeon.emanuilov/llm-agent-operating-system-aios-and-the-future-of-llm-powered-agents-3d08b4e91c34 https://unfoldai.com/aios-llm-powered-agents/
Grant Gross, 13 Jun 2024, IT leaders go small for purpose-built AI, https://www.cio.com/article/2139985/it-leaders-go-small-for-purpose-built-ai.html
Will Larson, April 8, 2024, Notes on how to use LLMs in your product. https://lethain.com/mental-model-for-how-to-use-llms-in-products/
Matt Murphy, Tim Tully, Grace Ge, Derek Xiao, Katie Keller, January 18, 2024, The Modern AI Stack: Design Principles for the Future of Enterprise AI Architectures, https://menlovc.com/perspective/the-modern-ai-stack-design-principles-for-the-future-of-enterprise-ai-architectures/?tpcc=NL_Marketing
NLUX: The 𝗣𝗼𝘄𝗲𝗿𝗳𝘂𝗹 Conversational AI JavaScript Library, https://github.com/nlkitai/nlux
Jesse Clayton, Kedar Potdar and Annamalai Chockalingam, Jun 02, 2024, Streamline Development of AI-Powered Apps with NVIDIA RTX AI Toolkit for Windows RTX PCs, NVIDIA Technical Blog, https://developer.nvidia.com/blog/streamline-ai-powered-app-development-with-nvidia-rtx-ai-toolkit-for-windows-rtx-pcs/
John Borthwick, May 28, 2024, Announcing AI Camp: Native Applications, https://render.betaworks.com/announcing-ai-camp-native-applications-e1358061c601
Julian Yip, Apr 2, 2024, Build Autonomous AI Agents with Function Calling: Transform your chatbot into an agent that can interact with external APIs, https://towardsdatascience.com/build-autonomous-ai-agents-with-function-calling-0bb483753975 (Implement agents via models that output a JSON object that describes the API to call and the parmaeters to send.)
Benedict Evans, 2024, Building AI products, https://www.ben-evans.com/benedictevans/2024/6/8/building-ai-products
David Spuler, March 2024, Generative AI in C++: Coding Transformers and LLMs, https://www.amazon.com/dp/B0CXJKCWX9
Ben Auffarth, Dec 22, 2023 Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT and other LLMs,https://www.amazon.com/Generative-AI-LangChain-language-ChatGPT/dp/1835083463/
Olivier Caelen and Marie-Alice Blete, Oct 3, 2023 Developing Apps with GPT-4 and ChatGPT: Build Intelligent Chatbots, Content Generators, and More, https://www.amazon.com/Developing-Apps-GPT-4-ChatGPT-Intelligent/dp/1098152484/
Douglas C. Youvan , June 15, 2024, Developing and Deploying AI Applications on NVIDIA Jetson Orin NX: A Comprehensive Guide, https://www.researchgate.net/profile/Douglas-Youvan/publication/381434888_Developing_and_Deploying_AI_Applications_on_NVIDIA_Jetson_Orin_NX_A_Comprehensive_Guide/links/666d7390de777205a32fceb6/Developing-and-Deploying-AI-Applications-on-NVIDIA-Jetson-Orin-NX-A-Comprehensive-Guide.pdf
Lak Lakshmanan, March 7, 2024, Building an AI Assistant with DSPy: A way to program and tune prompt-agnostic LLM agent pipelines, https://towardsdatascience.com/building-an-ai-assistant-with-dspy-2e1e749a1a95
Michael Lin, June 2024, How to Successfully Manage AI Software Projects: The 4 Phases of AI Projects I Shared at VixulCon https://medium.com/@_michaellin/how-to-successfully-manage-ai-software-projects-a8344b5b76a9
Fabian Both, June 2024, why we no longer use LangChain for building our AI agents , https://www.octomind.dev/blog/why-we-no-longer-use-langchain-for-building-our-ai-agents (Replaces LangChain with their own more-focused internal tool sets.)
Charles Lamanna, March 28, 2023, Companies innovate with low-code and fusion development, Microsoft, https://www.microsoft.com/en-us/industry/microsoft-in-business/business-transformation/2023/03/28/companies-innovate-with-low-code-and-fusion-development/ (States that 750 million new apps are required in the next two years, but there are only 4 million developers.)
McKinsey & Company, June 14, 2024, Scott Johnston on designing and building scalable platforms, https://www.mckinsey.com/industries/technology-media-and-telecommunications/our-insights/scott-johnston-on-designing-and-building-scalable-platforms (Docker CEO states that 750 million new apps are required.)
Valentina Alto, May 2024, Building LLM Powered Applications: Create intelligent apps and agents with large language models, Packt Publishing, https://www.amazon.com/Building-LLM-Apps-Intelligent-Language/dp/1835462316/
Irene Weber, 13 Jun 2024, Large Language Models as Software Components: A Taxonomy for LLM-Integrated Applications, https://arxiv.org/abs/2406.10300
Aarushi Kansal, Building Generative AI-Powered Apps: A Hands-on Guide for Developers, Apress, https://www.amazon.com/Building-Generative-AI-Powered-Apps-Hands-ebook/dp/B0CTXXP1S4/
Louis-François Bouchard, Louie Peters, May 2024, Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG, https://www.amazon.com/Building-LLMs-Production-Reliability-Fine-Tuning/dp/B0D4FFPFW8/
Kristian McCann July 15, 2024, AWS Unveils AI Service That Makes Enterprise Apps in Minutes, https://aimagazine.com/articles/aws-unveils-ai-service-that-builds-enterprise-apps-in-minute (Low-code enterprise AI app builder from AWS.)
Gene Rapoport, Sanjin Bicanic, Jue Wang, Richard Lichtenstein, Arjun Dutt, June 20, 2024, AI Survey: Four Themes Emerging: If 2023 was about experimentation, 2024 is all about results. Bain & Company, https://www.bain.com/insights/ai-survey-four-themes-emerging/ (Bain reports that use cases have been broadly successful in the use cases of sales, sales operations, software development, marketing, customer service, and customer onboarding, but less successful in HR, operations and legal. Interestingly, the main reason for AI project failures was that it couldn't perform the necessary task.)
Chip Huyen, Jul 25, 2024, Building A Generative AI Platform, https://huyenchip.com/2024/07/25/genai-platform.html
Juan Pablo Bottaro, April 25, 2024, Musings on building a Generative AI product, https://www.linkedin.com/blog/engineering/generative-ai/musings-on-building-a-generative-ai-product?_l=en_US
Writer, Aug 2024 (accessed), Writer AI Studio: The fastest way to build AI apps, https://writer.com/product/ai-studio/
OpenAI, Aug 2024 (accessed), .NET library, https://platform.openai.com/docs/libraries/dotnet-library https://github.com/openai/openai-dotnet
Travis Wilson, Jun 07 2024, Azure OpenAI Service expands .NET SDK support, https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-openai-service-expands-net-sdk-support/ba-p/4162940
Makhkamova, Ozoda, and Doohyun Kim. 2021. "A Conversation History-Based Q&A Cache Mechanism for Multi-Layered Chatbot Services" Applied Sciences 11, no. 21: 9981. https://doi.org/10.3390/app11219981 https://www.mdpi.com/2076-3417/11/21/9981 https://www.mdpi.com/2076-3417/11/21/9981/pdf
Yanxi Chen, Yaliang Li, Bolin Ding, Jingren Zhou, 20 Jul 2024, On the Design and Analysis of LLM-Based Algorithms, https://arxiv.org/abs/2407.14788 https://github.com/modelscope/agentscope/tree/main/examples/paper_llm_based_algorithm
Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
Xinyi Hou, Yanjie Zhao, Haoyu Wang, 3 Aug 2024, Voices from the Frontier: A Comprehensive Analysis of the OpenAI Developer Forum, https://arxiv.org/abs/2408.01687
Xiang Chen, Chaoyang Gao, Chunyang Chen, Guangbei Zhang, Yong Liu, 12 Aug 2024 (v2), An Empirical Study on Challenges for LLM Developers, https://arxiv.org/abs/2408.05002
Chaojun Xiao, Zhengyan Zhang, Chenyang Song, Dazhi Jiang, Feng Yao, Xu Han, Xiaozhi Wang, Shuo Wang, Yufei Huang, Guanyu Lin, Yingfa Chen, Weilin Zhao, Yuge Tu, Zexuan Zhong, Ao Zhang, Chenglei Si, Khai Hao Moo, Chenyang Zhao, Huimin Chen, Yankai Lin, Zhiyuan Liu, Jingbo Shang, Maosong Sun, Sep 2024, Configurable Foundation Models: Building LLMs from a Modular Perspective, https://arxiv.org/pdf/2409.02877
Lior Solomon, Sep 2024, Gen AI testing strategies and tools, https://medium.com/ai-in-grc/gen-ai-testing-strategies-and-tools-257383e5cbfb
Carl Franzen, September 13, 2024, What OpenAI’s new o1-preview and o1-mini models mean for developers, https://venturebeat.com/programming-development/what-openais-new-o1-preview-and-o1-mini-models-mean-for-developers/
Ellie Ko, Sep 25, 2024, A Survey of Python Frameworks, https://ploomber.io/blog/survey-python-frameworks/
Timothy Mugayi, Sep 2024, LLM Practical Ideas to Build Your Next AI-Powered Application: Realistic Use Cases to Unleash the Power of AI in Your Next Project, https://levelup.gitconnected.com/llm-practical-ideas-to-build-your-next-ai-powered-application-9379feba6cbc
Matt Asay, Sep 23, 2024, Too much assembly required for AI, https://www.infoworld.com/article/3536292/too-much-assembly-required-for-ai.html
Kif Leswing, Fri, Oct 4 2024, As Apple enters AI race, iPhone maker turns to its army of developers for an edge, https://www.cnbc.com/2024/10/04/apple-is-turning-to-its-army-of-developers-for-an-edge-in-the-ai-race.html
Nicola Sessions, Oct 15, 2024, DataStax Announces New AI Development Platform, Built with NVIDIA AI, https://developer.nvidia.com/blog/datastax-announces-new-ai-development-platform-built-with-nvidia-ai/
Anurag Guda and Shruthii Sathyanarayanan, Oct 16, 2024, Simplify AI Application Development with NVIDIA Cloud Native Stack, https://developer.nvidia.com/blog/simplify-ai-application-development-with-nvidia-cloud-native-stack/
Sid Chatterjee, Matt Silverlock, Celso Martinho, 2024-10-24, Build durable applications on Cloudflare Workers: you write the Workflows, we take care of the rest, https://blog.cloudflare.com/building-workflows-durable-execution-on-workers/
LangChain, Nov 7, 2024. SCIPE - Systematic Chain Improvement and Problem Evaluation, https://blog.langchain.dev/scipe-systematic-chain-improvement-and-problem-evaluation/ https://github.com/garg-ankush/scipe/tree/main
Lak Lakshmanan, Oct 4, 2024, How to Choose the Architecture for Your GenAI Application. A framework to select the simplest, fastest, cheapest architecture that will balance LLMs’ creativity and risk, https://towardsdatascience.com/how-to-choose-the-architecture-for-your-genai-application-6053e862c457
Siyun Zhao, Yuqing Yang, Zilong Wang, Zhiyuan He, Luna K. Qiu, Lili Qiu, 23 Sep 2024, Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely, https://arxiv.org/abs/2409.14924
Dhavalkumar Patel, Ganesh Raut, Satya Narayan Cheetirala, Girish N Nadkarni, Robert Freeman, Benjamin S. Glicksberg, Eyal Klang, Prem Timsina, 8 Dec 2024, Cloud Platforms for Developing Generative AI Solutions: A Scoping Review of Tools and Services, https://arxiv.org/abs/2412.06044
Isabel Hulseman and Ruchika Kharwar, Dec 11, 2024, Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint, https://developer.nvidia.com/blog/three-building-blocks-for-creating-ai-virtual-assistants-for-customer-service-with-an-nvidia-nim-agent-blueprint/
Jason Redmond, Jan 2025, Microsoft CEO Nadella forms new AI group to build and run apps for customers. Microsoft hired DeepMind co-founder Mustafa Suleyman to lead Copilot AI initiatives last year. https://www.nbcnews.com/business/business-news/microsoft-ceo-nadella-forms-new-ai-group-build-run-apps-customers-rcna187506
Xiang Chen, Chaoyang Gao, Chunyang Chen, Guangbei Zhang, and Yong Liu. 2025. An Empirical Study on Challenges for LLM Application Developers. ACM Trans. Softw. Eng. Methodol. Just Accepted (January 2025). https://doi.org/10.1145/3715007 https://dl.acm.org/doi/pdf/10.1145/3715007
Bharani Subramaniam, 13 February 2025, Emerging Patterns in Building GenAI Products, https://martinfowler.com/articles/gen-ai-patterns/

Inference Frameworks

Research papers include:

Yiheng Liu, Hao He, Tianle Han, Xu Zhang, Mengyuan Liu, Jiaming Tian, Yutong Zhang, Jiaqi Wang, Xiaohui Gao, Tianyang Zhong, Yi Pan, Shaochen Xu, Zihao Wu, Zhengliang Liu, Xin Zhang, Shu Zhang, Xintao Hu, Tuo Zhang, Ning Qiang, Tianming Liu, Bao Ge, Jan 2024, Understanding LLMs: A Comprehensive Overview from Training to Inference https://arxiv.org/abs/2401.02038
MLC team. 2023. MLC-LLM. https://github.com/mlc-ai/mlc-llm
tinygrad. 2023. Tinygrad. https://github.com/tinygrad/tinygrad
Woosuk Kwon, Zhuohan Li, Siyuan Zhuang, Ying Sheng, Lianmin Zheng, Cody Hao Yu, Joseph E. Gonzalez, Hao Zhang, Ion Stoica, Oct 2023, Efficient Memory Management for Large Language Model Serving with PagedAttention, SOSP ’23, October 23–26, 2023, Koblenz, Germany, https://dl.acm.org/doi/pdf/10.1145/3600006.3613165 (The original Paged Attention and vLLM paper, focusing on optimizing memory size of the KV cache using methods similar to operating-system memory paging.)
Vince Lam, Mar 12, 2024, 50+ Open-Source Options for Running LLMs Locally, https://medium.com/thedeephub/50-open-source-options-for-running-llms-locally-db1ec6f5a54f
Jason Perlow, Aug. 6, 2024, How to run dozens of AI models on your Mac or PC - no third-party cloud needed, https://www.zdnet.com/article/how-to-run-dozens-of-ai-models-on-your-mac-or-pc-no-third-party-cloud-needed/
Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Chuyue Sun, Jeff Huang, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark Barrett, Ying Sheng, 6 Jun 2024 (v2), SGLang: Efficient Execution of Structured Language Model Programs, https://arxiv.org/abs/2312.07104 https://github.com/sgl-project/sglang
The SGLang Team, Jul 25, 2024, Achieving Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, vLLM), https://lmsys.org/blog/2024-07-25-sglang-llama3/
Anna Popovych, Sofiya Merenych, February 16, 2024, Top AI Frameworks in 2024: Comparison of Artificial Intelligence Frameworks, https://clockwise.software/blog/artificial-intelligence-framework/
Hugging Face, 2024, Text Generation Inference, https://huggingface.co/docs/text-generation-inference/index
ZML, Sep 2024, ZML: High performance AI inference stack. Built for productionl https://docs.zml.ai/ https://github.com/zml/zml?tab=readme-ov-file
Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Hongyi Jin, Tianqi Chen, Zhihao Jia, 23 Dec 2023, Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems, https://arxiv.org/abs/2312.15234
Ruihao Gong, Yifu Ding, Zining Wang, Chengtao Lv, Xingyu Zheng, Jinyang Du, Haotong Qin, Jinyang Guo, Michele Magno, Xianglong Liu, 25 Sep 2024, A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms, https://arxiv.org/abs/2409.16694
Sebastian Petrus, Sep 4, 2024, Top 10 RAG Frameworks Github Repos 2024, https://sebastian-petrus.medium.com/top-10-rag-frameworks-github-repos-2024-12b2a81f4a49
Rick Zhou, Larme Zhao, Bo Jiang, and Sean Sheng, June 5, 2024, Benchmarking LLM Inference Backends: vLLM, LMDeploy, MLC-LLM, TensorRT-LLM, and TGI, https://www.bentoml.com/blog/benchmarking-llm-inference-backends
Wenchao Xu, Jinyu Chen, Peirong Zheng, Xiaoquan Yi, Tianyi Tian, Wenhui Zhu, Quan Wan, Haozhao Wang, Yunfeng Fan, Qinliang Su, Xuemin Shen, https://arxiv.org/abs/2412.13437 18 Dec 2024, Deploying Foundation Model Powered Agent Services: A Survey, (A survey of not just deployment, but many inference optimization techniques.)
Meta, Jan 2025 (accessed), Llama Stack: Composable building blocks to build Llama Apps, https://github.com/meta-llama/llama-stack
Mozhgan Navardi, Romina Aalishah, Yuzhe Fu, Yueqian Lin, Hai Li, Yiran Chen, Tinoosh Mohsenin, 19 Feb 2025, GenAI at the Edge: Comprehensive Survey on Empowering Edge Devices, https://arxiv.org/abs/2502.15816
Amr Elmeleegy, Harry Kim, David Zier, Kyle Kranen, Neelay Shah, Ryan Olson and Omri Kahalon, Mar 18, 2025, Introducing NVIDIA Dynamo, A Low-Latency Distributed Inference Framework for Scaling Reasoning AI Models, https://developer.nvidia.com/blog/introducing-nvidia-dynamo-a-low-latency-distributed-inference-framework-for-scaling-reasoning-ai-models/

Orchestration Frameworks

Research papers include:

Konstantinos Papaioannou, Thaleia Dimitra Doudali, April 2024, The Importance of Workload Choice in Evaluating LLM Inference Systems, EuroMLSys '24: Proceedings of the 4th Workshop on Machine Learning and Systems, April 2024, Pages 39–46, https://doi.org/10.1145/3642970.3655823 https://dl.acm.org/doi/abs/10.1145/3642970.3655823
Jacob Robbins, January 4, 2024, Why generative AI orchestration startups are poised for growth in 2024, Pitch Book, https://pitchbook.com/news/articles/generative-ai-orchestration-startups-venture-capital-unicorns
Xin Tan, Yimin Jiang, Yitao Yang, Hong Xu, 29 Jun 2024, Teola: Towards End-to-End Optimization of LLM-based Applications, https://arxiv.org/abs/2407.00326
Chip Huyen, Jul 25, 2024, Building A Generative AI Platform, https://huyenchip.com/2024/07/25/genai-platform.html
Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Chuyue Sun, Jeff Huang, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark Barrett, Ying Sheng, 6 Jun 2024 (v2), SGLang: Efficient Execution of Structured Language Model Programs, https://arxiv.org/abs/2312.07104 https://github.com/sgl-project/sglang
The SGLang Team, Jul 25, 2024, Achieving Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, vLLM), https://lmsys.org/blog/2024-07-25-sglang-llama3/
An Efficient Network Orchestrator for Distributed Compound Language Model Systems Muhammad Shahir Abdurrahman, Stanford University, Stanford, California, USA, https://www.scs.stanford.edu/24sp-cs244b/projects/An_Efficient_Network_Orchestrator_for_Distributed_Compound_Language_Model_Systems.pdf
Melissa Malec, June 5, 2024, AI Orchestration Explained: The What, Why & How for 2024, https://hatchworks.com/blog/gen-ai/ai-orchestration/
Manish Kochar, May 19, 2024, Compounding GenAI Success: Why Orchestration is the Key to Mastering Generative AI, https://medium.com/@mkochar/compounding-genai-success-why-orchestration-is-the-key-to-mastering-generative-ai-543a2952acfe
Carl Franzen, August 23, 2024, Grok-2 gets a speed bump after developers rewrite code in three days, https://venturebeat.com/ai/grok-2-gets-a-speed-bump-after-developers-rewrite-code-in-three-days/ (Inference speed improvement by rewriting using the SGLang orchestration framework.)
Gary Grossman, September 8, 2024, AI orchestration: Crafting harmony or creating dependency? https://venturebeat.com/ai/ai-orchestration-crafting-harmony-or-creating-dependency/
A. R. Ali, K. Kumar, M. A. Siddiqui and M. Zahid, 2024, An Open-source Cross-Industry and Cloud-agnostic Generative AI Platform, 2024 International Joint Conference on Neural Networks (IJCNN), Yokohama, Japan, 2024, pp. 1-10, doi: 10.1109/IJCNN60899.2024.10650688, https://ieeexplore.ieee.org/abstract/document/10650688
LiLMod, Aug 27, 2024, Haystack: the new LLM framework that is shaking its competitors, https://ai.plainenglish.io/haystack-the-new-llm-framework-that-is-shaking-its-competitors-1a083a153fd9
Yiyuan He, Minxian Xu, Jingfeng Wu, Wanyi Zheng, Kejiang Ye, Chengzhong Xu, 24 Sep 2024 (v2), UELLM: A Unified and Efficient Approach for LLM Inference Serving, https://arxiv.org/abs/2409.14961
Michael Nuñez, September 25, 2024, AI for all: Meta’s ‘Llama Stack’ promises to simplify enterprise adoption, https://venturebeat.com/ai/ai-for-all-meta-llama-stack-promises-to-simplify-enterprise-ai-adoption/
Matt Asay, Sep 23, 2024, Too much assembly required for AI, https://www.infoworld.com/article/3536292/too-much-assembly-required-for-ai.html
Kabir Nagrecha, Oct 2024, Thesis, Orchestration Systems to Support Deep Learning at Scale Doctor of Philosophy, Computer Science, University of California San Diego, https://escholarship.org/content/qt3pp6k1p4/qt3pp6k1p4_noSplash_457f4c7c0435172a3d0a17428455894c.pdf (Pipeline and data parallelism systems.)
Emilia David, November 19, 2024, Orchestrator agents: Integration, human interaction, and enterprise knowledge at the core, https://venturebeat.com/ai/orchestrator-agents-integration-human-interaction-and-enterprise-knowledge-at-the-core/

LangChain

LangChain is an AI orchestration framework that allows "chaining" of multiple components in a sequence. Research papers on LangChain usage:

Timo Lehto, June 2024, Developing LLM-powered Applications Using Modern Frameworks, Bachelor’s Thesis, Information and Communications Technology, Jamk University of Applied Sciences, Finland, June 2024, 53 pages., https://www.theseus.fi/bitstream/handle/10024/862271/Lehto_Timo.pdf?sequence=2 (Building LLM-based applications in RAG architecture using LangChain.)
Ben Auffarth, Dec 22, 2023 Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT and other LLMs,https://www.amazon.com/Generative-AI-LangChain-language-ChatGPT/dp/1835083463/
Fabian Both, June 2024, why we no longer use LangChain for building our AI agents , https://www.octomind.dev/blog/why-we-no-longer-use-langchain-for-building-our-ai-agents (Replaces LangChain with their own more-focused internal tool sets.)
Louis-François Bouchard, Louie Peters, May 2024, Chapter 4: Prompting, and Chapter 6, Prompting with LangChain, Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG, https://www.amazon.com/Building-LLMs-Production-Reliability-Fine-Tuning/dp/B0D4FFPFW8/
Aarushi Kansal, Chapter 2: LangChain: Your Swiss Army Knife, Building Generative AI-Powered Apps: A Hands-on Guide for Developers, Apress, https://www.amazon.com/Building-Generative-AI-Powered-Apps-Hands-ebook/dp/B0CTXXP1S4/
Eddie Forson, Apr 29, 2024, Why I’m building my own AI Agent library, https://medium.com/@Ed_Forson/why-im-building-my-own-ai-agent-library-e20ec9aa3647
AI Agent Workflows: A Complete Guide on Whether to Build With LangGraph or LangChain, Sandi Besen, Oct 2024, https://towardsdatascience.com/ai-agent-workflows-a-complete-guide-on-whether-to-build-with-langgraph-or-langchain-117025509fa0
R Szilágyi, 2024, OpenSource alternatives of Generative Artifical Intelligence for SME's, Journal of Agricultural Informatics, Vol. 15 No. 2 (2024), https://doi.org/10.17700/jai.2024.15.2.733 https://journal.magisz.org/index.php/jai/article/view/733 https://journal.magisz.org/index.php/jai/article/view/733/412

Wrap Architectures for Gen AI Applications

The simplest architectures for AI applications are those that simply "wrap" around LLMs, whether it is commercial LLMs like GPT, or open source LLMs like Mistral or Llama.

A16Z, April 2nd, 2024 (accessed), AI Getting Started https://github.com/a16z-infra/ai-getting-started (Javascript wrapper kits for several commercial AI APIs.)
Ben Auffarth, Dec 22, 2023 Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT and other LLMs,https://www.amazon.com/Generative-AI-LangChain-language-ChatGPT/dp/1835083463/
Thiyagarajan Maruthavan (Rajan), Apr 12, 2024, So what if it is a thin wrapper on OpenAI? https://medium.com/@mtrajan/so-what-if-it-is-a-thin-wrapper-on-openai-274dd005b6d3
Adva Nakash Peleg, May 30, 2024, An LLM Journey: From POC to Production, https://medium.com/cyberark-engineering/an-llm-journey-from-poc-to-production-6c5ec6a172fb
Apurv Sibal, February 26, 2025, Hands-On Prompt Engineering: Learning to Program ChatGPT Using OpenAI APIs, Wiley, https://www.amazon.com/Hands-Prompt-Engineering-Learning-Program/dp/1394210760/
Dr Kris Jamsa, Dec 2023, OpenAI and ChatGPT Programming: Using Python to Unlock OpenAI and ChatGPT, https://www.amazon.com/OpenAI-ChatGPT-Programming-Python-Unlock/dp/B0CQK41P6B/
Cuantum Technologies, May 2023, ChatGPT API Bible: Mastering Python Programming for Conversational AI: Build Intelligent Chatbots and AI Applications with ChatGPT API and Python (Mastering AI and Python), https://www.amazon.com/ChatGPT-API-Bible-Conversational-Applications/dp/B0C47NWRT7/
Mike Gold, October 6, 2023, Crafting Applications with ChatGPT API: Using Python, Green Belt Book LLC, https://www.amazon.com/Crafting-Applications-ChatGPT-API-Python-ebook/dp/B0CHJX36X3/
Henry Habib, Paul Siegel, March 12, 2024, OpenAI API Cookbook: Build intelligent applications including chatbots, virtual assistants, and content generators, Packt Publishing, https://www.amazon.com/OpenAI-API-Cookbook-intelligent-applications-ebook/dp/B0CT8W7B79/
Olivier Caelen, Marie-Alice Blete, August 13, 2024, Developing Apps with GPT-4 and ChatGPT: Build Intelligent Chatbots, Content Generators, and More, 2nd edition, O'Reilly Media; https://www.amazon.com/Developing-Apps-GPT-4-ChatGPT-Intelligent/dp/1098168100/
Michael J. Lever, Aug 2024, AI or API? | Chatbot cuckoos are bloating tech OpenAI wrappers are becoming a shortcut for start-ups, but are they sustainable? https://medium.com/future-ux/ai-or-api-chatbot-cuckoos-are-bloating-tech-d6b8d8255279
Yorick Sens, Henriette Knopp, Sven Peldszus, Thorsten Berger, 12 Aug 2024, A Large-Scale Study of Model Integration in ML-Enabled Software Systems, https://arxiv.org/abs/2408.06226
Raymond Lo, Jul 10, 2024, How to Build Faster GenAI Apps with Fewer Lines of Code using OpenVINO™ GenAI API, https://medium.com/openvino-toolkit/how-to-build-faster-genai-apps-with-fewer-lines-of-code-using-openvino-genai-api-5dd5fcabea17
Rachel Curry, Aug 28 2024, Why companies including JPMorgan and Walmart are opting for internal gen AI assistants after initially restricting usage, https://www.cnbc.com/2024/08/28/why-jpmorgan-and-walmart-are-opting-for-internal-gen-ai-assistants.html
Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
Xinyi Hou, Yanjie Zhao, Haoyu Wang, 3 Aug 2024, Voices from the Frontier: A Comprehensive Analysis of the OpenAI Developer Forum, https://arxiv.org/abs/2408.01687
Chandra Irugalbandara, Ashish Mahendra, Roland Daynauth, Tharuka Kasthuri Arachchige, Jayanaka Dantanarayana, Krisztian Flautner, Lingjia Tang, Yiping Kang, Jason Mars, 16 Apr 2024 (v3), Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's LLM with Open Source SLMs in Production, https://arxiv.org/abs/2312.14972
Xiang Chen, Chaoyang Gao, Chunyang Chen, Guangbei Zhang, Yong Liu, 12 Aug 2024 (v2), An Empirical Study on Challenges for LLM Developers, https://arxiv.org/abs/2408.05002
Dennis Rall, Bernhard Bauer, Thomas Fraunholz, 8 Nov 2023, Towards Democratizing AI: A Comparative Analysis of AI as a Service Platforms and the Open Space for Machine Learning Approach, https://arxiv.org/abs/2311.04518
David Spuler, March 2024, API Wrapper Architecture Optimizations, in Generative AI in C++, https://www.aussieai.com/book/ch7-api-wrapper-optimizations
Andrew Zuo, Sep 2024, Don’t Judge An LLM Only By The Web App, https://andrewzuo.com/dont-judge-an-llm-only-by-the-web-app-0a47d29390c3
Emilia David, September 3, 2024, Anthropic to release system prompts for Artifacts, latest Claude family prompts found incomplete, https://venturebeat.com/ai/anthropic-to-release-system-prompts-for-artifacts-latest-claude-family-prompts-found-incomplete/
Emilia David, August 27, 2024, Anthropic releases AI model system prompts, winning praise for transparency, https://venturebeat.com/ai/anthropic-releases-ai-model-system-prompts-winning-praise-for-transparency/
Gian Segato, September 2024, The dawn of a new startup era, https://giansegato.com/essays/dawn-new-startup-era
Kris Ograbek, Aug 30, 2024, 6 Hard-learned Lessons from My First Project as a Freelance AI Engineer, https://ai.gopubby.com/6-hard-learned-lessons-from-my-first-project-as-a-freelance-ai-engineer-9519e6edee90
Asankhaya Sharma (codelion), Sep 2024, Optillm: Optimizing inference proxy for LLMs, https://github.com/codelion/optillm
Xiaoxia Liu, Jingyi Wang, Jun Sun, Xiaohan Yuan, Guoliang Dong, Peng Di, Wenhai Wang, Dongxia Wang, 21 Nov 2023, Prompting Frameworks for Large Language Models: A Survey, https://arxiv.org/abs/2311.12785
Carl Franzen, September 13, 2024, What OpenAI’s new o1-preview and o1-mini models mean for developers, https://venturebeat.com/programming-development/what-openais-new-o1-preview-and-o1-mini-models-mean-for-developers/
Sascha Heyer, Sep 2024, RAG API: 30 lines of code is all you need for RAG. The easiest way to get started with RAG. https://medium.com/google-cloud/google-cloud-rag-api-c7e3c9931b3e
Kyle Wiggers, September 16, 2024, Runway announces an API for its video-generating AI models, https://techcrunch.com/2024/09/16/runway-announces-an-api-for-its-video-generating-models/
Kyle Wiggers, October 21, 2024, xAI, Elon Musk’s AI startup, launches an API, https://techcrunch.com/2024/10/21/xai-elon-musks-ai-startup-launches-an-api/
Quang H. Nguyen, Duy C. Hoang, Juliette Decugis, Saurav Manchanda, Nitesh V. Chawla, Khoa D. Doan, 24 Jul 2024 (v2), MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs, https://arxiv.org/abs/2407.10834
K. Balázs Neszlényi, A. Milos and A. Kiss, "AssistantGPT: Enhancing User Interaction with LLM Integration," 2024 IEEE 22nd Jubilee International Symposium on Intelligent Systems and Informatics (SISY), Pula, Croatia, 2024, pp. 000619-000624, doi: 10.1109/SISY62279.2024.10737548. https://ieeexplore.ieee.org/abstract/document/10737548
swyx, Sep 2024, What Works in AI UX (lightning talk + Q&A), https://www.youtube.com/watch?v=PkHjoihjo6U
Latent Space, Nov 2024, Why GPT Wrappers Are Good, Actually, https://www.latent.space/p/gpt-wrappers
Tegan Jones, 22 November, 2024, Neural Notes: Stop building AI startups with “the same crap” as everyone else. In this edition: warnings for startups relying too heavily on generic AI models and how AI has changed the relationship between VCs and founders. https://www.smartcompany.com.au/artificial-intelligence/neural-notes-stop-building-ai-startups-same-crap-everyone-else/
Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su, 22 Feb 2024, Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments, https://arxiv.org/abs/2402.14672
Narcisa Guran, Florian Knauf, Man Ngo, Stefan Petrescu, Jan S. Rellermeyer, 21 Nov 2024, Towards a Middleware for Large Language Models, https://arxiv.org/abs/2411.14513
Andrew Ng, Nov 2024, Simple, unified interface to multiple Generative AI providers, https://github.com/andrewyng/aisuite
Angular Ventures, December 03, 2024, Engines or plastics? How we talk about LLMs and how we use them. The Angle Issue #249, https://newsletter.angularventures.com/p/engines-or-plastics-how-we-talk-about-llms-and-how-we-use-them
Chris Pedregal, December 9, 2024, How to Build a Truly Useful AI Product. Generative AI breaks the old startup playbook, https://every.to/thesis/how-to-build-a-truly-useful-ai-product
Anshul Ramachandran, Jul 08, 2023, How to Make AI UX Your Moat. Design great AI Products that go beyond "just LLM Wrappers": make AI more present, more practical, and then more powerful. https://www.latent.space/p/ai-ux-moat
Lester Mapp, Feb. 6, 2025, From zero to millions? How regular people are cashing in on AI. Every day people are using AI in ways you wouldn't expect. You can too. Here's how, https://www.zdnet.com/article/from-zero-to-millions-how-regular-people-are-cashing-in-on-ai/
Andrew Chen, Feb 05, 2025, Revenge of the GPT Wrappers: Defensibility in a world of commoditized AI models: Why network effects and distribution will be king, once more, https://andrewchen.substack.com/p/revenge-of-the-gpt-wrappers-defensibility
Mandar Karhade, Feb 2025, Tired of LLM Chaos? LiteLLM Should Be Your Default. Stop juggling multiple LLM APIs and their “standards”. https://pub.towardsai.net/tired-of-llm-chaos-litellm-should-be-your-default-e04730b3c33c
Alex Fazio, Feb 2025, How to Build an LLM Chat App: The New Litmus Test for Junior Devs, https://x.com/alxfazio/status/1893242657331101976 (How to build a wrapper chat app that scales by taking care of message queueing, API rate limits, history database management, caching, and other real-world deployment issues.)
Jovan Cicmil, Feb 2025, Your ‘AI Startup’ Is Just OpenAI’s API: Why 99% of AI Companies Are Just Wrappers Around GPT and Will Die a Quick Death, https://blog.startupstash.com/your-ai-startup-is-just-openai-s-api-85940e81d2bd
Mary Ann Azevedo, February 27, 2025, Stripe says AI startups are growing faster than SaaS ever did, and calling them wrappers ‘misses the point’, https://techcrunch.com/2025/02/27/stripe-ceo-says-ai-startups-are-growing-faster-than-saas-ever-did-and-calling-them-wrappers-misses-the-point/
John Webber, January 6, 2025, Building an AI Wrapper SaaS in 2025: Opportunities and Challenges, https://saasminded.dev/building-an-ai-wrapper-saas-in-2025-opportunities-and-challenges/ ("...a faster route to market, the ability to tap into cutting-edge technology, and the potential for rapid scaling... requires a deep understanding of the market dynamics, a commitment to continuous innovation, a strategic approach to building defensibility, and a relentless focus on delivering unique and irreplaceable value to users.")
Wil Chung, 21 Nov 2024, The moats are in the GPT-wrappers, https://interjectedfuture.com/the-moats-are-in-the-gpt-wrappers/ ("...the GPT-wrapper application layer is where the value accrues.")
Stewart Townsend, 16 August 2024, The Future of AI Wrapper Companies: Will They Survive in 2024? https://stewarttownsend.com/the-future-of-ai-wrapper-companies-will-they-survive-in-2024/ ("Conclusion: The future of AI wrapper companies indeed looks promising and intriguing.")
Kate Clark Fri, March 7, 2025, The Hottest AI Companies Right Now Are ‘Apps’, Bloomberg, https://finance.yahoo.com/news/hottest-ai-companies-now-apps-140037730.html
Supreeth Koundinya, March 10, 2025, Manus is a Wrapper of Anthropic’s Claude, and It’s Okay, https://analyticsindiamag.com/ai-features/manus-is-a-wrapper-of-anthropics-claude-and-its-okay/ (“Manus didn’t just slap an API on a model. They built an autonomous system that can execute deep research, deep thinking, and multi-step tasks in a way that no other AI have.”)
Garry Tan, March 2025, X post, https://x.com/garrytan/status/1898949767335752019 ("... the models are plenty smart already and all the alpha is in custom prompting, tool use, clever workflow and evals.")

OpenAI API Applications

One particular type of "wrap" AI application is to use the OpenAI API (e.g. for ChatGPT).

Dr Kris Jamsa, Dec 2023, OpenAI and ChatGPT Programming: Using Python to Unlock OpenAI and ChatGPT, https://www.amazon.com/OpenAI-ChatGPT-Programming-Python-Unlock/dp/B0CQK41P6B/
Cuantum Technologies, May 2023, ChatGPT API Bible: Mastering Python Programming for Conversational AI: Build Intelligent Chatbots and AI Applications with ChatGPT API and Python (Mastering AI and Python), https://www.amazon.com/ChatGPT-API-Bible-Conversational-Applications/dp/B0C47NWRT7/
Mike Gold, October 6, 2023, Crafting Applications with ChatGPT API: Using Python, Green Belt Book LLC, https://www.amazon.com/Crafting-Applications-ChatGPT-API-Python-ebook/dp/B0CHJX36X3/
Henry Habib, Paul Siegel, March 12, 2024, OpenAI API Cookbook: Build intelligent applications including chatbots, virtual assistants, and content generators, Packt Publishing, https://www.amazon.com/OpenAI-API-Cookbook-intelligent-applications-ebook/dp/B0CT8W7B79/
Olivier Caelen, Marie-Alice Blete, August 13, 2024, Developing Apps with GPT-4 and ChatGPT: Build Intelligent Chatbots, Content Generators, and More, 2nd edition, O'Reilly Media; https://www.amazon.com/Developing-Apps-GPT-4-ChatGPT-Intelligent/dp/1098168100/

Batch API for Inference

Michael Nuñez, October 8, 2024, Anthropic challenges OpenAI with affordable batch processing, https://venturebeat.com/ai/anthropic-challenges-openai-with-affordable-batch-processing/
Microsoft Nov 2024, Getting started with Azure OpenAI global batch deployments, https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/batch
OpenAI, Nov 2024, Batch API FAQ. Batch API endpoint for asynchronous batch processing, https://help.openai.com/en/articles/9197833-batch-api-faq
Anthropic, 9 Oct 2024, Introducing the Message Batches API, https://www.anthropic.com/news/message-batches-api
Katia Gil Guzman Apr 24, 2024, Batch processing with the Batch API, https://cookbook.openai.com/examples/batch_processing
Lunary, Oct 22, 2024, Using the Batch API with Azure OpenAI, https://lunary.ai/blog/batch-api-azure-openai
Sukalp Tripathi, Sep 8, 2024, Batch API: OpenAI, https://sukalp.medium.com/batch-api-openai-831a0b09690c
Google, Nov 2024, Get batch predictions for Gemini, https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/batch-prediction-api
Google, Nov 2024, Send a batch process documents request, https://cloud.google.com/document-ai/docs/samples/documentai-batch-process-document
Gibion AI, Jan 15, 2024, Efficient Batch Processing with LangChain and OpenAI: Overcoming RateLimitError, https://medium.com/@hey_16878/efficient-batch-processing-with-langchain-and-openai-overcoming-ratelimiterror-daa9de4bbd8b
Bingli Liao, Danilo Vasconcellos Vargas, 13 Jul 2024, Beyond KV Caching: Shared Attention for Efficient LLMs, https://arxiv.org/abs/2407.12866 (Layerwise weight sharing in attention.)

Application Layer

The "application layer" is the whole range of applications that can be built on top of generative AI and its LLMs as building blocks. Research includes:

Ashu Garg, Oct 25, 2024, Why OpenAI’s $157B valuation misreads AI’s future, https://foundationcapital.com/why-openais-157b-valuation-misreads-ais-future/ (Bullish on the "application layer" saying "The top of the stack is where I see the most promise. ...the most valuable companies of the AI era don’t exist yet."... "The cloud era created over 20 application companies with $1B+ revenue. In AI, we believe this number could exceed 100.")
Akash Bajwa, Nov 18, 2024, Opinionated AI Products: Strong Technologies Forms Beliefs, https://akashbajwa.substack.com/p/opinionated-ai-products
Meno Ventures, Nov 2024, 2024: The State of Generative AI in the Enterprise: The enterprise AI landscape is being rewritten in real time, https://menlovc.com/2024-the-state-of-generative-ai-in-the-enterprise/
Tegan Jones, 22 November, 2024, Neural Notes: Stop building AI startups with “the same crap” as everyone else. In this edition: warnings for startups relying too heavily on generic AI models and how AI has changed the relationship between VCs and founders. https://www.smartcompany.com.au/artificial-intelligence/neural-notes-stop-building-ai-startups-same-crap-everyone-else/
Angular Ventures, December 03, 2024, Engines or plastics? How we talk about LLMs and how we use them. The Angle Issue #249, https://newsletter.angularventures.com/p/engines-or-plastics-how-we-talk-about-llms-and-how-we-use-them
Charles Rollet, December 4, 2024, Key leaders behind Google’s viral NotebookLM are leaving to create their own startup, https://techcrunch.com/2024/12/04/key-leaders-behind-googles-viral-notebooklm-are-leaving-to-create-their-own-startup/ ("As the frontier models and their capabilities continue to grow, thoughtful products are required to make the benefits of this technology accessible, useful, and obvious to everyday people — so our team is going to be focused on building a user-first AI product...the team wanted to create something that leverages the latest AI models to build something useful to regular people.")
Leah Hodgson, December 7, 2024, Where are all the consumer AI startups—and why aren’t VCs funding them? https://pitchbook.com/news/articles/where-are-all-the-consumer-ai-startups-and-why-arent-vcs-funding-them ("...consumer AI market by 2032 will be twice the size of the enterprise market for AI."..."According to Zion Market Research, the market size for consumer AI is predicted to grow to around $1.3 trillion by 2032. For enterprise, it is estimated to reach only around $560 billion by the same year, according to Precedence research.")
Kevin Mahaffey, Dec 13, 2024, Defensibility: Applications. Part 7: Where bucks are born, https://writing.snr.vc/p/defensibility-applications
Chris Pedregal, December 9, 2024, How to Build a Truly Useful AI Product. Generative AI breaks the old startup playbook, https://every.to/thesis/how-to-build-a-truly-useful-ai-product
Anshul Ramachandran, Jul 08, 2023, How to Make AI UX Your Moat. Design great AI Products that go beyond "just LLM Wrappers": make AI more present, more practical, and then more powerful. https://www.latent.space/p/ai-ux-moat
Ignacio de Gregorio Noblejas, December 15, 2024, The AI Trillion-Dollar Product, https://thewhitebox.beehiiv.com/p/the-ai-trillion-dollar-product
Akash Bajwa, Dec 16, 2024, Vertical Integration: Model vs Product Companies: The False Dichotomy of Model & App Layer, https://akashbajwa.substack.com/p/vertical-integration-model-vs-product
Apple, December 16, 2024, Apple reveals 2024’s most downloaded apps and games on the App Store, https://www.apple.com/newsroom/2024/12/apple-reveals-2024s-most-downloaded-apps-and-games-on-the-app-store/
Sarah Perez, December 16, 2024, Temu is the most downloaded app on the US App Store in 2024, https://techcrunch.com/2024/12/16/temu-is-the-most-downloaded-app-on-the-u-s-app-store-in-2024/
Jess Weatherbed, Dec 10, 2024, AI is booming on the App Store, and developers are taking advantage of it. Many high-ranking AI apps feel like an attempted cash grab, and it’s not easy to spot the trash from the treasure. https://www.theverge.com/2024/12/9/24314972/apple-app-store-ai-apps-art-design-photography
Johan Uddståhl, Jan 2, 2025, …when all we ever needed was a text box, or how 2025 will be back to basics for the web, https://medium.com/@baktakt/when-all-we-ever-needed-was-a-text-box-c672c52a0dca
Rex Woodbury, Jun 05, 2024, The Consumer Renaissance: From Predicting Consumer AI Applications to Analyzing Consumer Spend, https://www.digitalnative.tech/p/the-consumer-renaissance
Rex Woodbury, Jun 13, 2024, The Consumer Renaissance (Part II): Shopping, Consumer Health, and Patterns of Household Spend, https://www.digitalnative.tech/p/the-consumer-renaissance-part-ii
James Currier, Jan 2025, Consumer is Back – And Why It’s Been So Hard Since 2014, https://www.nfx.com/post/consumer-is-back
Alex Kantrowitz, Jan 28, 2025, Notes on DeepSeek: Generative AI is All About the Applications Now: Building with AI might cost 5% of what it did a week ago, so what gets built has never been more important. https://www.bigtechnology.com/p/notes-on-deepseek-generative-ai-is
What’s 🔥 in Enterprise IT/VC #431, Feb 01, 2025, 🙏🏼 DeepSeek - years compressed into days - the cost 💰 of intelligence 🧠 has dramatically 📉 - the time to build 🏗️ is now! https://www.whatshotit.vc/p/whats-in-enterprise-itvc-431
Alex Kantrowitz, Feb 01, 2025, OpenAI is an App Company Now. After DeepSeek, OpenAI is an app builder above all else. Perhaps that was always the way. https://www.bigtechnology.com/p/openai-is-an-app-company-now
Olivia Moore, Feb 20025, AI Voice Agent Update - 2025, A16Z, https://a16z.com/ai-voice-agents-2025-update/ https://gamma.app/docs/a16z-AI-Voice-Update-2025--ttkorld8iy6wfnj?mode=doc (Thesis that voice will be the primary AI interface for consumers.)
AL Anany, Feb 2025, Now That AI is Affordable — It’s Time To Build. It is time for perfect use cases. https://entreprenal.com/now-that-ai-is-affordable-its-time-to-build-8e84337355eb
Tanay Jaipuria, Feb 11, 2025, How Big Tech Sees DeepSeek: Five Key Takeaways: On diffusion of innovation, the need for strong business models, lower inference costs benefiting apps and investing in infrastructure as a strategic advantage, https://www.tanayj.com/p/how-big-tech-sees-deepseek-five-key
Leah Hodgson, February 8, 2025, DeepSeek's gift to the AI app space: DeepSeek might be just what the AI app space needs, https://pitchbook.com/news/articles/deepseek-might-be-just-what-the-ai-app-space-needs
Andrew Chen, Feb 05, 2025, Revenge of the GPT Wrappers: Defensibility in a world of commoditized AI models: Why network effects and distribution will be king, once more, https://andrewchen.substack.com/p/revenge-of-the-gpt-wrappers-defensibility
Jan Kammerath, Feb 11, 2025, Programmers’ New Goldrush: Seizing Opportunities With Local AI, https://medium.com/@jankammerath/programmers-new-goldrush-seizing-opportunities-with-local-ai-12b1a3e2692f
Yaakov Carno, Feb 24, 2025, The surprising patterns behind viral AI products: A deep dive into Bolt, Cursor, Granola, PhotoRoom, Replit and more, https://open.substack.com/pub/kylepoyar/p/ai-ux-patterns (The "surprising pattern" in successful AI products is that they all have a slick UI.)
Kyle Wiggers, February 25, 2025, Quora’s Poe now lets users create and share custom AI-powered apps, https://techcrunch.com/2025/02/25/quoras-poe-now-lets-users-create-and-share-custom-ai-powered-apps/
CNBC, Feb 2025, Hugging Face co-founder: The next step in AI will be applications, https://www.msn.com/en-au/money/other/hugging-face-co-founder-the-next-step-in-ai-will-be-applications/vi-AA1xFd8X
Joe McKendrick, Feb. 20, 2025, Brace yourself: The era of 'citizen developers' creating apps is here, thanks to AI, https://www.zdnet.com/article/brace-yourself-the-era-of-citizen-developers-creating-apps-is-here-thanks-to-ai/
Craig Le Clair, Oct 23 2024, Predictions 2025: GenAI, Citizen Developers, And Caution Influence Automation, https://www.forrester.com/blogs/predictions-2025-automation/
Rex Woodbury, Feb 26, 2025, The ChatGPT Prompts That Can Be $1B+ Companies: The Unbundling of ChatGPT? https://open.substack.com/pub/digitalnative/p/the-chatgpt-prompts-that-can-be-1b
Rex Woodbury, Feb 20, 2025, How Consumer Psychology Informs AI Product Design: The IKEA Effect, the Paradox of Choice, and AI's Interface Problem, https://www.digitalnative.tech/p/how-consumer-psychology-informs-ai
John Webber, January 6, 2025, Building an AI Wrapper SaaS in 2025: Opportunities and Challenges, https://saasminded.dev/building-an-ai-wrapper-saas-in-2025-opportunities-and-challenges/ ("...a faster route to market, the ability to tap into cutting-edge technology, and the potential for rapid scaling... requires a deep understanding of the market dynamics, a commitment to continuous innovation, a strategic approach to building defensibility, and a relentless focus on delivering unique and irreplaceable value to users.")
Wil Chung, 21 Nov 2024, The moats are in the GPT-wrappers, https://interjectedfuture.com/the-moats-are-in-the-gpt-wrappers/ ("...the GPT-wrapper application layer is where the value accrues.")
Stewart Townsend, 16 August 2024, The Future of AI Wrapper Companies: Will They Survive in 2024? https://stewarttownsend.com/the-future-of-ai-wrapper-companies-will-they-survive-in-2024/ ("Conclusion: The future of AI wrapper companies indeed looks promising and intriguing.")
Kate Clark Fri, March 7, 2025, The Hottest AI Companies Right Now Are ‘Apps’, Bloomberg, https://finance.yahoo.com/news/hottest-ai-companies-now-apps-140037730.html
Garry Tan, March 2025, X post, https://x.com/garrytan/status/1898949767335752019 ("... the models are plenty smart already and all the alpha is in custom prompting, tool use, clever workflow and evals.")
Julio Pessan, Mar 7, 2025, Don’t Sell AI Agents, Sell AI Infrastructures Instead — The Billion-Dollar Opportunity, https://medium.com/@julio.pessan.pessan/dont-sell-ai-agents-sell-ai-infrastructures-instead-the-billion-dollar-opportunity-04eb7166b3d9

Code Generation Applications of Generative AI

Hadi Ghaemi, Zakieh Alizadehsani, Amin Shahraki, Juan M. Corchado, June 2024, Transformers in source code generation: A comprehensive survey, Journal of Systems Architecture, 103193, https://www.sciencedirect.com/science/article/abs/pii/S1383762124001309
Franklin Huang, May 17, 2024, Machine Learning Systems with Reduced Memory Requirements, Masters of Science, Electrical Engineering and Computer Sciences, University of California, Berkeley, Technical Report No. UCB/EECS-2024-120 http://www2.eecs.berkeley.edu/Pubs/TechRpts/2024/EECS-2024-120.html https://www2.eecs.berkeley.edu/Pubs/TechRpts/2024/EECS-2024-120.pdf Code: https://github.com/hongyihuang/spec-mcts/blob/main/triton (Broad paper that examines a lot of different optimizations that reduce memory costs, including quantization, kernel fusion, sparsity, MatMul optimizations, KV cache compression, and various other methods.)
Lianghong Guo, Yanlin Wang, Ensheng Shi, Wanjun Zhong, Hongyu Zhang, Jiachi Chen, Ruikai Zhang, Yuchi Ma, Zibin Zheng, 29 Jul 2024, When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention, https://arxiv.org/abs/2407.20042 Code: https://github.com/DeepSoftwareAnalytics/CodeFast
AIM, 2024, Mistral AI Unveils Mistral Large 2, Beats Llama 3.1 on Code and Math, https://analyticsindiamag.com/ai-news-updates/mistral-ai-unveils-mistral-large-2-beats-llama-3-1-on-code-and-math/
Kevin Zhang, Jun 26, 2024, Investing in the Age of Generative AI, https://eastwind.substack.com/p/investing-in-the-age-of-generative
by Nicholas Carlini, 2024-08-01, How I Use "AI", https://nicholas.carlini.com/writing/2024/how-i-use-ai.html (Generative AI and LLM use cases are "unglamorous" but useful to software developers.)
Haolin Jin, Linghan Huang, Haipeng Cai, Jun Yan, Bo Li, Huaming Chen, 5 Aug 2024, From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future, https://arxiv.org/abs/2408.02479
Grant Gross, 30 Aug 2024, Agentic AI: Decisive, operational AI arrives in business, https://www.cio.com/article/3496519/agentic-ai-decisive-operational-ai-arrives-in-business.html
Hao Zhou, Chengming Hu, Ye Yuan, Yufei Cui, Yili Jin, Can Chen, Haolun Wu, Dun Yuan, Li Jiang, Di Wu, Xue Liu, Charlie Zhang, Xianbin Wang, Jiangchuan Liu, 17 May 2024, Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities, https://arxiv.org/abs/2405.10825
Yaroslav Zharov, Yury Khudyakov, Evgeniia Fedotova, Evgeny Grigorenko, Egor Bogomolov, 18 Feb 2024, Tool-Augmented LLMs as a Universal Interface for IDEs, https://arxiv.org/abs/2402.11635
Advait Sarkar, 1 Nov 2023, Will Code Remain a Relevant User Interface for End-User Programming with Generative AI Models? https://arxiv.org/abs/2311.00382
Liwenhan Xie, Chengbo Zheng, Haijun Xia, Huamin Qu, Chen Zhu-Tian, 3 Aug 2024, WaitGPT: Monitoring and Steering Conversational LLM Agent in Data Analysis with On-the-Fly Code Visualization, https://arxiv.org/abs/2408.01703
Madhumita Murgia, August 23 2024, AI-powered coding pulls in almost $1bn of funding to claim ‘killer app’ status, https://www.ft.com/content/4868bd38-613c-4fa9-ba9d-1ed8fa8a40c8
Hesam Sheikh, Aug 2024, The Smarter Way of Using AI in Programming, https://towardsdatascience.com/the-smarter-way-of-using-ai-in-programming-0492ac610385
Pragmatic Coders, Sep 2024, Best AI tools for developers in 2024: AI-powered coding, https://medium.com/@pragmaticcoders/best-ai-tools-for-developers-in-2024-ai-powered-coding-32e31dff6024
Zheyuan (Kevin) Cui, Mert Demirer, Sonia Jaffe, Leon Musolff, Sida Peng, Tobias Salz, September 03, 2024, The Effects of Generative AI on High Skilled Work: Evidence from Three Field Experiments with Software Developers, https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4945566 https://papers.ssrn.com/sol3/Delivery.cfm/4945566.pdf?abstractid=4945566&mirid=1
Asif Razzaq, September 5, 2024, Yi-Coder Released by 01.AI: A Powerful Small-Scale Code LLM Series, Delivering Exceptional Performance in Code Generation, Editing, and Long-Context Comprehension, https://www.marktechpost.com/2024/09/05/yi-coder-released-by-01-ai-a-powerful-small-scale-code-llm-series-delivering-exceptional-performance-in-code-generation-editing-and-long-context-comprehension/
OpenAI, September 12, 2024, Learning to Reason with LLMs, https://openai.com/index/learning-to-reason-with-llms/
Grant Gross, 12 Sep 2024, AI coding assistants wave goodbye to junior developers, https://www.cio.com/article/3509174/ai-coding-assistants-wave-goodbye-to-junior-developers.html
Evan Wang, Federico Cassano, Catherine Wu, Yunfeng Bai, Will Song, Vaskar Nath, Ziwen Han, Sean Hendryx, Summer Yue, Hugh Zhang, 5 Sep 2024, Planning In Natural Language Improves LLM Search For Code Generation, https://arxiv.org/abs/2409.03733
Michael Nuñez, September 19, 2024, Microsoft’s GRIN-MoE AI model takes on coding and math, beating competitors in key benchmarks, https://venturebeat.com/ai/microsofts-grin-moe-ai-model-takes-on-coding-and-math-beating-competitors-in-key-benchmarks/
Yanxian Huang, Wanjun Zhong, Ensheng Shi, Min Yang, Jiachi Chen, Hui Li, Yuchi Ma, Qianxiang Wang, Zibin Zheng, Yanlin Wang, 13 Sep 2024, Agents in Software Engineering: Survey, Landscape, and Vision, https://arxiv.org/abs/2409.09030 https://github.com/DeepSoftwareAnalytics/Awesome-Agent4SE
Grant Gross, 26 Sep 2024, Devs gaining little (if anything) from AI coding assistants, https://www.cio.com/article/3540579/devs-gaining-little-if-anything-from-ai-coding-assistants.html
Ulyana Piterbarg, Lerrel Pinto, Rob Fergus, 3 Oct 2024, Training Language Models on Synthetic Edit Sequences Improves Code Synthesis, https://arxiv.org/abs/2410.02749
https://www.cio.com/article/3567138/ai-native-software-engineering-may-be-closer-than-developers-think.html
C Thiede, M Taeumel, L Böhme, R Hirschfeld, 2024, Talking to Objects in Natural Language: Toward Semantic Tools for Exploratory Programming, Onward! ’24, October 23–25, 2024, Pasadena, CA, USA, https://dl.acm.org/doi/pdf/10.1145/3689492.3690049
Aki Ranin, Sep 2, 2024, The Code Canaries Are Singing — Our Path Toward AGI: How the fate of human software developers reveals our path toward AGI, https://akiranin.medium.com/the-code-canaries-are-singing-our-path-toward-agi-6c234cae0189
Jose Yapur, 29 OCT 2024, Introducing the next-level of AI-powered workflows with Amazon Q Developer inline chat, https://aws.amazon.com/blogs/devops/amazon-q-developer-inline-chat/
GitHub, Oct 2024, Bringing developer choice to Copilot with Anthropic’s Claude 3.5 Sonnet, Google’s Gemini 1.5 Pro, and OpenAI’s o1-preview, https://github.blog/news-insights/product-news/bringing-developer-choice-to-copilot/
John Wang, Oct 2024, How we saved hundreds of engineering hours by writing tests with LLMs, https://www.assembled.com/blog/how-we-saved-hundreds-of-engineering-hours-by-writing-tests-with-llms
Shihan Dou, Jiazheng Zhang, Jianxiang Zang, Yunbo Tao, Haoxiang Jia, Shichun Liu, Yuming Yang, Shenxi Wu, Shaoqing Zhang, Muling Wu, Changze Lv, Limao Xiong, Wenyu Zhan, Lin Zhang, Rongxiang Weng, Jingang Wang, Xunliang Cai, Yueming Wu, Ming Wen, Rui Zheng, Tao Ji, Yixin Cao, Tao Gui, Xipeng Qiu, Qi Zhang, Xuanjing Huang, 30 Oct 2024, Multi-Programming Language Sandbox for LLMs, https://arxiv.org/abs/2410.23074
David Gewirtz, September 27, 2024, The best AI for coding, and a bunch that failed miserably, https://www.zdnet.com/article/the-best-ai-for-coding/
Jason Perlow, Nov. 6, 2024, The best open-source AI models: All your free-to-use options explained: Here are the best open-source and free-to-use AI models for text, images, and audio, organized by type, application, and licensing considerations. https://www.zdnet.com/article/the-best-open-source-ai-models-all-your-free-to-use-options-explained/
Fali Wang, Zhiwei Zhang, Xianren Zhang, Zongyu Wu, Tzuhao Mo, Qiuhao Lu, Wanjing Wang, Rui Li, Junjie Xu, Xianfeng Tang, Qi He, Yao Ma, Ming Huang, Suhang Wang, 4 Nov 2024, A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness, https://arxiv.org/abs/2411.03350
Qwen Team, November 12, 2024, Qwen2.5-Coder Series: Powerful, Diverse, Practical, https://qwenlm.github.io/blog/qwen2.5-coder-family/
Evan Doyle, Nov 14, 2024, AI Makes Tech Debt More Expensive, https://www.gauge.sh/blog/ai-makes-tech-debt-more-expensive
Haoxiang Zhang, Shi Chang, Arthur Leung, Kishanthan Thangarajah, Boyuan Chen, Hanan Lutfiyya, Ahmed E. Hassan, 14 Nov 2024, Software Performance Engineering for Foundation Model-Powered Software (FMware), https://arxiv.org/abs/2411.09580
Josh Fruhlinger, Dec 02, 2024, Refactoring AI code: The good, the bad, and the weird, https://www.infoworld.com/article/3610521/refactoring-ai-code-the-good-the-bad-and-the-weird.html
Joe McKendrick, Nov. 27, 2024, Gen AI gives software developers surge in productivity - but it's not for everyone, https://www.zdnet.com/article/gen-ai-gives-software-developers-surge-in-productivity-but-its-not-for-everyone/
Cory Hymel, Dec 02, 2024, 5 ways AI will change the software development life cycle, https://www.infoworld.com/article/3609988/5-ways-ai-will-change-the-software-development-life-cycle.html
Paul Heltzel, 03 Dec 2024, 5 dead-end IT skills — and how to avoid becoming obsolete, https://www.cio.com/article/188985/6-dead-end-it-skills-and-how-to-avoid-becoming-obsolete.html
Google, Dec 2024, Welcome to Project IDX, a new web-based development workspace from Google. IDX is designed to make it faster and easier to build, ship, and manage full-stack, multiplatform apps from the comfort of your browser. https://idx.google.com/
Giordano d'Aloisio, Luca Traini, Federica Sarro, Antinisca Di Marco, 18 Dec 2024, On the Compression of Language Models for Code: An Empirical Study on CodeBERT, https://arxiv.org/abs/2412.13737 (Quantization, pruning and distillation on code generation models.)
Francisco Durán, Matias Martinez, Patricia Lago, Silverio Martínez-Fernández, 19 Dec 2024, Energy consumption of code small language models serving with runtime engines and execution providers, https://arxiv.org/abs/2412.15441
David Gewirtz, Nov. 27, 2024, 25 AI tips to boost your programming productivity with ChatGPT. With ChatGPT in your toolkit, coding can be faster and smoother. I share the best ways of using AI to overcome common coding challenges, so you can streamline your development projects. https://www.zdnet.com/article/25-ai-tips-to-boost-your-programming-productivity-with-chatgpt/
Dewu Zheng, Yanlin Wang, Ensheng Shi, Hongyu Zhang, Zibin Zheng, 24 Dec 2024, How Well Do LLMs Generate Code for Different Application Domains? Benchmark and Evaluation, https://arxiv.org/abs/2412.18573
Aman, May 14, 2024, Near-Instant Full-File Edits, Cursor, https://cursor.sh/blog/instant-apply (A type of speculative decoding for code editing called "speculative edits.")
Lucas Mearian, 03 Apr 2024 Just how good is AI-assisted code generation? Computer World, https://www.computerworld.com/article/2077802/just-how-good-is-ai-assisted-code-generation.html (Notes issues with code quality, security, and reuse.)
Yao Wan, Yang He, Zhangqian Bi, Jianguo Zhang, Hongyu Zhang, Yulei Sui, Guandong Xu, Hai Jin, Philip S. Yu, 30 Dec 2023, Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit, https://arxiv.org/abs/2401.00288 Code: https://xcodemind.github.io/
Xuanle Zhao, Xianzhen Luo, Qi Shi, Chi Chen, Shuo Wang, Wanxiang Che, Zhiyuan Liu, Maosong Sun, 11 Jan 2025, ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation, https://arxiv.org/abs/2501.06598
Tari Ibaba, Jan 2025, This new IDE just destroyed VS Code and Copilot without even trying, https://medium.com/coding-beauty/windsurf-ide-0678288ce0a4
Sida Peng, Eirini Kalliamvakou, Peter Cihon, Mert Demirer, 13 Feb 2023, The Impact of AI on Developer Productivity: Evidence from GitHub Copilot, https://arxiv.org/abs/2302.06590
Paul Sawers, February 6, 2025, GitHub Copilot brings mockups to life by generating code from images, https://techcrunch.com/2025/02/06/github-copilot-brings-mockups-to-life-by-generating-code-from-images/
Daniel Delaney, Feb 2025, Chat is a bad UI pattern for development tools, https://danieldelaney.net/chat/
Dacheng Li, Shiyi Cao, Chengkun Cao, Xiuyu Li, Shangyin Tan, Kurt Keutzer, Jiarong Xing, Joseph E. Gonzalez, Ion Stoica, 20 Feb 2025, S*: Test Time Scaling for Code Generation, https://arxiv.org/abs/2502.14382 https://github.com/NovaSky-AI/SkyThought
David Gewirtz, Feb. 25, 2025, Google just made AI coding assistance free for everyone - with very generous limits,] https://www.zdnet.com/article/google-just-made-ai-coding-assistance-free-for-everyone-with-very-generous-limits/
Qianhui Zhao, Li Zhang, Fang Liu, Xiaoli Lian, Qiaoyuanhe Meng, Ziqian Jiao, Zetong Zhou, Borui Zhang, Runlin Guo, Jia Li, 24 Feb 2025, CodeSwift: Accelerating LLM Inference for Efficient Code Generation, https://arxiv.org/abs/2502.17139 (Using draft sequences from a datastore of code, to achieve parallel inference, similar to prompt looking decoding or retrieval lookup decoding.)
alexp, February 19, 2025, Vibe Coding and the Future of Software Engineering, https://alexp.pl/2025/02/19/vibe-coding.html
Kate Rooney, Mar 15 2025, Y Combinator startups are fastest growing, most profitable in fund history because of AI, https://www.cnbc.com/2025/03/15/y-combinator-startups-are-fastest-growing-in-fund-history-because-of-ai.html
David Gewirtz, March 18, 2025, What is AI vibe coding? It's all the rage but it's not for everyone - here's why: Caution: Experience required. Vibe coding feels like magic, until your AI assistant starts overwriting your work, https://www.zdnet.com/article/what-is-ai-vibe-coding-its-all-the-rage-but-its-not-for-everyone-heres-why/
Bill Doerrfeld, Mar 17, 2025, Why AI-generated code isn’t good enough (and how it will get better), https://www.infoworld.com/article/3844363/why-ai-generated-code-isnt-good-enough-and-how-it-will-get-better.html

Code Checker Applications

Aman, May 14, 2024, Near-Instant Full-File Edits, Cursor, https://cursor.sh/blog/instant-apply (A type of speculative decoding for code editing called "speculative edits.")
Ansong Ni, Miltiadis Allamanis, Arman Cohan, Yinlin Deng, Kensen Shi, Charles Sutton, Pengcheng Yin, 23 Apr 2024, NExT: Teaching Large Language Models to Reason about Code Execution, https://arxiv.org/abs/2404.14662
David Spuler, March 2024, Chapter 40. Reliability, Generative AI in C++: Coding Transformers and LLMs, https://www.amazon.com/dp/B0CXJKCWX9
Yingbing Huang, Lily Jiaxin Wan, Hanchen Ye, Manvi Jha, Jinghua Wang, Yuhong Li, Xiaofan Zhang, Deming Chen, 16 Jun 2024, New Solutions on LLM Acceleration, Optimization, and Application, https://arxiv.org/abs/2406.10903 (A survey of inference optimization methods and further analysis of Medusa-type speculative decoding and KV cache compression. Also explores hardware co-design, ML compilers and LLM-assisted code debugging.)
Nat McAleese, Rai (Michael Pokorny), Evgenia Nitishinskaya, Jan Leike, Juan Felipe Cerón Uribe, Maja Trebacz, 2024, LMCritics Help Catch LLM Bugs, https://cdn.openai.com/llm-critics-help-catch-llm-bugs-paper.pdf
Patrick J. Chapman, Cindy Rubio-González, and Aditya V. Thakur. 2024. Interleaving Static Analysis and LLM Prompting. In Proceedings of the 13th ACM SIGPLAN International Workshop on the State Of the Art in Program Analysis (SOAP 2024). Association for Computing Machinery, New York, NY, USA, 9–17. https://doi.org/10.1145/3652588.3663317 https://dl.acm.org/doi/abs/10.1145/3652588.3663317
Junwei Liu, Yixuan Chen, Mingwei Liu, Xin Peng, Yiling Lou, 14 Jun 2024, STALL+: Boosting LLM-based Repository-level Code Completion with Static Analysis, https://arxiv.org/abs/2406.10018
Shaojian Qiu, Huihao Huang, Jianxiang Luo, Yingjie Kuang, Haoyu Luo, 11 Feb 2024, BAFLineDP: Code Bilinear Attention Fusion Framework for Line-Level Defect Prediction, https://arxiv.org/pdf/2402.07132
Pragmatic Coders, Sep 2024, Best AI tools for developers in 2024: AI-powered coding, https://medium.com/@pragmaticcoders/best-ai-tools-for-developers-in-2024-ai-powered-coding-32e31dff6024
Tom Ganz, April 2024, Software Defect Localization Using Explainable Deep Learning, Master's Thesis, Master of Science, der Technischen Universität Berlin, https://api-depositonce.tu-berlin.de/server/api/core/bitstreams/308879e0-b14b-4baf-a0c3-19067184ef50/content (AI-based security vulnerability code checker.)
Francisco Ribeiro, José Nuno Castro de Macedo, Kanae Tsushima, Rui Abreu, João Saraiva, 2023, GPT-3-Powered Type Error Debugging: Investigating the Use of Large Language Models for Code Repair, SLE 2023: Proceedings of the 16th ACM SIGPLAN International Conference on Software Language Engineering, October 2023, Pages 111–124, https://doi.org/10.1145/3623476.3623522 (Code corrections are a type of GEC.)
Jiawei Guo, Ziming Li, Xueling Liu, Kaijing Ma, Tianyu Zheng, Zhouliang Yu, Ding Pan, Yizhi LI, Ruibo Liu, Yue Wang, Shuyue Guo, Xingwei Qu, Xiang Yue, Ge Zhang, Wenhu Chen, Jie Fu, 4 Apr 2024, CodeEditorBench: Evaluating Code Editing Capability of Large Language Models, https://arxiv.org/abs/2404.03543
David Spuler, June 2024, Aussie AI, Optimizing On-Device Transformer Inference for Source Code Checking: IP Australia, https://ipsearch.ipaustralia.gov.au/patents/2024901675
Ulyana Piterbarg, Lerrel Pinto, Rob Fergus, 3 Oct 2024, Training Language Models on Synthetic Edit Sequences Improves Code Synthesis, https://arxiv.org/abs/2410.02749
Albin Johansson, Carl Holmberg, Francisco Gomes De Oliveira Neto, and Philipp Leitner. 2024. The Impact of Compiler Warnings on Code Quality in C++ Projects. In Proceedings of the 32nd IEEE/ACM International Conference on Program Comprehension (ICPC '24). Association for Computing Machinery, New York, NY, USA, 270–279. https://doi.org/10.1145/3643916.3644410 https://dl.acm.org/doi/abs/10.1145/3643916.3644410 (Using compiler warnings correlations with higher quality metrics.)
Fang Liu, Zhenwei Liu, Qianhui Zhao, Jing Jiang, Li Zhang, Zian Sun, Ge Li, Zhongqi Li, and Yuchi Ma. 2024. FastFixer: An Efficient and Effective Approach for Repairing Programming Assignments. In Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering (ASE '24). Association for Computing Machinery, New York, NY, USA, 669–680. https://doi.org/10.1145/3691620.3695062 https://dl.acm.org/doi/abs/10.1145/3691620.3695062
Andrea Lepori, Alexandru Calotoiu, and Torsten Hoefler. 2024. Iterating Pointers: Enabling Static Analysis for Loop-based Pointers. ACM Trans. Archit. Code Optim. Just Accepted (October 2024). https://doi.org/10.1145/3701993 https://dl.acm.org/doi/pdf/10.1145/3701993
A Hück, T Ziegler, S Schwitanski, J Jenke, C Bischof, Nov 2024, Compiler-Aided Correctness Checking of CUDA-Aware MPI Applications, https://conferences.computer.org/sc-wpub/pdfs/SC-W2024-6oZmigAQfgJ1GhPL0yE3pS/555400a204/555400a204.pdf
Zeyu Chen, Daiping Liu, Jidong Xiao, and Haining Wang. 2023. All Use-After-Free Vulnerabilities Are Not Created Equal: An Empirical Study on Their Characteristics and Detectability. In Proceedings of the 26th International Symposium on Research in Attacks, Intrusions and Defenses (RAID '23). Association for Computing Machinery, New York, NY, USA, 623–638. https://doi.org/10.1145/3607199.3607229 https://dl.acm.org/doi/10.1145/3607199.3607229 https://vtechworks.lib.vt.edu/bitstream/handle/10919/116595/3607199.3607229.pdf
B. Gui, W. Song, H. Xiong and J. Huang, "Automated Use-After-Free Detection and Exploit Mitigation: How Far Have We Gone?," in IEEE Transactions on Software Engineering, vol. 48, no. 11, pp. 4569-4589, 1 Nov. 2022, doi: 10.1109/TSE.2021.3121994. https://ieeexplore.ieee.org/document/9583875
H. Wei, L. Chen, X. Nie, Z. Zhang, Y. Zhang and G. Shi, "An Efficient Metric-Based Approach for Static Use-After-Free Detection," 2022 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), Melbourne, Australia, 2022, pp. 58-65, doi: 10.1109/ISPA-BDCloud-SocialCom-SustainCom57177.2022.00015. https://ieeexplore.ieee.org/document/10070682

User Interface (UI) Issues for AI Apps

Li Zhang, Shihe Wang, Xianqing Jia, Zhihan Zheng, Yunhe Yan, Longxi Gao, Yuanchun Li, Mengwei Xu, 12 Apr 2024, LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Automation Task Evaluation, https://arxiv.org/abs/2404.16054
Jiachen Liu, Zhiyu Wu, Jae-Won Chung, Fan Lai, Myungjin Lee, Mosharaf Chowdhury, 25 Apr 2024, Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services, https://arxiv.org/abs/2404.16283 (Scheduling GPU activity for multiple queries to ensure good UI experience for text-streaming outputs like chatbots.)
NLUX: The 𝗣𝗼𝘄𝗲𝗿𝗳𝘂𝗹 Conversational AI JavaScript Library, https://github.com/nlkitai/nlux
Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia, 7 Dec 2023, Prompt Highlighter: Interactive Control for Multi-Modal LLMs, https://arxiv.org/abs/2312.04302 Code: https://github.com/dvlab-research/Prompt-Highlighter/ (Allows users to highlight part of their prompt for more specificity.)
Michael Nuñez, June 21, 2024, Why Anthropic’s Artifacts may be this year’s most important AI feature: Unveiling the interface battle, https://venturebeat.com/ai/why-anthropics-artifacts-may-be-this-years-most-important-ai-feature-unveiling-the-interface-battle/
Paul DelSignore, Jul 5, 2024, From AI Models to Products: The Shift in AI Strategy: Why Model Performance No Longer Matters, https://generativeai.pub/from-ai-models-to-products-the-shift-in-ai-strategy-b377aeee3948
Vince Lam, Mar 12, 2024, 50+ Open-Source Options for Running LLMs Locally, https://medium.com/thedeephub/50-open-source-options-for-running-llms-locally-db1ec6f5a54f
Ethan Mollick, Aug 01, 2024, On speaking to AI: Voice changes a lot of things, https://www.oneusefulthing.org/p/on-speaking-to-ai
Arvind Narayanan and Sayash Kapoor, Aug 19, 2024, AI companies are pivoting from creating gods to building products. Good. Turning models into products runs into five challenges, https://www.aisnakeoil.com/p/ai-companies-are-pivoting-from-creating
Lance Whitney, Aug. 28, 2024, Why Claude's Artifacts is the coolest feature I've seen in generative AI so far, https://www.zdnet.com/article/why-claudes-artifacts-is-the-coolest-feature-ive-seen-in-generative-ai-so-far/
Kevin Lin, Sumant Guha, Joe Spaniac, Andy Zheng, 13 Nov 2020 (v3), Nifty Web Apps: Build a Web App for Any Text-Based Programming Assignment, https://arxiv.org/abs/2010.04671
Songqin Nong, Jiali Zhu, Rui Wu, Jiongchao Jin, Shuo Shan, Xiutian Huang, Wenhao Xu, 7 Aug 2024 (v2), MobileFlow: A Multimodal LLM For Mobile GUI Agent, https://arxiv.org/abs/2407.04346
Dongping Chen, Yue Huang, Siyuan Wu, Jingyu Tang, Liuyi Chen, Yilin Bai, Zhigang He, Chenlong Wang, Huichi Zhou, Yiqiang Li, Tianshuo Zhou, Yue Yu, Chujie Gao, Qihui Zhang, Yi Gui, Zhen Li, Yao Wan, Pan Zhou, Jianfeng Gao, Lichao Sun, 16 Jun 2024, GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents, https://arxiv.org/abs/2406.10819 https://gui-world.github.io/
Kristian Kolthoff, Felix Kretzer, Christian Bartelt, Alexander Maedche, Simone Paolo Ponzetto, 12 Jun 2024, Interlinking User Stories and GUI Prototyping: A Semi-Automatic LLM-based Approach, https://arxiv.org/abs/2406.08120
Abdur Rahman, Rajat Chawla, Muskaan Kumar, Arkajit Datta, Adarsh Jha, Mukunda NS, Ishaan Bhola, 21 Jul 2024 (v2), V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM, https://arxiv.org/abs/2405.15341
Danyang Zhang, Zhennan Shen, Rui Xie, Situo Zhang, Tianbao Xie, Zihan Zhao, Siyuan Chen, Lu Chen, Hongshen Xu, Ruisheng Cao, Kai Yu, 13 Jun 2024 (v4), Mobile-Env: Building Qualified Evaluation Benchmarks for LLM-GUI Interaction, https://arxiv.org/abs/2305.08144
Quanfeng Lu, Wenqi Shao, Zitao Liu, Fanqing Meng, Boxuan Li, Botong Chen, Siyuan Huang, Kaipeng Zhang, Yu Qiao, Ping Luo, 12 Jun 2024, GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices, https://arxiv.org/abs/2406.08451 https://github.com/OpenGVLab/GUI-Odyssey
Shengcheng Yu, Chunrong Fang, Ziyuan Tuo, Quanjun Zhang, Chunyang Chen, Zhenyu Chen, Zhendong Su, 20 Oct 2023, Vision-Based Mobile App GUI Testing: A Survey, https://arxiv.org/abs/2310.13518
Jieshan Chen, Chunyang Chen, Zhenchang Xing, Xiwei Xu, Liming Zhu, Guoqiang Li, Jinshui Wang, 2 Jul 2020 (v2), Unblind Your Apps: Predicting Natural-Language Labels for Mobile GUI Components by Deep Learning, https://arxiv.org/abs/2003.00380
Carlos Bernal-Cardenas, Kevin Moran, Michele Tufano, Zichang Liu, Linyong Nan, Zhehan Shi, Denys Poshyvanyk, 3 Jan 2019, Guigle: A GUI Search Engine for Android Apps, https://arxiv.org/abs/1901.00891
Yijie Guo, Zhenhan Huang, Ruhan Wang, Zhihao Yao, Tianyu Yu, Zhiling Xu, Xinyu Zhao, Xueqing Li, Haipeng Mi, 24 Jul 2024, AI-Gadget Kit: Integrating Swarm User Interfaces with LLM-driven Agents for Rich Tabletop Game Applications, https://arxiv.org/abs/2407.17086
Harry Li, Gabriel Appleby, Ashley Suh, 7 Jun 2024, LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-Answering, https://arxiv.org/abs/2406.06621
William Seymour, Emilee Rader, 23 May 2024, Speculating About Multi-user Conversational Interfaces and LLMs: What If Chatting Wasn't So Lonely? https://arxiv.org/abs/2405.14390
Daniel Chin, Yuxuan Wang, Gus Xia, 19 May 2024, Human-Centered LLM-Agent User Interface: A Position Paper, https://arxiv.org/abs/2405.13050
Yaroslav Zharov, Yury Khudyakov, Evgeniia Fedotova, Evgeny Grigorenko, Egor Bogomolov, 18 Feb 2024, Tool-Augmented LLMs as a Universal Interface for IDEs, https://arxiv.org/abs/2402.11635
Syed Mekael Wasti, Ken Q. Pu, Ali Neshati, 16 Apr 2024 (v2), Large Language User Interfaces: Voice Interactive User Interfaces powered by LLMs, https://arxiv.org/abs/2402.07938
Qirui Huang, Min Lu, Joel Lanir, Dani Lischinski, Daniel Cohen-Or, Hui Huang, 24 Jan 2024, GraphiMind: LLM-centric Interface for Information Graphics Design, https://arxiv.org/abs/2401.13245
Yue Jiang, Changkong Zhou, Vikas Garg, Antti Oulasvirta, 21 Apr 2024, Graph4GUI: Graph Neural Networks for Representing Graphical User Interfaces, https://arxiv.org/abs/2404.13521
Daniel Buschek, 27 May 2024, Collage is the New Writing: Exploring the Fragmentation of Text and User Interfaces in AI Tools, https://arxiv.org/abs/2405.17217
Abdallah Namoun, Ahmed Alrehaili, Zaib Un Nisa, Hani Almoamari, Ali Tufail, 5 May 2024, Predicting the usability of mobile applications using AI tools: the rise of large user interface models, opportunities, and challenges, https://arxiv.org/abs/2405.03716
Zijian Ding, 2 May 2024 (v2), Towards Intent-based User Interfaces: Charting the Design Space of Intent-AI Interactions Across Task Types, https://arxiv.org/abs/2404.18196
Patrick Ebel, 16 Feb 2024, Generative AI and Attentive User Interfaces: Five Strategies to Enhance Take-Over Quality in Automated Driving, https://arxiv.org/abs/2402.10664
Advait Sarkar, 1 Nov 2023, Will Code Remain a Relevant User Interface for End-User Programming with Generative AI Models? https://arxiv.org/abs/2311.00382
Alex Renda, Harrison Goldstein, Sarah Bird, Chris Quirk, Adrian Sampson, 14 Sep 2017, Abstractions for AI-Based User Interfaces and Systems, https://arxiv.org/abs/1709.04991
Thomas Mildner, Orla Cooney, Anna-Maria Meck, Marion Bartl, Gian-Luca Savino, Philip R. Doyle, Diego Garaialde, Leigh Clark, John Sloan, Nina Wenig, Rainer Malaka, Jasmin Niess, 26 Jan 2024, Listening to the Voices: Describing Ethical Caveats of Conversational User Interfaces According to Experts and Frequent Users, Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11--16, 2024, Honolulu, HI, USA, https://arxiv.org/abs/2401.14746 https://doi.org/https://doi.org/10.1145/3613904.3642542
Andreas Liesenfeld, Alianda Lopez, Mark Dingemanse, 28 Jul 2023, The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems, https://arxiv.org/abs/2307.15493
William Seymour, Xiao Zhan, Mark Cote, Jose Such, 8 Jun 2023, Who are CUIs Really For? Representation and Accessibility in the Conversational User Interface Literature, https://arxiv.org/abs/2306.05228
Open WebUI, 2024, Open WebUI (Formerly Ollama WebUI), https://github.com/open-webui/open-webui
Xhoni Shollaj, 2024, Awesome LLM WebUIs, https://github.com/JShollaj/Awesome-LLM-Web-UI
Sujeet Kumar, May 20, 2024, 14 Best Software for Running local LLM, https://scifilogic.com/interface-for-running-local-llm/
Mauro Sicard, Miguel Joya, LanguageGUI is the UI Kit for LLMs, 2024, https://languagegui.com/
Reddit, 2024, New Open Source Framework and No-Code GUI for Fine-Tuning LLMs: H2O LLM Studio, https://www.reddit.com/r/LocalLLaMA/comments/12yc8op/new_open_source_framework_and_nocode_gui_for/
LLM-UI, 2024, The React library for LLMs, https://llm-ui.com/
Reddit, 2024, LLM Web-UI recommendations, https://www.reddit.com/r/LocalLLaMA/comments/1847qt6/llm_webui_recommendations/
Yuzhe Cai, Shaoguang Mao, Wenshan Wu, Zehua Wang, Yaobo Liang, Tao Ge, Chenfei Wu, Wang You, Ting Song, Yan Xia, Jonathan Tien, Nan Duan, Furu Wei, 1 Apr 2024 (v3), Low-code LLM: Graphical User Interface over Large Language Models, https://arxiv.org/abs/2304.08103 https://github.com/chenfei-wu/TaskMatrix/tree/main/LowCodeLLM https://www.youtube.com/watch?v=jb2C1vaeO3E
Ramalingame, Hari, May 2024, Deployable Web GUI for LLM Applications, Thesis, Arizona State University, https://keep.lib.asu.edu/items/192554
by Jarrett Yeo and Tammy Lim , 12 DEC 2023, Create a web UI to interact with LLMs using Amazon SageMaker JumpStart, https://aws.amazon.com/blogs/machine-learning/create-a-web-ui-to-interact-with-llms-using-amazon-sagemaker-jumpstart/
Difei Gao, Lei Ji, Zechen Bai, Mingyu Ouyang, Peiran Li, Dongxing Mao, Qinchen Wu, Weichen Zhang, Peiyi Wang, Xiangwu Guo, Hengxu Wang, Luowei Zhou, Mike Zheng Shou; 2024, AssistGUI: Task-Oriented PC Graphical User Interface Automation, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 13289-13298, https://openaccess.thecvf.com/content/CVPR2024/html/Gao_AssistGUI_Task-Oriented_PC_Graphical_User_Interface_Automation_CVPR_2024_paper.html https://openaccess.thecvf.com/content/CVPR2024/papers/Gao_AssistGUI_Task-Oriented_PC_Graphical_User_Interface_Automation_CVPR_2024_paper.pdf https://openaccess.thecvf.com/content/CVPR2024/supplemental/Gao_AssistGUI_Task-Oriented_PC_CVPR_2024_supplemental.pdf
Jie Gao, Simret Araya Gebreegziabher, Kenny Tsu Wei Choo, Toby Jia-Jun Li, Simon Tangi Perrault, Thomas W. Malone, 30 Mar 2024, A Taxonomy for Human-LLM Interaction Modes: An Initial Exploration, https://arxiv.org/abs/2404.00405 https://dl.acm.org/doi/abs/10.1145/3613905.3650786
Prakash Joshi Pax, Aug 26, 2024, Fabric: The Best AI Tool That Nobody is Talking About. An open-source AI tool to automate every day tasks https://beingpax.medium.com/why-fabric-ai-can-change-the-way-you-use-ai-973e725354da
Nick: The AI Guru, Aug 15, 2024, Why Perplexity AI Has Been a Game Changer For Me, https://medium.com/@nickm9/why-perplexity-ai-has-been-a-game-changer-for-me-b38976bdc1b4
Michal Malewicz, Sep 3, 2024, Ugly websites sell better. Web design is getting out of hand again. https://michalmalewicz.medium.com/ugly-websites-sell-better-0b0354ebff10
Yicheng Fu, Raviteja Anantha, Prabal Vashisht, Jianpeng Cheng, Etai Littwin, 6 Sep 2024, UI-JEPA: Towards Active Perception of User Intent through Onscreen User Activity, https://www.arxiv.org/abs/2409.04081
Jared Spataro, Sep 16, 2024, Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents, Microsoft blog, https://www.microsoft.com/en-us/microsoft-365/blog/2024/09/16/microsoft-365-copilot-wave-2-pages-python-in-excel-and-agents/
Simon Willison, Sep 2024, How streaming LLM APIs work, https://til.simonwillison.net/llms/streaming-llm-apis
Ellie Ko, Sep 25, 2024, A Survey of Python Frameworks, https://ploomber.io/blog/survey-python-frameworks/
Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 25 Sep 2024, Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents, https://arxiv.org/abs/2409.17140
Mareike Hartmann, Alexander Koller, 27 Sep 2024, A Survey on Complex Tasks for Goal-Directed Interactive Agents, https://arxiv.org/abs/2409.18538 https://coli-saar.github.io/interactive-agents
Emilia David, October 3, 2024, OpenAI launches ChatGPT Canvas, challenging Claude Artifacts, https://venturebeat.com/ai/openai-launches-chatgpt-canvas-challenging-claude-artifacts/
Sabrina Ortiz, Oct. 7, 2024, I test ChatGPT features for a living, and this new one really did supercharge my productivity. If you use OpenAI's generative AI tool to co-edit code or text, Canvas will take your work to a whole new level, https://www.zdnet.com/article/i-test-chatgpt-features-for-a-living-and-this-new-one-really-did-supercharge-my-productivity/
Emilia David, October 17, 2024, Google launches NotebookLM Business to make enterprise AI audio, text, https://venturebeat.com/ai/googles-notebooklm-will-expand-to-business-use-cases-soon/
David Gewirtz, Oct. 25, 2024, I wrote half this article on Apple Watch, thanks to this under-the-radar iOS 18 feature: Here's how to transform your writing workflow and turn your Apple Watch into a productivity powerhouse, https://www.zdnet.com/article/i-wrote-half-this-article-on-apple-watch-thanks-to-this-under-the-radar-ios-18-feature/
LangChain, Jul 26, 2024, UX for Agents, Part 1: Chat, https://blog.langchain.dev/ux-for-agents-part-1-chat-2/
LangChain, Aug 2, 2024, UX for Agents, Part 2: Ambient, https://blog.langchain.dev/ux-for-agents-part-2-ambient/
LangChain, Aug 10, 2024, UX for Agents, Part 3: Spreadsheet, Generative, and Collaborative UI/UX, https://blog.langchain.dev/ux-for-agents-part-3/
Lance Whitney, Oct. 30, 2024, Apple Watch lets you translate your conversations in real-time. Here's how: WatchOS 11's Translate app lets you have a live conversation in two languages with another person - right from your wrist, https://www.zdnet.com/article/apple-watch-lets-you-translate-your-conversations-in-real-time-heres-how/
Julia Winn, Oct 2024, The AI Productivity Paradox: Why Aren’t More Workers Using ChatGPT? The real barrier isn’t technical skills — it’s time to think. https://towardsdatascience.com/the-ai-productivity-paradox-why-arent-more-workers-using-chatgpt-a1dfe96a9460
Lance Whitney, Oct. 31, 2024, Claude AI adds desktop apps and dictation mode – here's how to use them, https://www.zdnet.com/article/claude-ai-adds-desktop-apps-and-dictation-mode-heres-how-to-use-them/
K. Balázs Neszlényi, A. Milos and A. Kiss, "AssistantGPT: Enhancing User Interaction with LLM Integration," 2024 IEEE 22nd Jubilee International Symposium on Intelligent Systems and Informatics (SISY), Pula, Croatia, 2024, pp. 000619-000624, doi: 10.1109/SISY62279.2024.10737548. https://ieeexplore.ieee.org/abstract/document/10737548
OpenAI, October 3, 2024, Introducing canvas: A new way of working with ChatGPT to write and code, https://openai.com/index/introducing-canvas/
Emilia David, November 14, 2024, OpenAI launches ChatGPT desktop integrations, rivaling Copilot, https://venturebeat.com/ai/openai-launches-chatgpt-desktop-integrations-rivaling-copilot/
swyx, Sep 2024, What Works in AI UX (lightning talk + Q&A), https://www.youtube.com/watch?v=PkHjoihjo6U
swyx & Alessio, Maggie Appleton, Linus Lee, and Geoffrey Litt, Apr 27, 2023, It's Time To Build AI | UX. Bridging the Capability Overhang from Generative AI to Generative UI, https://www.latent.space/p/build-ai-ux
Akash Bajwa, Nov 18, 2024, Opinionated AI Products: Strong Technologies Forms Beliefs, https://akashbajwa.substack.com/p/opinionated-ai-products
Jared Spataro, November 19, 2024, Introducing Copilot Actions, new agents, and tools to empower IT teams, https://www.microsoft.com/en-us/microsoft-365/blog/2024/11/19/introducing-copilot-actions-new-agents-and-tools-to-empower-it-teams/ ("Copilot is the UI for AI")
Tiernan Ray, Nov. 21, 2024 , Even Nvidia's CEO is obsessed with Google's NotebookLM AI tool, https://www.zdnet.com/article/even-nvidias-ceo-is-obsessed-with-googles-notebooklm-ai-tool/
Ethan Mollick, Nov 24, 2024, Getting started with AI: Good enough prompting. Don't make this hard. https://www.oneusefulthing.org/p/getting-started-with-ai-good-enough
Charlie Guo, Nov 15, 2024, The Chatbot Trap. Why AI products really need some better UX. https://www.ignorance.ai/p/the-chatbot-trap
Christian Swinehart, Dec 2024, Skia-Canvas: A GPU-accelerated 2D graphics environment for Node.js, https://github.com/samizdatco/skia-canvas
Charles Rollet, December 4, 2024, Key leaders behind Google’s viral NotebookLM are leaving to create their own startup, https://techcrunch.com/2024/12/04/key-leaders-behind-googles-viral-notebooklm-are-leaving-to-create-their-own-startup/ ("As the frontier models and their capabilities continue to grow, thoughtful products are required to make the benefits of this technology accessible, useful, and obvious to everyday people — so our team is going to be focused on building a user-first AI product...the team wanted to create something that leverages the latest AI models to build something useful to regular people.")
Ian Drosos, Jack Williams, Advait Sarkar, Nicholas Wilson, 3 Dec 2024, Dynamic Prompt Middleware: Contextual Prompt Refinement Controls for Comprehension Tasks, https://arxiv.org/abs/2412.02357
Emilia David, December 10, 2024, OpenAI expands ChatGPT Canvas to all users, https://venturebeat.com/ai/openai-expands-chatgpt-canvas-to-all-users/
Anshul Ramachandran, Jul 08, 2023, How to Make AI UX Your Moat. Design great AI Products that go beyond "just LLM Wrappers": make AI more present, more practical, and then more powerful. https://www.latent.space/p/ai-ux-moat
Sabrina Ortiz, Dec. 13, 2024, ChatGPT finally gets easier to organize on the 7th day of OpenAI, https://www.zdnet.com/article/chatgpt-finally-gets-easier-to-organize-on-the-7th-day-of-openai/
Maxwell Zeff, November 20, 2024, Current AI scaling laws are showing diminishing returns, forcing AI labs to change course, https://techcrunch.com/2024/11/20/ai-scaling-laws-are-showing-diminishing-returns-forcing-ai-labs-to-change-course/ ("at least 10 to 20x gains in model performance ...intelligent prompting, UX decisions, and passing context at the right time into the models...")
Google, Dec 2024, Welcome to Project IDX, a new web-based development workspace from Google. IDX is designed to make it faster and easier to build, ship, and manage full-stack, multiplatform apps from the comfort of your browser. https://idx.google.com/
Avi Siegel, Dec 2024, Features shouldn’t feel like features: Why (and how) to craft product experiences that feel inevitable, https://uxdesign.cc/features-shouldnt-feel-like-features-fba44644f961
Kartik Hosanagar, Daehwan Ahn, 14 Dec 2024, Designing Human and Generative AI Collaboration, https://arxiv.org/abs/2412.14199
Google, Dec 2024, multimodal-live-api-web-console: A react-based starter app for using the Multimodal Live API over websockets with Gemini, https://github.com/google-gemini/multimodal-live-api-web-console
Will Whitney, Dec 2024, Computing inside an AI: What would it mean to treat AI as a tool instead of a person? https://willwhitney.com/computing-inside-ai.html
Latent Space, Jan 05, 2025, AI Engineering for Art — with comfyanonymous, of ComfyUI, Using models for "Art Engineering", building hard to use UIs, and how image generation is moving from text boxes to DAGs https://www.latent.space/p/comfyui
comfyanonymous, Jan 2025, ComfyUI: The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface, https://github.com/comfyanonymous/ComfyUI
Boqiang Liang, Jan 2025, SaaS is Dead, Says Microsoft CEO, https://medium.com/@lbq999/saas-is-dead-says-microsoft-ceo-a8cff2a516c4
Ori Ziv, Jan 2025, How AI Agents Will Disrupt SaaS in 2025, https://medium.com/@oriziv4/how-ai-agents-will-disrupt-saas-in-2025-7567d793ca68
Johan Uddståhl, Jan 2, 2025, …when all we ever needed was a text box, or how 2025 will be back to basics for the web, https://medium.com/@baktakt/when-all-we-ever-needed-was-a-text-box-c672c52a0dca
Tari Ibaba, Jan 2025, AI is killing apps, https://medium.com/coding-beauty/ai-is-killing-apps-868a7b59fafe
James Currier, Jan 2025, Consumer is Back – And Why It’s Been So Hard Since 2014, https://www.nfx.com/post/consumer-is-back
Akash Bajwa, Feb 03, 2025, Forward Deployed Engineers: A Means To An End For AI Startups: Capturing Business Logic And Expert Reasoning, https://akashbajwa.substack.com/p/forward-deployed-engineers-a-means (" AI truly is a new way of computing, and that means the better analogies are to computing itself. Transformers are the transistor, and mainframes are today’s models. The GUI is, arguably, still TBD.")
Olivia Moore, Feb 20025, AI Voice Agent Update - 2025, A16Z, https://a16z.com/ai-voice-agents-2025-update/ https://gamma.app/docs/a16z-AI-Voice-Update-2025--ttkorld8iy6wfnj?mode=doc (Thesis that voice will be the primary AI interface for consumers.)
Sharon Goldman, December 13, 2023, Lightning AI debuts ‘iPhone approach’ to new AI dev platform, https://venturebeat.com/ai/lightning-ai-debuts-iphone-approach-to-new-ai-dev-platform/
Daniel Delaney, Feb 2025, Chat is a bad UI pattern for development tools, https://danieldelaney.net/chat/
Jack Wallen, Feb. 6, 2025, I tried to replace my desktop with a phone for work - 5 frustrating lessons I learned As phones continue to win the consumer war against desktops and laptops, those who swear by our PCs will never give in to the lure of mobile-only. Here's why. https://www.zdnet.com/article/i-tried-to-replace-my-desktop-with-a-phone-for-work-5-frustrating-lessons-i-learned/
Alexander Deplov, Feb 12, 2025, How I Automated My Computer Routine With macOS Folder Actions, https://interfacecraft.online/posts/blog/2025/how-i-automated-my-computer-life-with-macos-folder-actions/
M.G. Siegler, Feb 14, 2025, The Great AI UI Unification. ChatGPT starts cleaning up the cruft..., https://spyglass.org/chatgpt-ai-ui/
Rex Woodbury, Feb 20, 2025, How Consumer Psychology Informs AI Product Design: The IKEA Effect, the Paradox of Choice, and AI's Interface Problem, https://www.digitalnative.tech/p/how-consumer-psychology-informs-ai
Yaakov Carno, Feb 24, 2025, The surprising patterns behind viral AI products: A deep dive into Bolt, Cursor, Granola, PhotoRoom, Replit and more, https://open.substack.com/pub/kylepoyar/p/ai-ux-patterns (The "surprising pattern" in successful AI products is that they all have a slick UI.)
Tetiana Sydorenko, Feb 2025, AI is reshaping UI — have you noticed the biggest change yet? https://uxdesign.cc/ai-is-reshaping-ui-have-you-noticed-the-biggest-change-yet-ee80efcbf8a5
Andrew Zuo, March 2025, Developers Are Keeping The Best AI Interface To Themselves, https://andrewzuo.com/developers-are-keeping-the-best-ai-interface-to-themselves-f558261ee109
Leixian Shen, Haotian Li, Yifang Wang, Xing Xie, Huamin Qu, 4 Mar 2025, Prompting Generative AI with Interaction-Augmented Instructions, https://arxiv.org/abs/2503.02874
Dom Couldwell, Mar 10, 2025, Building generative AI? Get ready for generative UI,https://www.infoworld.com/article/3834886/building-generative-ai-get-ready-for-generative-ui.html
Anthropic, Mar 2025, Text editor tool, https://docs.anthropic.com/en/docs/build-with-claude/tool-use/text-editor-tool
Dave Citron, Mar 18, 2025, New ways to collaborate and get creative with Gemini: Explore Gemini's latest features: Canvas, a new interactive space for refining your documents and code and Audio Overview, which transform your files into engaging podcast-style discussions, Google Blog, https://blog.google/products/gemini/gemini-collaboration-features/
RM Amin, OH Kühle, D Buschek, A Butz, 2025, Composable Prompting Workspaces for Creative Writing: Exploration and Iteration Using Dynamic Widgets, https://www.medien.ifi.lmu.de/pubdb/publications/pub/amin2025chi/amin2025chi.pdf
Nikunj Kothari. Mar 21, 2025, Beyond Chat: The New Patterns of AI Interfaces,https://writing.nikunjk.com/p/beyond-chat

Workflow

Research paper on workflow interfaces for AI applications:

Microsoft, Dec 2024, Power Automate: A comprehensive, end-to-end cloud automation platform powered by low code and AI. https://www.microsoft.com/en-us/power-platform/products/power-automate
Orlando Marquez Ayala, Patrice Béchard, 29 Nov 2024, Generating a Low-code Complete Workflow via Task Decomposition and RAG, https://arxiv.org/abs/2412.00239
Laura Minkova, Jessica López Espejel, Taki Eddine Toufik Djaidja, Walid Dahhane, El Hassane Ettifouri, 4 Dec 2024, From Words to Workflows: Automating Business Processes, https://arxiv.org/abs/2412.03446
Isaac Sacolick, Jul 29, 2024, How to choose the right low-code, no-code, or process automation platform, https://www.infoworld.com/article/3476848/how-to-choose-the-right-low-code-no-code-or-process-automation-platform.html
Wes Brewer, Ana Gainaru, Frédéric Suter, Feiyi Wang, Murali Emani, Shantenu Jha, 20 Jun 2024, AI-coupled HPC Workflow Applications, Middleware and Performance, (Examines integrations of various workflows into LLMs.) https://arxiv.org/abs/2406.14315
Vishal Rajput, Apr 11, 2024, What’s next for AI: AI agentic workflows? https://medium.com/aiguys/next-for-llms-and-rag-ai-agentic-workflows-1869ba0a6796
Ben Sherry, August 15, 2024, The 3 Top AI Use Cases, According to Inc.5000 CEOs, https://www.inc-aus.com/ben-sherry/3-ways-inc-5000-companies-are-using-ai.html (Workflow automation, content creation, and "marketing" are the three use cases at over 50% penetration for businesses using AI.)
Lakshmi narayana .U, Jul 28, 2024, STORM: Stanford’s Revolutionary Research Tool Harnessing the Power of Agents and Agentic Workflows, https://blog.stackademic.com/storm-stanfords-revolutionary-research-tool-harnessing-the-power-of-agents-and-agentic-workflows-a2fa0e1a7fe3
Hao Wu, Yue Yu, and Junxiao Deng, Shadi Ibrahim, Inria; Song Wu and Hao Fan, Ziyue Cheng, Hai Jin, Huazhong, 2024, StreamBox: A Lightweight GPU SandBox for Serverless Inference Workflow, Usenix 2024, https://www.usenix.org/conference/atc24/presentation/wu-hao PDF: https://www.usenix.org/system/files/atc24-wu-hao.pdf
Ozgur Ozan Kilic, Tianle Wang, Matteo Turilli, Mikhail Titov, Andre Merzky, Line Pouchard, Shantenu Jha, 26 Mar 2024, Workflow Mini-Apps: Portable, Scalable, Tunable & Faithful Representations of Scientific Workflows, https://arxiv.org/abs/2403.18073
Zelong Li, Shuyuan Xu, Kai Mei, Wenyue Hua, Balaji Rama, Om Raheja, Hao Wang, He Zhu, Yongfeng Zhang, 1 Jul 2024, AutoFlow: Automated Workflow Generation for Large Language Model Agents, https://arxiv.org/abs/2407.12821 https://github.com/agiresearch/AutoFlow
Lukas Teufelberger, Xintong Liu, Zhipeng Li, Max Moebus, Christian Holz, 31 Jul 2024, LLM-for-X: Application-agnostic Integration of Large Language Models to Support Personal Writing Workflows, https://arxiv.org/abs/2407.21593
Emilia David, September 10, 2024, ServiceNow introduces a library of enterprise AI agents you can customize to fit your workflow, https://venturebeat.com/ai/servicenow-introduces-a-library-of-enterprise-ai-agents-you-can-customize-to-fit-your-workflow/
Rinon Gal, Adi Haviv, Yuval Alaluf, Amit H. Bermano, Daniel Cohen-Or, Gal Chechik, 2 Oct 2024, ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation, https://arxiv.org/abs/2410.01731 https://comfygen-paper.github.io/
David Gewirtz, Oct. 25, 2024, I wrote half this article on Apple Watch, thanks to this under-the-radar iOS 18 feature: Here's how to transform your writing workflow and turn your Apple Watch into a productivity powerhouse, https://www.zdnet.com/article/i-wrote-half-this-article-on-apple-watch-thanks-to-this-under-the-radar-ios-18-feature/
Arun Shankar, Oct 2024, Designing Cognitive Architectures: Agentic Workflow Patterns from Scratch, https://medium.com/google-cloud/designing-cognitive-architectures-agentic-workflow-patterns-from-scratch-63baa74c54bc
AI Agent Workflows: A Complete Guide on Whether to Build With LangGraph or LangChain, Sandi Besen, Oct 2024, https://towardsdatascience.com/ai-agent-workflows-a-complete-guide-on-whether-to-build-with-langgraph-or-langchain-117025509fa0
Anita Kirkovska, David Vargas, Jul 11, 2024, Agentic Workflows in 2024: The ultimate guide, https://www.vellum.ai/blog/agentic-workflows-emerging-architectures-and-design-patterns
Shuofei Qiao, Runnan Fang, Zhisong Qiu, Xiaobin Wang, Ningyu Zhang, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen, 10 Oct 2024, Benchmarking Agentic Workflow Generation, https://arxiv.org/abs/2410.07869
A. Singh, A. Ehtesham, S. Kumar and T. T. Khoei, "Enhancing AI Systems with Agentic Workflows Patterns in Large Language Model," 2024 IEEE World AI IoT Congress (AIIoT), Seattle, WA, USA, 2024, pp. 527-532, doi: 10.1109/AIIoT61789.2024.10578990. https://ieeexplore.ieee.org/abstract/document/10578990
Chawla, Chhavi; Chatterjee, Siddharth; Gadadinni, Sanketh Siddanna; Verma, Pulkit; Banerjee, Sourav, 2024, Agentic AI: The building blocks of sophisticated AI business applications, Journal of AI, Robotics & Workplace Automation, Volume 3 / Number 3 / Summer 2024, pp. 1-15(15), Henry Stewart Publications, DOI: https://doi.org/10.69554/XEHZ1946 https://www.ingentaconnect.com/content/hsp/airwa/2024/00000003/00000003/art00001
Jiayi Zhang, Jinyu Xiang, Zhaoyang Yu, Fengwei Teng, Xionghui Chen, Jiaqi Chen, Mingchen Zhuge, Xin Cheng, Sirui Hong, Jinlin Wang, Bingnan Zheng, Bang Liu, Yuyu Luo, Chenglin Wu, 14 Oct 2024, AFlow: Automating Agentic Workflow Generation, https://arxiv.org/abs/2410.10762 https://github.com/geekan/MetaGPT
Amy Nichol Smith, Lauren Holznienkemper, Aug 25, 2024, Best Workflow Apps, https://www.forbes.com/advisor/business/software/best-workflow-app/
Kyle Wiggers, March 17, 2025, OpenAI to start testing ChatGPT connectors for Google Drive and Slack, https://techcrunch.com/2025/03/17/openai-to-start-testing-chatgpt-connectors-for-google-drive-and-slack/

Consoles

Anthropic, 21 May 2024, Generate better prompts in the developer console, https://www.anthropic.com/news/prompt-generator
Michael Nuñez, September 10, 2024, Is Anthropic’s new ‘Workspaces’ feature the future of enterprise AI management? https://venturebeat.com/ai/is-anthropics-new-workspaces-feature-the-future-of-enterprise-ai-management/
Jared Spataro, Sep 16, 2024, Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents, Microsoft blog, https://www.microsoft.com/en-us/microsoft-365/blog/2024/09/16/microsoft-365-copilot-wave-2-pages-python-in-excel-and-agents/
Ellie Ko, Sep 25, 2024, A Survey of Python Frameworks, https://ploomber.io/blog/survey-python-frameworks/
Matt Asay, Sep 23, 2024, Too much assembly required for AI, https://www.infoworld.com/article/3536292/too-much-assembly-required-for-ai.html
Emilia David, October 3, 2024, OpenAI launches ChatGPT Canvas, challenging Claude Artifacts, https://venturebeat.com/ai/openai-launches-chatgpt-canvas-challenging-claude-artifacts/
Sabrina Ortiz, Oct. 7, 2024, I test ChatGPT features for a living, and this new one really did supercharge my productivity. If you use OpenAI's generative AI tool to co-edit code or text, Canvas will take your work to a whole new level, https://www.zdnet.com/article/i-test-chatgpt-features-for-a-living-and-this-new-one-really-did-supercharge-my-productivity/
Emilia David, October 17, 2024, Google launches NotebookLM Business to make enterprise AI audio, text, https://venturebeat.com/ai/googles-notebooklm-will-expand-to-business-use-cases-soon/
Jason Perlow, Nov. 8, 2024, How to manage Bluesky, Mastodon, and Threads all from one free app Openvibe simplifies social media management with unified timelines, cross-posting, and customizable feeds for easier navigation of the digital landscape. Here's why you should try it. https://www.zdnet.com/article/how-to-manage-bluesky-mastodon-and-threads-all-from-one-free-app/
OpenAI, October 3, 2024, Introducing canvas: A new way of working with ChatGPT to write and code, https://openai.com/index/introducing-canvas/
swyx & Alessio, Maggie Appleton, Linus Lee, and Geoffrey Litt, Apr 27, 2023, It's Time To Build AI | UX. Bridging the Capability Overhang from Generative AI to Generative UI, https://www.latent.space/p/build-ai-ux
Jared Spataro, November 19, 2024, Introducing Copilot Actions, new agents, and tools to empower IT teams, https://www.microsoft.com/en-us/microsoft-365/blog/2024/11/19/introducing-copilot-actions-new-agents-and-tools-to-empower-it-teams/ ("Copilot is the UI for AI")
Microsoft, Dec 2024, Power Automate: A comprehensive, end-to-end cloud automation platform powered by low code and AI. https://www.microsoft.com/en-us/power-platform/products/power-automate
Emilia David, December 10, 2024, OpenAI expands ChatGPT Canvas to all users, https://venturebeat.com/ai/openai-expands-chatgpt-canvas-to-all-users/
Anshul Ramachandran, Jul 08, 2023, How to Make AI UX Your Moat. Design great AI Products that go beyond "just LLM Wrappers": make AI more present, more practical, and then more powerful. https://www.latent.space/p/ai-ux-moat
Sabrina Ortiz, Dec. 13, 2024, ChatGPT finally gets easier to organize on the 7th day of OpenAI, https://www.zdnet.com/article/chatgpt-finally-gets-easier-to-organize-on-the-7th-day-of-openai/
Google, Dec 2024, multimodal-live-api-web-console: A react-based starter app for using the Multimodal Live API over websockets with Gemini, https://github.com/google-gemini/multimodal-live-api-web-console
Latent Space, Jan 05, 2025, AI Engineering for Art — with comfyanonymous, of ComfyUI, Using models for "Art Engineering", building hard to use UIs, and how image generation is moving from text boxes to DAGs https://www.latent.space/p/comfyui
comfyanonymous, Jan 2025, ComfyUI: The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface, https://github.com/comfyanonymous/ComfyUI
Sharon Goldman, December 13, 2023, Lightning AI debuts ‘iPhone approach’ to new AI dev platform, https://venturebeat.com/ai/lightning-ai-debuts-iphone-approach-to-new-ai-dev-platform/
Anthropic, 7 Mar 2025, Get to production faster with the upgraded Anthropic Console, https://www.anthropic.com/news/upgraded-anthropic-console

Declarative Programming

Declarative programming is the method of creating apps by defining what to do, rather than how to do it. The language to define a declarative app is more like a configuration file, rather than a procedural programming language like C++.

Research on declarative programming issues:

S Madden, M Cafarella, M Franklin, T Kraska, 2024, Databases Unbound: Querying All of the World’s Bytes with AI, https://www.vldb.org/pvldb/vol17/p4546-madden.pdf
Mandana Vaziri, Louis Mandel, Claudio Spiess, Martin Hirzel, 24 Oct 2024, PDL: A Declarative Prompt Programming Language, https://arxiv.org/abs/2410.19135
Manpreet Singh, Oct 31, 2024, Let's Simplifying How We Talk to AI Using Prompt Declaration Language (PDL), https://pub.towardsai.net/lets-simplifying-how-we-talk-to-ai-using-prompt-declaration-language-pdl-b1824c4de833
Saksham Goel, October 29, 2024, Build LLM/RAG pipelines with YAML templates by Pathway, https://pathway.com/blog/llm-yaml-templates
Ignacio de Gregorio Noblejas, December 15, 2024, The AI Trillion-Dollar Product, https://thewhitebox.beehiiv.com/p/the-ai-trillion-dollar-product
C Liu, M Russo, M Cafarella, L Cao, PB Chen, Z Chen, Jan 2025, Palimpzest: Optimizing AI-Powered Analytics with Declarative Query Processing, https://vldb.org/cidrdb/papers/2025/p12-liu.pdf

Script Languages

L. Zheng, L. Yin, Z. Xie, J. Huang, C. Sun, C. H. Yu, S. Cao, C. Kozyrakis, I. Stoica, J. E. Gonzalez et al., Dec 2023, Efficiently programming large language models using SGLang, arXiv preprint arXiv:2312.07104, 2023, https://arxiv.org/abs/2312.07104 (Uses a radix attention method, a trie or prefix tree, for KV caching.)
Hongzheng Chen, Niansong Zhang, Shaojie Xiang, Zhichen Zeng, Mengjia Dai, Zhiru Zhang, 7 Apr 2024, Allo: A Programming Model for Composable Accelerator Design, https://arxiv.org/abs/2404.04815
Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia, Christopher Potts, 5 Oct 2023, DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines, https://arxiv.org/abs/2310.03714 Code: https://github.com/stanfordnlp/dspy
Honghua Dong, Qidong Su, Yubo Gao, Zhaoyu Li, Yangjun Ruan, Gennady Pekhimenko, Chris J. Maddison, Xujie Si, 19 Jun 2024, APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts, https://arxiv.org/abs/2406.13161 Code: https://github.com/appl-team/appl (A Python-like script language for prompt engineering integration into applications and agents.)
Till Döhmen, 2024/10/17, Introducing the prompt() Function: Use the Power of LLMs with SQL! https://motherduck.com/blog/sql-llm-prompt-function-gpt-models/
Mandana Vaziri, Louis Mandel, Claudio Spiess, Martin Hirzel, 24 Oct 2024, PDL: A Declarative Prompt Programming Language, https://arxiv.org/abs/2410.19135
Saksham Goel, October 29, 2024, Build LLM/RAG pipelines with YAML templates by Pathway, https://pathway.com/blog/llm-yaml-templates
Yuka Ikarashi, Kevin Qian, Samir Droubi, Alex Reinking, Gilbert Bernstein, Jonathan Ragan-Kelley, 14 Nov 2024 (v2), Exo 2: Growing a Scheduling Language, https://arxiv.org/abs/2411.07211

API Architectures

Kyle Wiggers, September 16, 2024, Runway announces an API for its video-generating AI models, https://techcrunch.com/2024/09/16/runway-announces-an-api-for-its-video-generating-models/
Mistral, Sep 2024, AI in abundance. Introducing a free API, improved pricing across the board, a new enterprise-grade Mistral Small, and free vision capabilities on le Chat. https://mistral.ai/news/september-24-release/
Luma Labs, Sep 2024, Creative Intelligence platform for magical AI products, https://lumalabs.ai/dream-machine/api (API to access video models.)
Simon Willison, Sep 2024, How streaming LLM APIs work, https://til.simonwillison.net/llms/streaming-llm-apis
Carl Franzen, September 27, Cohere updates APIs to make it easier for devs to switch from other models, https://venturebeat.com/ai/cohere-updates-apis-to-make-it-easier-for-devs-to-switch-from-other-models/
Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 25 Sep 2024, Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents, https://arxiv.org/abs/2409.17140
Michael Nuñez, September 25, 2024, AI for all: Meta’s ‘Llama Stack’ promises to simplify enterprise adoption, https://venturebeat.com/ai/ai-for-all-meta-llama-stack-promises-to-simplify-enterprise-ai-adoption/
Kyle Wiggers, October 3, 2024, Black Forest Labs, the startup behind Grok’s image generator, releases an API, https://techcrunch.com/2024/10/03/black-forest-labs-the-startup-behind-groks-image-generator-releases-an-api/
Kyle Wiggers, October 21, 2024, xAI, Elon Musk’s AI startup, launches an API, https://techcrunch.com/2024/10/21/xai-elon-musks-ai-startup-launches-an-api/
X AI, November 4, 2024 API Public Beta, https://x.ai/blog/api
Gemini is now accessible from the OpenAI Library NOV 08, 2024 Logan Kilpatrick, https://developers.googleblog.com/en/gemini-is-now-accessible-from-the-openai-library/
Kwindla Hultman Kramer and swyx & Alessio, Nov 22, 2024, OpenAI Realtime API: The Missing Manual, Latent Space, https://www.latent.space/p/realtime-api
Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su, 22 Feb 2024, Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments, https://arxiv.org/abs/2402.14672
Andrew Ng, Nov 2024, Simple, unified interface to multiple Generative AI providers, https://github.com/andrewyng/aisuite
Asif Razzaq, November 29, 2024, Andrew Ng’s Team Releases ‘aisuite’: A New Open Source Python Library for Generative AI, https://www.marktechpost.com/2024/11/29/andrew-ngs-team-releases-aisuite-a-new-open-source-python-library-for-generative-ai/
Paul Krill Dec 05, 2024, OpenAI unveils API for tracking OpenAI API usage, costs, https://www.infoworld.com/article/3618202/openai-unveils-api-for-tracking-openai-api-usage-costs.html
Outlore, Dec 14, 2024, Reflections on building with Model Context Protocol (MCP), https://outlore.dev/blog/model-context-protocol/
Google, Dec 2024, multimodal-live-api-web-console: A react-based starter app for using the Multimodal Live API over websockets with Gemini, https://github.com/google-gemini/multimodal-live-api-web-console
Anthropic, 24 Jan 2025, Introducing Citations on the Anthropic API, https://www.anthropic.com/news/introducing-citations-api
OpenVINO™ toolkit, Nov 22, 2024, How to generate images locally on AI PC with OpenVINO GenAI API, https://medium.com/openvino-toolkit/how-to-generate-images-locally-on-ai-pc-with-openvino-genai-api-220d08370958
Anirban Ghoshal, 06 Feb 2025, NetSuite adds new AI capabilities to improve enterprise workflows, https://www.cio.com/article/3818405/netsuite-adds-new-ai-capabilities-to-improve-enterprise-workflows.html
Mandar Karhade, Feb 2025, Tired of LLM Chaos? LiteLLM Should Be Your Default. Stop juggling multiple LLM APIs and their “standards”. https://pub.towardsai.net/tired-of-llm-chaos-litellm-should-be-your-default-e04730b3c33c
Reuters, February 26, 2025, DeepSeek cuts off-peak pricing for developers by up to 75%, https://www.reuters.com/technology/chinas-deepseek-cuts-off-peak-pricing-by-up-75-2025-02-26/
Alex Fazio, Feb 2025, How to Build an LLM Chat App: The New Litmus Test for Junior Devs, https://x.com/alxfazio/status/1893242657331101976 (How to build a wrapper chat app that scales by taking care of message queueing, with RabbitMQ or Kafka API rate limits, history database management, in-memory caching with Redis, load balancing, and other real-world deployment issues.)
Chaoyun Zhang, Shilin He, Liqun Li, Si Qin, Yu Kang, Qingwei Lin, Dongmei Zhang, 14 Mar 2025, API Agents vs. GUI Agents: Divergence and Convergence, https://arxiv.org/abs/2503.11069

Plugins

Reyna Abhyankar, Zijian He, Vikranth Srivatsa, Hao Zhang, Yiying Zhang, 2024, INFERCEPT: Efficient Intercept Support for Augmented Large Language Model Inference, https://openreview.net/pdf?id=wDDGQabYPQ
Zile Qiao, Wei Ye, Yong Jiang, Tong Mo, Pengjun Xie, Weiping Li, Fei Huang, Shikun Zhang, 12 Jun 2024, Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling, https://arxiv.org/abs/2406.08116
Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman, 1 Jun 2022 (v3), WebGPT: Browser-assisted question-answering with human feedback, https://arxiv.org/abs/2112.09332
Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
Sreedevi Gogusetty, Dec 6, 2024, From RAG to TAG: Leveraging the Power of Table-Augmented Generation (TAG): A Leap Beyond Retrieval-Augmented Generation (RAG), https://ai.plainenglish.io/from-rag-to-tag-leveraging-the-power-of-table-augmented-generation-tag-a-leap-beyond-54d1cfadb994 (TAG for augmenting LLMs with queries from database tables, similar to data source plugins.)
Kyoungmin Kim, Anastasia Ailamaki, 23 Dec 2024, Trustworthy and Efficient LLMs Meet Databases, https://arxiv.org/abs/2412.18022
Connor Shorten, Charles Pierse, Thomas Benjamin Smith, Karel D'Oosterlinck, Tuana Celik, Erika Cardenas, Leonie Monigatti, Mohd Shukri Hasan, Edward Schmuhl, Daniel Williams, Aravind Kesiraju, Bob van Luijt, 23 Jan 2025, Querying Databases with Function Calling, https://arxiv.org/abs/2502.00032
Dr. Ashish Bamania, Feb 2025, The Open Source “Agentic Reasoning” Beats Google Gemini Deep Research. A deep dive into how the “Agentic Reasoning” framework works and the techniques behind it that make it outperform the most advanced reasoning LLMs today. https://levelup.gitconnected.com/the-open-source-agentic-reasoning-beats-google-gemini-deep-research-8ed8d9d07176
Minhua Lin, Hui Liu, Xianfeng Tang, Jingying Zeng, Zhenwei Dai, Chen Luo, Zheng Li, Xiang Zhang, Qi He, Suhang Wang, 26 Feb 2025 (v2), How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities, https://arxiv.org/abs/2502.18387
Krish Arvapally, Mar 2025, The End of AI Scraping? A Better Way to Unlock Data at the Point of Inference with RAG & MCP, https://medium.com/@arvapallykrish/the-end-of-ai-scraping-a-better-way-to-unlock-data-at-the-point-of-inference-with-rag-mcp-6cbb141a5765
Kyle Wiggers, March 17, 2025, OpenAI to start testing ChatGPT connectors for Google Drive and Slack, https://techcrunch.com/2025/03/17/openai-to-start-testing-chatgpt-connectors-for-google-drive-and-slack/

Custom AI Apps

Gino Zambe, Feb 1, 2024, Was The GPT store a failure? https://medium.com/@ginozambe/was-the-gpt-store-a-failure-d2a2379fdfc1
OpenAI, November 6, 2023 Introducing GPTs, OpenAI Blog, https://openai.com/blog/introducing-gpts
Lance Whitney, June 12, 2024, Microsoft scraps Copilot Pro GPT Builder after just 3 months - how to save your work, https://www.zdnet.com/article/microsoft-scraps-copilot-pro-gpt-builder-after-just-3-months-how-to-save-your-work/
Reuters, July 30, 2024, Meta to let users to create custom AI characters, https://www.reuters.com/technology/artificial-intelligence/meta-let-users-create-custom-ai-characters-2024-07-29/
Lucas Mearian, 27 Aug 2024, BCG execs: AI across the company increased productivity, ‘employee joy’, https://www.computerworld.com/article/3491334/bcg-execs-ai-across-the-company-increased-productivity-employee-joy.html
Yuanchun Li, Hao Wen, Weijun Wang, Xiangyu Li, Yizhen Yuan, Guohong Liu, Jiacheng Liu, Wenxing Xu, Xiang Wang, Yi Sun, Rui Kong, Yile Wang, Hanfei Geng, Jian Luan, Xuefeng Jin, Zilong Ye, Guanjing Xiong, Fan Zhang, Xiang Li, Mengwei Xu, Zhijun Li, Peng Li, Yang Liu, Ya-Qin Zhang, Yunxin Liu, 8 May 2024 (v2), Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security, https://arxiv.org/abs/2401.05459 https://github.com/MobileLLM/Personal_LLM_Agents_Survey
Emilia David, August 30, 2024, OpenAI gives developers more control over AI assistants, https://venturebeat.com/ai/openai-gives-developers-more-control-over-ai-assistants/
Henrique Centieiro & Bee Lee, Aug 2024, Build Your Own Money-Making Personal AI Bot: An Easy Step-by-Step Guide to Creating and Monetizing Your Personal AI Bot on Poe, https://medium.com/limitless-investor/build-your-own-money-making-personal-ai-bot-9810e3175699
OpenAI, January 10, 2024, Introducing the GPT Store, https://openai.com/index/introducing-the-gpt-store/
Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
Xinyi Hou, Yanjie Zhao, Haoyu Wang, 3 Aug 2024, Voices from the Frontier: A Comprehensive Analysis of the OpenAI Developer Forum, https://arxiv.org/abs/2408.01687
OpenAI, 2024, GPT Builder: What is the GPT Builder for in ChatGPT and why did we make it? https://help.openai.com/en/articles/8770868-gpt-builder
Xiang Chen, Chaoyang Gao, Chunyang Chen, Guangbei Zhang, Yong Liu, 12 Aug 2024 (v2), An Empirical Study on Challenges for LLM Developers, https://arxiv.org/abs/2408.05002
Nick: The AI Guru, Aug 15, 2024, Why Perplexity AI Has Been a Game Changer For Me, https://medium.com/@nickm9/why-perplexity-ai-has-been-a-game-changer-for-me-b38976bdc1b4
https://levelup.gitconnected.com/zero-to-hero-crafting-a-custom-gpt-e2ef22653b1f
Tiernan Ray, Sept. 4, 2024, Google's Gems are a gentle introduction to AI prompt engineering: Google's pre-built Gems offer prompt examples you can modify to get started with your own custom bot, https://www.zdnet.com/article/googles-gems-are-a-gentle-introduction-to-ai-prompt-engineering/
Chaojun Xiao, Zhengyan Zhang, Chenyang Song, Dazhi Jiang, Feng Yao, Xu Han, Xiaozhi Wang, Shuo Wang, Yufei Huang, Guanyu Lin, Yingfa Chen, Weilin Zhao, Yuge Tu, Zexuan Zhong, Ao Zhang, Chenglei Si, Khai Hao Moo, Chenyang Zhao, Huimin Chen, Yankai Lin, Zhiyuan Liu, Jingbo Shang, Maosong Sun, Sep 2024, Configurable Foundation Models: Building LLMs from a Modular Perspective, https://arxiv.org/pdf/2409.02877
Emilia David, September 10, 2024, ServiceNow introduces a library of enterprise AI agents you can customize to fit your workflow, https://venturebeat.com/ai/servicenow-introduces-a-library-of-enterprise-ai-agents-you-can-customize-to-fit-your-workflow/
Kyle Wiggers, February 25, 2025, Quora’s Poe now lets users create and share custom AI-powered apps, https://techcrunch.com/2025/02/25/quoras-poe-now-lets-users-create-and-share-custom-ai-powered-apps/

No Code/Low Code for AI Apps

Writer, Aug 2024 (accessed), Writer AI Studio: The fastest way to build AI apps, https://writer.com/product/ai-studio/
Isaac Sacolick, How to choose the right low-code, no-code, or process automation platform, Jul 29, 2024, https://www.infoworld.com/article/3476848/how-to-choose-the-right-low-code-no-code-or-process-automation-platform.html
Rebekah Carter, 2023, Gartner Magic Quadrant for Enterprise Low-Code Application Platforms 2023, https://www.cxtoday.com/loyalty-management/gartner-magic-quadrant-for-enterprise-low-code-application-platforms-2023/
Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang Zhu, Chi Wang, Saleema Amershi, 9 Aug 2024, AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems, https://arxiv.org/abs/2408.15247
Yanxi Chen, Yaliang Li, Bolin Ding, Jingren Zhou, 20 Jul 2024, On the Design and Analysis of LLM-Based Algorithms, https://arxiv.org/abs/2407.14788 https://github.com/modelscope/agentscope/tree/main/examples/paper_llm_based_algorithm
Reddit, 2024, New Open Source Framework and No-Code GUI for Fine-Tuning LLMs: H2O LLM Studio, https://www.reddit.com/r/LocalLLaMA/comments/12yc8op/new_open_source_framework_and_nocode_gui_for/
Yuzhe Cai, Shaoguang Mao, Wenshan Wu, Zehua Wang, Yaobo Liang, Tao Ge, Chenfei Wu, Wang You, Ting Song, Yan Xia, Jonathan Tien, Nan Duan, Furu Wei, 1 Apr 2024 (v3), Low-code LLM: Graphical User Interface over Large Language Models, https://arxiv.org/abs/2304.08103 https://github.com/chenfei-wu/TaskMatrix/tree/main/LowCodeLLM https://www.youtube.com/watch?v=jb2C1vaeO3E
Zelong Li, Shuyuan Xu, Kai Mei, Wenyue Hua, Balaji Rama, Om Raheja, Hao Wang, He Zhu, Yongfeng Zhang, 1 Jul 2024, AutoFlow: Automated Workflow Generation for Large Language Model Agents, https://arxiv.org/abs/2407.12821 https://github.com/agiresearch/AutoFlow
Xin Pang, Zhucong Li, Jiaxiang Chen, Yuan Cheng, Yinghui Xu, Yuan Qi, 7 Apr 2024, AI2Apps: A Visual IDE for Building LLM-based AI Agent Applications, https://arxiv.org/abs/2404.04902
Wenyi Hong, Weihan Wang, Qingsong Lv, Jiazheng Xu, Wenmeng Yu, Junhui Ji, Yan Wang, Zihan Wang, Yuxiao Dong, Ming Ding, Jie Tang; 2024, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 14281-14290, https://arxiv.org/abs/2312.08914 https://openaccess.thecvf.com/content/CVPR2024/html/Hong_CogAgent_A_Visual_Language_Model_for_GUI_Agents_CVPR_2024_paper.html https://openaccess.thecvf.com/content/CVPR2024/papers/Hong_CogAgent_A_Visual_Language_Model_for_GUI_Agents_CVPR_2024_paper.pdf
Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
S Madden, M Cafarella, M Franklin, T Kraska, 2024, Databases Unbound: Querying All of the World’s Bytes with AI, https://www.vldb.org/pvldb/vol17/p4546-madden.pdf
Google, Sep 2024, Supercharge your work with no-code. AppSheet helps you build powerful applications and automations that boost productivity. No coding required., https://about.appsheet.com/home/ (Google AppSheet no code platform.)
Matt Asay, Sep 23, 2024, Too much assembly required for AI, https://www.infoworld.com/article/3536292/too-much-assembly-required-for-ai.html
Shubham Sharma, October 8, 2024, Databricks now lets developers create AI apps in 5 minutes: Here’s how, https://venturebeat.com/data-infrastructure/databricks-now-lets-developers-create-ai-apps-in-5-minutes-heres-how/
Dr. Marcel Müller, Oct 18, 2024, No-Code Generative AI: How Companies Can Build Without Data Scientists, https://medium.com/deep-tech-innovation/no-code-generative-ai-how-companies-can-build-without-data-scientists-7e5ca851f2ba
Mandana Vaziri, Louis Mandel, Claudio Spiess, Martin Hirzel, 24 Oct 2024, PDL: A Declarative Prompt Programming Language, https://arxiv.org/abs/2410.19135
Saksham Goel, October 29, 2024, Build LLM/RAG pipelines with YAML templates by Pathway, https://pathway.com/blog/llm-yaml-templates
Microsoft, Dec 2024, Power Automate: A comprehensive, end-to-end cloud automation platform powered by low code and AI. https://www.microsoft.com/en-us/power-platform/products/power-automate
Orlando Marquez Ayala, Patrice Béchard, 29 Nov 2024, Generating a Low-code Complete Workflow via Task Decomposition and RAG, https://arxiv.org/abs/2412.00239
Iván Alfonso, Aaron Conrardy, Jordi Cabot, 6 Dec 2024, Towards the interoperability of low-code platforms, https://arxiv.org/abs/2412.05075
Ignacio de Gregorio Noblejas, December 15, 2024, The AI Trillion-Dollar Product, https://thewhitebox.beehiiv.com/p/the-ai-trillion-dollar-product
Latent Space, Jan 05, 2025, AI Engineering for Art — with comfyanonymous, of ComfyUI, Using models for "Art Engineering", building hard to use UIs, and how image generation is moving from text boxes to DAGs https://www.latent.space/p/comfyui
comfyanonymous, Jan 2025, ComfyUI: The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface, https://github.com/comfyanonymous/ComfyUI
Dawei Gao, Zitao Li, Xuchen Pan, Weirui Kuang, Zhijian Ma, Bingchen Qian, Fei Wei, Wenhao Zhang, Yuexiang Xie, Daoyuan Chen, Liuyi Yao, Hongyi Peng, Zeyu Zhang, Lin Zhu, Chen Cheng, Hongzhu Shi, Yaliang Li, Bolin Ding, Jingren Zhou, 20 May 2024 (v2), AgentScope: A Flexible yet Robust Multi-Agent Platform, https://arxiv.org/abs/2402.14034 https://github.com/modelscope/agentscope
Joe McKendrick, Feb. 20, 2025, Brace yourself: The era of 'citizen developers' creating apps is here, thanks to AI, https://www.zdnet.com/article/brace-yourself-the-era-of-citizen-developers-creating-apps-is-here-thanks-to-ai/
Craig Le Clair, Oct 23 2024, Predictions 2025: GenAI, Citizen Developers, And Caution Influence Automation, https://www.forrester.com/blogs/predictions-2025-automation/

Miniapps

Kevin Lin, Sumant Guha, Joe Spaniac, Andy Zheng, 13 Nov 2020 (v3), Nifty Web Apps: Build a Web App for Any Text-Based Programming Assignment, https://arxiv.org/abs/2010.04671
Yuyang Han, Xu Ji, Zhiqiang Wang, Jianyi Zhang, 19 Nov 2023, Systematic Analysis of Security and Vulnerabilities in Miniapps, https://arxiv.org/abs/2311.11382
Shenao Wang, Yuekang Li, Kailong Wang, Yi Liu, Hui Li, Yang Liu, Haoyu Wang, 16 Jan 2024 (v2), MiniScope: Automated UI Exploration and Privacy Inconsistency Detection of MiniApps via Two-phase Iterative Hybrid Analysis, https://arxiv.org/abs/2401.03218
Chao Wang, Yue Zhang, Zhiqiang Lin, 13 Jun 2023, Uncovering and Exploiting Hidden APIs in Mobile Super Apps, https://arxiv.org/abs/2306.08134
Yuqing Yang, Chao Wang, Yue Zhang, Zhiqiang Lin, 13 Jun 2023, SoK: Decoding the Super App Enigma: The Security Mechanisms, Threats, and Trade-offs in OS-alike Apps, https://arxiv.org/abs/2306.07495
Ozgur Ozan Kilic, Tianle Wang, Matteo Turilli, Mikhail Titov, Andre Merzky, Line Pouchard, Shantenu Jha, 26 Mar 2024, Workflow Mini-Apps: Portable, Scalable, Tunable & Faithful Representations of Scientific Workflows, https://arxiv.org/abs/2403.18073
Liming Jiang, 12 Feb 2024, Utilizing Large LanguageModels to Detect Privacy Leaks in Mini-App Code, https://arxiv.org/abs/2402.07367
Yin Wang, Ming Fan, Junfeng Liu, Junjie Tao, Wuxia Jin, Qi Xiong, Yuhao Liu, Qinghua Zheng, Ting Liu, 27 Feb 2023, Do as You Say: Consistency Detection of Data Practice in Program Code and Privacy Policy in Mini-App, https://arxiv.org/abs/2302.13860
Thomas Steiner, 2024, What are mini apps? https://web.dev/articles/mini-apps/mini-app-about
Boxo, 2024, What is a Miniapp? A New Era for Apps, https://www.boxo.io/blog/what-is-a-miniapp
Electrode Native, 2024, What is a MiniApp, https://native.electrode.io/introduction/what-is-ern/what-is-a-miniapp
W3C, 2024, MiniApps Working Group, https://www.w3.org/2021/miniapps/
GMO Research, 22 March, 2023, The Rise of Super Apps , https://gmo-research.ai/en/news-events/articles/rise-super-apps
Grand View Research, 2023, Super Apps Market Size, Share & Trends Analysis Report By Platform (iOS, Android), By Device (Smartphone, Tablets), By Application, By End-user, By Region, And Segment Forecasts, 2023 - 2030, Report ID: GVR-4-68040-036-1, https://www.grandviewresearch.com/industry-analysis/super-apps-market-report
Lee Ying Shan, Nov 18 2024, Tencent challenges Amazon and Microsoft’s cloud dominance by tapping into its WeChat ecosystem, CNBC, https://www.cnbc.com/2024/11/18/tencent-is-contesting-microsoft-googles-cloud-dominance-with-wechat.html
Nicolás Cerdeira, December 17, 2024, The Rise of Mini Tools: Why AI-powered tools are the new go-to growth strategy. https://newsletter.failory.com/p/the-rise-of-mini-tools-
Nicolás Cerdeira, April 02, 2024, The Day Wise Created 250K Variants of a Calculator. How Wise used Programmatic SEO to obtain over 40.7M users/mo, https://newsletter.failory.com/p/day-wise-created-250k-variants-calculator
Tari Ibaba, Jan 2025, AI is killing apps, https://medium.com/coding-beauty/ai-is-killing-apps-868a7b59fafe
Jeff Huang, Jan 21 2025, Why U.S. tech companies struggle to replicate China’s WeChat ‘super app’ model, https://www.cnbc.com/2025/01/21/why-us-companies-struggle-to-replicate-chinas-wechat-super-app-.html https://www.cnbc.com/video/2025/01/21/why-the-us-doesnt-have-super-apps.html

Tabular Data Applications

Xi Fang, Weijie Xu, Fiona Anting Tan, Jiani Zhang, Ziqing Hu, Yanjun Qi, Scott Nickleach, Diego Socolinsky, Srinivasan Sengamedu, Christos Faloutsos, 1 Mar 2024 (v2), Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey, https://arxiv.org/abs/2402.17944
Weijia Wang, 2023, Efficient and Explainable Machine Learning Ph.D. thesis, University of California San Diego, https://escholarship.org/content/qt9q52g27p/qt9q52g27p_noSplash_70dba1eae3531240d1fec8e0cdaf1be2.pdf (Processing of tabular data is a weakness of GenAI models, and this thesis examines various issues of tabular data and rules-based processing.)
David Bonet, Daniel Mas Montserrat, Xavier Giró-i-Nieto, Alexander G. Ioannidis, HyperFast: Instant Classification for Tabular Data, 2023, NeurIPS 2023, https://openreview.net/pdf?id=VRBhaU8IDz
Irwin Deng, Kushagra Dixit, Vivek Gupta, Dan Roth, 22 Jul 2024, Enhancing Temporal Understanding in LLMs for Semi-structured Tables, https://arxiv.org/abs/2407.16030
Liang, X., Hu, R., Liu, Y., Zhu, K. (2024). Open-Domain Question Answering over Tables with Large Language Models. In: Huang, DS., Pan, Y., Guo, J. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2024. Lecture Notes in Computer Science, vol 14873. Springer, Singapore. https://doi.org/10.1007/978-981-97-5615-5_28 https://link.springer.com/chapter/10.1007/978-981-97-5615-5_28
Xianjie Wu, Jian Yang, Linzheng Chai, Ge Zhang, Jiaheng Liu, Xinrun Du, Di Liang, Daixin Shu, Xianfu Cheng, Tianzhen Sun, Guanglin Niu, Tongliang Li, Zhoujun Li, 17 Aug 2024, TableBench: A Comprehensive and Complex Benchmark for Table Question Answering, https://www.arxiv.org/abs/2408.09174
Asim Biswal, Liana Patel, Siddarth Jha, Amog Kamsetty, Shu Liu, Joseph E. Gonzalez, Carlos Guestrin, Matei Zaharia, 27 Aug 2024, Text2SQL is Not Enough: Unifying AI and Databases with TAG, https://arxiv.org/abs/2408.14717 https://github.com/TAG-Research/TAG-Bench
Shubham Sharma, September 2, 2024, Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL, https://venturebeat.com/data-infrastructure/table-augmented-generation-shows-promise-for-complex-dataset-querying-outperforms-text-to-sql/
S Madden, M Cafarella, M Franklin, T Kraska, 2024, Databases Unbound: Querying All of the World’s Bytes with AI, https://www.vldb.org/pvldb/vol17/p4546-madden.pdf
Shubham Sharma, September 12, 2024, Google’s DataGemma AI is a statistics wizard, https://venturebeat.com/ai/datagemma-googles-open-ai-models-mitigate-hallucination-on-statistical-queries/
David Gewirtz, Sept. 16, 2024, Why natural language AI scripting in Microsoft Excel could be a game changer. What if you could run advanced Excel analyses with no coding skills? Here's how Microsoft's Copilot in Excel could use Python to allow you to do just that, https://www.zdnet.com/article/why-natural-language-ai-scripting-in-microsoft-excel-could-be-a-game-changer/
Xinyuan Lu, Liangming Pan, Yubo Ma, Preslav Nakov, Min-Yen Kan, 18 Sep 2024, TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning, https://arxiv.org/abs/2409.11724 https://github.com/XinyuanLu00/TART
Yuzhang Tian, Jianbo Zhao, Haoyu Dong, Junyu Xiong, Shiyu Xia, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang, 12 Jul 2024, SpreadsheetLLM: Encoding Spreadsheets for Large Language Models, https://arxiv.org/abs/2407.09025
Mukul Singh, Gust Verbruggen, Vu Le, and Sumit Gulwani. 2024. Tabularis Revilio: Converting Text to Tables. In Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM '24). Association for Computing Machinery, New York, NY, USA, 4056–4060. https://doi.org/10.1145/3627673.3680000 https://dl.acm.org/doi/abs/10.1145/3627673.3680000
LangChain, Aug 10, 2024, UX for Agents, Part 3: Spreadsheet, Generative, and Collaborative UI/UX, https://blog.langchain.dev/ux-for-agents-part-3/
Deyi Ji, Lanyun Zhu, Siqi Gao, Peng Xu, Hongtao Lu, Jieping Ye, Feng Zhao, 13 Nov 2024, Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding, https://arxiv.org/abs/2411.08516
Qwen: An Yang, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng, Bowen Yu, Chengyuan Li, Dayiheng Liu, Fei Huang, Haoran Wei, Huan Lin, Jian Yang, Jianhong Tu, Jianwei Zhang, Jianxin Yang, Jiaxi Yang, Jingren Zhou, Junyang Lin, Kai Dang, Keming Lu, Keqin Bao, Kexin Yang, Le Yu, Mei Li, Mingfeng Xue, Pei Zhang, Qin Zhu, Rui Men, Runji Lin, Tianhao Li, Tingyu Xia, Xingzhang Ren, Xuancheng Ren, Yang Fan, Yang Su, Yichang Zhang, Yu Wan, Yuqiong Liu, Zeyu Cui, Zhenru Zhang, Zihan Qiu (additional authors not shown), 19 Dec 2024, Qwen2.5 Technical Report, https://arxiv.org/abs/2412.15115
Xiaoqiang Kang, Zimu Wang, Xiaobo Jin, Wei Wang, Kaizhu Huang, Qiufeng Wang, 20 Dec 2024, Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation, https://arxiv.org/abs/2412.15594 https://github.com/Jason8Kang/TELL
Zipeng Qiu, You Peng, Guangxin He, Binhang Yuan, Chen Wang, 29 Nov 2024, TQA-Bench: Evaluating LLMs for Multi-Table Question Answering with Scalable Context and Symbolic Extension, https://arxiv.org/abs/2411.19504
Mayi Xu, Yunfeng Ning, Yongqi Li, Jianhao Chen, Jintao Wen, Yao Xiao, Shen Zhou, Birong Pan, Zepeng Bao, Xin Miao, Hankun Kang, Ke Sun, Tieyun Qian, 2 Jan 2025, Reasoning based on symbolic and parametric knowledge bases: a survey, https://arxiv.org/abs/2501.01030 (Extensive survey of reasoning from CoT to knowledge graphs to table-based reasoning.)
FZ Subah, Oct 2025, Mitigating and Assessing Bias and Fairness in Large Language Model-Generated Synthetic Tabular Data, Masters Thesis, Department of Engineering, University of Cambridge, https://www.mlmi.eng.cam.ac.uk/files/2023-2024/fzs21_mitigating_2024.pdf
G Wang, S Zhang, T Zhan, Z Shen, J Li, X Hu, X Sun, Jan 2025, Unlocking the Mysteries of OpenAI o1: A Survey of the Reasoning Abilities of Large Language Models, https://openreview.net/pdf?id=J0ADLa2rNp
Connor Shorten, Charles Pierse, Thomas Benjamin Smith, Karel D'Oosterlinck, Tuana Celik, Erika Cardenas, Leonie Monigatti, Mohd Shukri Hasan, Edward Schmuhl, Daniel Williams, Aravind Kesiraju, Bob van Luijt, 23 Jan 2025, Querying Databases with Function Calling, https://arxiv.org/abs/2502.00032

Microsoft Excel

Use of Microsoft Excel with AI:

Yuzhang Tian, Jianbo Zhao, Haoyu Dong, Junyu Xiong, Shiyu Xia, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang, 12 Jul 2024, SpreadsheetLLM: Encoding Spreadsheets for Large Language Models, https://arxiv.org/abs/2407.09025
David Gewirtz, Sept. 16, 2024, Why natural language AI scripting in Microsoft Excel could be a game changer. What if you could run advanced Excel analyses with no coding skills? Here's how Microsoft's Copilot in Excel could use Python to allow you to do just that, https://www.zdnet.com/article/why-natural-language-ai-scripting-in-microsoft-excel-could-be-a-game-changer/
Microsoft, Aug 22 2023, Announcing Python in Excel: Combining the power of Python and the flexibility of Excel, https://techcommunity.microsoft.com/t5/excel-blog/announcing-python-in-excel-combining-the-power-of-python-and-the/ba-p/3893439
Jared Spataro, Sep 16, 2024, Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents, Microsoft blog, https://www.microsoft.com/en-us/microsoft-365/blog/2024/09/16/microsoft-365-copilot-wave-2-pages-python-in-excel-and-agents/
Cristian Challu, Oct 07, 2024, 5 ways companies can use time-series forecasting, https://www.infoworld.com/article/3543468/5-ways-companies-can-use-time-series-forecasting.html
LangChain, Aug 10, 2024, UX for Agents, Part 3: Spreadsheet, Generative, and Collaborative UI/UX, https://blog.langchain.dev/ux-for-agents-part-3/
Péter Harang, Dec 21, 2024, Building a Transformer in Excel. Pico-scale reference implementation of everyone’s favourite LLM architecture, for demostration purposes, https://medium.com/@harangpeter/building-a-transformer-in-excel-467a4a27608d

Copilot Apps

Research on "copilot" types of AI applications:

Chris Parnin, Gustavo Soares, Rahul Pandita, Sumit Gulwani, Jessica Rich, Austin Z. Henley, 21 Dec 2023, Building Your Own Product Copilot: Challenges, Opportunities, and Needs, https://arxiv.org/abs/2312.14231
Jared Spataro, Sep 16, 2024, Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents, Microsoft blog, https://www.microsoft.com/en-us/microsoft-365/blog/2024/09/16/microsoft-365-copilot-wave-2-pages-python-in-excel-and-agents/
Jeremy Kahn, September 17, 2024, Microsoft introduces AI agents and updates to Copilot 365 apps as the war to make AI more useful intensifies, https://fortune.com/2024/09/16/microsoft-launches-ai-agents-updates-to-copilot-365-apps/
Tanay Jaipuria, Nov 12, 2024, Big Tech x Generative AI Q3 '24 Update (Part 2), How Meta and Microsoft's Generative AI investments are going so far, https://www.tanayj.com/p/big-tech-x-generative-ai-q3-24-update
Jason Redmond, Jan 2025, Microsoft CEO Nadella forms new AI group to build and run apps for customers. Microsoft hired DeepMind co-founder Mustafa Suleyman to lead Copilot AI initiatives last year. https://www.nbcnews.com/business/business-news/microsoft-ceo-nadella-forms-new-ai-group-build-run-apps-customers-rcna187506

AI Operating System

An AI operating system, or AI OS, is the idea of building an entire system on AI components. This is a generalization beyond just an AI framework or AI platform.

Research on an AI OS:

Simeon Emanuilov, Apr 4, 2024 LLM agent operating system (AIOS) and the future of LLM-powered agents, https://medium.com/@simeon.emanuilov/llm-agent-operating-system-aios-and-the-future-of-llm-powered-agents-3d08b4e91c34 https://unfoldai.com/aios-llm-powered-agents/
Charles Packer, Sarah Wooders, Kevin Lin, Vivian Fang, Shishir G. Patil, Ion Stoica, Joseph E. Gonzalez, 12 Feb 2024 (v2), MemGPT: Towards LLMs as Operating Systems, https://arxiv.org/abs/2310.08560 https://memgpt.ai/
Sean Michael Kerner, September 25, 2024, How Intuit plans to use agentic AI to automate complex business tasks, https://venturebeat.com/ai/how-intuit-plans-to-use-agentic-ai-to-automate-complex-business-tasks/
Nicholas Grous, Andrew Kim, June 04, 2024, Generative AI: A New Consumer Operating System, https://www.ark-invest.com/articles/analyst-research/generative-ai-a-new-consumer-operating-system
Ignacio de Gregorio Noblejas, December 15, 2024, The AI Trillion-Dollar Product, https://thewhitebox.beehiiv.com/p/the-ai-trillion-dollar-product

Security Credential Management

Security credential management is an important part of productionizing AI apps. This includes both user login passwords and the security keys of commercial APIs.

Papers on security credentials:

Jason Koebler, June 26, 2024, Researchers Prove Rabbit AI Breach By Sending Email to Us as Admin, https://www.404media.co/researchers-prove-rabbit-ai-breach-by-sending-email-to-us-as-admin/ (Rabbit's API security credentials were hard-coded into the device.)
Google, 2024, Authentication with OAuth quickstart, https://ai.google.dev/gemini-api/docs/oauth
Sunil Kumar Dash, November 25, 2024 AgentAuth: Seamless Authentication for AI Agents with 250+ Tools https://composio.dev/blog/agentauth-seamless-authentication-for-ai-agents-with-250-tools/