Aussie AI
Applications of Generative AI
-
Last Updated 12 December, 2024
-
by David Spuler, Ph.D.
Apps Built on AI
- Timo Lehto, June 2024, Developing LLM-powered Applications Using Modern Frameworks, Bachelor’s Thesis, Information and Communications Technology, Jamk University of Applied Sciences, Finland, June 2024, 53 pages., https://www.theseus.fi/bitstream/handle/10024/862271/Lehto_Timo.pdf?sequence=2 (Building LLM-based applications in RAG architecture using LangChain.)
- Evelyn Cheng Apr 17, 2024 Baidu releases new AI tools to promote application development, https://www.cnbc.com/2024/04/18/baidu-releases-new-ai-tools-to-promote-application-development.html
- David Cahn, Sep 20, 2023, AI’s $200B Question: GPU capacity is getting overbuilt. Long-term, this is good. Short-term, things could get messy, https://www.sequoiacap.com/article/follow-the-gpus-perspective/
- Kirill Kolodiazhnyi, May 15, 2020, Hands-On Machine Learning with C++: Build, train, and deploy end-to-end machine learning and deep learning pipelines, https://www.amazon.com/Hands-Machine-Learning-end-end/dp/1789955335/
- Prakash Joshi Pax, Aug 26, 2024, Fabric: The Best AI Tool That Nobody is Talking About. An open-source AI tool to automate every day tasks https://beingpax.medium.com/why-fabric-ai-can-change-the-way-you-use-ai-973e725354da
- Andrew Ng, Sep 2024, X post, https://x.com/AndrewYNg/status/1829190549842321758 (Dropping token prices for LLMs means developers can focus on the app layer.)
- Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 25 Sep 2024, Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents, https://arxiv.org/abs/2409.17140
- Sonya Huang, Pat Grady, and o1, Sequoia, October 9, 2024 Generative AI’s Act o1, https://www.sequoiacap.com/article/generative-ais-act-o1/
Building Applications for Generative AI
Research on building Gen AI apps:
- Metin Karatas, June 25, 2024, Developing AI Applications: An Introduction (New Edition), Rheinwerk Computing; New edition, https://www.amazon.com/Developing-AI-Applications-Metin-Karatas/dp/1493226010/
- Mistral AI Team, Aug 7, 2024, Build, tweak, repeat: Making it easier to develop and share generative AI applications, https://mistral.ai/news/build-tweak-repeat/
- Yorick Sens, Henriette Knopp, Sven Peldszus, Thorsten Berger, 12 Aug 2024, A Large-Scale Study of Model Integration in ML-Enabled Software Systems, https://arxiv.org/abs/2408.06226
- Google, 2024, L’Oréal: Launching Gen AI as a Service in 3 months with Cloud Run and LangChain, https://services.google.com/fh/files/misc/google_loreal_with_langchain_case_study.pdf
- Raymond Lo, Jul 10, 2024, How to Build Faster GenAI Apps with Fewer Lines of Code using OpenVINO™ GenAI API, https://medium.com/openvino-toolkit/how-to-build-faster-genai-apps-with-fewer-lines-of-code-using-openvino-genai-api-5dd5fcabea17
- Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang Zhu, Chi Wang, Saleema Amershi, 9 Aug 2024, AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems, https://arxiv.org/abs/2408.15247
- Abhimanyu Bambhaniya, Ritik Raj, Geonhwa Jeong, Souvik Kundu, Sudarshan Srinivasan, Midhilesh Elavazhagan, Madhu Kumar, Tushar Krishna, 3 Jun 2024, Demystifying Platform Requirements for Diverse LLM Inference Use Cases, https://arxiv.org/abs/2406.01698 Code: https://github.com/abhibambhaniya/GenZ-LLM-Analyzer (Analysis of cost of serving LLMs, including separate profiles of prefill versus decoding phases, and the cost of extra prompt processing in RAG architectures with prepended information.)
- Timo Lehto, June 2024, Developing LLM-powered Applications Using Modern Frameworks, Bachelor’s Thesis, Information and Communications Technology, Jamk University of Applied Sciences, Finland, June 2024, 53 pages., https://www.theseus.fi/bitstream/handle/10024/862271/Lehto_Timo.pdf?sequence=2 (Building LLM-based applications in RAG architecture using LangChain.)
- Fareed Khan, March 2024, BasicLINGUA: LLM Based NLP Library, https://github.com/FareedKhan-dev/basiclingua-LLM-Based-NLP
- Eugene Yan, Bryan Bischof, Charles Frye, Hamel Husain, Jason Liu and Shreya Shankar, May 28, 2024, What We Learned from a Year of Building with LLMs (Part I), https://www.oreilly.com/radar/what-we-learned-from-a-year-of-building-with-llms-part-i/
- Dell Technologies, May 20, 2024, Dell Technologies Expands Dell AI Factory with NVIDIA to Turbocharge AI Adoption, PR Newswire, https://www.prnewswire.com/news-releases/dell-technologies-expands-dell-ai-factory-with-nvidia-to-turbocharge-ai-adoption-302150245.html
- JH Jones, May 2024, A Quantitative Comparison of Pre-Trained Model Registries to Traditional Software Package Registries, Masters Thesis, Electrical and Computer Engineering, Purdue University, https://hammer.purdue.edu/articles/thesis/A_Quantitative_Comparison_of_Pre-Trained_Model_Registries_to_Traditional_Software_Package_Registries/25686447/1 PDF: https://hammer.purdue.edu/ndownloader/files/46096152
- Evelyn Cheng Apr 17, 2024 Baidu releases new AI tools to promote application development, https://www.cnbc.com/2024/04/18/baidu-releases-new-ai-tools-to-promote-application-development.html
- Priyank Rathod, May 21, 2024, Efficient Usage of RAG Systems in the World of LLMs, https://www.techrxiv.org/doi/full/10.36227/techrxiv.171625877.73379410/v1
- Kirill Kolodiazhnyi, May 15, 2020, Hands-On Machine Learning with C++: Build, train, and deploy end-to-end machine learning and deep learning pipelines, https://www.amazon.com/Hands-Machine-Learning-end-end/dp/1789955335/
- Mozilla, June 3, 2024, Announcing Mozilla Builders: 2024 Accelerator Theme: Local AI, https://future.mozilla.org/builders/blog/announcing-mozilla-builders/
- June 2024 (accessed), R2R: The ultimate open-source RAG framework, https://github.com/SciPhi-AI/R2R
- Hesam Sheikh, Jun 1, 2024, Towards AI Build Blog Writer and Researcher AI Agents with Ollama (100% local): Creating AI agents with Crewai and using Ollama to run them 100% locally in 5 very easy steps!, https://pub.towardsai.net/build-your-first-ai-agent-in-5-easy-steps-100-local-2fb771438a8f
- Simeon Emanuilov, Apr 4, 2024 LLM agent operating system (AIOS) and the future of LLM-powered agents, https://medium.com/@simeon.emanuilov/llm-agent-operating-system-aios-and-the-future-of-llm-powered-agents-3d08b4e91c34 https://unfoldai.com/aios-llm-powered-agents/
- Grant Gross, 13 Jun 2024, IT leaders go small for purpose-built AI, https://www.cio.com/article/2139985/it-leaders-go-small-for-purpose-built-ai.html
- Will Larson, April 8, 2024, Notes on how to use LLMs in your product. https://lethain.com/mental-model-for-how-to-use-llms-in-products/
- Matt Murphy, Tim Tully, Grace Ge, Derek Xiao, Katie Keller, January 18, 2024, The Modern AI Stack: Design Principles for the Future of Enterprise AI Architectures, https://menlovc.com/perspective/the-modern-ai-stack-design-principles-for-the-future-of-enterprise-ai-architectures/?tpcc=NL_Marketing
- NLUX: The 𝗣𝗼𝘄𝗲𝗿𝗳𝘂𝗹 Conversational AI JavaScript Library, https://github.com/nlkitai/nlux
- Jesse Clayton, Kedar Potdar and Annamalai Chockalingam, Jun 02, 2024, Streamline Development of AI-Powered Apps with NVIDIA RTX AI Toolkit for Windows RTX PCs, NVIDIA Technical Blog, https://developer.nvidia.com/blog/streamline-ai-powered-app-development-with-nvidia-rtx-ai-toolkit-for-windows-rtx-pcs/
- John Borthwick, May 28, 2024, Announcing AI Camp: Native Applications, https://render.betaworks.com/announcing-ai-camp-native-applications-e1358061c601
- Julian Yip, Apr 2, 2024, Build Autonomous AI Agents with Function Calling: Transform your chatbot into an agent that can interact with external APIs, https://towardsdatascience.com/build-autonomous-ai-agents-with-function-calling-0bb483753975 (Implement agents via models that output a JSON object that describes the API to call and the parmaeters to send.)
- Benedict Evans, 2024, Building AI products, https://www.ben-evans.com/benedictevans/2024/6/8/building-ai-products
- David Spuler, March 2024, Generative AI in C++: Coding Transformers and LLMs, https://www.amazon.com/dp/B0CXJKCWX9
- Ben Auffarth, Dec 22, 2023 Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT and other LLMs,https://www.amazon.com/Generative-AI-LangChain-language-ChatGPT/dp/1835083463/
- Olivier Caelen and Marie-Alice Blete, Oct 3, 2023 Developing Apps with GPT-4 and ChatGPT: Build Intelligent Chatbots, Content Generators, and More, https://www.amazon.com/Developing-Apps-GPT-4-ChatGPT-Intelligent/dp/1098152484/
- Douglas C. Youvan , June 15, 2024, Developing and Deploying AI Applications on NVIDIA Jetson Orin NX: A Comprehensive Guide, https://www.researchgate.net/profile/Douglas-Youvan/publication/381434888_Developing_and_Deploying_AI_Applications_on_NVIDIA_Jetson_Orin_NX_A_Comprehensive_Guide/links/666d7390de777205a32fceb6/Developing-and-Deploying-AI-Applications-on-NVIDIA-Jetson-Orin-NX-A-Comprehensive-Guide.pdf
- Lak Lakshmanan, March 7, 2024, Building an AI Assistant with DSPy: A way to program and tune prompt-agnostic LLM agent pipelines, https://towardsdatascience.com/building-an-ai-assistant-with-dspy-2e1e749a1a95
- Michael Lin, June 2024, How to Successfully Manage AI Software Projects: The 4 Phases of AI Projects I Shared at VixulCon https://medium.com/@_michaellin/how-to-successfully-manage-ai-software-projects-a8344b5b76a9
- Fabian Both, June 2024, why we no longer use LangChain for building our AI agents , https://www.octomind.dev/blog/why-we-no-longer-use-langchain-for-building-our-ai-agents (Replaces LangChain with their own more-focused internal tool sets.)
- Charles Lamanna, March 28, 2023, Companies innovate with low-code and fusion development, Microsoft, https://www.microsoft.com/en-us/industry/microsoft-in-business/business-transformation/2023/03/28/companies-innovate-with-low-code-and-fusion-development/ (States that 750 million new apps are required in the next two years, but there are only 4 million developers.)
- McKinsey & Company, June 14, 2024, Scott Johnston on designing and building scalable platforms, https://www.mckinsey.com/industries/technology-media-and-telecommunications/our-insights/scott-johnston-on-designing-and-building-scalable-platforms (Docker CEO states that 750 million new apps are required.)
- Valentina Alto, May 2024, Building LLM Powered Applications: Create intelligent apps and agents with large language models, Packt Publishing, https://www.amazon.com/Building-LLM-Apps-Intelligent-Language/dp/1835462316/
- Irene Weber, 13 Jun 2024, Large Language Models as Software Components: A Taxonomy for LLM-Integrated Applications, https://arxiv.org/abs/2406.10300
- Aarushi Kansal, Building Generative AI-Powered Apps: A Hands-on Guide for Developers, Apress, https://www.amazon.com/Building-Generative-AI-Powered-Apps-Hands-ebook/dp/B0CTXXP1S4/
- Louis-François Bouchard, Louie Peters, May 2024, Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG, https://www.amazon.com/Building-LLMs-Production-Reliability-Fine-Tuning/dp/B0D4FFPFW8/
- Kristian McCann July 15, 2024, AWS Unveils AI Service That Makes Enterprise Apps in Minutes, https://aimagazine.com/articles/aws-unveils-ai-service-that-builds-enterprise-apps-in-minute (Low-code enterprise AI app builder from AWS.)
- Gene Rapoport, Sanjin Bicanic, Jue Wang, Richard Lichtenstein, Arjun Dutt, June 20, 2024, AI Survey: Four Themes Emerging: If 2023 was about experimentation, 2024 is all about results. Bain & Company, https://www.bain.com/insights/ai-survey-four-themes-emerging/ (Bain reports that use cases have been broadly successful in the use cases of sales, sales operations, software development, marketing, customer service, and customer onboarding, but less successful in HR, operations and legal. Interestingly, the main reason for AI project failures was that it couldn't perform the necessary task.)
- Chip Huyen, Jul 25, 2024, Building A Generative AI Platform, https://huyenchip.com/2024/07/25/genai-platform.html
- Juan Pablo Bottaro, April 25, 2024, Musings on building a Generative AI product, https://www.linkedin.com/blog/engineering/generative-ai/musings-on-building-a-generative-ai-product?_l=en_US
- Writer, Aug 2024 (accessed), Writer AI Studio: The fastest way to build AI apps, https://writer.com/product/ai-studio/
- OpenAI, Aug 2024 (accessed), .NET library, https://platform.openai.com/docs/libraries/dotnet-library https://github.com/openai/openai-dotnet
- Travis Wilson, Jun 07 2024, Azure OpenAI Service expands .NET SDK support, https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-openai-service-expands-net-sdk-support/ba-p/4162940
- Makhkamova, Ozoda, and Doohyun Kim. 2021. "A Conversation History-Based Q&A Cache Mechanism for Multi-Layered Chatbot Services" Applied Sciences 11, no. 21: 9981. https://doi.org/10.3390/app11219981 https://www.mdpi.com/2076-3417/11/21/9981 https://www.mdpi.com/2076-3417/11/21/9981/pdf
- Yanxi Chen, Yaliang Li, Bolin Ding, Jingren Zhou, 20 Jul 2024, On the Design and Analysis of LLM-Based Algorithms, https://arxiv.org/abs/2407.14788 https://github.com/modelscope/agentscope/tree/main/examples/paper_llm_based_algorithm
- Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
- Xinyi Hou, Yanjie Zhao, Haoyu Wang, 3 Aug 2024, Voices from the Frontier: A Comprehensive Analysis of the OpenAI Developer Forum, https://arxiv.org/abs/2408.01687
- Xiang Chen, Chaoyang Gao, Chunyang Chen, Guangbei Zhang, Yong Liu, 12 Aug 2024 (v2), An Empirical Study on Challenges for LLM Developers, https://arxiv.org/abs/2408.05002
- Chaojun Xiao, Zhengyan Zhang, Chenyang Song, Dazhi Jiang, Feng Yao, Xu Han, Xiaozhi Wang, Shuo Wang, Yufei Huang, Guanyu Lin, Yingfa Chen, Weilin Zhao, Yuge Tu, Zexuan Zhong, Ao Zhang, Chenglei Si, Khai Hao Moo, Chenyang Zhao, Huimin Chen, Yankai Lin, Zhiyuan Liu, Jingbo Shang, Maosong Sun, Sep 2024, Configurable Foundation Models: Building LLMs from a Modular Perspective, https://arxiv.org/pdf/2409.02877
- Lior Solomon, Sep 2024, Gen AI testing strategies and tools, https://medium.com/ai-in-grc/gen-ai-testing-strategies-and-tools-257383e5cbfb
- Carl Franzen, September 13, 2024, What OpenAI’s new o1-preview and o1-mini models mean for developers, https://venturebeat.com/programming-development/what-openais-new-o1-preview-and-o1-mini-models-mean-for-developers/
- Ellie Ko, Sep 25, 2024, A Survey of Python Frameworks, https://ploomber.io/blog/survey-python-frameworks/
- Timothy Mugayi, Sep 2024, LLM Practical Ideas to Build Your Next AI-Powered Application: Realistic Use Cases to Unleash the Power of AI in Your Next Project, https://levelup.gitconnected.com/llm-practical-ideas-to-build-your-next-ai-powered-application-9379feba6cbc
- Matt Asay, Sep 23, 2024, Too much assembly required for AI, https://www.infoworld.com/article/3536292/too-much-assembly-required-for-ai.html
- Kif Leswing, Fri, Oct 4 2024, As Apple enters AI race, iPhone maker turns to its army of developers for an edge, https://www.cnbc.com/2024/10/04/apple-is-turning-to-its-army-of-developers-for-an-edge-in-the-ai-race.html
- Nicola Sessions, Oct 15, 2024, DataStax Announces New AI Development Platform, Built with NVIDIA AI, https://developer.nvidia.com/blog/datastax-announces-new-ai-development-platform-built-with-nvidia-ai/
- Anurag Guda and Shruthii Sathyanarayanan, Oct 16, 2024, Simplify AI Application Development with NVIDIA Cloud Native Stack, https://developer.nvidia.com/blog/simplify-ai-application-development-with-nvidia-cloud-native-stack/
- Sid Chatterjee, Matt Silverlock, Celso Martinho, 2024-10-24, Build durable applications on Cloudflare Workers: you write the Workflows, we take care of the rest, https://blog.cloudflare.com/building-workflows-durable-execution-on-workers/
- LangChain, Nov 7, 2024. SCIPE - Systematic Chain Improvement and Problem Evaluation, https://blog.langchain.dev/scipe-systematic-chain-improvement-and-problem-evaluation/ https://github.com/garg-ankush/scipe/tree/main
- Lak Lakshmanan, Oct 4, 2024, How to Choose the Architecture for Your GenAI Application. A framework to select the simplest, fastest, cheapest architecture that will balance LLMs’ creativity and risk, https://towardsdatascience.com/how-to-choose-the-architecture-for-your-genai-application-6053e862c457
- Siyun Zhao, Yuqing Yang, Zilong Wang, Zhiyuan He, Luna K. Qiu, Lili Qiu, 23 Sep 2024, Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely, https://arxiv.org/abs/2409.14924
- Dhavalkumar Patel, Ganesh Raut, Satya Narayan Cheetirala, Girish N Nadkarni, Robert Freeman, Benjamin S. Glicksberg, Eyal Klang, Prem Timsina, 8 Dec 2024, Cloud Platforms for Developing Generative AI Solutions: A Scoping Review of Tools and Services, https://arxiv.org/abs/2412.06044
Inference Frameworks
Research papers include:
- Yiheng Liu, Hao He, Tianle Han, Xu Zhang, Mengyuan Liu, Jiaming Tian, Yutong Zhang, Jiaqi Wang, Xiaohui Gao, Tianyang Zhong, Yi Pan, Shaochen Xu, Zihao Wu, Zhengliang Liu, Xin Zhang, Shu Zhang, Xintao Hu, Tuo Zhang, Ning Qiang, Tianming Liu, Bao Ge, Jan 2024, Understanding LLMs: A Comprehensive Overview from Training to Inference https://arxiv.org/abs/2401.02038
- MLC team. 2023. MLC-LLM. https://github.com/mlc-ai/mlc-llm
- tinygrad. 2023. Tinygrad. https://github.com/tinygrad/tinygrad
- Woosuk Kwon, Zhuohan Li, Siyuan Zhuang, Ying Sheng, Lianmin Zheng, Cody Hao Yu, Joseph E. Gonzalez, Hao Zhang, Ion Stoica, Oct 2023, Efficient Memory Management for Large Language Model Serving with PagedAttention, SOSP ’23, October 23–26, 2023, Koblenz, Germany, https://dl.acm.org/doi/pdf/10.1145/3600006.3613165 (The original Paged Attention and vLLM paper, focusing on optimizing memory size of the KV cache using methods similar to operating-system memory paging.)
- Vince Lam, Mar 12, 2024, 50+ Open-Source Options for Running LLMs Locally, https://medium.com/thedeephub/50-open-source-options-for-running-llms-locally-db1ec6f5a54f
- Jason Perlow, Aug. 6, 2024, How to run dozens of AI models on your Mac or PC - no third-party cloud needed, https://www.zdnet.com/article/how-to-run-dozens-of-ai-models-on-your-mac-or-pc-no-third-party-cloud-needed/
- Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Chuyue Sun, Jeff Huang, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark Barrett, Ying Sheng, 6 Jun 2024 (v2), SGLang: Efficient Execution of Structured Language Model Programs, https://arxiv.org/abs/2312.07104 https://github.com/sgl-project/sglang
- The SGLang Team, Jul 25, 2024, Achieving Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, vLLM), https://lmsys.org/blog/2024-07-25-sglang-llama3/
- Anna Popovych, Sofiya Merenych, February 16, 2024, Top AI Frameworks in 2024: Comparison of Artificial Intelligence Frameworks, https://clockwise.software/blog/artificial-intelligence-framework/
- Hugging Face, 2024, Text Generation Inference, https://huggingface.co/docs/text-generation-inference/index
- ZML, Sep 2024, ZML: High performance AI inference stack. Built for productionl https://docs.zml.ai/ https://github.com/zml/zml?tab=readme-ov-file
- Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Hongyi Jin, Tianqi Chen, Zhihao Jia, 23 Dec 2023, Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems, https://arxiv.org/abs/2312.15234
- Ruihao Gong, Yifu Ding, Zining Wang, Chengtao Lv, Xingyu Zheng, Jinyang Du, Haotong Qin, Jinyang Guo, Michele Magno, Xianglong Liu, 25 Sep 2024, A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms, https://arxiv.org/abs/2409.16694
- Sebastian Petrus, Sep 4, 2024, Top 10 RAG Frameworks Github Repos 2024, https://sebastian-petrus.medium.com/top-10-rag-frameworks-github-repos-2024-12b2a81f4a49
- Rick Zhou, Larme Zhao, Bo Jiang, and Sean Sheng, June 5, 2024, Benchmarking LLM Inference Backends: vLLM, LMDeploy, MLC-LLM, TensorRT-LLM, and TGI, https://www.bentoml.com/blog/benchmarking-llm-inference-backends
Orchestration Frameworks
Research papers include:
- Konstantinos Papaioannou, Thaleia Dimitra Doudali, April 2024, The Importance of Workload Choice in Evaluating LLM Inference Systems, EuroMLSys '24: Proceedings of the 4th Workshop on Machine Learning and Systems, April 2024, Pages 39–46, https://doi.org/10.1145/3642970.3655823 https://dl.acm.org/doi/abs/10.1145/3642970.3655823
- Jacob Robbins, January 4, 2024, Why generative AI orchestration startups are poised for growth in 2024, Pitch Book, https://pitchbook.com/news/articles/generative-ai-orchestration-startups-venture-capital-unicorns
- Xin Tan, Yimin Jiang, Yitao Yang, Hong Xu, 29 Jun 2024, Teola: Towards End-to-End Optimization of LLM-based Applications, https://arxiv.org/abs/2407.00326
- Chip Huyen, Jul 25, 2024, Building A Generative AI Platform, https://huyenchip.com/2024/07/25/genai-platform.html
- Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Chuyue Sun, Jeff Huang, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark Barrett, Ying Sheng, 6 Jun 2024 (v2), SGLang: Efficient Execution of Structured Language Model Programs, https://arxiv.org/abs/2312.07104 https://github.com/sgl-project/sglang
- The SGLang Team, Jul 25, 2024, Achieving Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, vLLM), https://lmsys.org/blog/2024-07-25-sglang-llama3/
- An Efficient Network Orchestrator for Distributed Compound Language Model Systems Muhammad Shahir Abdurrahman, Stanford University, Stanford, California, USA, https://www.scs.stanford.edu/24sp-cs244b/projects/An_Efficient_Network_Orchestrator_for_Distributed_Compound_Language_Model_Systems.pdf
- Melissa Malec, June 5, 2024, AI Orchestration Explained: The What, Why & How for 2024, https://hatchworks.com/blog/gen-ai/ai-orchestration/
- Manish Kochar, May 19, 2024, Compounding GenAI Success: Why Orchestration is the Key to Mastering Generative AI, https://medium.com/@mkochar/compounding-genai-success-why-orchestration-is-the-key-to-mastering-generative-ai-543a2952acfe
- Carl Franzen, August 23, 2024, Grok-2 gets a speed bump after developers rewrite code in three days, https://venturebeat.com/ai/grok-2-gets-a-speed-bump-after-developers-rewrite-code-in-three-days/ (Inference speed improvement by rewriting using the SGLang orchestration framework.)
- Gary Grossman, September 8, 2024, AI orchestration: Crafting harmony or creating dependency? https://venturebeat.com/ai/ai-orchestration-crafting-harmony-or-creating-dependency/
- A. R. Ali, K. Kumar, M. A. Siddiqui and M. Zahid, 2024, An Open-source Cross-Industry and Cloud-agnostic Generative AI Platform, 2024 International Joint Conference on Neural Networks (IJCNN), Yokohama, Japan, 2024, pp. 1-10, doi: 10.1109/IJCNN60899.2024.10650688, https://ieeexplore.ieee.org/abstract/document/10650688
- LiLMod, Aug 27, 2024, Haystack: the new LLM framework that is shaking its competitors, https://ai.plainenglish.io/haystack-the-new-llm-framework-that-is-shaking-its-competitors-1a083a153fd9
- Yiyuan He, Minxian Xu, Jingfeng Wu, Wanyi Zheng, Kejiang Ye, Chengzhong Xu, 24 Sep 2024 (v2), UELLM: A Unified and Efficient Approach for LLM Inference Serving, https://arxiv.org/abs/2409.14961
- Michael Nuñez, September 25, 2024, AI for all: Meta’s ‘Llama Stack’ promises to simplify enterprise adoption, https://venturebeat.com/ai/ai-for-all-meta-llama-stack-promises-to-simplify-enterprise-ai-adoption/
- Matt Asay, Sep 23, 2024, Too much assembly required for AI, https://www.infoworld.com/article/3536292/too-much-assembly-required-for-ai.html
- Kabir Nagrecha, Oct 2024, Thesis, Orchestration Systems to Support Deep Learning at Scale Doctor of Philosophy, Computer Science, University of California San Diego, https://escholarship.org/content/qt3pp6k1p4/qt3pp6k1p4_noSplash_457f4c7c0435172a3d0a17428455894c.pdf (Pipeline and data parallelism systems.)
- Emilia David, November 19, 2024, Orchestrator agents: Integration, human interaction, and enterprise knowledge at the core, https://venturebeat.com/ai/orchestrator-agents-integration-human-interaction-and-enterprise-knowledge-at-the-core/
Wrap Architectures for Gen AI Applications
The simplest architectures for AI applications are those that simply "wrap" around LLMs, whether it is commercial LLMs like GPT, or open source LLMs like Mistral or Llama.
- A16Z, April 2nd, 2024 (accessed), AI Getting Started https://github.com/a16z-infra/ai-getting-started (Javascript wrapper kits for several commercial AI APIs.)
- Ben Auffarth, Dec 22, 2023 Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT and other LLMs,https://www.amazon.com/Generative-AI-LangChain-language-ChatGPT/dp/1835083463/
- Thiyagarajan Maruthavan (Rajan), Apr 12, 2024, So what if it is a thin wrapper on OpenAI? https://medium.com/@mtrajan/so-what-if-it-is-a-thin-wrapper-on-openai-274dd005b6d3
- Adva Nakash Peleg, May 30, 2024, An LLM Journey: From POC to Production, https://medium.com/cyberark-engineering/an-llm-journey-from-poc-to-production-6c5ec6a172fb
- Apurv Sibal, February 26, 2025, Hands-On Prompt Engineering: Learning to Program ChatGPT Using OpenAI APIs, Wiley, https://www.amazon.com/Hands-Prompt-Engineering-Learning-Program/dp/1394210760/
- Dr Kris Jamsa, Dec 2023, OpenAI and ChatGPT Programming: Using Python to Unlock OpenAI and ChatGPT, https://www.amazon.com/OpenAI-ChatGPT-Programming-Python-Unlock/dp/B0CQK41P6B/
- Cuantum Technologies, May 2023, ChatGPT API Bible: Mastering Python Programming for Conversational AI: Build Intelligent Chatbots and AI Applications with ChatGPT API and Python (Mastering AI and Python), https://www.amazon.com/ChatGPT-API-Bible-Conversational-Applications/dp/B0C47NWRT7/
- Mike Gold, October 6, 2023, Crafting Applications with ChatGPT API: Using Python, Green Belt Book LLC, https://www.amazon.com/Crafting-Applications-ChatGPT-API-Python-ebook/dp/B0CHJX36X3/
- Henry Habib, Paul Siegel, March 12, 2024, OpenAI API Cookbook: Build intelligent applications including chatbots, virtual assistants, and content generators, Packt Publishing, https://www.amazon.com/OpenAI-API-Cookbook-intelligent-applications-ebook/dp/B0CT8W7B79/
- Olivier Caelen, Marie-Alice Blete, August 13, 2024, Developing Apps with GPT-4 and ChatGPT: Build Intelligent Chatbots, Content Generators, and More, 2nd edition, O'Reilly Media; https://www.amazon.com/Developing-Apps-GPT-4-ChatGPT-Intelligent/dp/1098168100/
- Michael J. Lever, Aug 2024, AI or API? | Chatbot cuckoos are bloating tech OpenAI wrappers are becoming a shortcut for start-ups, but are they sustainable? https://medium.com/future-ux/ai-or-api-chatbot-cuckoos-are-bloating-tech-d6b8d8255279
- Yorick Sens, Henriette Knopp, Sven Peldszus, Thorsten Berger, 12 Aug 2024, A Large-Scale Study of Model Integration in ML-Enabled Software Systems, https://arxiv.org/abs/2408.06226
- Raymond Lo, Jul 10, 2024, How to Build Faster GenAI Apps with Fewer Lines of Code using OpenVINO™ GenAI API, https://medium.com/openvino-toolkit/how-to-build-faster-genai-apps-with-fewer-lines-of-code-using-openvino-genai-api-5dd5fcabea17
- Rachel Curry, Aug 28 2024, Why companies including JPMorgan and Walmart are opting for internal gen AI assistants after initially restricting usage, https://www.cnbc.com/2024/08/28/why-jpmorgan-and-walmart-are-opting-for-internal-gen-ai-assistants.html
- Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
- Xinyi Hou, Yanjie Zhao, Haoyu Wang, 3 Aug 2024, Voices from the Frontier: A Comprehensive Analysis of the OpenAI Developer Forum, https://arxiv.org/abs/2408.01687
- Chandra Irugalbandara, Ashish Mahendra, Roland Daynauth, Tharuka Kasthuri Arachchige, Jayanaka Dantanarayana, Krisztian Flautner, Lingjia Tang, Yiping Kang, Jason Mars, 16 Apr 2024 (v3), Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's LLM with Open Source SLMs in Production, https://arxiv.org/abs/2312.14972
- Xiang Chen, Chaoyang Gao, Chunyang Chen, Guangbei Zhang, Yong Liu, 12 Aug 2024 (v2), An Empirical Study on Challenges for LLM Developers, https://arxiv.org/abs/2408.05002
- Dennis Rall, Bernhard Bauer, Thomas Fraunholz, 8 Nov 2023, Towards Democratizing AI: A Comparative Analysis of AI as a Service Platforms and the Open Space for Machine Learning Approach, https://arxiv.org/abs/2311.04518
- David Spuler, March 2024, API Wrapper Architecture Optimizations, in Generative AI in C++, https://www.aussieai.com/book/ch7-api-wrapper-optimizations
- Andrew Zuo, Sep 2024, Don’t Judge An LLM Only By The Web App, https://andrewzuo.com/dont-judge-an-llm-only-by-the-web-app-0a47d29390c3
- Emilia David, September 3, 2024, Anthropic to release system prompts for Artifacts, latest Claude family prompts found incomplete, https://venturebeat.com/ai/anthropic-to-release-system-prompts-for-artifacts-latest-claude-family-prompts-found-incomplete/
- Emilia David, August 27, 2024, Anthropic releases AI model system prompts, winning praise for transparency, https://venturebeat.com/ai/anthropic-releases-ai-model-system-prompts-winning-praise-for-transparency/
- Gian Segato, September 2024, The dawn of a new startup era, https://giansegato.com/essays/dawn-new-startup-era
- Kris Ograbek, Aug 30, 2024, 6 Hard-learned Lessons from My First Project as a Freelance AI Engineer, https://ai.gopubby.com/6-hard-learned-lessons-from-my-first-project-as-a-freelance-ai-engineer-9519e6edee90
- Asankhaya Sharma (codelion), Sep 2024, Optillm: Optimizing inference proxy for LLMs, https://github.com/codelion/optillm
- Xiaoxia Liu, Jingyi Wang, Jun Sun, Xiaohan Yuan, Guoliang Dong, Peng Di, Wenhai Wang, Dongxia Wang, 21 Nov 2023, Prompting Frameworks for Large Language Models: A Survey, https://arxiv.org/abs/2311.12785
- Carl Franzen, September 13, 2024, What OpenAI’s new o1-preview and o1-mini models mean for developers, https://venturebeat.com/programming-development/what-openais-new-o1-preview-and-o1-mini-models-mean-for-developers/
- Sascha Heyer, Sep 2024, RAG API: 30 lines of code is all you need for RAG. The easiest way to get started with RAG. https://medium.com/google-cloud/google-cloud-rag-api-c7e3c9931b3e
- Kyle Wiggers, September 16, 2024, Runway announces an API for its video-generating AI models, https://techcrunch.com/2024/09/16/runway-announces-an-api-for-its-video-generating-models/
- Kyle Wiggers, October 21, 2024, xAI, Elon Musk’s AI startup, launches an API, https://techcrunch.com/2024/10/21/xai-elon-musks-ai-startup-launches-an-api/
- Quang H. Nguyen, Duy C. Hoang, Juliette Decugis, Saurav Manchanda, Nitesh V. Chawla, Khoa D. Doan, 24 Jul 2024 (v2), MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs, https://arxiv.org/abs/2407.10834
- K. Balázs Neszlényi, A. Milos and A. Kiss, "AssistantGPT: Enhancing User Interaction with LLM Integration," 2024 IEEE 22nd Jubilee International Symposium on Intelligent Systems and Informatics (SISY), Pula, Croatia, 2024, pp. 000619-000624, doi: 10.1109/SISY62279.2024.10737548. https://ieeexplore.ieee.org/abstract/document/10737548
- swyx, Sep 2024, What Works in AI UX (lightning talk + Q&A), https://www.youtube.com/watch?v=PkHjoihjo6U
- Latent Space, Nov 2024, Why GPT Wrappers Are Good, Actually, https://www.latent.space/p/gpt-wrappers
- Tegan Jones, 22 November, 2024, Neural Notes: Stop building AI startups with “the same crap” as everyone else. In this edition: warnings for startups relying too heavily on generic AI models and how AI has changed the relationship between VCs and founders. https://www.smartcompany.com.au/artificial-intelligence/neural-notes-stop-building-ai-startups-same-crap-everyone-else/
- Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su, 22 Feb 2024, Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments, https://arxiv.org/abs/2402.14672
- Narcisa Guran, Florian Knauf, Man Ngo, Stefan Petrescu, Jan S. Rellermeyer, 21 Nov 2024, Towards a Middleware for Large Language Models, https://arxiv.org/abs/2411.14513
- Andrew Ng, Nov 2024, Simple, unified interface to multiple Generative AI providers, https://github.com/andrewyng/aisuite
- Angular Ventures, December 03, 2024, Engines or plastics? How we talk about LLMs and how we use them. The Angle Issue #249, https://newsletter.angularventures.com/p/engines-or-plastics-how-we-talk-about-llms-and-how-we-use-them
OpenAI API Applications
One particular type of "wrap" AI application is to use the OpenAI API (e.g. for ChatGPT).
- Dr Kris Jamsa, Dec 2023, OpenAI and ChatGPT Programming: Using Python to Unlock OpenAI and ChatGPT, https://www.amazon.com/OpenAI-ChatGPT-Programming-Python-Unlock/dp/B0CQK41P6B/
- Cuantum Technologies, May 2023, ChatGPT API Bible: Mastering Python Programming for Conversational AI: Build Intelligent Chatbots and AI Applications with ChatGPT API and Python (Mastering AI and Python), https://www.amazon.com/ChatGPT-API-Bible-Conversational-Applications/dp/B0C47NWRT7/
- Mike Gold, October 6, 2023, Crafting Applications with ChatGPT API: Using Python, Green Belt Book LLC, https://www.amazon.com/Crafting-Applications-ChatGPT-API-Python-ebook/dp/B0CHJX36X3/
- Henry Habib, Paul Siegel, March 12, 2024, OpenAI API Cookbook: Build intelligent applications including chatbots, virtual assistants, and content generators, Packt Publishing, https://www.amazon.com/OpenAI-API-Cookbook-intelligent-applications-ebook/dp/B0CT8W7B79/
- Olivier Caelen, Marie-Alice Blete, August 13, 2024, Developing Apps with GPT-4 and ChatGPT: Build Intelligent Chatbots, Content Generators, and More, 2nd edition, O'Reilly Media; https://www.amazon.com/Developing-Apps-GPT-4-ChatGPT-Intelligent/dp/1098168100/
Batch API for Inference
- Michael Nuñez, October 8, 2024, Anthropic challenges OpenAI with affordable batch processing, https://venturebeat.com/ai/anthropic-challenges-openai-with-affordable-batch-processing/
- Microsoft Nov 2024, Getting started with Azure OpenAI global batch deployments, https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/batch
- OpenAI, Nov 2024, Batch API FAQ. Batch API endpoint for asynchronous batch processing, https://help.openai.com/en/articles/9197833-batch-api-faq
- Anthropic, 9 Oct 2024, Introducing the Message Batches API, https://www.anthropic.com/news/message-batches-api
- Katia Gil Guzman Apr 24, 2024, Batch processing with the Batch API, https://cookbook.openai.com/examples/batch_processing
- Lunary, Oct 22, 2024, Using the Batch API with Azure OpenAI, https://lunary.ai/blog/batch-api-azure-openai
- Sukalp Tripathi, Sep 8, 2024, Batch API: OpenAI, https://sukalp.medium.com/batch-api-openai-831a0b09690c
- Google, Nov 2024, Get batch predictions for Gemini, https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/batch-prediction-api
- Google, Nov 2024, Send a batch process documents request, https://cloud.google.com/document-ai/docs/samples/documentai-batch-process-document
- Gibion AI, Jan 15, 2024, Efficient Batch Processing with LangChain and OpenAI: Overcoming RateLimitError, https://medium.com/@hey_16878/efficient-batch-processing-with-langchain-and-openai-overcoming-ratelimiterror-daa9de4bbd8b
- Bingli Liao, Danilo Vasconcellos Vargas, 13 Jul 2024, Beyond KV Caching: Shared Attention for Efficient LLMs, https://arxiv.org/abs/2407.12866 (Layerwise weight sharing in attention.)
Application Layer
The "application layer" is the whole range of applications that can be built on top of generative AI and its LLMs as building blocks. Research includes:
- Ashu Garg, Oct 25, 2024, Why OpenAI’s $157B valuation misreads AI’s future, https://foundationcapital.com/why-openais-157b-valuation-misreads-ais-future/ (Bullish on the "application layer" saying "The top of the stack is where I see the most promise. ...the most valuable companies of the AI era don’t exist yet."... "The cloud era created over 20 application companies with $1B+ revenue. In AI, we believe this number could exceed 100.")
- Akash Bajwa, Nov 18, 2024, Opinionated AI Products: Strong Technologies Forms Beliefs, https://akashbajwa.substack.com/p/opinionated-ai-products
- Meno Ventures, Nov 2024, 2024: The State of Generative AI in the Enterprise: The enterprise AI landscape is being rewritten in real time, https://menlovc.com/2024-the-state-of-generative-ai-in-the-enterprise/
- Tegan Jones, 22 November, 2024, Neural Notes: Stop building AI startups with “the same crap” as everyone else. In this edition: warnings for startups relying too heavily on generic AI models and how AI has changed the relationship between VCs and founders. https://www.smartcompany.com.au/artificial-intelligence/neural-notes-stop-building-ai-startups-same-crap-everyone-else/
- Angular Ventures, December 03, 2024, Engines or plastics? How we talk about LLMs and how we use them. The Angle Issue #249, https://newsletter.angularventures.com/p/engines-or-plastics-how-we-talk-about-llms-and-how-we-use-them
- Charles Rollet, December 4, 2024, Key leaders behind Google’s viral NotebookLM are leaving to create their own startup, https://techcrunch.com/2024/12/04/key-leaders-behind-googles-viral-notebooklm-are-leaving-to-create-their-own-startup/ ("As the frontier models and their capabilities continue to grow, thoughtful products are required to make the benefits of this technology accessible, useful, and obvious to everyday people — so our team is going to be focused on building a user-first AI product...the team wanted to create something that leverages the latest AI models to build something useful to regular people.")
- Leah Hodgson, December 7, 2024, Where are all the consumer AI startups—and why aren’t VCs funding them? https://pitchbook.com/news/articles/where-are-all-the-consumer-ai-startups-and-why-arent-vcs-funding-them ("...consumer AI market by 2032 will be twice the size of the enterprise market for AI."..."According to Zion Market Research, the market size for consumer AI is predicted to grow to around $1.3 trillion by 2032. For enterprise, it is estimated to reach only around $560 billion by the same year, according to Precedence research.")
Code Generation Applications of Generative AI
- Hadi Ghaemi, Zakieh Alizadehsani, Amin Shahraki, Juan M. Corchado, June 2024, Transformers in source code generation: A comprehensive survey, Journal of Systems Architecture, 103193, https://www.sciencedirect.com/science/article/abs/pii/S1383762124001309
- Franklin Huang, May 17, 2024, Machine Learning Systems with Reduced Memory Requirements, Masters of Science, Electrical Engineering and Computer Sciences, University of California, Berkeley, Technical Report No. UCB/EECS-2024-120 http://www2.eecs.berkeley.edu/Pubs/TechRpts/2024/EECS-2024-120.html https://www2.eecs.berkeley.edu/Pubs/TechRpts/2024/EECS-2024-120.pdf Code: https://github.com/hongyihuang/spec-mcts/blob/main/triton (Broad paper that examines a lot of different optimizations that reduce memory costs, including quantization, kernel fusion, sparsity, MatMul optimizations, KV cache compression, and various other methods.)
- Lianghong Guo, Yanlin Wang, Ensheng Shi, Wanjun Zhong, Hongyu Zhang, Jiachi Chen, Ruikai Zhang, Yuchi Ma, Zibin Zheng, 29 Jul 2024, When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention, https://arxiv.org/abs/2407.20042 Code: https://github.com/DeepSoftwareAnalytics/CodeFast
- AIM, 2024, Mistral AI Unveils Mistral Large 2, Beats Llama 3.1 on Code and Math, https://analyticsindiamag.com/ai-news-updates/mistral-ai-unveils-mistral-large-2-beats-llama-3-1-on-code-and-math/
- Kevin Zhang, Jun 26, 2024, Investing in the Age of Generative AI, https://eastwind.substack.com/p/investing-in-the-age-of-generative
- by Nicholas Carlini, 2024-08-01, How I Use "AI", https://nicholas.carlini.com/writing/2024/how-i-use-ai.html (Generative AI and LLM use cases are "unglamorous" but useful to software developers.)
- Haolin Jin, Linghan Huang, Haipeng Cai, Jun Yan, Bo Li, Huaming Chen, 5 Aug 2024, From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future, https://arxiv.org/abs/2408.02479
- Grant Gross, 30 Aug 2024, Agentic AI: Decisive, operational AI arrives in business, https://www.cio.com/article/3496519/agentic-ai-decisive-operational-ai-arrives-in-business.html
- Hao Zhou, Chengming Hu, Ye Yuan, Yufei Cui, Yili Jin, Can Chen, Haolun Wu, Dun Yuan, Li Jiang, Di Wu, Xue Liu, Charlie Zhang, Xianbin Wang, Jiangchuan Liu, 17 May 2024, Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities, https://arxiv.org/abs/2405.10825
- Yaroslav Zharov, Yury Khudyakov, Evgeniia Fedotova, Evgeny Grigorenko, Egor Bogomolov, 18 Feb 2024, Tool-Augmented LLMs as a Universal Interface for IDEs, https://arxiv.org/abs/2402.11635
- Advait Sarkar, 1 Nov 2023, Will Code Remain a Relevant User Interface for End-User Programming with Generative AI Models? https://arxiv.org/abs/2311.00382
- Liwenhan Xie, Chengbo Zheng, Haijun Xia, Huamin Qu, Chen Zhu-Tian, 3 Aug 2024, WaitGPT: Monitoring and Steering Conversational LLM Agent in Data Analysis with On-the-Fly Code Visualization, https://arxiv.org/abs/2408.01703
- Madhumita Murgia, August 23 2024, AI-powered coding pulls in almost $1bn of funding to claim ‘killer app’ status, https://www.ft.com/content/4868bd38-613c-4fa9-ba9d-1ed8fa8a40c8
- Hesam Sheikh, Aug 2024, The Smarter Way of Using AI in Programming, https://towardsdatascience.com/the-smarter-way-of-using-ai-in-programming-0492ac610385
- Pragmatic Coders, Sep 2024, Best AI tools for developers in 2024: AI-powered coding, https://medium.com/@pragmaticcoders/best-ai-tools-for-developers-in-2024-ai-powered-coding-32e31dff6024
- Zheyuan (Kevin) Cui, Mert Demirer, Sonia Jaffe, Leon Musolff, Sida Peng, Tobias Salz, September 03, 2024, The Effects of Generative AI on High Skilled Work: Evidence from Three Field Experiments with Software Developers, https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4945566 https://papers.ssrn.com/sol3/Delivery.cfm/4945566.pdf?abstractid=4945566&mirid=1
- Asif Razzaq, September 5, 2024, Yi-Coder Released by 01.AI: A Powerful Small-Scale Code LLM Series, Delivering Exceptional Performance in Code Generation, Editing, and Long-Context Comprehension, https://www.marktechpost.com/2024/09/05/yi-coder-released-by-01-ai-a-powerful-small-scale-code-llm-series-delivering-exceptional-performance-in-code-generation-editing-and-long-context-comprehension/
- OpenAI, September 12, 2024, Learning to Reason with LLMs, https://openai.com/index/learning-to-reason-with-llms/
- Grant Gross, 12 Sep 2024, AI coding assistants wave goodbye to junior developers, https://www.cio.com/article/3509174/ai-coding-assistants-wave-goodbye-to-junior-developers.html
- Evan Wang, Federico Cassano, Catherine Wu, Yunfeng Bai, Will Song, Vaskar Nath, Ziwen Han, Sean Hendryx, Summer Yue, Hugh Zhang, 5 Sep 2024, Planning In Natural Language Improves LLM Search For Code Generation, https://arxiv.org/abs/2409.03733
- Michael Nuñez, September 19, 2024, Microsoft’s GRIN-MoE AI model takes on coding and math, beating competitors in key benchmarks, https://venturebeat.com/ai/microsofts-grin-moe-ai-model-takes-on-coding-and-math-beating-competitors-in-key-benchmarks/
- Yanxian Huang, Wanjun Zhong, Ensheng Shi, Min Yang, Jiachi Chen, Hui Li, Yuchi Ma, Qianxiang Wang, Zibin Zheng, Yanlin Wang, 13 Sep 2024, Agents in Software Engineering: Survey, Landscape, and Vision, https://arxiv.org/abs/2409.09030 https://github.com/DeepSoftwareAnalytics/Awesome-Agent4SE
- Grant Gross, 26 Sep 2024, Devs gaining little (if anything) from AI coding assistants, https://www.cio.com/article/3540579/devs-gaining-little-if-anything-from-ai-coding-assistants.html
- Ulyana Piterbarg, Lerrel Pinto, Rob Fergus, 3 Oct 2024, Training Language Models on Synthetic Edit Sequences Improves Code Synthesis, https://arxiv.org/abs/2410.02749
- https://www.cio.com/article/3567138/ai-native-software-engineering-may-be-closer-than-developers-think.html
- C Thiede, M Taeumel, L Böhme, R Hirschfeld, 2024, Talking to Objects in Natural Language: Toward Semantic Tools for Exploratory Programming, Onward! ’24, October 23–25, 2024, Pasadena, CA, USA, https://dl.acm.org/doi/pdf/10.1145/3689492.3690049
- Aki Ranin, Sep 2, 2024, The Code Canaries Are Singing — Our Path Toward AGI: How the fate of human software developers reveals our path toward AGI, https://akiranin.medium.com/the-code-canaries-are-singing-our-path-toward-agi-6c234cae0189
- Jose Yapur, 29 OCT 2024, Introducing the next-level of AI-powered workflows with Amazon Q Developer inline chat, https://aws.amazon.com/blogs/devops/amazon-q-developer-inline-chat/
- GitHub, Oct 2024, Bringing developer choice to Copilot with Anthropic’s Claude 3.5 Sonnet, Google’s Gemini 1.5 Pro, and OpenAI’s o1-preview, https://github.blog/news-insights/product-news/bringing-developer-choice-to-copilot/
- John Wang, Oct 2024, How we saved hundreds of engineering hours by writing tests with LLMs, https://www.assembled.com/blog/how-we-saved-hundreds-of-engineering-hours-by-writing-tests-with-llms
- Shihan Dou, Jiazheng Zhang, Jianxiang Zang, Yunbo Tao, Haoxiang Jia, Shichun Liu, Yuming Yang, Shenxi Wu, Shaoqing Zhang, Muling Wu, Changze Lv, Limao Xiong, Wenyu Zhan, Lin Zhang, Rongxiang Weng, Jingang Wang, Xunliang Cai, Yueming Wu, Ming Wen, Rui Zheng, Tao Ji, Yixin Cao, Tao Gui, Xipeng Qiu, Qi Zhang, Xuanjing Huang, 30 Oct 2024, Multi-Programming Language Sandbox for LLMs, https://arxiv.org/abs/2410.23074
- David Gewirtz, September 27, 2024, The best AI for coding, and a bunch that failed miserably, https://www.zdnet.com/article/the-best-ai-for-coding/
- Jason Perlow, Nov. 6, 2024, The best open-source AI models: All your free-to-use options explained: Here are the best open-source and free-to-use AI models for text, images, and audio, organized by type, application, and licensing considerations. https://www.zdnet.com/article/the-best-open-source-ai-models-all-your-free-to-use-options-explained/
- Fali Wang, Zhiwei Zhang, Xianren Zhang, Zongyu Wu, Tzuhao Mo, Qiuhao Lu, Wanjing Wang, Rui Li, Junjie Xu, Xianfeng Tang, Qi He, Yao Ma, Ming Huang, Suhang Wang, 4 Nov 2024, A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness, https://arxiv.org/abs/2411.03350
- Qwen Team, November 12, 2024, Qwen2.5-Coder Series: Powerful, Diverse, Practical, https://qwenlm.github.io/blog/qwen2.5-coder-family/
- Evan Doyle, Nov 14, 2024, AI Makes Tech Debt More Expensive, https://www.gauge.sh/blog/ai-makes-tech-debt-more-expensive
- Haoxiang Zhang, Shi Chang, Arthur Leung, Kishanthan Thangarajah, Boyuan Chen, Hanan Lutfiyya, Ahmed E. Hassan, 14 Nov 2024, Software Performance Engineering for Foundation Model-Powered Software (FMware), https://arxiv.org/abs/2411.09580
- Josh Fruhlinger, Dec 02, 2024, Refactoring AI code: The good, the bad, and the weird, https://www.infoworld.com/article/3610521/refactoring-ai-code-the-good-the-bad-and-the-weird.html
- Joe McKendrick, Nov. 27, 2024, Gen AI gives software developers surge in productivity - but it's not for everyone, https://www.zdnet.com/article/gen-ai-gives-software-developers-surge-in-productivity-but-its-not-for-everyone/
- Cory Hymel, Dec 02, 2024, 5 ways AI will change the software development life cycle, https://www.infoworld.com/article/3609988/5-ways-ai-will-change-the-software-development-life-cycle.html
- Paul Heltzel, 03 Dec 2024, 5 dead-end IT skills — and how to avoid becoming obsolete, https://www.cio.com/article/188985/6-dead-end-it-skills-and-how-to-avoid-becoming-obsolete.html
Code Checker Applications
- Aman, May 14, 2024, Near-Instant Full-File Edits, Cursor, https://cursor.sh/blog/instant-apply (A type of speculative decoding for code editing called "speculative edits.")
- Ansong Ni, Miltiadis Allamanis, Arman Cohan, Yinlin Deng, Kensen Shi, Charles Sutton, Pengcheng Yin, 23 Apr 2024, NExT: Teaching Large Language Models to Reason about Code Execution, https://arxiv.org/abs/2404.14662
- David Spuler, March 2024, Chapter 40. Reliability, Generative AI in C++: Coding Transformers and LLMs, https://www.amazon.com/dp/B0CXJKCWX9
- Yingbing Huang, Lily Jiaxin Wan, Hanchen Ye, Manvi Jha, Jinghua Wang, Yuhong Li, Xiaofan Zhang, Deming Chen, 16 Jun 2024, New Solutions on LLM Acceleration, Optimization, and Application, https://arxiv.org/abs/2406.10903 (A survey of inference optimization methods and further analysis of Medusa-type speculative decoding and KV cache compression. Also explores hardware co-design, ML compilers and LLM-assisted code debugging.)
- Nat McAleese, Rai (Michael Pokorny), Evgenia Nitishinskaya, Jan Leike, Juan Felipe Cerón Uribe, Maja Trebacz, 2024, LMCritics Help Catch LLM Bugs, https://cdn.openai.com/llm-critics-help-catch-llm-bugs-paper.pdf
- Patrick J. Chapman, Cindy Rubio-González, and Aditya V. Thakur. 2024. Interleaving Static Analysis and LLM Prompting. In Proceedings of the 13th ACM SIGPLAN International Workshop on the State Of the Art in Program Analysis (SOAP 2024). Association for Computing Machinery, New York, NY, USA, 9–17. https://doi.org/10.1145/3652588.3663317 https://dl.acm.org/doi/abs/10.1145/3652588.3663317
- Junwei Liu, Yixuan Chen, Mingwei Liu, Xin Peng, Yiling Lou, 14 Jun 2024, STALL+: Boosting LLM-based Repository-level Code Completion with Static Analysis, https://arxiv.org/abs/2406.10018
- Shaojian Qiu, Huihao Huang, Jianxiang Luo, Yingjie Kuang, Haoyu Luo, 11 Feb 2024, BAFLineDP: Code Bilinear Attention Fusion Framework for Line-Level Defect Prediction, https://arxiv.org/pdf/2402.07132
- Pragmatic Coders, Sep 2024, Best AI tools for developers in 2024: AI-powered coding, https://medium.com/@pragmaticcoders/best-ai-tools-for-developers-in-2024-ai-powered-coding-32e31dff6024
- Tom Ganz, April 2024, Software Defect Localization Using Explainable Deep Learning, Master's Thesis, Master of Science, der Technischen Universität Berlin, https://api-depositonce.tu-berlin.de/server/api/core/bitstreams/308879e0-b14b-4baf-a0c3-19067184ef50/content (AI-based security vulnerability code checker.)
- Francisco Ribeiro, José Nuno Castro de Macedo, Kanae Tsushima, Rui Abreu, João Saraiva, 2023, GPT-3-Powered Type Error Debugging: Investigating the Use of Large Language Models for Code Repair, SLE 2023: Proceedings of the 16th ACM SIGPLAN International Conference on Software Language Engineering, October 2023, Pages 111–124, https://doi.org/10.1145/3623476.3623522 (Code corrections are a type of GEC.)
- Jiawei Guo, Ziming Li, Xueling Liu, Kaijing Ma, Tianyu Zheng, Zhouliang Yu, Ding Pan, Yizhi LI, Ruibo Liu, Yue Wang, Shuyue Guo, Xingwei Qu, Xiang Yue, Ge Zhang, Wenhu Chen, Jie Fu, 4 Apr 2024, CodeEditorBench: Evaluating Code Editing Capability of Large Language Models, https://arxiv.org/abs/2404.03543
- David Spuler, June 2024, Aussie AI, Optimizing On-Device Transformer Inference for Source Code Checking: IP Australia, https://ipsearch.ipaustralia.gov.au/patents/2024901675
- Ulyana Piterbarg, Lerrel Pinto, Rob Fergus, 3 Oct 2024, Training Language Models on Synthetic Edit Sequences Improves Code Synthesis, https://arxiv.org/abs/2410.02749
- Albin Johansson, Carl Holmberg, Francisco Gomes De Oliveira Neto, and Philipp Leitner. 2024. The Impact of Compiler Warnings on Code Quality in C++ Projects. In Proceedings of the 32nd IEEE/ACM International Conference on Program Comprehension (ICPC '24). Association for Computing Machinery, New York, NY, USA, 270–279. https://doi.org/10.1145/3643916.3644410 https://dl.acm.org/doi/abs/10.1145/3643916.3644410 (Using compiler warnings correlations with higher quality metrics.)
- Fang Liu, Zhenwei Liu, Qianhui Zhao, Jing Jiang, Li Zhang, Zian Sun, Ge Li, Zhongqi Li, and Yuchi Ma. 2024. FastFixer: An Efficient and Effective Approach for Repairing Programming Assignments. In Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering (ASE '24). Association for Computing Machinery, New York, NY, USA, 669–680. https://doi.org/10.1145/3691620.3695062 https://dl.acm.org/doi/abs/10.1145/3691620.3695062
- Andrea Lepori, Alexandru Calotoiu, and Torsten Hoefler. 2024. Iterating Pointers: Enabling Static Analysis for Loop-based Pointers. ACM Trans. Archit. Code Optim. Just Accepted (October 2024). https://doi.org/10.1145/3701993 https://dl.acm.org/doi/pdf/10.1145/3701993
- A Hück, T Ziegler, S Schwitanski, J Jenke, C Bischof, Nov 2024, Compiler-Aided Correctness Checking of CUDA-Aware MPI Applications, https://conferences.computer.org/sc-wpub/pdfs/SC-W2024-6oZmigAQfgJ1GhPL0yE3pS/555400a204/555400a204.pdf
- Zeyu Chen, Daiping Liu, Jidong Xiao, and Haining Wang. 2023. All Use-After-Free Vulnerabilities Are Not Created Equal: An Empirical Study on Their Characteristics and Detectability. In Proceedings of the 26th International Symposium on Research in Attacks, Intrusions and Defenses (RAID '23). Association for Computing Machinery, New York, NY, USA, 623–638. https://doi.org/10.1145/3607199.3607229 https://dl.acm.org/doi/10.1145/3607199.3607229 https://vtechworks.lib.vt.edu/bitstream/handle/10919/116595/3607199.3607229.pdf
- B. Gui, W. Song, H. Xiong and J. Huang, "Automated Use-After-Free Detection and Exploit Mitigation: How Far Have We Gone?," in IEEE Transactions on Software Engineering, vol. 48, no. 11, pp. 4569-4589, 1 Nov. 2022, doi: 10.1109/TSE.2021.3121994. https://ieeexplore.ieee.org/document/9583875
- H. Wei, L. Chen, X. Nie, Z. Zhang, Y. Zhang and G. Shi, "An Efficient Metric-Based Approach for Static Use-After-Free Detection," 2022 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), Melbourne, Australia, 2022, pp. 58-65, doi: 10.1109/ISPA-BDCloud-SocialCom-SustainCom57177.2022.00015. https://ieeexplore.ieee.org/document/10070682
User Interface (UI) Issues for AI Apps
- Li Zhang, Shihe Wang, Xianqing Jia, Zhihan Zheng, Yunhe Yan, Longxi Gao, Yuanchun Li, Mengwei Xu, 12 Apr 2024, LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Automation Task Evaluation, https://arxiv.org/abs/2404.16054
- Jiachen Liu, Zhiyu Wu, Jae-Won Chung, Fan Lai, Myungjin Lee, Mosharaf Chowdhury, 25 Apr 2024, Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services, https://arxiv.org/abs/2404.16283 (Scheduling GPU activity for multiple queries to ensure good UI experience for text-streaming outputs like chatbots.)
- NLUX: The 𝗣𝗼𝘄𝗲𝗿𝗳𝘂𝗹 Conversational AI JavaScript Library, https://github.com/nlkitai/nlux
- Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia, 7 Dec 2023, Prompt Highlighter: Interactive Control for Multi-Modal LLMs, https://arxiv.org/abs/2312.04302 Code: https://github.com/dvlab-research/Prompt-Highlighter/ (Allows users to highlight part of their prompt for more specificity.)
- Michael Nuñez, June 21, 2024, Why Anthropic’s Artifacts may be this year’s most important AI feature: Unveiling the interface battle, https://venturebeat.com/ai/why-anthropics-artifacts-may-be-this-years-most-important-ai-feature-unveiling-the-interface-battle/
- Paul DelSignore, Jul 5, 2024, From AI Models to Products: The Shift in AI Strategy: Why Model Performance No Longer Matters, https://generativeai.pub/from-ai-models-to-products-the-shift-in-ai-strategy-b377aeee3948
- Vince Lam, Mar 12, 2024, 50+ Open-Source Options for Running LLMs Locally, https://medium.com/thedeephub/50-open-source-options-for-running-llms-locally-db1ec6f5a54f
- Ethan Mollick, Aug 01, 2024, On speaking to AI: Voice changes a lot of things, https://www.oneusefulthing.org/p/on-speaking-to-ai
- Arvind Narayanan and Sayash Kapoor, Aug 19, 2024, AI companies are pivoting from creating gods to building products. Good. Turning models into products runs into five challenges, https://www.aisnakeoil.com/p/ai-companies-are-pivoting-from-creating
- Lance Whitney, Aug. 28, 2024, Why Claude's Artifacts is the coolest feature I've seen in generative AI so far, https://www.zdnet.com/article/why-claudes-artifacts-is-the-coolest-feature-ive-seen-in-generative-ai-so-far/
- Kevin Lin, Sumant Guha, Joe Spaniac, Andy Zheng, 13 Nov 2020 (v3), Nifty Web Apps: Build a Web App for Any Text-Based Programming Assignment, https://arxiv.org/abs/2010.04671
- Songqin Nong, Jiali Zhu, Rui Wu, Jiongchao Jin, Shuo Shan, Xiutian Huang, Wenhao Xu, 7 Aug 2024 (v2), MobileFlow: A Multimodal LLM For Mobile GUI Agent, https://arxiv.org/abs/2407.04346
- Dongping Chen, Yue Huang, Siyuan Wu, Jingyu Tang, Liuyi Chen, Yilin Bai, Zhigang He, Chenlong Wang, Huichi Zhou, Yiqiang Li, Tianshuo Zhou, Yue Yu, Chujie Gao, Qihui Zhang, Yi Gui, Zhen Li, Yao Wan, Pan Zhou, Jianfeng Gao, Lichao Sun, 16 Jun 2024, GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents, https://arxiv.org/abs/2406.10819 https://gui-world.github.io/
- Kristian Kolthoff, Felix Kretzer, Christian Bartelt, Alexander Maedche, Simone Paolo Ponzetto, 12 Jun 2024, Interlinking User Stories and GUI Prototyping: A Semi-Automatic LLM-based Approach, https://arxiv.org/abs/2406.08120
- Abdur Rahman, Rajat Chawla, Muskaan Kumar, Arkajit Datta, Adarsh Jha, Mukunda NS, Ishaan Bhola, 21 Jul 2024 (v2), V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM, https://arxiv.org/abs/2405.15341
- Danyang Zhang, Zhennan Shen, Rui Xie, Situo Zhang, Tianbao Xie, Zihan Zhao, Siyuan Chen, Lu Chen, Hongshen Xu, Ruisheng Cao, Kai Yu, 13 Jun 2024 (v4), Mobile-Env: Building Qualified Evaluation Benchmarks for LLM-GUI Interaction, https://arxiv.org/abs/2305.08144
- Quanfeng Lu, Wenqi Shao, Zitao Liu, Fanqing Meng, Boxuan Li, Botong Chen, Siyuan Huang, Kaipeng Zhang, Yu Qiao, Ping Luo, 12 Jun 2024, GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices, https://arxiv.org/abs/2406.08451 https://github.com/OpenGVLab/GUI-Odyssey
- Shengcheng Yu, Chunrong Fang, Ziyuan Tuo, Quanjun Zhang, Chunyang Chen, Zhenyu Chen, Zhendong Su, 20 Oct 2023, Vision-Based Mobile App GUI Testing: A Survey, https://arxiv.org/abs/2310.13518
- Jieshan Chen, Chunyang Chen, Zhenchang Xing, Xiwei Xu, Liming Zhu, Guoqiang Li, Jinshui Wang, 2 Jul 2020 (v2), Unblind Your Apps: Predicting Natural-Language Labels for Mobile GUI Components by Deep Learning, https://arxiv.org/abs/2003.00380
- Carlos Bernal-Cardenas, Kevin Moran, Michele Tufano, Zichang Liu, Linyong Nan, Zhehan Shi, Denys Poshyvanyk, 3 Jan 2019, Guigle: A GUI Search Engine for Android Apps, https://arxiv.org/abs/1901.00891
- Yijie Guo, Zhenhan Huang, Ruhan Wang, Zhihao Yao, Tianyu Yu, Zhiling Xu, Xinyu Zhao, Xueqing Li, Haipeng Mi, 24 Jul 2024, AI-Gadget Kit: Integrating Swarm User Interfaces with LLM-driven Agents for Rich Tabletop Game Applications, https://arxiv.org/abs/2407.17086
- Harry Li, Gabriel Appleby, Ashley Suh, 7 Jun 2024, LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-Answering, https://arxiv.org/abs/2406.06621
- William Seymour, Emilee Rader, 23 May 2024, Speculating About Multi-user Conversational Interfaces and LLMs: What If Chatting Wasn't So Lonely? https://arxiv.org/abs/2405.14390
- Daniel Chin, Yuxuan Wang, Gus Xia, 19 May 2024, Human-Centered LLM-Agent User Interface: A Position Paper, https://arxiv.org/abs/2405.13050
- Yaroslav Zharov, Yury Khudyakov, Evgeniia Fedotova, Evgeny Grigorenko, Egor Bogomolov, 18 Feb 2024, Tool-Augmented LLMs as a Universal Interface for IDEs, https://arxiv.org/abs/2402.11635
- Syed Mekael Wasti, Ken Q. Pu, Ali Neshati, 16 Apr 2024 (v2), Large Language User Interfaces: Voice Interactive User Interfaces powered by LLMs, https://arxiv.org/abs/2402.07938
- Qirui Huang, Min Lu, Joel Lanir, Dani Lischinski, Daniel Cohen-Or, Hui Huang, 24 Jan 2024, GraphiMind: LLM-centric Interface for Information Graphics Design, https://arxiv.org/abs/2401.13245
- Yue Jiang, Changkong Zhou, Vikas Garg, Antti Oulasvirta, 21 Apr 2024, Graph4GUI: Graph Neural Networks for Representing Graphical User Interfaces, https://arxiv.org/abs/2404.13521
- Daniel Buschek, 27 May 2024, Collage is the New Writing: Exploring the Fragmentation of Text and User Interfaces in AI Tools, https://arxiv.org/abs/2405.17217
- Abdallah Namoun, Ahmed Alrehaili, Zaib Un Nisa, Hani Almoamari, Ali Tufail, 5 May 2024, Predicting the usability of mobile applications using AI tools: the rise of large user interface models, opportunities, and challenges, https://arxiv.org/abs/2405.03716
- Zijian Ding, 2 May 2024 (v2), Towards Intent-based User Interfaces: Charting the Design Space of Intent-AI Interactions Across Task Types, https://arxiv.org/abs/2404.18196
- Patrick Ebel, 16 Feb 2024, Generative AI and Attentive User Interfaces: Five Strategies to Enhance Take-Over Quality in Automated Driving, https://arxiv.org/abs/2402.10664
- Advait Sarkar, 1 Nov 2023, Will Code Remain a Relevant User Interface for End-User Programming with Generative AI Models? https://arxiv.org/abs/2311.00382
- Alex Renda, Harrison Goldstein, Sarah Bird, Chris Quirk, Adrian Sampson, 14 Sep 2017, Abstractions for AI-Based User Interfaces and Systems, https://arxiv.org/abs/1709.04991
- Thomas Mildner, Orla Cooney, Anna-Maria Meck, Marion Bartl, Gian-Luca Savino, Philip R. Doyle, Diego Garaialde, Leigh Clark, John Sloan, Nina Wenig, Rainer Malaka, Jasmin Niess, 26 Jan 2024, Listening to the Voices: Describing Ethical Caveats of Conversational User Interfaces According to Experts and Frequent Users, Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11--16, 2024, Honolulu, HI, USA, https://arxiv.org/abs/2401.14746 https://doi.org/https://doi.org/10.1145/3613904.3642542
- Andreas Liesenfeld, Alianda Lopez, Mark Dingemanse, 28 Jul 2023, The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems, https://arxiv.org/abs/2307.15493
- William Seymour, Xiao Zhan, Mark Cote, Jose Such, 8 Jun 2023, Who are CUIs Really For? Representation and Accessibility in the Conversational User Interface Literature, https://arxiv.org/abs/2306.05228
- Open WebUI, 2024, Open WebUI (Formerly Ollama WebUI), https://github.com/open-webui/open-webui
- Xhoni Shollaj, 2024, Awesome LLM WebUIs, https://github.com/JShollaj/Awesome-LLM-Web-UI
- Sujeet Kumar, May 20, 2024, 14 Best Software for Running local LLM, https://scifilogic.com/interface-for-running-local-llm/
- Mauro Sicard, Miguel Joya, LanguageGUI is the UI Kit for LLMs, 2024, https://languagegui.com/
- Reddit, 2024, New Open Source Framework and No-Code GUI for Fine-Tuning LLMs: H2O LLM Studio, https://www.reddit.com/r/LocalLLaMA/comments/12yc8op/new_open_source_framework_and_nocode_gui_for/
- LLM-UI, 2024, The React library for LLMs, https://llm-ui.com/
- Reddit, 2024, LLM Web-UI recommendations, https://www.reddit.com/r/LocalLLaMA/comments/1847qt6/llm_webui_recommendations/
- Yuzhe Cai, Shaoguang Mao, Wenshan Wu, Zehua Wang, Yaobo Liang, Tao Ge, Chenfei Wu, Wang You, Ting Song, Yan Xia, Jonathan Tien, Nan Duan, Furu Wei, 1 Apr 2024 (v3), Low-code LLM: Graphical User Interface over Large Language Models, https://arxiv.org/abs/2304.08103 https://github.com/chenfei-wu/TaskMatrix/tree/main/LowCodeLLM https://www.youtube.com/watch?v=jb2C1vaeO3E
- Ramalingame, Hari, May 2024, Deployable Web GUI for LLM Applications, Thesis, Arizona State University, https://keep.lib.asu.edu/items/192554
- by Jarrett Yeo and Tammy Lim , 12 DEC 2023, Create a web UI to interact with LLMs using Amazon SageMaker JumpStart, https://aws.amazon.com/blogs/machine-learning/create-a-web-ui-to-interact-with-llms-using-amazon-sagemaker-jumpstart/
- Difei Gao, Lei Ji, Zechen Bai, Mingyu Ouyang, Peiran Li, Dongxing Mao, Qinchen Wu, Weichen Zhang, Peiyi Wang, Xiangwu Guo, Hengxu Wang, Luowei Zhou, Mike Zheng Shou; 2024, AssistGUI: Task-Oriented PC Graphical User Interface Automation, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 13289-13298, https://openaccess.thecvf.com/content/CVPR2024/html/Gao_AssistGUI_Task-Oriented_PC_Graphical_User_Interface_Automation_CVPR_2024_paper.html https://openaccess.thecvf.com/content/CVPR2024/papers/Gao_AssistGUI_Task-Oriented_PC_Graphical_User_Interface_Automation_CVPR_2024_paper.pdf https://openaccess.thecvf.com/content/CVPR2024/supplemental/Gao_AssistGUI_Task-Oriented_PC_CVPR_2024_supplemental.pdf
- Jie Gao, Simret Araya Gebreegziabher, Kenny Tsu Wei Choo, Toby Jia-Jun Li, Simon Tangi Perrault, Thomas W. Malone, 30 Mar 2024, A Taxonomy for Human-LLM Interaction Modes: An Initial Exploration, https://arxiv.org/abs/2404.00405 https://dl.acm.org/doi/abs/10.1145/3613905.3650786
- Prakash Joshi Pax, Aug 26, 2024, Fabric: The Best AI Tool That Nobody is Talking About. An open-source AI tool to automate every day tasks https://beingpax.medium.com/why-fabric-ai-can-change-the-way-you-use-ai-973e725354da
- Nick: The AI Guru, Aug 15, 2024, Why Perplexity AI Has Been a Game Changer For Me, https://medium.com/@nickm9/why-perplexity-ai-has-been-a-game-changer-for-me-b38976bdc1b4
- Michal Malewicz, Sep 3, 2024, Ugly websites sell better. Web design is getting out of hand again. https://michalmalewicz.medium.com/ugly-websites-sell-better-0b0354ebff10
- Yicheng Fu, Raviteja Anantha, Prabal Vashisht, Jianpeng Cheng, Etai Littwin, 6 Sep 2024, UI-JEPA: Towards Active Perception of User Intent through Onscreen User Activity, https://www.arxiv.org/abs/2409.04081
- Jared Spataro, Sep 16, 2024, Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents, Microsoft blog, https://www.microsoft.com/en-us/microsoft-365/blog/2024/09/16/microsoft-365-copilot-wave-2-pages-python-in-excel-and-agents/
- Simon Willison, Sep 2024, How streaming LLM APIs work, https://til.simonwillison.net/llms/streaming-llm-apis
- Ellie Ko, Sep 25, 2024, A Survey of Python Frameworks, https://ploomber.io/blog/survey-python-frameworks/
- Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 25 Sep 2024, Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents, https://arxiv.org/abs/2409.17140
- Mareike Hartmann, Alexander Koller, 27 Sep 2024, A Survey on Complex Tasks for Goal-Directed Interactive Agents, https://arxiv.org/abs/2409.18538 https://coli-saar.github.io/interactive-agents
- Emilia David, October 3, 2024, OpenAI launches ChatGPT Canvas, challenging Claude Artifacts, https://venturebeat.com/ai/openai-launches-chatgpt-canvas-challenging-claude-artifacts/
- Sabrina Ortiz, Oct. 7, 2024, I test ChatGPT features for a living, and this new one really did supercharge my productivity. If you use OpenAI's generative AI tool to co-edit code or text, Canvas will take your work to a whole new level, https://www.zdnet.com/article/i-test-chatgpt-features-for-a-living-and-this-new-one-really-did-supercharge-my-productivity/
- Emilia David, October 17, 2024, Google launches NotebookLM Business to make enterprise AI audio, text, https://venturebeat.com/ai/googles-notebooklm-will-expand-to-business-use-cases-soon/
- David Gewirtz, Oct. 25, 2024, I wrote half this article on Apple Watch, thanks to this under-the-radar iOS 18 feature: Here's how to transform your writing workflow and turn your Apple Watch into a productivity powerhouse, https://www.zdnet.com/article/i-wrote-half-this-article-on-apple-watch-thanks-to-this-under-the-radar-ios-18-feature/
- LangChain, Jul 26, 2024, UX for Agents, Part 1: Chat, https://blog.langchain.dev/ux-for-agents-part-1-chat-2/
- LangChain, Aug 2, 2024, UX for Agents, Part 2: Ambient, https://blog.langchain.dev/ux-for-agents-part-2-ambient/
- LangChain, Aug 10, 2024, UX for Agents, Part 3: Spreadsheet, Generative, and Collaborative UI/UX, https://blog.langchain.dev/ux-for-agents-part-3/
- Lance Whitney, Oct. 30, 2024, Apple Watch lets you translate your conversations in real-time. Here's how: WatchOS 11's Translate app lets you have a live conversation in two languages with another person - right from your wrist, https://www.zdnet.com/article/apple-watch-lets-you-translate-your-conversations-in-real-time-heres-how/
- Julia Winn, Oct 2024, The AI Productivity Paradox: Why Aren’t More Workers Using ChatGPT? The real barrier isn’t technical skills — it’s time to think. https://towardsdatascience.com/the-ai-productivity-paradox-why-arent-more-workers-using-chatgpt-a1dfe96a9460
- Lance Whitney, Oct. 31, 2024, Claude AI adds desktop apps and dictation mode – here's how to use them, https://www.zdnet.com/article/claude-ai-adds-desktop-apps-and-dictation-mode-heres-how-to-use-them/
- K. Balázs Neszlényi, A. Milos and A. Kiss, "AssistantGPT: Enhancing User Interaction with LLM Integration," 2024 IEEE 22nd Jubilee International Symposium on Intelligent Systems and Informatics (SISY), Pula, Croatia, 2024, pp. 000619-000624, doi: 10.1109/SISY62279.2024.10737548. https://ieeexplore.ieee.org/abstract/document/10737548
- OpenAI, October 3, 2024, Introducing canvas: A new way of working with ChatGPT to write and code, https://openai.com/index/introducing-canvas/
- Emilia David, November 14, 2024, OpenAI launches ChatGPT desktop integrations, rivaling Copilot, https://venturebeat.com/ai/openai-launches-chatgpt-desktop-integrations-rivaling-copilot/
- swyx, Sep 2024, What Works in AI UX (lightning talk + Q&A), https://www.youtube.com/watch?v=PkHjoihjo6U
- swyx & Alessio, Maggie Appleton, Linus Lee, and Geoffrey Litt, Apr 27, 2023, It's Time To Build AI | UX. Bridging the Capability Overhang from Generative AI to Generative UI, https://www.latent.space/p/build-ai-ux
- Akash Bajwa, Nov 18, 2024, Opinionated AI Products: Strong Technologies Forms Beliefs, https://akashbajwa.substack.com/p/opinionated-ai-products
- Jared Spataro, November 19, 2024, Introducing Copilot Actions, new agents, and tools to empower IT teams, https://www.microsoft.com/en-us/microsoft-365/blog/2024/11/19/introducing-copilot-actions-new-agents-and-tools-to-empower-it-teams/ ("Copilot is the UI for AI")
- Tiernan Ray, Nov. 21, 2024 , Even Nvidia's CEO is obsessed with Google's NotebookLM AI tool, https://www.zdnet.com/article/even-nvidias-ceo-is-obsessed-with-googles-notebooklm-ai-tool/
- Ethan Mollick, Nov 24, 2024, Getting started with AI: Good enough prompting. Don't make this hard. https://www.oneusefulthing.org/p/getting-started-with-ai-good-enough
- Charlie Guo, Nov 15, 2024, The Chatbot Trap. Why AI products really need some better UX. https://www.ignorance.ai/p/the-chatbot-trap
- Christian Swinehart, Dec 2024, Skia-Canvas: A GPU-accelerated 2D graphics environment for Node.js, https://github.com/samizdatco/skia-canvas
- Charles Rollet, December 4, 2024, Key leaders behind Google’s viral NotebookLM are leaving to create their own startup, https://techcrunch.com/2024/12/04/key-leaders-behind-googles-viral-notebooklm-are-leaving-to-create-their-own-startup/ ("As the frontier models and their capabilities continue to grow, thoughtful products are required to make the benefits of this technology accessible, useful, and obvious to everyday people — so our team is going to be focused on building a user-first AI product...the team wanted to create something that leverages the latest AI models to build something useful to regular people.")
- Ian Drosos, Jack Williams, Advait Sarkar, Nicholas Wilson, 3 Dec 2024, Dynamic Prompt Middleware: Contextual Prompt Refinement Controls for Comprehension Tasks, https://arxiv.org/abs/2412.02357
- Emilia David, December 10, 2024, OpenAI expands ChatGPT Canvas to all users, https://venturebeat.com/ai/openai-expands-chatgpt-canvas-to-all-users/
Consoles
- Anthropic, 21 May 2024, Generate better prompts in the developer console, https://www.anthropic.com/news/prompt-generator
- Michael Nuñez, September 10, 2024, Is Anthropic’s new ‘Workspaces’ feature the future of enterprise AI management? https://venturebeat.com/ai/is-anthropics-new-workspaces-feature-the-future-of-enterprise-ai-management/
- Jared Spataro, Sep 16, 2024, Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents, Microsoft blog, https://www.microsoft.com/en-us/microsoft-365/blog/2024/09/16/microsoft-365-copilot-wave-2-pages-python-in-excel-and-agents/
- Ellie Ko, Sep 25, 2024, A Survey of Python Frameworks, https://ploomber.io/blog/survey-python-frameworks/
- Matt Asay, Sep 23, 2024, Too much assembly required for AI, https://www.infoworld.com/article/3536292/too-much-assembly-required-for-ai.html
- Emilia David, October 3, 2024, OpenAI launches ChatGPT Canvas, challenging Claude Artifacts, https://venturebeat.com/ai/openai-launches-chatgpt-canvas-challenging-claude-artifacts/
- Sabrina Ortiz, Oct. 7, 2024, I test ChatGPT features for a living, and this new one really did supercharge my productivity. If you use OpenAI's generative AI tool to co-edit code or text, Canvas will take your work to a whole new level, https://www.zdnet.com/article/i-test-chatgpt-features-for-a-living-and-this-new-one-really-did-supercharge-my-productivity/
- Emilia David, October 17, 2024, Google launches NotebookLM Business to make enterprise AI audio, text, https://venturebeat.com/ai/googles-notebooklm-will-expand-to-business-use-cases-soon/
- Jason Perlow, Nov. 8, 2024, How to manage Bluesky, Mastodon, and Threads all from one free app Openvibe simplifies social media management with unified timelines, cross-posting, and customizable feeds for easier navigation of the digital landscape. Here's why you should try it. https://www.zdnet.com/article/how-to-manage-bluesky-mastodon-and-threads-all-from-one-free-app/
- OpenAI, October 3, 2024, Introducing canvas: A new way of working with ChatGPT to write and code, https://openai.com/index/introducing-canvas/
- swyx & Alessio, Maggie Appleton, Linus Lee, and Geoffrey Litt, Apr 27, 2023, It's Time To Build AI | UX. Bridging the Capability Overhang from Generative AI to Generative UI, https://www.latent.space/p/build-ai-ux
- Jared Spataro, November 19, 2024, Introducing Copilot Actions, new agents, and tools to empower IT teams, https://www.microsoft.com/en-us/microsoft-365/blog/2024/11/19/introducing-copilot-actions-new-agents-and-tools-to-empower-it-teams/ ("Copilot is the UI for AI")
- Microsoft, Dec 2024, Power Automate: A comprehensive, end-to-end cloud automation platform powered by low code and AI. https://www.microsoft.com/en-us/power-platform/products/power-automate
- Emilia David, December 10, 2024, OpenAI expands ChatGPT Canvas to all users, https://venturebeat.com/ai/openai-expands-chatgpt-canvas-to-all-users/
Script Languages
- L. Zheng, L. Yin, Z. Xie, J. Huang, C. Sun, C. H. Yu, S. Cao, C. Kozyrakis, I. Stoica, J. E. Gonzalez et al., Dec 2023, Efficiently programming large language models using SGLang, arXiv preprint arXiv:2312.07104, 2023, https://arxiv.org/abs/2312.07104 (Uses a radix attention method, a trie or prefix tree, for KV caching.)
- Hongzheng Chen, Niansong Zhang, Shaojie Xiang, Zhichen Zeng, Mengjia Dai, Zhiru Zhang, 7 Apr 2024, Allo: A Programming Model for Composable Accelerator Design, https://arxiv.org/abs/2404.04815
- Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia, Christopher Potts, 5 Oct 2023, DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines, https://arxiv.org/abs/2310.03714 Code: https://github.com/stanfordnlp/dspy
- Honghua Dong, Qidong Su, Yubo Gao, Zhaoyu Li, Yangjun Ruan, Gennady Pekhimenko, Chris J. Maddison, Xujie Si, 19 Jun 2024, APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts, https://arxiv.org/abs/2406.13161 Code: https://github.com/appl-team/appl (A Python-like script language for prompt engineering integration into applications and agents.)
- Till Döhmen, 2024/10/17, Introducing the prompt() Function: Use the Power of LLMs with SQL! https://motherduck.com/blog/sql-llm-prompt-function-gpt-models/
- Mandana Vaziri, Louis Mandel, Claudio Spiess, Martin Hirzel, 24 Oct 2024, PDL: A Declarative Prompt Programming Language, https://arxiv.org/abs/2410.19135
- Saksham Goel, October 29, 2024, Build LLM/RAG pipelines with YAML templates by Pathway, https://pathway.com/blog/llm-yaml-templates
- Yuka Ikarashi, Kevin Qian, Samir Droubi, Alex Reinking, Gilbert Bernstein, Jonathan Ragan-Kelley, 14 Nov 2024 (v2), Exo 2: Growing a Scheduling Language, https://arxiv.org/abs/2411.07211
API Architectures
- Kyle Wiggers, September 16, 2024, Runway announces an API for its video-generating AI models, https://techcrunch.com/2024/09/16/runway-announces-an-api-for-its-video-generating-models/
- Mistral, Sep 2024, AI in abundance. Introducing a free API, improved pricing across the board, a new enterprise-grade Mistral Small, and free vision capabilities on le Chat. https://mistral.ai/news/september-24-release/
- Luma Labs, Sep 2024, Creative Intelligence platform for magical AI products, https://lumalabs.ai/dream-machine/api (API to access video models.)
- Simon Willison, Sep 2024, How streaming LLM APIs work, https://til.simonwillison.net/llms/streaming-llm-apis
- Carl Franzen, September 27, Cohere updates APIs to make it easier for devs to switch from other models, https://venturebeat.com/ai/cohere-updates-apis-to-make-it-easier-for-devs-to-switch-from-other-models/
- Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 25 Sep 2024, Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents, https://arxiv.org/abs/2409.17140
- Michael Nuñez, September 25, 2024, AI for all: Meta’s ‘Llama Stack’ promises to simplify enterprise adoption, https://venturebeat.com/ai/ai-for-all-meta-llama-stack-promises-to-simplify-enterprise-ai-adoption/
- Kyle Wiggers, October 3, 2024, Black Forest Labs, the startup behind Grok’s image generator, releases an API, https://techcrunch.com/2024/10/03/black-forest-labs-the-startup-behind-groks-image-generator-releases-an-api/
- Kyle Wiggers, October 21, 2024, xAI, Elon Musk’s AI startup, launches an API, https://techcrunch.com/2024/10/21/xai-elon-musks-ai-startup-launches-an-api/
- X AI, November 4, 2024 API Public Beta, https://x.ai/blog/api
- Gemini is now accessible from the OpenAI Library NOV 08, 2024 Logan Kilpatrick, https://developers.googleblog.com/en/gemini-is-now-accessible-from-the-openai-library/
- Kwindla Hultman Kramer and swyx & Alessio, Nov 22, 2024, OpenAI Realtime API: The Missing Manual, Latent Space, https://www.latent.space/p/realtime-api
- Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su, 22 Feb 2024, Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments, https://arxiv.org/abs/2402.14672
- Andrew Ng, Nov 2024, Simple, unified interface to multiple Generative AI providers, https://github.com/andrewyng/aisuite
- Asif Razzaq, November 29, 2024, Andrew Ng’s Team Releases ‘aisuite’: A New Open Source Python Library for Generative AI, https://www.marktechpost.com/2024/11/29/andrew-ngs-team-releases-aisuite-a-new-open-source-python-library-for-generative-ai/
- Paul Krill Dec 05, 2024, OpenAI unveils API for tracking OpenAI API usage, costs, https://www.infoworld.com/article/3618202/openai-unveils-api-for-tracking-openai-api-usage-costs.html
Plugins
- Reyna Abhyankar, Zijian He, Vikranth Srivatsa, Hao Zhang, Yiying Zhang, 2024, INFERCEPT: Efficient Intercept Support for Augmented Large Language Model Inference, https://openreview.net/pdf?id=wDDGQabYPQ
- Zile Qiao, Wei Ye, Yong Jiang, Tong Mo, Pengjun Xie, Weiping Li, Fei Huang, Shikun Zhang, 12 Jun 2024, Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling, https://arxiv.org/abs/2406.08116
- Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman, 1 Jun 2022 (v3), WebGPT: Browser-assisted question-answering with human feedback, https://arxiv.org/abs/2112.09332
- Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
Custom AI Apps
- Gino Zambe, Feb 1, 2024, Was The GPT store a failure? https://medium.com/@ginozambe/was-the-gpt-store-a-failure-d2a2379fdfc1
- OpenAI, November 6, 2023 Introducing GPTs, OpenAI Blog, https://openai.com/blog/introducing-gpts
- Lance Whitney, June 12, 2024, Microsoft scraps Copilot Pro GPT Builder after just 3 months - how to save your work, https://www.zdnet.com/article/microsoft-scraps-copilot-pro-gpt-builder-after-just-3-months-how-to-save-your-work/
- Reuters, July 30, 2024, Meta to let users to create custom AI characters, https://www.reuters.com/technology/artificial-intelligence/meta-let-users-create-custom-ai-characters-2024-07-29/
- Lucas Mearian, 27 Aug 2024, BCG execs: AI across the company increased productivity, ‘employee joy’, https://www.computerworld.com/article/3491334/bcg-execs-ai-across-the-company-increased-productivity-employee-joy.html
- Yuanchun Li, Hao Wen, Weijun Wang, Xiangyu Li, Yizhen Yuan, Guohong Liu, Jiacheng Liu, Wenxing Xu, Xiang Wang, Yi Sun, Rui Kong, Yile Wang, Hanfei Geng, Jian Luan, Xuefeng Jin, Zilong Ye, Guanjing Xiong, Fan Zhang, Xiang Li, Mengwei Xu, Zhijun Li, Peng Li, Yang Liu, Ya-Qin Zhang, Yunxin Liu, 8 May 2024 (v2), Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security, https://arxiv.org/abs/2401.05459 https://github.com/MobileLLM/Personal_LLM_Agents_Survey
- Emilia David, August 30, 2024, OpenAI gives developers more control over AI assistants, https://venturebeat.com/ai/openai-gives-developers-more-control-over-ai-assistants/
- Henrique Centieiro & Bee Lee, Aug 2024, Build Your Own Money-Making Personal AI Bot: An Easy Step-by-Step Guide to Creating and Monetizing Your Personal AI Bot on Poe, https://medium.com/limitless-investor/build-your-own-money-making-personal-ai-bot-9810e3175699
- OpenAI, January 10, 2024, Introducing the GPT Store, https://openai.com/index/introducing-the-gpt-store/
- Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
- Xinyi Hou, Yanjie Zhao, Haoyu Wang, 3 Aug 2024, Voices from the Frontier: A Comprehensive Analysis of the OpenAI Developer Forum, https://arxiv.org/abs/2408.01687
- OpenAI, 2024, GPT Builder: What is the GPT Builder for in ChatGPT and why did we make it? https://help.openai.com/en/articles/8770868-gpt-builder
- Xiang Chen, Chaoyang Gao, Chunyang Chen, Guangbei Zhang, Yong Liu, 12 Aug 2024 (v2), An Empirical Study on Challenges for LLM Developers, https://arxiv.org/abs/2408.05002
- Nick: The AI Guru, Aug 15, 2024, Why Perplexity AI Has Been a Game Changer For Me, https://medium.com/@nickm9/why-perplexity-ai-has-been-a-game-changer-for-me-b38976bdc1b4
- https://levelup.gitconnected.com/zero-to-hero-crafting-a-custom-gpt-e2ef22653b1f
- Tiernan Ray, Sept. 4, 2024, Google's Gems are a gentle introduction to AI prompt engineering: Google's pre-built Gems offer prompt examples you can modify to get started with your own custom bot, https://www.zdnet.com/article/googles-gems-are-a-gentle-introduction-to-ai-prompt-engineering/
- Chaojun Xiao, Zhengyan Zhang, Chenyang Song, Dazhi Jiang, Feng Yao, Xu Han, Xiaozhi Wang, Shuo Wang, Yufei Huang, Guanyu Lin, Yingfa Chen, Weilin Zhao, Yuge Tu, Zexuan Zhong, Ao Zhang, Chenglei Si, Khai Hao Moo, Chenyang Zhao, Huimin Chen, Yankai Lin, Zhiyuan Liu, Jingbo Shang, Maosong Sun, Sep 2024, Configurable Foundation Models: Building LLMs from a Modular Perspective, https://arxiv.org/pdf/2409.02877
- Emilia David, September 10, 2024, ServiceNow introduces a library of enterprise AI agents you can customize to fit your workflow, https://venturebeat.com/ai/servicenow-introduces-a-library-of-enterprise-ai-agents-you-can-customize-to-fit-your-workflow/
No Code/Low Code for AI Apps
- Writer, Aug 2024 (accessed), Writer AI Studio: The fastest way to build AI apps, https://writer.com/product/ai-studio/
- Isaac Sacolick, How to choose the right low-code, no-code, or process automation platform, Jul 29, 2024, https://www.infoworld.com/article/3476848/how-to-choose-the-right-low-code-no-code-or-process-automation-platform.html
- Rebekah Carter, 2023, Gartner Magic Quadrant for Enterprise Low-Code Application Platforms 2023, https://www.cxtoday.com/loyalty-management/gartner-magic-quadrant-for-enterprise-low-code-application-platforms-2023/
- Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang Zhu, Chi Wang, Saleema Amershi, 9 Aug 2024, AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems, https://arxiv.org/abs/2408.15247
- Yanxi Chen, Yaliang Li, Bolin Ding, Jingren Zhou, 20 Jul 2024, On the Design and Analysis of LLM-Based Algorithms, https://arxiv.org/abs/2407.14788 https://github.com/modelscope/agentscope/tree/main/examples/paper_llm_based_algorithm
- Reddit, 2024, New Open Source Framework and No-Code GUI for Fine-Tuning LLMs: H2O LLM Studio, https://www.reddit.com/r/LocalLLaMA/comments/12yc8op/new_open_source_framework_and_nocode_gui_for/
- Yuzhe Cai, Shaoguang Mao, Wenshan Wu, Zehua Wang, Yaobo Liang, Tao Ge, Chenfei Wu, Wang You, Ting Song, Yan Xia, Jonathan Tien, Nan Duan, Furu Wei, 1 Apr 2024 (v3), Low-code LLM: Graphical User Interface over Large Language Models, https://arxiv.org/abs/2304.08103 https://github.com/chenfei-wu/TaskMatrix/tree/main/LowCodeLLM https://www.youtube.com/watch?v=jb2C1vaeO3E
- Zelong Li, Shuyuan Xu, Kai Mei, Wenyue Hua, Balaji Rama, Om Raheja, Hao Wang, He Zhu, Yongfeng Zhang, 1 Jul 2024, AutoFlow: Automated Workflow Generation for Large Language Model Agents, https://arxiv.org/abs/2407.12821 https://github.com/agiresearch/AutoFlow
- Xin Pang, Zhucong Li, Jiaxiang Chen, Yuan Cheng, Yinghui Xu, Yuan Qi, 7 Apr 2024, AI2Apps: A Visual IDE for Building LLM-based AI Agent Applications, https://arxiv.org/abs/2404.04902
- Wenyi Hong, Weihan Wang, Qingsong Lv, Jiazheng Xu, Wenmeng Yu, Junhui Ji, Yan Wang, Zihan Wang, Yuxiao Dong, Ming Ding, Jie Tang; 2024, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 14281-14290, https://arxiv.org/abs/2312.08914 https://openaccess.thecvf.com/content/CVPR2024/html/Hong_CogAgent_A_Visual_Language_Model_for_GUI_Agents_CVPR_2024_paper.html https://openaccess.thecvf.com/content/CVPR2024/papers/Hong_CogAgent_A_Visual_Language_Model_for_GUI_Agents_CVPR_2024_paper.pdf
- Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
- S Madden, M Cafarella, M Franklin, T Kraska, 2024, Databases Unbound: Querying All of the World’s Bytes with AI, https://www.vldb.org/pvldb/vol17/p4546-madden.pdf
- Google, Sep 2024, Supercharge your work with no-code. AppSheet helps you build powerful applications and automations that boost productivity. No coding required., https://about.appsheet.com/home/ (Google AppSheet no code platform.)
- Matt Asay, Sep 23, 2024, Too much assembly required for AI, https://www.infoworld.com/article/3536292/too-much-assembly-required-for-ai.html
- Shubham Sharma, October 8, 2024, Databricks now lets developers create AI apps in 5 minutes: Here’s how, https://venturebeat.com/data-infrastructure/databricks-now-lets-developers-create-ai-apps-in-5-minutes-heres-how/
- Dr. Marcel Müller, Oct 18, 2024, No-Code Generative AI: How Companies Can Build Without Data Scientists, https://medium.com/deep-tech-innovation/no-code-generative-ai-how-companies-can-build-without-data-scientists-7e5ca851f2ba
- Mandana Vaziri, Louis Mandel, Claudio Spiess, Martin Hirzel, 24 Oct 2024, PDL: A Declarative Prompt Programming Language, https://arxiv.org/abs/2410.19135
- Saksham Goel, October 29, 2024, Build LLM/RAG pipelines with YAML templates by Pathway, https://pathway.com/blog/llm-yaml-templates
- Microsoft, Dec 2024, Power Automate: A comprehensive, end-to-end cloud automation platform powered by low code and AI. https://www.microsoft.com/en-us/power-platform/products/power-automate
- Orlando Marquez Ayala, Patrice Béchard, 29 Nov 2024, Generating a Low-code Complete Workflow via Task Decomposition and RAG, https://arxiv.org/abs/2412.00239
- Iván Alfonso, Aaron Conrardy, Jordi Cabot, 6 Dec 2024, Towards the interoperability of low-code platforms, https://arxiv.org/abs/2412.05075
Miniapps
- Kevin Lin, Sumant Guha, Joe Spaniac, Andy Zheng, 13 Nov 2020 (v3), Nifty Web Apps: Build a Web App for Any Text-Based Programming Assignment, https://arxiv.org/abs/2010.04671
- Yuyang Han, Xu Ji, Zhiqiang Wang, Jianyi Zhang, 19 Nov 2023, Systematic Analysis of Security and Vulnerabilities in Miniapps, https://arxiv.org/abs/2311.11382
- Shenao Wang, Yuekang Li, Kailong Wang, Yi Liu, Hui Li, Yang Liu, Haoyu Wang, 16 Jan 2024 (v2), MiniScope: Automated UI Exploration and Privacy Inconsistency Detection of MiniApps via Two-phase Iterative Hybrid Analysis, https://arxiv.org/abs/2401.03218
- Chao Wang, Yue Zhang, Zhiqiang Lin, 13 Jun 2023, Uncovering and Exploiting Hidden APIs in Mobile Super Apps, https://arxiv.org/abs/2306.08134
- Yuqing Yang, Chao Wang, Yue Zhang, Zhiqiang Lin, 13 Jun 2023, SoK: Decoding the Super App Enigma: The Security Mechanisms, Threats, and Trade-offs in OS-alike Apps, https://arxiv.org/abs/2306.07495
- Ozgur Ozan Kilic, Tianle Wang, Matteo Turilli, Mikhail Titov, Andre Merzky, Line Pouchard, Shantenu Jha, 26 Mar 2024, Workflow Mini-Apps: Portable, Scalable, Tunable & Faithful Representations of Scientific Workflows, https://arxiv.org/abs/2403.18073
- Liming Jiang, 12 Feb 2024, Utilizing Large LanguageModels to Detect Privacy Leaks in Mini-App Code, https://arxiv.org/abs/2402.07367
- Yin Wang, Ming Fan, Junfeng Liu, Junjie Tao, Wuxia Jin, Qi Xiong, Yuhao Liu, Qinghua Zheng, Ting Liu, 27 Feb 2023, Do as You Say: Consistency Detection of Data Practice in Program Code and Privacy Policy in Mini-App, https://arxiv.org/abs/2302.13860
- Thomas Steiner, 2024, What are mini apps? https://web.dev/articles/mini-apps/mini-app-about
- Boxo, 2024, What is a Miniapp? A New Era for Apps, https://www.boxo.io/blog/what-is-a-miniapp
- Electrode Native, 2024, What is a MiniApp, https://native.electrode.io/introduction/what-is-ern/what-is-a-miniapp
- W3C, 2024, MiniApps Working Group, https://www.w3.org/2021/miniapps/
- GMO Research, 22 March, 2023, The Rise of Super Apps , https://gmo-research.ai/en/news-events/articles/rise-super-apps
- Grand View Research, 2023, Super Apps Market Size, Share & Trends Analysis Report By Platform (iOS, Android), By Device (Smartphone, Tablets), By Application, By End-user, By Region, And Segment Forecasts, 2023 - 2030, Report ID: GVR-4-68040-036-1, https://www.grandviewresearch.com/industry-analysis/super-apps-market-report
- Lee Ying Shan, Nov 18 2024, Tencent challenges Amazon and Microsoft’s cloud dominance by tapping into its WeChat ecosystem, CNBC, https://www.cnbc.com/2024/11/18/tencent-is-contesting-microsoft-googles-cloud-dominance-with-wechat.html
Tabular Data Applications
- Xi Fang, Weijie Xu, Fiona Anting Tan, Jiani Zhang, Ziqing Hu, Yanjun Qi, Scott Nickleach, Diego Socolinsky, Srinivasan Sengamedu, Christos Faloutsos, 1 Mar 2024 (v2), Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey, https://arxiv.org/abs/2402.17944
- Weijia Wang, 2023, Efficient and Explainable Machine Learning Ph.D. thesis, University of California San Diego, https://escholarship.org/content/qt9q52g27p/qt9q52g27p_noSplash_70dba1eae3531240d1fec8e0cdaf1be2.pdf (Processing of tabular data is a weakness of GenAI models, and this thesis examines various issues of tabular data and rules-based processing.)
- David Bonet, Daniel Mas Montserrat, Xavier Giró-i-Nieto, Alexander G. Ioannidis, HyperFast: Instant Classification for Tabular Data, 2023, NeurIPS 2023, https://openreview.net/pdf?id=VRBhaU8IDz
- Irwin Deng, Kushagra Dixit, Vivek Gupta, Dan Roth, 22 Jul 2024, Enhancing Temporal Understanding in LLMs for Semi-structured Tables, https://arxiv.org/abs/2407.16030
- Liang, X., Hu, R., Liu, Y., Zhu, K. (2024). Open-Domain Question Answering over Tables with Large Language Models. In: Huang, DS., Pan, Y., Guo, J. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2024. Lecture Notes in Computer Science, vol 14873. Springer, Singapore. https://doi.org/10.1007/978-981-97-5615-5_28 https://link.springer.com/chapter/10.1007/978-981-97-5615-5_28
- Xianjie Wu, Jian Yang, Linzheng Chai, Ge Zhang, Jiaheng Liu, Xinrun Du, Di Liang, Daixin Shu, Xianfu Cheng, Tianzhen Sun, Guanglin Niu, Tongliang Li, Zhoujun Li, 17 Aug 2024, TableBench: A Comprehensive and Complex Benchmark for Table Question Answering, https://www.arxiv.org/abs/2408.09174
- Asim Biswal, Liana Patel, Siddarth Jha, Amog Kamsetty, Shu Liu, Joseph E. Gonzalez, Carlos Guestrin, Matei Zaharia, 27 Aug 2024, Text2SQL is Not Enough: Unifying AI and Databases with TAG, https://arxiv.org/abs/2408.14717 https://github.com/TAG-Research/TAG-Bench
- Shubham Sharma, September 2, 2024, Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL, https://venturebeat.com/data-infrastructure/table-augmented-generation-shows-promise-for-complex-dataset-querying-outperforms-text-to-sql/
- S Madden, M Cafarella, M Franklin, T Kraska, 2024, Databases Unbound: Querying All of the World’s Bytes with AI, https://www.vldb.org/pvldb/vol17/p4546-madden.pdf
- Shubham Sharma, September 12, 2024, Google’s DataGemma AI is a statistics wizard, https://venturebeat.com/ai/datagemma-googles-open-ai-models-mitigate-hallucination-on-statistical-queries/
- David Gewirtz, Sept. 16, 2024, Why natural language AI scripting in Microsoft Excel could be a game changer. What if you could run advanced Excel analyses with no coding skills? Here's how Microsoft's Copilot in Excel could use Python to allow you to do just that, https://www.zdnet.com/article/why-natural-language-ai-scripting-in-microsoft-excel-could-be-a-game-changer/
- Xinyuan Lu, Liangming Pan, Yubo Ma, Preslav Nakov, Min-Yen Kan, 18 Sep 2024, TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning, https://arxiv.org/abs/2409.11724 https://github.com/XinyuanLu00/TART
- Yuzhang Tian, Jianbo Zhao, Haoyu Dong, Junyu Xiong, Shiyu Xia, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang, 12 Jul 2024, SpreadsheetLLM: Encoding Spreadsheets for Large Language Models, https://arxiv.org/abs/2407.09025
- Mukul Singh, Gust Verbruggen, Vu Le, and Sumit Gulwani. 2024. Tabularis Revilio: Converting Text to Tables. In Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM '24). Association for Computing Machinery, New York, NY, USA, 4056–4060. https://doi.org/10.1145/3627673.3680000 https://dl.acm.org/doi/abs/10.1145/3627673.3680000
- LangChain, Aug 10, 2024, UX for Agents, Part 3: Spreadsheet, Generative, and Collaborative UI/UX, https://blog.langchain.dev/ux-for-agents-part-3/
- Deyi Ji, Lanyun Zhu, Siqi Gao, Peng Xu, Hongtao Lu, Jieping Ye, Feng Zhao, 13 Nov 2024, Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding, https://arxiv.org/abs/2411.08516
More AI Research
Read more about: