Aussie AI

Incremental AI Algorithms

Last Updated 22 May, 2025

by David Spuler, Ph.D.

Incremental algorithms are a general class of code optimizations where a large amount of computation is replaced by repeated smaller code executions. It is the opposite of "batch processing" where a big chunk of processing is done all at once.

A simple example is summing a list of numbers input by a user. You can either get all the numbers, and then scan through them, adding them all at once (non-incrementally). Or you can use the incremental approach, where you keep a running sum, and add each new number to that sum as it is received.

AI models in a sense are one big incremental algorithm. During training, the weights are incrementally updated, one input item at a time. During inference, the tokens are scanned one at a time, the layers incrementally modify the logits (one layer at a time), and the autoregressive decoding phase processes one new output token at a time.

Incremental learning is a method of training or fine-tuning whereby the model learns incrementally. This is an established machine learning algorithm with a body of research. However, inference optimization is not a goal of incremental learning.

Incremental algorithms are not a mainstay of inference optimization. Generally, most of the AI optimization techniques tend to be batch rather than incremental. One major reason is that batch algorithms are often easier to parallelize, whereas incremental computations need to await the results of the prior step. However, there are some optimizations to AI inference that involve the use of incremental algorithms:

Incremental Inference

Research on using incremental algorithms for LLM inference:

Or Sharir, Anima Anandkumar, 27 Jul 2023, Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs, https://arxiv.org/abs/2307.14988
Supun Nakandala, Kabir Nagrecha, Arun Kumar, and Yannis Papakonstantinou. 2020. Incremental and Approximate Computations for Accelerating Deep CNN Inference. ACM Trans. Database Syst. 45, 4, Article 16 (December 2020), 42 pages. https://doi.org/10.1145/3397461 https://dl.acm.org/doi/abs/10.1145/3397461 PDF: https://dl.acm.org/doi/pdf/10.1145/3397461

Incremental Algorithm Research

Research on incremental algorithms:

David Spuler, March 2024, Incremental Algorithms, in Generative AI in C++, https://www.aussieai.com/book/ch13-incremental-algorithms
David Spuler, March 2024, Chapter 13. Algorithm Speedups, Generative AI in C++: Coding Transformers and LLMs, https://www.amazon.com/dp/B0CXJKCWX9
Wikipedia, Sep 2024, Incremental computing, https://en.wikipedia.org/wiki/Incremental_computing
Liu, Y.A. Efficiency by Incrementalization: An Introduction. Higher-Order and Symbolic Computation 13, 289–313 (2000). https://doi.org/10.1023/A:1026547031739 https://link.springer.com/article/10.1023/A:1026547031739
Pranjal Naman, Yogesh Simmhan, 17 May 2025, Ripple: Scalable Incremental GNN Inferencing on Large Streaming Graphs, https://arxiv.org/abs/2505.12112

Aussie AI

Incremental AI Algorithms

Incremental Inference

Incremental Algorithm Research

More AI Research

Quick Links

Product

New to Writing?

Writing Styles