Aussie AI

Heuristics in AI Architectures

Last Updated 22 October, 2025

by David Spuler, Ph.D.

What are Heuristics?

These days, the term "heuristics" seems to mean "anything except LLMs" in the AI research literature. We used to have software that did things prior to ChatGPT, and amazingly, some of that stuff is still useful. For starters, heuristics are:

Much faster than LLMs, and
Sometimes more accurate than LLMs (e.g., no hallucinations)

There are many decades of research on software algorithms that existed long before AI was any good. And now it's come full circle, with some of the research on heuristics starting to be used with LLM architectures.

Research on Heuristics

Research papers on the use of "heuristics" in AI include:

X Zhang, 2024, Disentangling syntactics, semantics, and pragmatics in natural language processing, Doctoral thesis, Nanyang Technological University, Singapore, https://hdl.handle.net/10356/177426 https://dr.ntu.edu.sg/bitstream/10356/177426/2/Final%20Thesis%20for%20DRNTU.pdf
Jindrich Libovicky, Jindrich Helcl, Marek Tlusty, Ondrej Bojar, and Pavel Pecina. 2016. CUNI system for WMT16 automatic post-editing and multimodal translation tasks. In Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers, pages 646–654, Berlin, Germany. https://arxiv.org/abs/1606.07481 (Post-editing of machine translation.)
M Sponner, B Waschneck, A Kumar , 2024, Adapting Neural Networks at Runtime: Current Trends in At-Runtime Optimizations for Deep Learning, ACM Computing Surveys,, PDF: https://dl.acm.org/doi/pdf/10.1145/3657283 (Survey of various adaptive inference optimization techniques with much focus on image and video processing optimization for LLMs.)
Camilo Chacón Sartori, Christian Blum, Filippo Bistaffa, Guillem Rodríguez Corominas, 28 May 2024, Metaheuristics and Large Language Models Join Forces: Towards an Integrated Optimization Approach, https://arxiv.org/abs/2405.18272
Mingjie Sun, Xinlei Chen, J. Zico Kolter, Zhuang Liu, 27 Feb 2024, Massive Activations in Large Language Models, https://arxiv.org/abs/2402.17762 (Examines the range of values of activations, focused on very large outlier values, in models such as LLaMA2-7B, LLaMA2-13B, and Mixtral-8x7B.)
K. Liao, Y. Zhang, X. Ren, Q. Su, X. Sun, and B. He, “A global past-future early exit method for accelerating inference of pre trained language models,” in Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021, pp. 2013–2023. https://aclanthology.org/2021.naacl-main.162/
Ke Hong, Guohao Dai, Jiaming Xu, Qiuli Mao, Xiuhong Li, Jun Liu, kangdi chen, Yuhan Dong, Yu Wang, 2024, FlashDecoding++: Faster Large Language Model Inference with Asynchronization, Flat GEMM Optimization, and Heuristics, Part of Proceedings of Machine Learning and Systems 6 (MLSys 2024) Conference, PDF: https://proceedings.mlsys.org/paper_files/paper/2024/file/5321b1dabcd2be188d796c21b733e8c7-Paper-Conference.pdf (Next generation of Flash Decoding, with improved ascynchronous parallelism of Softmax in both prefill and decoding phases, heuristic dataflow management algorithms, and enhanced GEMM during the decoding phase.)
Fei Liu, Xialiang Tong, Mingxuan Yuan, Xi Lin, Fu Luo, Zhenkun Wang, Zhichao Lu, Qingfu Zhang, 1 Jun 2024 (v3), Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language Model, https://arxiv.org/abs/2401.02051 (Using an LLM to find heuristics, rather than the other way around.)
David Spuler, June 2024, Aussie AI, Heuristic Optimization of Transformer On-Device Inference: IP Australia, https://ipsearch.ipaustralia.gov.au/patents/2024901670
Cal Paterson, April 2021, We were promised Strong AI, but instead we got metadata analysis, https://calpaterson.com/metadata.html
Peng Jiang and Xiaodong Cai, 12 Sep 2024, A Survey of Semantic Parsing Techniques Symmetry 2024, 16(9), 1201; https://doi.org/10.3390/sym16091201 https://www.mdpi.com/2073-8994/16/9/1201 PDF: https://www.mdpi.com/2073-8994/16/9/1201/pdf?version=1726149198
Anees Ahmed, Dec 2024, Heuristics in AI: The Secret Ingredient to Solving Complex Problems Quickly, https://aistacked.com/heuristics-in-artificial-intelligence/
Geeks for Geeks, June 2024, Heuristic Function in AI, https://www.geeksforgeeks.org/heuristic-function-in-ai/
Jiejun Tan, Zhicheng Dou, Yutao Zhu, Peidong Guo, Kun Fang, Ji-Rong Wen, 30 May 2024 (v3), Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs, https://arxiv.org/abs/2402.12052 https://github.com/plageon/SlimPLM
Damien de Mijolla, Wen Yang, Philippa Duckett, Christopher Frye, Mark Worrall, 8 Dec 2024, Language hooks: a modular framework for augmenting LLM reasoning that decouples tool usage from the model and its prompt, https://arxiv.org/abs/2412.05967
Vincent-Pierre Berges, Barlas Oguz, December 12, 2024, Memory Layers at Scale, Meta, https://ai.meta.com/research/publications/memory-layers-at-scale/ https://github.com/facebookresearch/memory (Augmention of an LLM with an additional key-value associative memory, by replacing some FFNs with a "memory layer".)
LCM team, Loïc Barrault, Paul-Ambroise Duquenne, Maha Elbayad, Artyom Kozhevnikov, Belen Alastruey, Pierre Andrews, Mariano Coria, Guillaume Couairon, Marta R. Costa-jussà, David Dale, Hady Elsahar, Kevin Heffernan, João Maria Janeiro, Tuan Tran, Christophe Ropers, Eduardo Sánchez, Robin San Roman, Alexandre Mourachko, Safiyyah Saleem, Holger Schwenk, 15 Dec 2024 (v2), Large Concept Models: Language Modeling in a Sentence Representation Space, https://arxiv.org/abs/2412.08821 https://github.com/facebookresearch/large_concept_model (Model operates at the sentence concept level, using SONAR sentence embeddings.)
Zijie Chen, Zhanchao Zhou, Yu Lu, Renjun Xu, Lili Pan, Zhenzhong Lan, 30 Dec 2024, UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design, https://arxiv.org/abs/2412.20694
Haoran Wang, Kai Shu, Jan 2025, MakeEveryTokenCount: ASystematic Survey on Decoding Methods for Foundation Model, https://www.researchgate.net/profile/Haoran-Wang-96/publication/387703971_Make_Every_Token_Count_A_Systematic_Survey_on_Decoding_Methods_for_Foundation_Models/links/67784c8ce74ca64e1f49eb15/Make-Every-Token-Count-A-Systematic-Survey-on-Decoding-Methods-for-Foundation-Models.pdf https://github.com/wang2226/Awesome-LLM-Decoding
A. Mishra, S. Kirmani and K. Madduri, "Fast Sentence Classification using Word Co-occurrence Graphs*," 2024 IEEE International Conference on Big Data (BigData), Washington, DC, USA, 2024, pp. 620-629, doi: 10.1109/BigData62323.2024.10825869. https://ieeexplore.ieee.org/abstract/document/10825869
Wendi Cui, Jiaxin Zhang, Zhuohang Li, Hao Sun, Damien Lopez, Kamalika Das, Bradley A. Malin, Sricharan Kumar, 26 Feb 2025, Automatic Prompt Optimization via Heuristic Search: A Survey, https://arxiv.org/abs/2502.18746 (Survey of auto prompting, from basic LLM enhancements to some methods quite similar to RALM and TALM.)
Bosi Wen, Pei Ke, Yufei Sun, Cunxiang Wang, Xiaotao Gu, Jinfeng Zhou, Jie Tang, Hongning Wang, Minlie Huang, 31 May 2025 (v2), HPSS: Heuristic Prompting Strategy Search for LLM Evaluators https://arxiv.org/abs/2502.13031 https://github.com/thu-coai/HPSS
Alexander Beiser, Markus Hecher, Stefan Woltran, 23 Jul 2025, Automated Hybrid Grounding Using Structural and Data-Driven Heuristics, https://arxiv.org/abs/2507.17493
Lijie Zheng, Ji He, Shih Yu Chang, Yulong Shen and Dusit Niyato, 23 Jul 2025, LLM Meets the Sky: Heuristic Multi-Agent Reinforcement Learning for Secure Heterogeneous UAV Networks, https://arxiv.org/abs/2507.17188
Renato Ghisellini and Remo Pareschi and Marco Pedroni and Giovanni Battista Raggi, 18 Jul 2025, From Extraction to Synthesis: Entangled Heuristics for Agent-Augmented Strategic Reasoning, https://arxiv.org/abs/2507.13768
Zishang Qiu, Xinan Chen, Long Chen and Ruibin Bai, 28 Jul 2025, MeLA: A Metacognitive LLM-Driven Architecture for Automatic Heuristic Design, https://arxiv.org/abs/2507.20541
Minh Hieu Ha, Hung Phan, Tung Duy Doan, Tung Dao, Dao Tran, and Huynh Thi Thanh Binh, 28 Jul 2025, Pareto-Grid-Guided Large Language Models for Fast and High-Quality Heuristics Design in Multi-Objective Combinatorial Optimization, https://arxiv.org/abs/2507.20923
Yiwen Sun, Furong Ye, Zhihan Chen, Ke Wei, and Shaowei Cai, 30 Jul 2025, Automatically discovering heuristics in a complex SAT solver with large language models, https://arxiv.org/abs/2507.22876
Jake Tuero, Michael Buro, Levi H. S. Lelis, 29 Jul 2025, Subgoal-Guided Policy Heuristic Search with Learned Subgoals, https://arxiv.org/abs/2506.07255
Juyan Zhang and Rhys Newbury and Xinyang Zhang and Tin Tran and Dana Kulic and Michael Burke, 3 Aug 2025, Why Heuristic Weighting Works: A Theoretical Analysis of Denoising Score Matching, https://arxiv.org/abs/2508.01597
Alexander Tuisov, Yonatan Vernik and Alexander Shleyfman, 3 Aug 2025, LLM-Generated Heuristics for AI Planning: Do We Even Need Domain-Independence Anymore?, https://arxiv.org/abs/2501.18784
Fei Liu, Yilu Liu, Qingfu Zhang, Xialiang Tong, Mingxuan Yuan, 5 Aug 2025, EoH-S: Evolution of Heuristic Set using LLMs for Automated Heuristic Design, https://arxiv.org/abs/2508.03082
Xinyue Wu, Fan Hu, Shaik Jani Babu, Yi Zhao, Xinfei Guo, 7 Aug 2025, EasySize: Elastic Analog Circuit Sizing via LLM-Guided Heuristic Search, https://arxiv.org/abs/2508.05113
Mason Nakamura, Saaduddin Mahmud, Kyle H. Wray, Hamed Zamani, Shlomo Zilberstein, 7 Aug 2025, Aligning LLMs on a Budget: Inference-Time Alignment with Heuristic Reward Models, https://arxiv.org/abs/2508.05165
Zhikai Zhao, Chuanbo Hua, Federico Berto, Kanghoon Lee, Zihan Ma, Jiachen Li, Jinkyoo Park, 7 Aug 2025, TrajEvo: Trajectory Prediction Heuristics Design via LLM-driven Evolution, https://arxiv.org/abs/2508.05616
Antonio M. Sudoso, 7 Aug 2025, Exact and Heuristic Algorithms for Constrained Biclustering, https://arxiv.org/abs/2508.05493
Bachtiar Herdianto, Romain Billot, Flavien Lucas, and Marc Sevaux, 8 Aug 2025, Study of Robust Features in Formulating Guidance for Heuristic Algorithms for Solving the Vehicle Routing Problem, https://arxiv.org/abs/2508.06129
Giovanni Briglia, Francesco Fabiano, Stefano Mariani, 18 Aug 2025, Scaling Multi-Agent Epistemic Planning through GNN-Derived Heuristics, https://arxiv.org/abs/2508.12840
Jiaqi Yin, Zhan Song, Chen Chen, Yaohui Cai, Zhiru Zhang and Cunxi Yu, 18 Aug 2025, e-boost: Boosted E-Graph Extraction with Adaptive Heuristics and Exact Solving, https://arxiv.org/abs/2508.13020
Qian Cao and Jielin Chen and Junchao Zhao and Rudi Stouffs, 15 Aug 2025, From Heuristics to Data: Quantifying Site Planning Layout Indicators with Deep Learning and Multi-Modal Data, https://arxiv.org/abs/2508.11723
Ruijia Zhang, Xinyan Zhao, Ruixiang Wang, Sigen Chen, Guibin Zhang, An Zhang, Kun Wang and Qingsong Wen, 15 Aug 2025, SafeSieve: From Heuristics to Experience in Progressive Pruning for LLM-based Multi-Agent Communication, https://arxiv.org/abs/2508.11733
Ma Teng and Jia Xiaojun and Duan Ranjie and Li Xinfeng and Huang Yihao and Jia Xiaoshuang and Chu Zhixuan and Ren Wenqi, 18 Aug 2025, Heuristic-Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models, https://arxiv.org/abs/2412.05934
Chentong Chen, Mengyuan Zhong, Jianyong Sun, Ye Fan, Jialong Shi, 18 Aug 2025, HiFo-Prompt: Prompting with Hindsight and Foresight for LLM-based Automatic Heuristic Design, https://arxiv.org/abs/2508.13333
Nikolai Antonov, Pr\v{e}mysl \v{S}\r{u}cha, Mikol\'a\v{s} Janota, Jan H\r{u}la, 19 Aug 2025, Minimizing the Weighted Number of Tardy Jobs: Data-Driven Heuristic for Single-Machine Scheduling, https://arxiv.org/abs/2508.13703
Sadman Mohammad Nasif, Md Abrar Jahin, M. F. Mridha, 23 Aug 2025, Reinforcement-Guided Hyper-Heuristic Hyperparameter Optimization for Fair and Explainable Spiking Neural Network-Based Financial Fraud Detection, https://arxiv.org/abs/2508.16915
Yizhi Li, Qingshui Gu, Zhoufutu Wen, Ziniu Li, Tianshun Xing, Shuyue Guo, Tianyu Zheng, Xin Zhou, Xingwei Qu, Wangchunshu Zhou, Zheng Zhang, Wei Shen, Qian Liu, Chenghua Lin, Jian Yang, Ge Zhang, Wenhao Huang, 24 Aug 2025, TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling, https://arxiv.org/abs/2508.17445
Daniil Medyakov, Gleb Molodtsov, Grigoriy Evseev, Egor Petrov, Aleksandr Beznosikov, 4 Sep 2025, Shuffling Heuristic in Variational Inequalities: Establishing New Convergence Guarantees, https://arxiv.org/abs/2509.04133
Dillon Z. Chen, 25 Aug 2025, Symmetry-Invariant Novelty Heuristics via Unsupervised Weisfeiler-Leman Features, https://arxiv.org/abs/2508.18520
Jungwoo Kim, Minsang Kim, Jaeheon Lee, Chanwoo Moon, Heejin Kim, Taeho Hwang, Woosuk Chung, Yeseong Kim, Sungjin Lee, 26 Aug 2025, Rethinking Caching for LLM Serving Systems: Beyond Traditional Heuristics, https://arxiv.org/abs/2508.18736
Ramkumar Natarajan, Muhammad Suhail Saleem, William Xiao, Sandip Aine, Howie Choset, Maxim Likhachev, 29 Aug 2025, A-MHA*: Anytime Multi-Heuristic A*, https://arxiv.org/abs/2508.21637
Guorui Quan, Mingfei Sun, Manuel L\'opez-Ib\'a\~nez, 2 Sep 2025, Re-evaluating LLM-based Heuristic Search: A Case Study on the 3D Packing Problem, https://arxiv.org/abs/2509.02297
Rahul Meshram and Kesav Kaza, 30 Aug 2025, Lagrangian Relaxation for Multi-Action Partially Observable Restless Bandits: Heuristic Policies and Indexability, https://arxiv.org/abs/2509.00415
Andrea Eirale, Matteo Leonetti, Marcello Chiaberge, 2 Sep 2025, Learning Social Heuristics for Human-Aware Path Planning, https://arxiv.org/abs/2509.02134
Abderrahim Bendahi, Benjamin Doerr, Adrien Fradin, Johannes F. Lutzeyer, 1 Sep 2025, Speeding Up Hyper-Heuristics With Markov-Chain Operator Selection and the Only-Worsening Acceptance Operator, https://arxiv.org/abs/2506.01107
V\'aclav Sobotka, Lucas Kletzander, Nysret Musliu, Hana Rudov\'a, 2 Sep 2025, Key Principles in Cross-Domain Hyper-Heuristic Performance, https://arxiv.org/abs/2509.02782
Camilo Chac\'on Sartori, Mart\'in Isla Pino, Pedro Pinacho-Davidson, Christian Blum, 5 Sep 2025, LLM-Based Instance-Driven Heuristic Bias In the Context of a Biased Random Key Genetic Algorithm, https://arxiv.org/abs/2509.09707
Duolin Sun, Dan Yang, Yue Shen, Yihan Jiao, Zhehao Tan, Jie Feng, Lianzhen Zhong, Jian Wang, Peng Wei, Jinjie Gu, 8 Sep 2025, HANRAG: Heuristic Accurate Noise-resistant Retrieval-Augmented Generation for Multi-hop Question Answering, https://arxiv.org/abs/2509.09713
Yuting Zeng, Zhiwen Zheng, You Zhou, JiaLing Xiao, Yongbin Yu, Manping Fan, Bo Gong, and Liyong Ren, 19 Sep 2025, Momentum-constrained Hybrid Heuristic Trajectory Optimization Framework with Residual-enhanced DRL for Visually Impaired Scenarios, https://arxiv.org/abs/2509.15582
Minyu Chen, Guoqiang Li, 16 Sep 2025, DaSAThco: Data-Aware SAT Heuristics Combinations Optimization via Large Language Models, https://arxiv.org/abs/2509.12602
Junfeng Tang, Yuping Yan, Zihan Ye, Zhenshou, Song, Zeqi Zheng and Yaochu Jin, 14 Sep 2025, Think Small, Plan Smart: Minimalist Symbolic Abstraction and Heuristic Subspace Search for LLM-Guided Task Planning, https://arxiv.org/abs/2501.15214
Yutong Liu, Ziyue Zhang, Yongbin Yu, Xiangxiang Wang, Yuqing Cai, Nyima Tashi, 18 Sep 2025, Listening, Imagining \& Refining: A Heuristic Optimized ASR Correction Framework with LLMs, https://arxiv.org/abs/2509.15095