Aussie AI Blog
-
by David Spuler, Ph.D.
Latest Blog Articles
- C++ Low Latency Book
- False Sharing and Cache Line Sizes in Multithreading
- Overview of C++ Multithreading Optimizations
- 100 C++ Memory Safety Techniques: a response to C++ memory safety attacks
- What's Hot in LLM Inference Optimization in 2025?
- AI Research by Country
- What's New in Speculative Decoding?
Most Popular
February 2025 Blog Articles
- DeepSeek is Good for NVIDIA and the AI Industry
- DeepSeek Upends Progress in Reasoning and AGI
- Low Latency Programming
January 2025 Blog Articles
- Debugging OpenAI Node.js API Wrappers
- Chain-of-Thought Efficiency Optimization
- Reasoning Decoding Algorithms
- Reasoning Inference Optimization
New Aussie AI Book Releases
- Generative AI Applications: Planning, Design, and Implementation
- CUDA C++ Optimization
- Debugging CUDA C++ Kernels
- Safe C++ Standard and Memory Safety Book
- Generative AI in C++: Coding Transformers and LLMs
December 2024 Blog Articles
- AI Hitting the Wall?
- Reasoning is the New AI Middleware
- Humans are the Top Layer of the AI Stack
- The AI Application Layer
- Consumer vs Enterprise AI
November 2024 Blog Articles
- DIY Preventive C++ Memory Safety
- Canary Values & Redzones for Memory-Safe C++
- User-After-Free Memory Errors in C++
- Array Bounds Violations and Memory Safe C++
- Poisoning Memory Blocks for Safer C++
- Uninitialized Memory Safety in C++
- DIY Memory Safety in C++
October 2024 Blog Articles
- CUDA C++ Floating Point Exceptions
- Memory Safe C++ Library Functions
- Smart Stack Buffers for Memory Safe C++
- Safe C++ Text Buffers with snprintf
- Weight Clustering Needs a Refresh
- Generalizing Prefix KV Caching to RAG Chunks
- RAG Optimization via Caching
- CUDA Memory Coalescing Optimizations
- CUDA GPU Thread Divergence
September 2024 Blog Articles
- Deciding on Your AI Business Project
- Planning Your AI Business Project
- CUDA Basic C++ Programming Mistakes
- 500+ Techniques for LLM Inference Optimization
August 2024 Blog Articles
- State-of-the-Art LLM Backends
- Hot Inference Optimization Techniques
- Inference Optimization Research Ideas
- Sequential Speculative Decoding
- Generative AI Textbook Free Online
More AI Research Topics
Read more about: