Aussie AI

Vocabulary Expansion

  • Last Updated 11 June, 2025
  • by David Spuler, Ph.D.

Vocabulary expansion, or vocabulary extension, is increasing the size of the LLM vocabulary. This means that the overall model has more distinct tokens, which can increase the ability of individual tokens to encode particular states or outputs.

LLM vocabulary expansion can be performed for increased accuracy, such as in foreign languages which have a much greater range of words and symbols (e.g., Unicode and DBCS languages). An increased LLM vocabulary can also be used for improved efficiency, as input sequences may be able to be encoded in fewer tokens, which makes it similar to token merging.

Most of the research on vocabulary expansion is related to foreign language translation via the research area of Neural Machine Translation (NMT). This research has existed for some time prior to much of the LLM research, and often uses non-LLM types of AI models. Hence, there is a need for more research on vocabulary extension with LLMs.

Related areas of LLM inference optimization include:

Research on Vocabulary Expansion

Research papers on increasing the size of the LLM vocabulary in tokenization:

AI Books from Aussie AI



The Sweetest Lesson: Your Brain Versus AI The Sweetest Lesson: Your Brain Versus AI: new book on AI intelligence theory:
  • Your brain is 50 times bigger than the best AI engines.
  • Truly intelligent AI will require more compute!
  • Another case of the bitter lesson?
  • Maybe it's the opposite of that: the sweetest lesson.

Get your copy from Amazon: The Sweetest Lesson



RAG Optimization RAG Optimization: Accurate and Efficient LLM Applications: new book on RAG architectures:
  • Smarter RAG
  • Faster RAG
  • Cheaper RAG
  • Agentic RAG
  • RAG reasoning

Get your copy from Amazon: RAG Optimization



Generative AI in C++ Generative AI Applications book:
  • Deciding on your AI project
  • Planning for success and safety
  • Designs and LLM architectures
  • Expediting development
  • Implementation and deployment

Get your copy from Amazon: Generative AI Applications



Generative AI in C++ Generative AI programming book:
  • Generative AI coding in C++
  • Transformer engine speedups
  • LLM models
  • Phone and desktop AI
  • Code examples
  • Research citations

Get your copy from Amazon: Generative AI in C++



CUDA C++ Optimization CUDA C++ Optimization book:
  • Faster CUDA C++ kernels
  • Optimization tools & techniques
  • Compute optimization
  • Memory optimization

Get your copy from Amazon: CUDA C++ Optimization



CUDA C++ Optimization CUDA C++ Debugging book:
  • Debugging CUDA C++ kernels
  • Tools & techniques
  • Self-testing & reliability
  • Common GPU kernel bugs

Get your copy from Amazon: CUDA C++ Debugging

More AI Research

Read more about: