Aussie AI

Shortlisting

  • Last Updated 7 December, 2024
  • by David Spuler, Ph.D.

Shortlisting is a type of vocabulary trimming for reducing the size of the token vocabulary in LLMs. This reduces the size of the vocabulary, thereby reducing both the computation cost and the memory size of model weights.

Shortlisting, also called lexical shortlisting, has been examined mostly in the research on Neural Machine Translation (NMT). Hence, there is a need for more research on LLM shortlisting of the vocabulary.

Related areas of LLM inference optimization include:

Research on Shortlisting

Research papers on lexical shortlisting in LLMs:

More AI Research

Read more about: