Aussie AI

Filter Pruning

  • Last Updated 12 December, 2024
  • by David Spuler, Ph.D.

Filter pruning is a type of LLM inference optimization, primarily in relation to images, that reduces calculations along the width dimension of models. It is primarily related to CNNs, and is analogous to attention head pruning in Transformer architectures.

Research on Filter Pruning

Research papers on filter pruning include:

More Research on Pruning Types

More AI Research

Read more about: