Aussie AI

Channel Pruning

  • Last Updated 12 December, 2024
  • by David Spuler, Ph.D.

Channel pruning is a type of LLM inference optimization that reduces calculations along the width dimension of models. It is primarily related to CNNs, and is analogous to attention head pruning in Transformer architectures.

Research on Channel Pruning

Research papers on channel pruning include:

More Research on Pruning Types

More AI Research

Read more about: