首页 正文

Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian Prior

{{output}}
Large pretrained transformer models have revolutionized modern AI applications with their state-of-the-art performance in natural language processing (NLP). However, their substantial parameter count poses challenges for real-world deployment. To address this,... ...