Convolution Filter Compression via Sparse Linear Combinations of Quantized Basis

Artificial Intelligence

Signal Processing

Computer Vision And Pattern Recognition

Authors

Weichao Lan,Yiu‐ming Cheung

Liang Lan,Juyong Jiang,Zhikai Hu

+3 authors

,Liuqin Liang

Journal

IEEE Transactions on Neural Networks and Learning Systems

Published

Jan 1, 2024

DOI

10.1109/tnnls.2024.3457943

Save

Tip

Document

Submit new version

Download

Flag content

Tip

Save

Document

Submit new version

Download

Flag content

Abstract

Convolutional neural networks (CNNs) have achieved significant performance on various real-life tasks. However, the large number of parameters in convolutional layers requires huge storage and computation resources, making it challenging to deploy CNNs on memory-constrained embedded devices. In this article, we propose a novel compression method that generates the convolution filters in each layer using a set of learnable low-dimensional quantized filter bases. The proposed method reconstructs the convolution filters by stacking the linear combinations of these filter bases. By using quantized values in weights, the compact filters can be represented using fewer bits so that the network can be highly compressed. Furthermore, we explore the sparsity of coefficients through L

Paper PDF

This PDF hasn't been uploaded yet.

Do not upload any copyrighted content to the site, only open-access content.

Scan to connect with one of our mobile apps

Coinbase Wallet app

Connect with your self-custody wallet

Coinbase app

Connect with your Coinbase account

Open Coinbase Wallet app
Tap Scan

Or try the Coinbase Wallet browser extension

Connect with dapps with just one click on your desktop browser
Add an additional layer of security by using a supported Ledger hardware wallet