Overview

Recent research in dynamic convolution shows substantial performance boost for efficient CNNs, due to the adaptive aggregation of K static convolution kernels. It has two limitations: (a) it increases the number of convolutional weights by K-times, and (b) the joint optimization of dynamic attention and static convolution kernels is challenging. In this project, we revisit it from a new perspective of matrix decomposition and reveal the key issue is that dynamic convolution applies dynamic attention over channel groups after projecting into a higher dimensional latent space. To address this issue, we propose dynamic channel fusion to replace dynamic attention over channel groups. Dynamic channel fusion not only enables significant dimension reduction of the latent space, but also mitigates the joint optimization difficulty. As a result, our method is easier to train and requires significantly fewer parameters without sacrificing accuracy.

Published in International Conference on Learning Representations (ICLR), 2021.

Arxiv

Repository

Bibtex

Models

Architecture: Dynamic convolution decomposition on tensor.

Architecture: Dynamic convolution decomposition embedded in the main network branch.

Highlights

High efficient dynamic operation in low dimensional feature space.

Benefit

More compact model with less parameters.
Improved performance for both model convergent speed and recognition accuracy.

Revisiting Dynamic Convolution Via Matrix Decomposition

Yunsheng Li¹

Yinpeng Chen²

Xiyang Dai²

Mengchen Liu²

Dongdong Chen²

Lu Yuan²

Zicheng Liu²

Nuno Vasconcelos¹

UC San Diego¹, Microsoft²

Overview

Arxiv

Repository

Bibtex

Models

Highlights

Benefit

Video

Authors

Yunsheng Li

UC San Diego

Nuno Vasconcelos

UC San Diego