IremulbNet: Rethinking the inverted residual architecture for image recognition

Neural Netw. 2024 Apr:172:106140. doi: 10.1016/j.neunet.2024.106140. Epub 2024 Jan 23.

Abstract

An increasing need of running Convolutional Neural Network (CNN) models on mobile devices encourages the studies on efficient and lightweight neural network model. In this paper, an Inverse Residual Multi-Branch Network named IremulbNet is proposed to solve the problem of insufficient classification accuracy in existing lightweight network models. The core module of this model is to reconstruct an inverse residual structure, in which a special feature fusion method, multi-branch feature extraction, and depthwise separable convolution techniques are used to improve the classification accuracy. The performance of model is tested using image databases. Experimental results show that for the fine-grained image dataset Imagenet-woof, IremulbNet achieved 10.9%, 12.2%, and 15.3% higher accuracy than that of MobileNet V3, ShuffleNet V2, and PeleeNet, respectively. Moreover, it can reduce inference time (GPU) about 42.09% and 75.56% compared to classic ResNet50 and DenseNet121.

Keywords: Image recognition; Inverse residual structure; Lightweight model; Mish activation function; Multi-branch structure.

MeSH terms

  • Databases, Factual
  • Neural Networks, Computer*
  • Recognition, Psychology*