Bilinear sparse coding for invariant vision

David B Grimes; Rajesh P N Rao

doi:10.1162/0899766052530893

Bilinear sparse coding for invariant vision

Neural Comput. 2005 Jan;17(1):47-73. doi: 10.1162/0899766052530893.

Authors

David B Grimes¹, Rajesh P N Rao

Affiliation

¹ Department of Computer Science and Engineering, University of Washington, Seattle, WA 98195-2350, USA. grimes@cs.washington.edu

PMID: 15563747
DOI: 10.1162/0899766052530893

Abstract

Recent algorithms for sparse coding and independent component analysis (ICA) have demonstrated how localized features can be learned from natural images. However, these approaches do not take image transformations into account. We describe an unsupervised algorithm for learning both localized features and their transformations directly from images using a sparse bilinear generative model. We show that from an arbitrary set of natural images, the algorithm produces oriented basis filters that can simultaneously represent features in an image and their transformations. The learned generative model can be used to translate features to different locations, thereby reducing the need to learn the same feature at multiple locations, a limitation of previous approaches to sparse coding and ICA. Our results suggest that by explicitly modeling the interaction between local image features and their transformations, the sparse bilinear approach can provide a basis for achieving transformation-invariant vision.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms*
Artificial Intelligence*
Image Processing, Computer-Assisted* / methods
Linear Models*
Pattern Recognition, Visual*
Signal Processing, Computer-Assisted