Deep Sparse Coding for Invariant Multimodal Halle Berry Neurons

Edward Kim; Darryl Hannan; Garrett Kenyon

doi:10.1109/CVPR.2018.00122

Back

Deep Sparse Coding for Invariant Multimodal Halle Berry Neurons

Conference proceeding

Open access

Deep Sparse Coding for Invariant Multimodal Halle Berry Neurons

Edward Kim, Darryl Hannan and Garrett Kenyon

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), pp 1111-1120

01 Jan 2018

DOI: https://doi.org/10.1109/CVPR.2018.00122

Files and links (1)

url

https://arxiv.org/abs/1711.07998View

SubmittedarXiv.org - Non-exclusive license to distribute, Open

Abstract

Computer Science, Artificial Intelligence

Science & Technology

Computer Science

Technology

Deep feed-forward convolutional neural networks (CNNs) have become ubiquitous in virtually all machine learning and computer vision challenges; however, advancements in CNNs have arguably reached an engineering saturation point where incremental novelty results in minor performance gains. Although there is evidence that object classification has reached human levels on narrowly defined tasks, for general applications, the biological visual system is far superior to that of any computer. Research reveals there are numerous missing components in feed-forward deep neural networks that are critical in mammalian vision. The brain does not work solely in a feed-forward fashion, but rather all of the neurons are in competition with each other; neurons are integrating information in a bottom up and top down fashion and incorporating expectation and feedback in the modeling process. Furthermore, our visual cortex is working in tandem with our parietal lobe, integrating sensory information from various modalities. In our work, we sought to improve upon the standard feed-forward deep learning model by augmenting them with biologically inspired concepts of sparsity, top-down feedback, and lateral inhibition. We define our model as a sparse coding problem using hierarchical layers. We solve the sparse coding problem with an additional top-down feedback error driving the dynamics of the neural network. While building and observing the behavior of our model, we were fascinated that multimodal, invariant neurons naturally emerged that mimicked, "Halle Berry neurons" found in the human brain. These neurons trained in our sparse model learned to respond to high level concepts from multiple modalities, which is not the case with a standard feedforward autoencoder. Furthermore, our sparse representation of multimodal signals demonstrates qualitative and quantitative superiority to the standard feed-forward joint embedding in common vision and machine learning tasks.

Metrics

9 Record Views

15 citations in Web of Science

18 citations in Scopus

Details

Title: Deep Sparse Coding for Invariant Multimodal Halle Berry Neurons
Creators: Edward Kim - Los Alamos National Laboratory
Darryl Hannan - Los Alamos National Laboratory
Garrett Kenyon - Los Alamos National Laboratory
Publication Details: 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), pp 1111-1120
Series: IEEE Conference on Computer Vision and Pattern Recognition
Publisher: IEEE
Number of pages: 10
Resource Type: Conference proceeding
Language: English
Academic Unit: Information Science; Computer Science
Web of Science ID: WOS:000457843601025
Scopus ID: 2-s2.0-85062840473
Other Identifier: 991021884588704721

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types: Domestic collaboration
Web of Science research areas: Computer Science, Artificial Intelligence

Deep Sparse Coding for Invariant Multimodal Halle Berry Neurons

Files and links (1)

Abstract

Metrics

Details

InCites Highlights

Drexel University Social media