SAN: SScale-space Attention Networks

Yash Garg; K. Selcuk Candan; Maria Luisa Sapino

doi:10.1109/ICDE48307.2020.00079

SAN: SScale-space Attention Networks

Yash Garg, K. Selcuk Candan, Maria Luisa Sapino

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

5 Scopus citations

Abstract

Deep neural networks (DNNs), especially convolutional neural networks (CNNs), have been effective in various data-driven applications. Yet, DNNs suffer from several major challenges; in particular, in many applications where the input data is relatively sparse, DNNs face the problems of overfitting to the input data and poor generalizability. This brings up several critical questions: "Are all inputs equally important" "Can we selectively focus on parts of the input data in a way that reduces overfitting to irrelevant observations" Recently, attention networks showed some success in helping the overall process focus onto parts of the data that carry higher importance in the current context. Yet, we note that the current attention network design approaches are not sufficiently informed about the key data characteristics in identifying salient regions in the data. We propose an innovative robust feature learning framework, scale-invariant attention networks (SAN), that identifies salient regions in the input data for the CNN to focus on. Unlike the existing attention networks, SAN concentrates attention on parts of the data where there is major change across space and scale. We argue, and experimentally show, that the salient regions identified by SAN lead to better network performance compared to state-of-the-art (attentioned and non-attentioned) approaches, including architectures such as LeNet, VGG, ResNet, and LSTM, with common benchmark datasets, MNIST, FMNIST, CIFAR10/20/100, GTSRB, ImageNet, Mocap, Aviage, and GTSDB for tasks such as image/time series classification, time series forecasting and object detection in images.

Original language	English (US)
Title of host publication	Proceedings - 2020 IEEE 36th International Conference on Data Engineering, ICDE 2020
Publisher	IEEE Computer Society
Pages	853-864
Number of pages	12
ISBN (Electronic)	9781728129037
DOIs	https://doi.org/10.1109/ICDE48307.2020.00079
State	Published - Apr 2020
Event	36th IEEE International Conference on Data Engineering, ICDE 2020 - Dallas, United States Duration: Apr 20 2020 → Apr 24 2020

Publication series

Name	Proceedings - International Conference on Data Engineering
Volume	2020-April
ISSN (Print)	1084-4627

Conference

Conference	36th IEEE International Conference on Data Engineering, ICDE 2020
Country/Territory	United States
City	Dallas
Period	4/20/20 → 4/24/20

Keywords

Attention module
Attention networks
Convolutional neural networks

ASJC Scopus subject areas

Software
Signal Processing
Information Systems

Access to Document

10.1109/ICDE48307.2020.00079

Cite this

SAN: SScale-space Attention Networks. / Garg, Yash; Candan, K. Selcuk; Sapino, Maria Luisa.
Proceedings - 2020 IEEE 36th International Conference on Data Engineering, ICDE 2020. IEEE Computer Society, 2020. p. 853-864 9101801 (Proceedings - International Conference on Data Engineering; Vol. 2020-April).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Garg, Y, Candan, KS & Sapino, ML 2020, SAN: SScale-space Attention Networks. in Proceedings - 2020 IEEE 36th International Conference on Data Engineering, ICDE 2020., 9101801, Proceedings - International Conference on Data Engineering, vol. 2020-April, IEEE Computer Society, pp. 853-864, 36th IEEE International Conference on Data Engineering, ICDE 2020, Dallas, United States, 4/20/20. https://doi.org/10.1109/ICDE48307.2020.00079

@inproceedings{4d875758534843259caa9e1d372d54f0,

title = "SAN: SScale-space Attention Networks",

abstract = "Deep neural networks (DNNs), especially convolutional neural networks (CNNs), have been effective in various data-driven applications. Yet, DNNs suffer from several major challenges; in particular, in many applications where the input data is relatively sparse, DNNs face the problems of overfitting to the input data and poor generalizability. This brings up several critical questions: {"}Are all inputs equally important{"} {"}Can we selectively focus on parts of the input data in a way that reduces overfitting to irrelevant observations{"} Recently, attention networks showed some success in helping the overall process focus onto parts of the data that carry higher importance in the current context. Yet, we note that the current attention network design approaches are not sufficiently informed about the key data characteristics in identifying salient regions in the data. We propose an innovative robust feature learning framework, scale-invariant attention networks (SAN), that identifies salient regions in the input data for the CNN to focus on. Unlike the existing attention networks, SAN concentrates attention on parts of the data where there is major change across space and scale. We argue, and experimentally show, that the salient regions identified by SAN lead to better network performance compared to state-of-the-art (attentioned and non-attentioned) approaches, including architectures such as LeNet, VGG, ResNet, and LSTM, with common benchmark datasets, MNIST, FMNIST, CIFAR10/20/100, GTSRB, ImageNet, Mocap, Aviage, and GTSDB for tasks such as image/time series classification, time series forecasting and object detection in images.",

keywords = "Attention module, Attention networks, Convolutional neural networks",

author = "Yash Garg and Candan, {K. Selcuk} and Sapino, {Maria Luisa}",

note = "Funding Information: Partially funded by: NSF #1610282 (DataStorm), #1633381 (Complex Systems), #1629888 (GEARS), #1827757 (PFI-RP), and #1909555 (pCAR) Publisher Copyright: {\textcopyright} 2020 IEEE.; 36th IEEE International Conference on Data Engineering, ICDE 2020 ; Conference date: 20-04-2020 Through 24-04-2020",

year = "2020",

month = apr,

doi = "10.1109/ICDE48307.2020.00079",

language = "English (US)",

series = "Proceedings - International Conference on Data Engineering",

publisher = "IEEE Computer Society",

pages = "853--864",

booktitle = "Proceedings - 2020 IEEE 36th International Conference on Data Engineering, ICDE 2020",

}

TY - GEN

T1 - SAN

T2 - 36th IEEE International Conference on Data Engineering, ICDE 2020

AU - Garg, Yash

AU - Candan, K. Selcuk

AU - Sapino, Maria Luisa

PY - 2020/4

Y1 - 2020/4

N2 - Deep neural networks (DNNs), especially convolutional neural networks (CNNs), have been effective in various data-driven applications. Yet, DNNs suffer from several major challenges; in particular, in many applications where the input data is relatively sparse, DNNs face the problems of overfitting to the input data and poor generalizability. This brings up several critical questions: "Are all inputs equally important" "Can we selectively focus on parts of the input data in a way that reduces overfitting to irrelevant observations" Recently, attention networks showed some success in helping the overall process focus onto parts of the data that carry higher importance in the current context. Yet, we note that the current attention network design approaches are not sufficiently informed about the key data characteristics in identifying salient regions in the data. We propose an innovative robust feature learning framework, scale-invariant attention networks (SAN), that identifies salient regions in the input data for the CNN to focus on. Unlike the existing attention networks, SAN concentrates attention on parts of the data where there is major change across space and scale. We argue, and experimentally show, that the salient regions identified by SAN lead to better network performance compared to state-of-the-art (attentioned and non-attentioned) approaches, including architectures such as LeNet, VGG, ResNet, and LSTM, with common benchmark datasets, MNIST, FMNIST, CIFAR10/20/100, GTSRB, ImageNet, Mocap, Aviage, and GTSDB for tasks such as image/time series classification, time series forecasting and object detection in images.

AB - Deep neural networks (DNNs), especially convolutional neural networks (CNNs), have been effective in various data-driven applications. Yet, DNNs suffer from several major challenges; in particular, in many applications where the input data is relatively sparse, DNNs face the problems of overfitting to the input data and poor generalizability. This brings up several critical questions: "Are all inputs equally important" "Can we selectively focus on parts of the input data in a way that reduces overfitting to irrelevant observations" Recently, attention networks showed some success in helping the overall process focus onto parts of the data that carry higher importance in the current context. Yet, we note that the current attention network design approaches are not sufficiently informed about the key data characteristics in identifying salient regions in the data. We propose an innovative robust feature learning framework, scale-invariant attention networks (SAN), that identifies salient regions in the input data for the CNN to focus on. Unlike the existing attention networks, SAN concentrates attention on parts of the data where there is major change across space and scale. We argue, and experimentally show, that the salient regions identified by SAN lead to better network performance compared to state-of-the-art (attentioned and non-attentioned) approaches, including architectures such as LeNet, VGG, ResNet, and LSTM, with common benchmark datasets, MNIST, FMNIST, CIFAR10/20/100, GTSRB, ImageNet, Mocap, Aviage, and GTSDB for tasks such as image/time series classification, time series forecasting and object detection in images.

KW - Attention module

KW - Attention networks

KW - Convolutional neural networks

UR - http://www.scopus.com/inward/record.url?scp=85085860849&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85085860849&partnerID=8YFLogxK

U2 - 10.1109/ICDE48307.2020.00079

DO - 10.1109/ICDE48307.2020.00079

M3 - Conference contribution

AN - SCOPUS:85085860849

T3 - Proceedings - International Conference on Data Engineering

SP - 853

EP - 864

BT - Proceedings - 2020 IEEE 36th International Conference on Data Engineering, ICDE 2020

PB - IEEE Computer Society

Y2 - 20 April 2020 through 24 April 2020

ER -

SAN: SScale-space Attention Networks

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this