Dropout as an implicit gating mechanism for continual learning

Seyed Iman Mirzadeh; Mehrdad Farajtabar; Hassan Ghasemzadeh

doi:10.1109/CVPRW50498.2020.00124

Dropout as an implicit gating mechanism for continual learning

Seyed Iman Mirzadeh, Mehrdad Farajtabar, Hassan Ghasemzadeh

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

20 Scopus citations

Abstract

In recent years, neural networks have demonstrated an outstanding ability to achieve complex learning tasks across various domains. However, they suffer from the "catastrophic forgetting" problem when they face a sequence of learning tasks, where they forget the old ones as they learn new tasks. This problem is also highly related to the "stability-plasticity dilemma". The more plastic the network, the easier it can learn new tasks, but the faster it also forgets previous ones. Conversely, a stable network cannot learn new tasks as fast as a very plastic network. However, it is more reliable to preserve the knowledge it has learned from the previous tasks. Several solutions have been proposed to overcome the forgetting problem by making the neural network parameters more stable, and some of them have mentioned the significance of dropout in continual learning. However, their relationship has not been sufficiently studied yet. In this paper, we investigate this relationship and show that a stable network with dropout learns a gating mechanism such that for different tasks, different paths of the network are active. Our experiments show that the stability achieved by this implicit gating plays a very critical role in leading to performance comparable to or better than other involved continual learning algorithms to overcome catastrophic forgetting.¹

Original language	English (US)
Title of host publication	Proceedings - 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2020
Publisher	IEEE Computer Society
Pages	945-951
Number of pages	7
ISBN (Electronic)	9781728193601
DOIs	https://doi.org/10.1109/CVPRW50498.2020.00124
State	Published - Jun 2020
Externally published	Yes
Event	2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2020 - Virtual, Online, United States Duration: Jun 14 2020 → Jun 19 2020

Publication series

Name	IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
Volume	2020-June
ISSN (Print)	2160-7508
ISSN (Electronic)	2160-7516

Conference

Conference	2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2020
Country/Territory	United States
City	Virtual, Online
Period	6/14/20 → 6/19/20

ASJC Scopus subject areas

Computer Vision and Pattern Recognition
Electrical and Electronic Engineering

Access to Document

10.1109/CVPRW50498.2020.00124

Cite this

Mirzadeh, S. I., Farajtabar, M., & Ghasemzadeh, H. (2020). Dropout as an implicit gating mechanism for continual learning. In Proceedings - 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2020 (pp. 945-951). Article 9150587 (IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops; Vol. 2020-June). IEEE Computer Society. https://doi.org/10.1109/CVPRW50498.2020.00124

Dropout as an implicit gating mechanism for continual learning. / Mirzadeh, Seyed Iman; Farajtabar, Mehrdad; Ghasemzadeh, Hassan.
Proceedings - 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2020. IEEE Computer Society, 2020. p. 945-951 9150587 (IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops; Vol. 2020-June).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Mirzadeh, SI, Farajtabar, M & Ghasemzadeh, H 2020, Dropout as an implicit gating mechanism for continual learning. in Proceedings - 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2020., 9150587, IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, vol. 2020-June, IEEE Computer Society, pp. 945-951, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2020, Virtual, Online, United States, 6/14/20. https://doi.org/10.1109/CVPRW50498.2020.00124

Mirzadeh SI, Farajtabar M, Ghasemzadeh H. Dropout as an implicit gating mechanism for continual learning. In Proceedings - 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2020. IEEE Computer Society. 2020. p. 945-951. 9150587. (IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops). doi: 10.1109/CVPRW50498.2020.00124

@inproceedings{61528759f6b64b1c941e5ef1cf510e87,

title = "Dropout as an implicit gating mechanism for continual learning",

abstract = "In recent years, neural networks have demonstrated an outstanding ability to achieve complex learning tasks across various domains. However, they suffer from the {"}catastrophic forgetting{"} problem when they face a sequence of learning tasks, where they forget the old ones as they learn new tasks. This problem is also highly related to the {"}stability-plasticity dilemma{"}. The more plastic the network, the easier it can learn new tasks, but the faster it also forgets previous ones. Conversely, a stable network cannot learn new tasks as fast as a very plastic network. However, it is more reliable to preserve the knowledge it has learned from the previous tasks. Several solutions have been proposed to overcome the forgetting problem by making the neural network parameters more stable, and some of them have mentioned the significance of dropout in continual learning. However, their relationship has not been sufficiently studied yet. In this paper, we investigate this relationship and show that a stable network with dropout learns a gating mechanism such that for different tasks, different paths of the network are active. Our experiments show that the stability achieved by this implicit gating plays a very critical role in leading to performance comparable to or better than other involved continual learning algorithms to overcome catastrophic forgetting.1",

author = "Mirzadeh, {Seyed Iman} and Mehrdad Farajtabar and Hassan Ghasemzadeh",

note = "Funding Information: Authors Mirzadeh and Ghasemzdeh were supported in part, under grants CNS-1750679 and CNS-1932346 from the United States National Science Foundation. Any opinions, findings, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the funding organizations. The authors would like to thank the anonymous reviewers for their helpful comments. Publisher Copyright: {\textcopyright} 2020 IEEE.; 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2020 ; Conference date: 14-06-2020 Through 19-06-2020",

year = "2020",

month = jun,

doi = "10.1109/CVPRW50498.2020.00124",

language = "English (US)",

series = "IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops",

publisher = "IEEE Computer Society",

pages = "945--951",

booktitle = "Proceedings - 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2020",

}

TY - GEN

T1 - Dropout as an implicit gating mechanism for continual learning

AU - Mirzadeh, Seyed Iman

AU - Farajtabar, Mehrdad

AU - Ghasemzadeh, Hassan

N1 - Funding Information: Authors Mirzadeh and Ghasemzdeh were supported in part, under grants CNS-1750679 and CNS-1932346 from the United States National Science Foundation. Any opinions, findings, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the funding organizations. The authors would like to thank the anonymous reviewers for their helpful comments. Publisher Copyright: © 2020 IEEE.

PY - 2020/6

Y1 - 2020/6

N2 - In recent years, neural networks have demonstrated an outstanding ability to achieve complex learning tasks across various domains. However, they suffer from the "catastrophic forgetting" problem when they face a sequence of learning tasks, where they forget the old ones as they learn new tasks. This problem is also highly related to the "stability-plasticity dilemma". The more plastic the network, the easier it can learn new tasks, but the faster it also forgets previous ones. Conversely, a stable network cannot learn new tasks as fast as a very plastic network. However, it is more reliable to preserve the knowledge it has learned from the previous tasks. Several solutions have been proposed to overcome the forgetting problem by making the neural network parameters more stable, and some of them have mentioned the significance of dropout in continual learning. However, their relationship has not been sufficiently studied yet. In this paper, we investigate this relationship and show that a stable network with dropout learns a gating mechanism such that for different tasks, different paths of the network are active. Our experiments show that the stability achieved by this implicit gating plays a very critical role in leading to performance comparable to or better than other involved continual learning algorithms to overcome catastrophic forgetting.1

AB - In recent years, neural networks have demonstrated an outstanding ability to achieve complex learning tasks across various domains. However, they suffer from the "catastrophic forgetting" problem when they face a sequence of learning tasks, where they forget the old ones as they learn new tasks. This problem is also highly related to the "stability-plasticity dilemma". The more plastic the network, the easier it can learn new tasks, but the faster it also forgets previous ones. Conversely, a stable network cannot learn new tasks as fast as a very plastic network. However, it is more reliable to preserve the knowledge it has learned from the previous tasks. Several solutions have been proposed to overcome the forgetting problem by making the neural network parameters more stable, and some of them have mentioned the significance of dropout in continual learning. However, their relationship has not been sufficiently studied yet. In this paper, we investigate this relationship and show that a stable network with dropout learns a gating mechanism such that for different tasks, different paths of the network are active. Our experiments show that the stability achieved by this implicit gating plays a very critical role in leading to performance comparable to or better than other involved continual learning algorithms to overcome catastrophic forgetting.1

UR - http://www.scopus.com/inward/record.url?scp=85090112058&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85090112058&partnerID=8YFLogxK

U2 - 10.1109/CVPRW50498.2020.00124

DO - 10.1109/CVPRW50498.2020.00124

M3 - Conference contribution

AN - SCOPUS:85090112058

T3 - IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops

SP - 945

EP - 951

BT - Proceedings - 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2020

PB - IEEE Computer Society

T2 - 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2020

Y2 - 14 June 2020 through 19 June 2020

ER -

Dropout as an implicit gating mechanism for continual learning

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this