KnowledgeNet: Disaggregated and distributed training and serving of deep neural networks

Saman Biookaghazadeh; Yitao Chen; Kaiqi Zhao; Ming Zhao

KnowledgeNet: Disaggregated and distributed training and serving of deep neural networks

Saman Biookaghazadeh, Yitao Chen, Kaiqi Zhao, Ming Zhao

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

Deep Neural Networks (DNNs) have a significant impact on numerous applications, such as video processing, virtual/augmented reality, and text processing. The ever-changing environment forces the DNN models to evolve, accordingly. Also, the transition from the cloud-only to edge-cloud paradigm has made the deployment and training of these models challenging. Addressing these challenges requires new methods and systems for continuous training and distribution of these models in a heterogeneous environment. In this paper, we propose KnowledgeNet (KN), which is a new architectural technique for a simple disaggregation and distribution of the neural networks for both training and serving. Using KN, DNNs can be partitioned into multiple small blocks and be deployed on a distributed set of computational nodes. Also, KN utilizes the knowledge transfer technique to provide small scale models with high accuracy in edge scenarios with limited resources. Preliminary results show that our new method can ensure a state-of-the-art accuracy for a DNN model while being disaggregated among multiple workers. Also, by using knowledge transfer technique, we can compress the model by 62% for deployment, while maintaining the same accuracy.

Original language	English (US)
Title of host publication	Proceedings of the 2019 USENIX Conference on Operational Machine Learning, OpML 2019
Publisher	USENIX Association
Pages	47-49
Number of pages	3
ISBN (Electronic)	9781939133007
State	Published - 2019
Event	2019 USENIX Conference on Operational Machine Learning, OpML 2019 - Santa Clara, United States Duration: May 20 2019 → …

Publication series

Name	Proceedings of the 2019 USENIX Conference on Operational Machine Learning, OpML 2019

Conference

Conference	2019 USENIX Conference on Operational Machine Learning, OpML 2019
Country/Territory	United States
City	Santa Clara
Period	5/20/19 → …

ASJC Scopus subject areas

Computer Science Applications
Human-Computer Interaction

Cite this

KnowledgeNet: Disaggregated and distributed training and serving of deep neural networks. / Biookaghazadeh, Saman; Chen, Yitao; Zhao, Kaiqi et al.
Proceedings of the 2019 USENIX Conference on Operational Machine Learning, OpML 2019. USENIX Association, 2019. p. 47-49 (Proceedings of the 2019 USENIX Conference on Operational Machine Learning, OpML 2019).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Biookaghazadeh, S, Chen, Y, Zhao, K & Zhao, M 2019, KnowledgeNet: Disaggregated and distributed training and serving of deep neural networks. in Proceedings of the 2019 USENIX Conference on Operational Machine Learning, OpML 2019. Proceedings of the 2019 USENIX Conference on Operational Machine Learning, OpML 2019, USENIX Association, pp. 47-49, 2019 USENIX Conference on Operational Machine Learning, OpML 2019, Santa Clara, United States, 5/20/19.

@inproceedings{4e1059951e4a46a7bfaeddf75ef9c871,

title = "KnowledgeNet: Disaggregated and distributed training and serving of deep neural networks",

abstract = "Deep Neural Networks (DNNs) have a significant impact on numerous applications, such as video processing, virtual/augmented reality, and text processing. The ever-changing environment forces the DNN models to evolve, accordingly. Also, the transition from the cloud-only to edge-cloud paradigm has made the deployment and training of these models challenging. Addressing these challenges requires new methods and systems for continuous training and distribution of these models in a heterogeneous environment. In this paper, we propose KnowledgeNet (KN), which is a new architectural technique for a simple disaggregation and distribution of the neural networks for both training and serving. Using KN, DNNs can be partitioned into multiple small blocks and be deployed on a distributed set of computational nodes. Also, KN utilizes the knowledge transfer technique to provide small scale models with high accuracy in edge scenarios with limited resources. Preliminary results show that our new method can ensure a state-of-the-art accuracy for a DNN model while being disaggregated among multiple workers. Also, by using knowledge transfer technique, we can compress the model by 62% for deployment, while maintaining the same accuracy.",

author = "Saman Biookaghazadeh and Yitao Chen and Kaiqi Zhao and Ming Zhao",

year = "2019",

language = "English (US)",

series = "Proceedings of the 2019 USENIX Conference on Operational Machine Learning, OpML 2019",

publisher = "USENIX Association",

pages = "47--49",

booktitle = "Proceedings of the 2019 USENIX Conference on Operational Machine Learning, OpML 2019",

note = "2019 USENIX Conference on Operational Machine Learning, OpML 2019 ; Conference date: 20-05-2019",

}

TY - GEN

T1 - KnowledgeNet

T2 - 2019 USENIX Conference on Operational Machine Learning, OpML 2019

AU - Biookaghazadeh, Saman

AU - Chen, Yitao

AU - Zhao, Kaiqi

AU - Zhao, Ming

PY - 2019

Y1 - 2019

N2 - Deep Neural Networks (DNNs) have a significant impact on numerous applications, such as video processing, virtual/augmented reality, and text processing. The ever-changing environment forces the DNN models to evolve, accordingly. Also, the transition from the cloud-only to edge-cloud paradigm has made the deployment and training of these models challenging. Addressing these challenges requires new methods and systems for continuous training and distribution of these models in a heterogeneous environment. In this paper, we propose KnowledgeNet (KN), which is a new architectural technique for a simple disaggregation and distribution of the neural networks for both training and serving. Using KN, DNNs can be partitioned into multiple small blocks and be deployed on a distributed set of computational nodes. Also, KN utilizes the knowledge transfer technique to provide small scale models with high accuracy in edge scenarios with limited resources. Preliminary results show that our new method can ensure a state-of-the-art accuracy for a DNN model while being disaggregated among multiple workers. Also, by using knowledge transfer technique, we can compress the model by 62% for deployment, while maintaining the same accuracy.

AB - Deep Neural Networks (DNNs) have a significant impact on numerous applications, such as video processing, virtual/augmented reality, and text processing. The ever-changing environment forces the DNN models to evolve, accordingly. Also, the transition from the cloud-only to edge-cloud paradigm has made the deployment and training of these models challenging. Addressing these challenges requires new methods and systems for continuous training and distribution of these models in a heterogeneous environment. In this paper, we propose KnowledgeNet (KN), which is a new architectural technique for a simple disaggregation and distribution of the neural networks for both training and serving. Using KN, DNNs can be partitioned into multiple small blocks and be deployed on a distributed set of computational nodes. Also, KN utilizes the knowledge transfer technique to provide small scale models with high accuracy in edge scenarios with limited resources. Preliminary results show that our new method can ensure a state-of-the-art accuracy for a DNN model while being disaggregated among multiple workers. Also, by using knowledge transfer technique, we can compress the model by 62% for deployment, while maintaining the same accuracy.

UR - http://www.scopus.com/inward/record.url?scp=85077010688&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85077010688&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85077010688

T3 - Proceedings of the 2019 USENIX Conference on Operational Machine Learning, OpML 2019

SP - 47

EP - 49

BT - Proceedings of the 2019 USENIX Conference on Operational Machine Learning, OpML 2019

PB - USENIX Association

Y2 - 20 May 2019

ER -

KnowledgeNet: Disaggregated and distributed training and serving of deep neural networks

Abstract

Publication series

Conference

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this