Stochastic Gaussian Process Model Averaging for High-Dimensional Inputs

Maxime Xuereb; Szu Hui Ng; Giulia Pedrielli

doi:10.1109/WSC48552.2020.9384114

Stochastic Gaussian Process Model Averaging for High-Dimensional Inputs

Maxime Xuereb, Szu Hui Ng, Giulia Pedrielli

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

Many statistical learning methodologies exhibit loss of efficiency and accuracy when applied to large, high-dimensional data-sets. Such loss is exacerbated by noisy data. In this paper, we focus on Gaussian Processes (GPs), a family of non-parametric approaches used in machine learning and Bayesian Optimization. In fact, GPs show difficulty scaling with the input data size and dimensionality. This paper presents, for the first time, the Stochastic GP Model Averaging (SGPMA) algorithm, to tackle both challenges. SGPMA uses a Bayesian approach to weight several predictors, each trained with an independent subset of the initial data-set (solving the large data-sets issue), and defined in a low-dimensional embedding of the original space (solving the high dimensionality). We conduct several experiments with different input size and dimensionality. The results show that our methodology is superior to naive averaging and that the embedding choice is critical to manage the computational cost / prediction accuracy trade-off.

Original language	English (US)
Title of host publication	Proceedings of the 2020 Winter Simulation Conference, WSC 2020
Editors	K.-H. Bae, B. Feng, S. Kim, S. Lazarova-Molnar, Z. Zheng, T. Roeder, R. Thiesing
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	373-384
Number of pages	12
ISBN (Electronic)	9781728194998
DOIs	https://doi.org/10.1109/WSC48552.2020.9384114
State	Published - Dec 14 2020
Externally published	Yes
Event	2020 Winter Simulation Conference, WSC 2020 - Orlando, United States Duration: Dec 14 2020 → Dec 18 2020

Publication series

Name	Proceedings - Winter Simulation Conference
Volume	2020-December
ISSN (Print)	0891-7736

Conference

Conference	2020 Winter Simulation Conference, WSC 2020
Country/Territory	United States
City	Orlando
Period	12/14/20 → 12/18/20

ASJC Scopus subject areas

Software
Modeling and Simulation
Computer Science Applications

Access to Document

10.1109/WSC48552.2020.9384114

Cite this

Xuereb, M., Hui Ng, S., & Pedrielli, G. (2020). Stochastic Gaussian Process Model Averaging for High-Dimensional Inputs. In K.-H. Bae, B. Feng, S. Kim, S. Lazarova-Molnar, Z. Zheng, T. Roeder, & R. Thiesing (Eds.), Proceedings of the 2020 Winter Simulation Conference, WSC 2020 (pp. 373-384). Article 9384114 (Proceedings - Winter Simulation Conference; Vol. 2020-December). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WSC48552.2020.9384114

Stochastic Gaussian Process Model Averaging for High-Dimensional Inputs. / Xuereb, Maxime; Hui Ng, Szu; Pedrielli, Giulia.
Proceedings of the 2020 Winter Simulation Conference, WSC 2020. ed. / K.-H. Bae; B. Feng; S. Kim; S. Lazarova-Molnar; Z. Zheng; T. Roeder; R. Thiesing. Institute of Electrical and Electronics Engineers Inc., 2020. p. 373-384 9384114 (Proceedings - Winter Simulation Conference; Vol. 2020-December).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Xuereb, M, Hui Ng, S & Pedrielli, G 2020, Stochastic Gaussian Process Model Averaging for High-Dimensional Inputs. in K-H Bae, B Feng, S Kim, S Lazarova-Molnar, Z Zheng, T Roeder & R Thiesing (eds), Proceedings of the 2020 Winter Simulation Conference, WSC 2020., 9384114, Proceedings - Winter Simulation Conference, vol. 2020-December, Institute of Electrical and Electronics Engineers Inc., pp. 373-384, 2020 Winter Simulation Conference, WSC 2020, Orlando, United States, 12/14/20. https://doi.org/10.1109/WSC48552.2020.9384114

Xuereb M, Hui Ng S, Pedrielli G. Stochastic Gaussian Process Model Averaging for High-Dimensional Inputs. In Bae KH, Feng B, Kim S, Lazarova-Molnar S, Zheng Z, Roeder T, Thiesing R, editors, Proceedings of the 2020 Winter Simulation Conference, WSC 2020. Institute of Electrical and Electronics Engineers Inc. 2020. p. 373-384. 9384114. (Proceedings - Winter Simulation Conference). doi: 10.1109/WSC48552.2020.9384114

Xuereb, Maxime ; Hui Ng, Szu ; Pedrielli, Giulia. / Stochastic Gaussian Process Model Averaging for High-Dimensional Inputs. Proceedings of the 2020 Winter Simulation Conference, WSC 2020. editor / K.-H. Bae ; B. Feng ; S. Kim ; S. Lazarova-Molnar ; Z. Zheng ; T. Roeder ; R. Thiesing. Institute of Electrical and Electronics Engineers Inc., 2020. pp. 373-384 (Proceedings - Winter Simulation Conference).

@inproceedings{617a6d4e53924d3480ff8f70465b13e2,

title = "Stochastic Gaussian Process Model Averaging for High-Dimensional Inputs",

abstract = "Many statistical learning methodologies exhibit loss of efficiency and accuracy when applied to large, high-dimensional data-sets. Such loss is exacerbated by noisy data. In this paper, we focus on Gaussian Processes (GPs), a family of non-parametric approaches used in machine learning and Bayesian Optimization. In fact, GPs show difficulty scaling with the input data size and dimensionality. This paper presents, for the first time, the Stochastic GP Model Averaging (SGPMA) algorithm, to tackle both challenges. SGPMA uses a Bayesian approach to weight several predictors, each trained with an independent subset of the initial data-set (solving the large data-sets issue), and defined in a low-dimensional embedding of the original space (solving the high dimensionality). We conduct several experiments with different input size and dimensionality. The results show that our methodology is superior to naive averaging and that the embedding choice is critical to manage the computational cost / prediction accuracy trade-off.",

author = "Maxime Xuereb and {Hui Ng}, Szu and Giulia Pedrielli",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.; 2020 Winter Simulation Conference, WSC 2020 ; Conference date: 14-12-2020 Through 18-12-2020",

year = "2020",

month = dec,

day = "14",

doi = "10.1109/WSC48552.2020.9384114",

language = "English (US)",

series = "Proceedings - Winter Simulation Conference",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "373--384",

editor = "K.-H. Bae and B. Feng and S. Kim and S. Lazarova-Molnar and Z. Zheng and T. Roeder and R. Thiesing",

booktitle = "Proceedings of the 2020 Winter Simulation Conference, WSC 2020",

}

TY - GEN

T1 - Stochastic Gaussian Process Model Averaging for High-Dimensional Inputs

AU - Xuereb, Maxime

AU - Hui Ng, Szu

AU - Pedrielli, Giulia

PY - 2020/12/14

Y1 - 2020/12/14

N2 - Many statistical learning methodologies exhibit loss of efficiency and accuracy when applied to large, high-dimensional data-sets. Such loss is exacerbated by noisy data. In this paper, we focus on Gaussian Processes (GPs), a family of non-parametric approaches used in machine learning and Bayesian Optimization. In fact, GPs show difficulty scaling with the input data size and dimensionality. This paper presents, for the first time, the Stochastic GP Model Averaging (SGPMA) algorithm, to tackle both challenges. SGPMA uses a Bayesian approach to weight several predictors, each trained with an independent subset of the initial data-set (solving the large data-sets issue), and defined in a low-dimensional embedding of the original space (solving the high dimensionality). We conduct several experiments with different input size and dimensionality. The results show that our methodology is superior to naive averaging and that the embedding choice is critical to manage the computational cost / prediction accuracy trade-off.

AB - Many statistical learning methodologies exhibit loss of efficiency and accuracy when applied to large, high-dimensional data-sets. Such loss is exacerbated by noisy data. In this paper, we focus on Gaussian Processes (GPs), a family of non-parametric approaches used in machine learning and Bayesian Optimization. In fact, GPs show difficulty scaling with the input data size and dimensionality. This paper presents, for the first time, the Stochastic GP Model Averaging (SGPMA) algorithm, to tackle both challenges. SGPMA uses a Bayesian approach to weight several predictors, each trained with an independent subset of the initial data-set (solving the large data-sets issue), and defined in a low-dimensional embedding of the original space (solving the high dimensionality). We conduct several experiments with different input size and dimensionality. The results show that our methodology is superior to naive averaging and that the embedding choice is critical to manage the computational cost / prediction accuracy trade-off.

UR - http://www.scopus.com/inward/record.url?scp=85103911514&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85103911514&partnerID=8YFLogxK

U2 - 10.1109/WSC48552.2020.9384114

DO - 10.1109/WSC48552.2020.9384114

M3 - Conference contribution

AN - SCOPUS:85103911514

T3 - Proceedings - Winter Simulation Conference

SP - 373

EP - 384

BT - Proceedings of the 2020 Winter Simulation Conference, WSC 2020

A2 - Bae, K.-H.

A2 - Feng, B.

A2 - Kim, S.

A2 - Lazarova-Molnar, S.

A2 - Zheng, Z.

A2 - Roeder, T.

A2 - Thiesing, R.

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2020 Winter Simulation Conference, WSC 2020

Y2 - 14 December 2020 through 18 December 2020

ER -

Stochastic Gaussian Process Model Averaging for High-Dimensional Inputs

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this