Statistical Guarantees for Transformation Based Models with Applications to Implicit Variational Inference

Sean Plummer; Shuang Zhou; Anirban Bhattacharya; David Dunson; Debdeep Pati

Statistical Guarantees for Transformation Based Models with Applications to Implicit Variational Inference

Sean Plummer, Shuang Zhou, Anirban Bhattacharya, David Dunson, Debdeep Pati

Mathematical and Statistical Sciences, School of (SoMSS)

Research output: Contribution to journal › Conference article › peer-review

1 Scopus citations

Abstract

Transformation-based methods have been an attractive approach in non-parametric inference for problems such as unconditional and conditional density estimation due to their unique hierarchical structure that models the data as flexible transformation of a set of common latent variables. More recently, transformation-based models have been used in variational inference (VI) to construct flexible implicit families of variational distributions. However, their use in both nonparametric inference and variational inference lacks theoretical justification. We provide theoretical justification for the use of non-linear latent variable models (NL-LVMs) in non-parametric inference by showing that the support of the transformation induced prior in the space of densities is sufficiently large in the L₁ sense. We also show that, when a Gaussian process (GP) prior is placed on the transformation function, the posterior concentrates at the optimal rate up to a logarithmic factor. Adopting the flexibility demonstrated in the non-parametric setting, we use the NL-LVM to construct an implicit family of variational distributions, deemed GP-IVI. We delineate sufficient conditions under which GP-IVI achieves optimal risk bounds and approximates the true posterior in the sense of the Kullback-Leibler divergence. To the best of our knowledge, this is the first work on providing theoretical guarantees for implicit variational inference.

Original language	English (US)
Pages (from-to)	2449-2457
Number of pages	9
Journal	Proceedings of Machine Learning Research
Volume	130
State	Published - 2021
Event	24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021 - Virtual, Online, United States Duration: Apr 13 2021 → Apr 15 2021

ASJC Scopus subject areas

Artificial Intelligence
Software
Control and Systems Engineering
Statistics and Probability

Cite this

@article{5288f0302fc248139c6871df484d9c6f,

title = "Statistical Guarantees for Transformation Based Models with Applications to Implicit Variational Inference",

abstract = "Transformation-based methods have been an attractive approach in non-parametric inference for problems such as unconditional and conditional density estimation due to their unique hierarchical structure that models the data as flexible transformation of a set of common latent variables. More recently, transformation-based models have been used in variational inference (VI) to construct flexible implicit families of variational distributions. However, their use in both nonparametric inference and variational inference lacks theoretical justification. We provide theoretical justification for the use of non-linear latent variable models (NL-LVMs) in non-parametric inference by showing that the support of the transformation induced prior in the space of densities is sufficiently large in the L1 sense. We also show that, when a Gaussian process (GP) prior is placed on the transformation function, the posterior concentrates at the optimal rate up to a logarithmic factor. Adopting the flexibility demonstrated in the non-parametric setting, we use the NL-LVM to construct an implicit family of variational distributions, deemed GP-IVI. We delineate sufficient conditions under which GP-IVI achieves optimal risk bounds and approximates the true posterior in the sense of the Kullback-Leibler divergence. To the best of our knowledge, this is the first work on providing theoretical guarantees for implicit variational inference.",

author = "Sean Plummer and Shuang Zhou and Anirban Bhattacharya and David Dunson and Debdeep Pati",

note = "Funding Information: Pati and Bhattacharya acknowledge support from NSF DMS (1854731, 1916371). In addition, Bhattacharya acknowledges the NSF CAREER 1653404 award for supporting this project. Publisher Copyright: Copyright {\textcopyright} 2021 by the author(s); 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021 ; Conference date: 13-04-2021 Through 15-04-2021",

year = "2021",

language = "English (US)",

volume = "130",

pages = "2449--2457",

journal = "Proceedings of Machine Learning Research",

issn = "2640-3498",

}

TY - JOUR

T1 - Statistical Guarantees for Transformation Based Models with Applications to Implicit Variational Inference

AU - Plummer, Sean

AU - Zhou, Shuang

AU - Bhattacharya, Anirban

AU - Dunson, David

AU - Pati, Debdeep

N1 - Funding Information: Pati and Bhattacharya acknowledge support from NSF DMS (1854731, 1916371). In addition, Bhattacharya acknowledges the NSF CAREER 1653404 award for supporting this project. Publisher Copyright: Copyright © 2021 by the author(s)

PY - 2021

Y1 - 2021

N2 - Transformation-based methods have been an attractive approach in non-parametric inference for problems such as unconditional and conditional density estimation due to their unique hierarchical structure that models the data as flexible transformation of a set of common latent variables. More recently, transformation-based models have been used in variational inference (VI) to construct flexible implicit families of variational distributions. However, their use in both nonparametric inference and variational inference lacks theoretical justification. We provide theoretical justification for the use of non-linear latent variable models (NL-LVMs) in non-parametric inference by showing that the support of the transformation induced prior in the space of densities is sufficiently large in the L1 sense. We also show that, when a Gaussian process (GP) prior is placed on the transformation function, the posterior concentrates at the optimal rate up to a logarithmic factor. Adopting the flexibility demonstrated in the non-parametric setting, we use the NL-LVM to construct an implicit family of variational distributions, deemed GP-IVI. We delineate sufficient conditions under which GP-IVI achieves optimal risk bounds and approximates the true posterior in the sense of the Kullback-Leibler divergence. To the best of our knowledge, this is the first work on providing theoretical guarantees for implicit variational inference.

AB - Transformation-based methods have been an attractive approach in non-parametric inference for problems such as unconditional and conditional density estimation due to their unique hierarchical structure that models the data as flexible transformation of a set of common latent variables. More recently, transformation-based models have been used in variational inference (VI) to construct flexible implicit families of variational distributions. However, their use in both nonparametric inference and variational inference lacks theoretical justification. We provide theoretical justification for the use of non-linear latent variable models (NL-LVMs) in non-parametric inference by showing that the support of the transformation induced prior in the space of densities is sufficiently large in the L1 sense. We also show that, when a Gaussian process (GP) prior is placed on the transformation function, the posterior concentrates at the optimal rate up to a logarithmic factor. Adopting the flexibility demonstrated in the non-parametric setting, we use the NL-LVM to construct an implicit family of variational distributions, deemed GP-IVI. We delineate sufficient conditions under which GP-IVI achieves optimal risk bounds and approximates the true posterior in the sense of the Kullback-Leibler divergence. To the best of our knowledge, this is the first work on providing theoretical guarantees for implicit variational inference.

UR - http://www.scopus.com/inward/record.url?scp=85161956443&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85161956443&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85161956443

SN - 2640-3498

VL - 130

SP - 2449

EP - 2457

JO - Proceedings of Machine Learning Research

JF - Proceedings of Machine Learning Research

T2 - 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021

Y2 - 13 April 2021 through 15 April 2021

ER -

Statistical Guarantees for Transformation Based Models with Applications to Implicit Variational Inference

Abstract

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this