Interpreting denoising autoencoders with complex perturbation approach

Dharanidharan Arumugam, Ravi Kiran

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

The goal of this study is to interpret denoising autoencoders by quantifying the importance of input pixel features for image reconstruction. The importance of pixel features is evaluated using the attributions of the pixel features to the latent variables of a denoising autoencoder used for image reconstruction. Pixel attributions are computed using a highly accurate and automatable perturbation approach and are plotted as saliency maps. Saliency maps highlight the contribution of the pixels for image reconstruction. The proposed approach produces more meaningful and understandable explanations than guided backpropagation and layer wise propagation methods. Three sanity checks are introduced to verify the fidelity of the generated saliency maps and also to elucidate the influence of inputs on the latent variables. The classification accuracy of images is significantly lowered when the most important pixel regions highlighted by the saliency maps are corrupted validating the proposed approach.

Original languageEnglish (US)
Article number109212
JournalPattern Recognition
Volume136
DOIs
StatePublished - Apr 2023
Externally publishedYes

Keywords

  • Complex step derivative approximation
  • Pixel attributions
  • Saliency maps
  • Sanity checks and deep neural networks (DNNs)
  • Trustworthiness

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Interpreting denoising autoencoders with complex perturbation approach'. Together they form a unique fingerprint.

Cite this