UNLOCKING THE POWER OF VOICE FOR FINANCIAL RISK PREDICTION: A THEORY-DRIVEN DEEP LEARNING DESIGN APPROACH1

Yi Yang; Yu Qin; Yangyang Fan; Zhongju Zhang

doi:10.25300/MISQ/2022/17062

UNLOCKING THE POWER OF VOICE FOR FINANCIAL RISK PREDICTION: A THEORY-DRIVEN DEEP LEARNING DESIGN APPROACH¹

Yi Yang, Yu Qin, Yangyang Fan, Zhongju Zhang

Information Systems

Research output: Contribution to journal › Article › peer-review

3 Scopus citations

Abstract

Unstructured multimedia data (text and audio) provides unprecedented opportunities to derive actionable decision-making in the financial industry, in areas such as portfolio and risk management. However, due to formidable methodological challenges, the promise of business value from unstructured multimedia data has not materialized. In this study, we use a design science approach to develop DeepVoice, a novel nonverbal predictive analysis system for financial risk prediction, in the setting of quarterly earnings conference calls. DeepVoice forecasts financial risk by leveraging not only what managers say (verbal linguistic cues) but also how managers say it (vocal cues) during the earnings conference calls. The design of DeepVoice addresses several challenges associated with the analysis of nonverbal communication. We also propose a two-stage deep learning model to effectively integrate managers’ sequential vocal and verbal cues. Using a unique dataset of 6,047 earnings call samples (audio recordings and textual transcripts) of S&P 500 firms across four years, we show that DeepVoice yields remarkably lower risk forecast errors than that achieved by previous efforts. The improvement can also translate into nontrivial economic gains in options trading. The theoretical and practical implications of analyzing vocal cues are discussed.

Original language	English (US)
Pages (from-to)	63-96
Number of pages	34
Journal	MIS Quarterly: Management Information Systems
Volume	47
Issue number	1
DOIs	https://doi.org/10.25300/MISQ/2022/17062
State	Published - Mar 2023

Keywords

deep learning
design science
Financial risk
verbal cues
vocal cues
vocal-verbal integrations
voice

ASJC Scopus subject areas

Management Information Systems
Information Systems
Computer Science Applications
Information Systems and Management

Access to Document

10.25300/MISQ/2022/17062

Cite this

@article{fda3f5597af2449b964e5b3384f1ad00,

title = "UNLOCKING THE POWER OF VOICE FOR FINANCIAL RISK PREDICTION: A THEORY-DRIVEN DEEP LEARNING DESIGN APPROACH1",

abstract = "Unstructured multimedia data (text and audio) provides unprecedented opportunities to derive actionable decision-making in the financial industry, in areas such as portfolio and risk management. However, due to formidable methodological challenges, the promise of business value from unstructured multimedia data has not materialized. In this study, we use a design science approach to develop DeepVoice, a novel nonverbal predictive analysis system for financial risk prediction, in the setting of quarterly earnings conference calls. DeepVoice forecasts financial risk by leveraging not only what managers say (verbal linguistic cues) but also how managers say it (vocal cues) during the earnings conference calls. The design of DeepVoice addresses several challenges associated with the analysis of nonverbal communication. We also propose a two-stage deep learning model to effectively integrate managers{\textquoteright} sequential vocal and verbal cues. Using a unique dataset of 6,047 earnings call samples (audio recordings and textual transcripts) of S&P 500 firms across four years, we show that DeepVoice yields remarkably lower risk forecast errors than that achieved by previous efforts. The improvement can also translate into nontrivial economic gains in options trading. The theoretical and practical implications of analyzing vocal cues are discussed.",

keywords = "deep learning, design science, Financial risk, verbal cues, vocal cues, vocal-verbal integrations, voice",

author = "Yi Yang and Yu Qin and Yangyang Fan and Zhongju Zhang",

year = "2023",

month = mar,

doi = "10.25300/MISQ/2022/17062",

language = "English (US)",

volume = "47",

pages = "63--96",

journal = "MIS Quarterly: Management Information Systems",

issn = "0276-7783",

publisher = "Management Information Systems Research Center",

number = "1",

}

TY - JOUR

T1 - UNLOCKING THE POWER OF VOICE FOR FINANCIAL RISK PREDICTION

T2 - A THEORY-DRIVEN DEEP LEARNING DESIGN APPROACH1

AU - Yang, Yi

AU - Qin, Yu

AU - Fan, Yangyang

AU - Zhang, Zhongju

PY - 2023/3

Y1 - 2023/3

N2 - Unstructured multimedia data (text and audio) provides unprecedented opportunities to derive actionable decision-making in the financial industry, in areas such as portfolio and risk management. However, due to formidable methodological challenges, the promise of business value from unstructured multimedia data has not materialized. In this study, we use a design science approach to develop DeepVoice, a novel nonverbal predictive analysis system for financial risk prediction, in the setting of quarterly earnings conference calls. DeepVoice forecasts financial risk by leveraging not only what managers say (verbal linguistic cues) but also how managers say it (vocal cues) during the earnings conference calls. The design of DeepVoice addresses several challenges associated with the analysis of nonverbal communication. We also propose a two-stage deep learning model to effectively integrate managers’ sequential vocal and verbal cues. Using a unique dataset of 6,047 earnings call samples (audio recordings and textual transcripts) of S&P 500 firms across four years, we show that DeepVoice yields remarkably lower risk forecast errors than that achieved by previous efforts. The improvement can also translate into nontrivial economic gains in options trading. The theoretical and practical implications of analyzing vocal cues are discussed.

AB - Unstructured multimedia data (text and audio) provides unprecedented opportunities to derive actionable decision-making in the financial industry, in areas such as portfolio and risk management. However, due to formidable methodological challenges, the promise of business value from unstructured multimedia data has not materialized. In this study, we use a design science approach to develop DeepVoice, a novel nonverbal predictive analysis system for financial risk prediction, in the setting of quarterly earnings conference calls. DeepVoice forecasts financial risk by leveraging not only what managers say (verbal linguistic cues) but also how managers say it (vocal cues) during the earnings conference calls. The design of DeepVoice addresses several challenges associated with the analysis of nonverbal communication. We also propose a two-stage deep learning model to effectively integrate managers’ sequential vocal and verbal cues. Using a unique dataset of 6,047 earnings call samples (audio recordings and textual transcripts) of S&P 500 firms across four years, we show that DeepVoice yields remarkably lower risk forecast errors than that achieved by previous efforts. The improvement can also translate into nontrivial economic gains in options trading. The theoretical and practical implications of analyzing vocal cues are discussed.

KW - deep learning

KW - design science

KW - Financial risk

KW - verbal cues

KW - vocal cues

KW - vocal-verbal integrations

KW - voice

UR - http://www.scopus.com/inward/record.url?scp=85173057990&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85173057990&partnerID=8YFLogxK

U2 - 10.25300/MISQ/2022/17062

DO - 10.25300/MISQ/2022/17062

M3 - Article

AN - SCOPUS:85173057990

SN - 0276-7783

VL - 47

SP - 63

EP - 96

JO - MIS Quarterly: Management Information Systems

JF - MIS Quarterly: Management Information Systems

IS - 1

ER -