"Let me tell you about your mental health!" Contextualized classification of reddit posts to DSM-5 for web-based intervention

Manas Gaur; Amit Sheth; Ugur Kursuncu; Raminta Daniulaityte; Jyotishman Pathak; Amanuel Alambo; Krishnaprasad Thirunarayan

doi:10.1145/3269206.3271732

"Let me tell you about your mental health!" Contextualized classification of reddit posts to DSM-5 for web-based intervention

Manas Gaur, Amit Sheth, Ugur Kursuncu, Raminta Daniulaityte, Jyotishman Pathak, Amanuel Alambo, Krishnaprasad Thirunarayan

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

46 Scopus citations

Abstract

Social media platforms are increasingly being used to share and seek advice on mental health issues. In particular, Reddit users freely discuss such issues on various subreddits, whose structure and content can be leveraged to formally interpret and relate subreddits and their posts in terms of mental health diagnostic categories. There is prior research on the extraction of mental health-related information, including symptoms, diagnosis, and treatments from social media; however, our approach can additionally provide actionable information to clinicians about the mental health of a patient in diagnostic terms for web-based intervention. Specifically, we provide a detailed analysis of the nature of subreddit content from domain expert's perspective and introduce a novel approach to map each subreddit to the best matching DSM-5 (Diagnostic and Statistical Manual of Mental Disorders - 5th Edition) category using multiclass classifier. Our classification algorithm analyzes all the posts of a subreddit by adapting topic modeling and word-embedding techniques, and utilizing curated medical knowledge bases to quantify relationship to DSM-5 categories. Our semantic encoding-decoding optimization approach reduces the false-alarm-rate from 30% to 2.5% over a comparable heuristic baseline, and our mapping results have been verified by domain experts achieving a kappa score of 0.84.

Original language	English (US)
Title of host publication	CIKM 2018 - Proceedings of the 27th ACM International Conference on Information and Knowledge Management
Editors	Norman Paton, Selcuk Candan, Haixun Wang, James Allan, Rakesh Agrawal, Alexandros Labrinidis, Alfredo Cuzzocrea, Mohammed Zaki, Divesh Srivastava, Andrei Broder, Assaf Schuster
Publisher	Association for Computing Machinery
Pages	753-762
Number of pages	10
ISBN (Electronic)	9781450360142
DOIs	https://doi.org/10.1145/3269206.3271732
State	Published - Oct 17 2018
Externally published	Yes
Event	27th ACM International Conference on Information and Knowledge Management, CIKM 2018 - Torino, Italy Duration: Oct 22 2018 → Oct 26 2018

Publication series

Name	International Conference on Information and Knowledge Management, Proceedings

Other

Other	27th ACM International Conference on Information and Knowledge Management, CIKM 2018
Country/Territory	Italy
City	Torino
Period	10/22/18 → 10/26/18

Keywords

DSM-5
Drug Abuse Ontology
Medical Knowledge bases
Mental Health
Reddit
Semantic Encoding and Decoding
Semantic Social Computing

ASJC Scopus subject areas

General Decision Sciences
General Business, Management and Accounting

Access to Document

10.1145/3269206.3271732

Cite this

Gaur, M., Sheth, A., Kursuncu, U., Daniulaityte, R., Pathak, J., Alambo, A., & Thirunarayan, K. (2018). "Let me tell you about your mental health!" Contextualized classification of reddit posts to DSM-5 for web-based intervention. In N. Paton, S. Candan, H. Wang, J. Allan, R. Agrawal, A. Labrinidis, A. Cuzzocrea, M. Zaki, D. Srivastava, A. Broder, & A. Schuster (Eds.), CIKM 2018 - Proceedings of the 27th ACM International Conference on Information and Knowledge Management (pp. 753-762). (International Conference on Information and Knowledge Management, Proceedings). Association for Computing Machinery. https://doi.org/10.1145/3269206.3271732

"Let me tell you about your mental health!" Contextualized classification of reddit posts to DSM-5 for web-based intervention. / Gaur, Manas; Sheth, Amit; Kursuncu, Ugur et al.
CIKM 2018 - Proceedings of the 27th ACM International Conference on Information and Knowledge Management. ed. / Norman Paton; Selcuk Candan; Haixun Wang; James Allan; Rakesh Agrawal; Alexandros Labrinidis; Alfredo Cuzzocrea; Mohammed Zaki; Divesh Srivastava; Andrei Broder; Assaf Schuster. Association for Computing Machinery, 2018. p. 753-762 (International Conference on Information and Knowledge Management, Proceedings).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Gaur, M, Sheth, A, Kursuncu, U, Daniulaityte, R, Pathak, J, Alambo, A & Thirunarayan, K 2018, "Let me tell you about your mental health!" Contextualized classification of reddit posts to DSM-5 for web-based intervention. in N Paton, S Candan, H Wang, J Allan, R Agrawal, A Labrinidis, A Cuzzocrea, M Zaki, D Srivastava, A Broder & A Schuster (eds), CIKM 2018 - Proceedings of the 27th ACM International Conference on Information and Knowledge Management. International Conference on Information and Knowledge Management, Proceedings, Association for Computing Machinery, pp. 753-762, 27th ACM International Conference on Information and Knowledge Management, CIKM 2018, Torino, Italy, 10/22/18. https://doi.org/10.1145/3269206.3271732

Gaur M, Sheth A, Kursuncu U, Daniulaityte R, Pathak J, Alambo A et al. "Let me tell you about your mental health!" Contextualized classification of reddit posts to DSM-5 for web-based intervention. In Paton N, Candan S, Wang H, Allan J, Agrawal R, Labrinidis A, Cuzzocrea A, Zaki M, Srivastava D, Broder A, Schuster A, editors, CIKM 2018 - Proceedings of the 27th ACM International Conference on Information and Knowledge Management. Association for Computing Machinery. 2018. p. 753-762. (International Conference on Information and Knowledge Management, Proceedings). doi: 10.1145/3269206.3271732

Gaur, Manas ; Sheth, Amit ; Kursuncu, Ugur et al. / "Let me tell you about your mental health!" Contextualized classification of reddit posts to DSM-5 for web-based intervention. CIKM 2018 - Proceedings of the 27th ACM International Conference on Information and Knowledge Management. editor / Norman Paton ; Selcuk Candan ; Haixun Wang ; James Allan ; Rakesh Agrawal ; Alexandros Labrinidis ; Alfredo Cuzzocrea ; Mohammed Zaki ; Divesh Srivastava ; Andrei Broder ; Assaf Schuster. Association for Computing Machinery, 2018. pp. 753-762 (International Conference on Information and Knowledge Management, Proceedings).

@inproceedings{77126bf63075489fa89d828f81822411,

title = "{"}Let me tell you about your mental health!{"} Contextualized classification of reddit posts to DSM-5 for web-based intervention",

abstract = "Social media platforms are increasingly being used to share and seek advice on mental health issues. In particular, Reddit users freely discuss such issues on various subreddits, whose structure and content can be leveraged to formally interpret and relate subreddits and their posts in terms of mental health diagnostic categories. There is prior research on the extraction of mental health-related information, including symptoms, diagnosis, and treatments from social media; however, our approach can additionally provide actionable information to clinicians about the mental health of a patient in diagnostic terms for web-based intervention. Specifically, we provide a detailed analysis of the nature of subreddit content from domain expert's perspective and introduce a novel approach to map each subreddit to the best matching DSM-5 (Diagnostic and Statistical Manual of Mental Disorders - 5th Edition) category using multiclass classifier. Our classification algorithm analyzes all the posts of a subreddit by adapting topic modeling and word-embedding techniques, and utilizing curated medical knowledge bases to quantify relationship to DSM-5 categories. Our semantic encoding-decoding optimization approach reduces the false-alarm-rate from 30% to 2.5% over a comparable heuristic baseline, and our mapping results have been verified by domain experts achieving a kappa score of 0.84.",

keywords = "DSM-5, Drug Abuse Ontology, Medical Knowledge bases, Mental Health, Reddit, Semantic Encoding and Decoding, Semantic Social Computing",

author = "Manas Gaur and Amit Sheth and Ugur Kursuncu and Raminta Daniulaityte and Jyotishman Pathak and Amanuel Alambo and Krishnaprasad Thirunarayan",

note = "Funding Information: We acknowledge partial support from the National Science Foundation (NSF) award CNS-1513721: Context-Aware Harassment Detection on Social Media{"}, National Institutes of Health (NIH) award: MH105384-01A1: Modeling Social Behavior for Healthcare Utilization in Depression{"}, and National Institute on Drug Abuse (NIDA) Grant No. 5R01DA039454-02 Trending: Social media analysis to monitor cannabis and synthetic cannabinoid use . Any opinions, conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF, NIH, or NIDA. Funding Information: We acknowledge partial support from the National Science Foundation (NSF) award CNS-1513721: “Context-Aware Harassment Detection on Social Media{"}, National Institutes of Health (NIH) award: MH105384-01A1: “Modeling Social Behavior for Healthcare Utilization in Depression{"}, and National Institute on Drug Abuse (NIDA) Grant No. 5R01DA039454-02 “Trending: Social media analysis to monitor cannabis and synthetic cannabinoid use”. Any opinions, conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF, NIH, or NIDA. Publisher Copyright: {\textcopyright} 2018 Association for Computing Machinery.; 27th ACM International Conference on Information and Knowledge Management, CIKM 2018 ; Conference date: 22-10-2018 Through 26-10-2018",

year = "2018",

month = oct,

day = "17",

doi = "10.1145/3269206.3271732",

language = "English (US)",

series = "International Conference on Information and Knowledge Management, Proceedings",

publisher = "Association for Computing Machinery",

pages = "753--762",

editor = "Norman Paton and Selcuk Candan and Haixun Wang and James Allan and Rakesh Agrawal and Alexandros Labrinidis and Alfredo Cuzzocrea and Mohammed Zaki and Divesh Srivastava and Andrei Broder and Assaf Schuster",

booktitle = "CIKM 2018 - Proceedings of the 27th ACM International Conference on Information and Knowledge Management",

}

TY - GEN

T1 - "Let me tell you about your mental health!" Contextualized classification of reddit posts to DSM-5 for web-based intervention

AU - Gaur, Manas

AU - Sheth, Amit

AU - Kursuncu, Ugur

AU - Daniulaityte, Raminta

AU - Pathak, Jyotishman

AU - Alambo, Amanuel

AU - Thirunarayan, Krishnaprasad

N1 - Funding Information: We acknowledge partial support from the National Science Foundation (NSF) award CNS-1513721: Context-Aware Harassment Detection on Social Media", National Institutes of Health (NIH) award: MH105384-01A1: Modeling Social Behavior for Healthcare Utilization in Depression", and National Institute on Drug Abuse (NIDA) Grant No. 5R01DA039454-02 Trending: Social media analysis to monitor cannabis and synthetic cannabinoid use . Any opinions, conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF, NIH, or NIDA. Funding Information: We acknowledge partial support from the National Science Foundation (NSF) award CNS-1513721: “Context-Aware Harassment Detection on Social Media", National Institutes of Health (NIH) award: MH105384-01A1: “Modeling Social Behavior for Healthcare Utilization in Depression", and National Institute on Drug Abuse (NIDA) Grant No. 5R01DA039454-02 “Trending: Social media analysis to monitor cannabis and synthetic cannabinoid use”. Any opinions, conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF, NIH, or NIDA. Publisher Copyright: © 2018 Association for Computing Machinery.

PY - 2018/10/17

Y1 - 2018/10/17

N2 - Social media platforms are increasingly being used to share and seek advice on mental health issues. In particular, Reddit users freely discuss such issues on various subreddits, whose structure and content can be leveraged to formally interpret and relate subreddits and their posts in terms of mental health diagnostic categories. There is prior research on the extraction of mental health-related information, including symptoms, diagnosis, and treatments from social media; however, our approach can additionally provide actionable information to clinicians about the mental health of a patient in diagnostic terms for web-based intervention. Specifically, we provide a detailed analysis of the nature of subreddit content from domain expert's perspective and introduce a novel approach to map each subreddit to the best matching DSM-5 (Diagnostic and Statistical Manual of Mental Disorders - 5th Edition) category using multiclass classifier. Our classification algorithm analyzes all the posts of a subreddit by adapting topic modeling and word-embedding techniques, and utilizing curated medical knowledge bases to quantify relationship to DSM-5 categories. Our semantic encoding-decoding optimization approach reduces the false-alarm-rate from 30% to 2.5% over a comparable heuristic baseline, and our mapping results have been verified by domain experts achieving a kappa score of 0.84.

AB - Social media platforms are increasingly being used to share and seek advice on mental health issues. In particular, Reddit users freely discuss such issues on various subreddits, whose structure and content can be leveraged to formally interpret and relate subreddits and their posts in terms of mental health diagnostic categories. There is prior research on the extraction of mental health-related information, including symptoms, diagnosis, and treatments from social media; however, our approach can additionally provide actionable information to clinicians about the mental health of a patient in diagnostic terms for web-based intervention. Specifically, we provide a detailed analysis of the nature of subreddit content from domain expert's perspective and introduce a novel approach to map each subreddit to the best matching DSM-5 (Diagnostic and Statistical Manual of Mental Disorders - 5th Edition) category using multiclass classifier. Our classification algorithm analyzes all the posts of a subreddit by adapting topic modeling and word-embedding techniques, and utilizing curated medical knowledge bases to quantify relationship to DSM-5 categories. Our semantic encoding-decoding optimization approach reduces the false-alarm-rate from 30% to 2.5% over a comparable heuristic baseline, and our mapping results have been verified by domain experts achieving a kappa score of 0.84.

KW - DSM-5

KW - Drug Abuse Ontology

KW - Medical Knowledge bases

KW - Mental Health

KW - Reddit

KW - Semantic Encoding and Decoding

KW - Semantic Social Computing

UR - http://www.scopus.com/inward/record.url?scp=85058009561&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85058009561&partnerID=8YFLogxK

U2 - 10.1145/3269206.3271732

DO - 10.1145/3269206.3271732

M3 - Conference contribution

AN - SCOPUS:85058009561

T3 - International Conference on Information and Knowledge Management, Proceedings

SP - 753

EP - 762

BT - CIKM 2018 - Proceedings of the 27th ACM International Conference on Information and Knowledge Management

A2 - Paton, Norman

A2 - Candan, Selcuk

A2 - Wang, Haixun

A2 - Allan, James

A2 - Agrawal, Rakesh

A2 - Labrinidis, Alexandros

A2 - Cuzzocrea, Alfredo

A2 - Zaki, Mohammed

A2 - Srivastava, Divesh

A2 - Broder, Andrei

A2 - Schuster, Assaf

PB - Association for Computing Machinery

T2 - 27th ACM International Conference on Information and Knowledge Management, CIKM 2018

Y2 - 22 October 2018 through 26 October 2018

ER -

"Let me tell you about your mental health!" Contextualized classification of reddit posts to DSM-5 for web-based intervention

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this