PEACE: Cross-Platform Hate Speech Detection - A Causality-Guided Framework

Paaras Sheth; Tharindu Kumarage; Raha Moraffah; Aman Chadha; Huan Liu

doi:10.1007/978-3-031-43412-9_33

PEACE: Cross-Platform Hate Speech Detection - A Causality-Guided Framework

Paaras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Hate speech detection refers to the task of detecting hateful content that aims at denigrating an individual or a group based on their religion, gender, sexual orientation, or other characteristics. Due to the different policies of the platforms, different groups of people express hate in different ways. Furthermore, due to the lack of labeled data in some platforms it becomes challenging to build hate speech detection models. To this end, we revisit if we can learn a generalizable hate speech detection model for the cross platform setting, where we train the model on the data from one (source) platform and generalize the model across multiple (target) platforms. Existing generalization models rely on linguistic cues or auxiliary information, making them biased towards certain tags or certain kinds of words (e.g., abusive words) on the source platform and thus not applicable to the target platforms. Inspired by social and psychological theories, we endeavor to explore if there exist inherent causal cues that can be leveraged to learn generalizable representations for detecting hate speech across these distribution shifts. To this end, we propose a causality-guided framework, PEACE, that identifies and leverages two intrinsic causal cues omnipresent in hateful content: the overall sentiment and the aggression in the text. We conduct extensive experiments across multiple platforms (representing the distribution shift) showing if causal cues can help cross-platform generalization.

Original language	English (US)
Title of host publication	Machine Learning and Knowledge Discovery in Databases
Subtitle of host publication	Research Track - European Conference, ECML PKDD 2023, Proceedings
Editors	Danai Koutra, Claudia Plant, Manuel Gomez Rodriguez, Elena Baralis, Francesco Bonchi
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	559-575
Number of pages	17
ISBN (Print)	9783031434112
DOIs	https://doi.org/10.1007/978-3-031-43412-9_33
State	Published - 2023
Event	European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2023 - Turin, Italy Duration: Sep 18 2023 → Sep 22 2023

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	14169 LNAI
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2023
Country/Territory	Italy
City	Turin
Period	9/18/23 → 9/22/23

Keywords

Causal Inference
Generalizability
Hate-Speech Detection

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-031-43412-9_33

Cite this

Sheth, P., Kumarage, T., Moraffah, R., Chadha, A., & Liu, H. (2023). PEACE: Cross-Platform Hate Speech Detection - A Causality-Guided Framework. In D. Koutra, C. Plant, M. Gomez Rodriguez, E. Baralis, & F. Bonchi (Eds.), Machine Learning and Knowledge Discovery in Databases: Research Track - European Conference, ECML PKDD 2023, Proceedings (pp. 559-575). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 14169 LNAI). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-43412-9_33

PEACE: Cross-Platform Hate Speech Detection - A Causality-Guided Framework. / Sheth, Paaras; Kumarage, Tharindu; Moraffah, Raha et al.
Machine Learning and Knowledge Discovery in Databases: Research Track - European Conference, ECML PKDD 2023, Proceedings. ed. / Danai Koutra; Claudia Plant; Manuel Gomez Rodriguez; Elena Baralis; Francesco Bonchi. Springer Science and Business Media Deutschland GmbH, 2023. p. 559-575 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 14169 LNAI).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Sheth, P, Kumarage, T, Moraffah, R, Chadha, A & Liu, H 2023, PEACE: Cross-Platform Hate Speech Detection - A Causality-Guided Framework. in D Koutra, C Plant, M Gomez Rodriguez, E Baralis & F Bonchi (eds), Machine Learning and Knowledge Discovery in Databases: Research Track - European Conference, ECML PKDD 2023, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 14169 LNAI, Springer Science and Business Media Deutschland GmbH, pp. 559-575, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2023, Turin, Italy, 9/18/23. https://doi.org/10.1007/978-3-031-43412-9_33

Sheth P, Kumarage T, Moraffah R, Chadha A, Liu H. PEACE: Cross-Platform Hate Speech Detection - A Causality-Guided Framework. In Koutra D, Plant C, Gomez Rodriguez M, Baralis E, Bonchi F, editors, Machine Learning and Knowledge Discovery in Databases: Research Track - European Conference, ECML PKDD 2023, Proceedings. Springer Science and Business Media Deutschland GmbH. 2023. p. 559-575. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-43412-9_33

Sheth, Paaras ; Kumarage, Tharindu ; Moraffah, Raha et al. / PEACE : Cross-Platform Hate Speech Detection - A Causality-Guided Framework. Machine Learning and Knowledge Discovery in Databases: Research Track - European Conference, ECML PKDD 2023, Proceedings. editor / Danai Koutra ; Claudia Plant ; Manuel Gomez Rodriguez ; Elena Baralis ; Francesco Bonchi. Springer Science and Business Media Deutschland GmbH, 2023. pp. 559-575 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{8a8f3127bd0749e4a9687702a689d822,

title = "PEACE: Cross-Platform Hate Speech Detection - A Causality-Guided Framework",

abstract = "Hate speech detection refers to the task of detecting hateful content that aims at denigrating an individual or a group based on their religion, gender, sexual orientation, or other characteristics. Due to the different policies of the platforms, different groups of people express hate in different ways. Furthermore, due to the lack of labeled data in some platforms it becomes challenging to build hate speech detection models. To this end, we revisit if we can learn a generalizable hate speech detection model for the cross platform setting, where we train the model on the data from one (source) platform and generalize the model across multiple (target) platforms. Existing generalization models rely on linguistic cues or auxiliary information, making them biased towards certain tags or certain kinds of words (e.g., abusive words) on the source platform and thus not applicable to the target platforms. Inspired by social and psychological theories, we endeavor to explore if there exist inherent causal cues that can be leveraged to learn generalizable representations for detecting hate speech across these distribution shifts. To this end, we propose a causality-guided framework, PEACE, that identifies and leverages two intrinsic causal cues omnipresent in hateful content: the overall sentiment and the aggression in the text. We conduct extensive experiments across multiple platforms (representing the distribution shift) showing if causal cues can help cross-platform generalization.",

keywords = "Causal Inference, Generalizability, Hate-Speech Detection",

author = "Paaras Sheth and Tharindu Kumarage and Raha Moraffah and Aman Chadha and Huan Liu",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.; European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2023 ; Conference date: 18-09-2023 Through 22-09-2023",

year = "2023",

doi = "10.1007/978-3-031-43412-9_33",

language = "English (US)",

isbn = "9783031434112",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "559--575",

editor = "Danai Koutra and Claudia Plant and {Gomez Rodriguez}, Manuel and Elena Baralis and Francesco Bonchi",

booktitle = "Machine Learning and Knowledge Discovery in Databases",

address = "Germany",

}

TY - GEN

T1 - PEACE

T2 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2023

AU - Sheth, Paaras

AU - Kumarage, Tharindu

AU - Moraffah, Raha

AU - Chadha, Aman

AU - Liu, Huan

PY - 2023

Y1 - 2023

N2 - Hate speech detection refers to the task of detecting hateful content that aims at denigrating an individual or a group based on their religion, gender, sexual orientation, or other characteristics. Due to the different policies of the platforms, different groups of people express hate in different ways. Furthermore, due to the lack of labeled data in some platforms it becomes challenging to build hate speech detection models. To this end, we revisit if we can learn a generalizable hate speech detection model for the cross platform setting, where we train the model on the data from one (source) platform and generalize the model across multiple (target) platforms. Existing generalization models rely on linguistic cues or auxiliary information, making them biased towards certain tags or certain kinds of words (e.g., abusive words) on the source platform and thus not applicable to the target platforms. Inspired by social and psychological theories, we endeavor to explore if there exist inherent causal cues that can be leveraged to learn generalizable representations for detecting hate speech across these distribution shifts. To this end, we propose a causality-guided framework, PEACE, that identifies and leverages two intrinsic causal cues omnipresent in hateful content: the overall sentiment and the aggression in the text. We conduct extensive experiments across multiple platforms (representing the distribution shift) showing if causal cues can help cross-platform generalization.

AB - Hate speech detection refers to the task of detecting hateful content that aims at denigrating an individual or a group based on their religion, gender, sexual orientation, or other characteristics. Due to the different policies of the platforms, different groups of people express hate in different ways. Furthermore, due to the lack of labeled data in some platforms it becomes challenging to build hate speech detection models. To this end, we revisit if we can learn a generalizable hate speech detection model for the cross platform setting, where we train the model on the data from one (source) platform and generalize the model across multiple (target) platforms. Existing generalization models rely on linguistic cues or auxiliary information, making them biased towards certain tags or certain kinds of words (e.g., abusive words) on the source platform and thus not applicable to the target platforms. Inspired by social and psychological theories, we endeavor to explore if there exist inherent causal cues that can be leveraged to learn generalizable representations for detecting hate speech across these distribution shifts. To this end, we propose a causality-guided framework, PEACE, that identifies and leverages two intrinsic causal cues omnipresent in hateful content: the overall sentiment and the aggression in the text. We conduct extensive experiments across multiple platforms (representing the distribution shift) showing if causal cues can help cross-platform generalization.

KW - Causal Inference

KW - Generalizability

KW - Hate-Speech Detection

UR - http://www.scopus.com/inward/record.url?scp=85174444261&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85174444261&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-43412-9_33

DO - 10.1007/978-3-031-43412-9_33

M3 - Conference contribution

AN - SCOPUS:85174444261

SN - 9783031434112

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 559

EP - 575

BT - Machine Learning and Knowledge Discovery in Databases

A2 - Koutra, Danai

A2 - Plant, Claudia

A2 - Gomez Rodriguez, Manuel

A2 - Baralis, Elena

A2 - Bonchi, Francesco

PB - Springer Science and Business Media Deutschland GmbH

Y2 - 18 September 2023 through 22 September 2023

ER -

PEACE: Cross-Platform Hate Speech Detection - A Causality-Guided Framework

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this