A first step towards combating fake news over online social media

Kuai Xu; Feng Wang; Haiyan Wang; Bo Yang

doi:10.1007/978-3-319-94268-1_43

A first step towards combating fake news over online social media

Kuai Xu, Feng Wang, Haiyan Wang, Bo Yang

Mathematical and Natural Sciences, School of (SMNS)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

8 Scopus citations

Abstract

Fake news has recently leveraged the power and scale of online social media to effectively spread misinformation which not only erodes the trust of people on traditional presses and journalisms, but also manipulates the opinions and sentiments of the public. Detecting fake news is a daunting challenge due to subtle difference between real and fake news. As a first step of fighting with fake news, this paper characterizes hundreds of popular fake and real news measured by shares, reactions, and comments on Facebook from two perspectives: Web sites and content. Our site analysis reveals that the Web sites of the fake and real news publishers exhibit diverse registration behaviors and registration timing. In addition, fake news tends to disappear from the Web after a certain amount of time. The content characterizations on the fake and real news corpus suggest that simply applying term frequency - inverse document frequency (tf-idf) and Latent Dirichlet allocation (LDA) topic modeling is inefficient in detecting fake news, while exploring document similarity with the term and word vectors is a very promising direction for predicting fake and real news. To the best of our knowledge, this is the first effort to systematically study the Web sites and content characteristics of fake and real news, which will provide key insights for effectively detecting fake news on social media.

Original language	English (US)
Title of host publication	Wireless Algorithms, Systems, and Applications - 13th International Conference, WASA 2018, Proceedings
Editors	Wei Cheng, Wei Li, Sriram Chellappan
Publisher	Springer Verlag
Pages	521-531
Number of pages	11
ISBN (Print)	9783319942674
DOIs	https://doi.org/10.1007/978-3-319-94268-1_43
State	Published - 2018
Event	13th International Conference on Wireless Algorithms, Systems, and Applications, WASA 2018 - Tianjin, China Duration: Jun 20 2018 → Jun 22 2018

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	10874 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Other

Other	13th International Conference on Wireless Algorithms, Systems, and Applications, WASA 2018
Country/Territory	China
City	Tianjin
Period	6/20/18 → 6/22/18

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-319-94268-1_43

Cite this

Xu, K., Wang, F., Wang, H., & Yang, B. (2018). A first step towards combating fake news over online social media. In W. Cheng, W. Li, & S. Chellappan (Eds.), Wireless Algorithms, Systems, and Applications - 13th International Conference, WASA 2018, Proceedings (pp. 521-531). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10874 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-94268-1_43

A first step towards combating fake news over online social media. / Xu, Kuai ; Wang, Feng ; Wang, Haiyan et al.
Wireless Algorithms, Systems, and Applications - 13th International Conference, WASA 2018, Proceedings. ed. / Wei Cheng; Wei Li; Sriram Chellappan. Springer Verlag, 2018. p. 521-531 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10874 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Xu, K , Wang, F , Wang, H & Yang, B 2018, A first step towards combating fake news over online social media. in W Cheng, W Li & S Chellappan (eds), Wireless Algorithms, Systems, and Applications - 13th International Conference, WASA 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 10874 LNCS, Springer Verlag, pp. 521-531, 13th International Conference on Wireless Algorithms, Systems, and Applications, WASA 2018, Tianjin, China, 6/20/18. https://doi.org/10.1007/978-3-319-94268-1_43

Xu K , Wang F , Wang H, Yang B. A first step towards combating fake news over online social media. In Cheng W, Li W, Chellappan S, editors, Wireless Algorithms, Systems, and Applications - 13th International Conference, WASA 2018, Proceedings. Springer Verlag. 2018. p. 521-531. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-94268-1_43

Xu, Kuai ; Wang, Feng ; Wang, Haiyan et al. / A first step towards combating fake news over online social media. Wireless Algorithms, Systems, and Applications - 13th International Conference, WASA 2018, Proceedings. editor / Wei Cheng ; Wei Li ; Sriram Chellappan. Springer Verlag, 2018. pp. 521-531 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{858e72dc6bdc42dd881ca2fc25e89e97,

title = "A first step towards combating fake news over online social media",

abstract = "Fake news has recently leveraged the power and scale of online social media to effectively spread misinformation which not only erodes the trust of people on traditional presses and journalisms, but also manipulates the opinions and sentiments of the public. Detecting fake news is a daunting challenge due to subtle difference between real and fake news. As a first step of fighting with fake news, this paper characterizes hundreds of popular fake and real news measured by shares, reactions, and comments on Facebook from two perspectives: Web sites and content. Our site analysis reveals that the Web sites of the fake and real news publishers exhibit diverse registration behaviors and registration timing. In addition, fake news tends to disappear from the Web after a certain amount of time. The content characterizations on the fake and real news corpus suggest that simply applying term frequency - inverse document frequency (tf-idf) and Latent Dirichlet allocation (LDA) topic modeling is inefficient in detecting fake news, while exploring document similarity with the term and word vectors is a very promising direction for predicting fake and real news. To the best of our knowledge, this is the first effort to systematically study the Web sites and content characteristics of fake and real news, which will provide key insights for effectively detecting fake news on social media.",

author = "Kuai Xu and Feng Wang and Haiyan Wang and Bo Yang",

note = "Funding Information: Acknowledgements. This work was supported in part by National Science Foundation Algorithms for Threat Detection (ATD) Program under the grant DMS #1737861. Publisher Copyright: {\textcopyright} 2018, Springer International Publishing AG, part of Springer Nature.; 13th International Conference on Wireless Algorithms, Systems, and Applications, WASA 2018 ; Conference date: 20-06-2018 Through 22-06-2018",

year = "2018",

doi = "10.1007/978-3-319-94268-1_43",

language = "English (US)",

isbn = "9783319942674",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "521--531",

editor = "Wei Cheng and Wei Li and Sriram Chellappan",

booktitle = "Wireless Algorithms, Systems, and Applications - 13th International Conference, WASA 2018, Proceedings",

}

TY - GEN

T1 - A first step towards combating fake news over online social media

AU - Xu, Kuai

AU - Wang, Feng

AU - Wang, Haiyan

AU - Yang, Bo

N1 - Funding Information: Acknowledgements. This work was supported in part by National Science Foundation Algorithms for Threat Detection (ATD) Program under the grant DMS #1737861. Publisher Copyright: © 2018, Springer International Publishing AG, part of Springer Nature.

PY - 2018

Y1 - 2018

N2 - Fake news has recently leveraged the power and scale of online social media to effectively spread misinformation which not only erodes the trust of people on traditional presses and journalisms, but also manipulates the opinions and sentiments of the public. Detecting fake news is a daunting challenge due to subtle difference between real and fake news. As a first step of fighting with fake news, this paper characterizes hundreds of popular fake and real news measured by shares, reactions, and comments on Facebook from two perspectives: Web sites and content. Our site analysis reveals that the Web sites of the fake and real news publishers exhibit diverse registration behaviors and registration timing. In addition, fake news tends to disappear from the Web after a certain amount of time. The content characterizations on the fake and real news corpus suggest that simply applying term frequency - inverse document frequency (tf-idf) and Latent Dirichlet allocation (LDA) topic modeling is inefficient in detecting fake news, while exploring document similarity with the term and word vectors is a very promising direction for predicting fake and real news. To the best of our knowledge, this is the first effort to systematically study the Web sites and content characteristics of fake and real news, which will provide key insights for effectively detecting fake news on social media.

AB - Fake news has recently leveraged the power and scale of online social media to effectively spread misinformation which not only erodes the trust of people on traditional presses and journalisms, but also manipulates the opinions and sentiments of the public. Detecting fake news is a daunting challenge due to subtle difference between real and fake news. As a first step of fighting with fake news, this paper characterizes hundreds of popular fake and real news measured by shares, reactions, and comments on Facebook from two perspectives: Web sites and content. Our site analysis reveals that the Web sites of the fake and real news publishers exhibit diverse registration behaviors and registration timing. In addition, fake news tends to disappear from the Web after a certain amount of time. The content characterizations on the fake and real news corpus suggest that simply applying term frequency - inverse document frequency (tf-idf) and Latent Dirichlet allocation (LDA) topic modeling is inefficient in detecting fake news, while exploring document similarity with the term and word vectors is a very promising direction for predicting fake and real news. To the best of our knowledge, this is the first effort to systematically study the Web sites and content characteristics of fake and real news, which will provide key insights for effectively detecting fake news on social media.

UR - http://www.scopus.com/inward/record.url?scp=85049027465&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85049027465&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-94268-1_43

DO - 10.1007/978-3-319-94268-1_43

M3 - Conference contribution

AN - SCOPUS:85049027465

SN - 9783319942674

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 521

EP - 531

BT - Wireless Algorithms, Systems, and Applications - 13th International Conference, WASA 2018, Proceedings

A2 - Cheng, Wei

A2 - Li, Wei

A2 - Chellappan, Sriram

PB - Springer Verlag

T2 - 13th International Conference on Wireless Algorithms, Systems, and Applications, WASA 2018

Y2 - 20 June 2018 through 22 June 2018

ER -

A first step towards combating fake news over online social media

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this