User-guided cross-domain sentiment classification

Arun Reddy Nelakurthi; Hanghang Tong; Ross Maciejewski; Nadya Bliss; Jingrui He

doi:10.1137/1.9781611974973.53

User-guided cross-domain sentiment classification

Arun Reddy Nelakurthi, Hanghang Tong, Ross Maciejewski, Nadya Bliss, Jingrui He

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

8 Scopus citations

Abstract

Sentiment analysis has been studied for decades, and it is widely used in many real applications such as media monitoring. In sentiment analysis, when addressing the problem of limited labeled data from the target domain, transfer learning, or domain adaptation, has been successfully applied, which borrows information from a relevant source domain with abundant labeled data to improve the prediction performance in the target domain. The key to transfer learning is how to model the relatedness among different domains. For sentiment analysis, a common practice is to assume similar sentiment polarity for the common keywords shared by different domains. However, existing methods largely overlooked the human factor, i.e., the users who expressed such sentiment. In this paper, we address this problem by explicitly modeling the human factor related to sentiment classification. In particular, we assume that the content generated by the same user across different domains is biased in the same way in terms of the sentiment polarity. In other words, optimistic/pessimistic users demonstrate consistent sentiment patterns, no matter what the context is. To this end, we propose a new graph-based approach named U-Cross, which models the relatedness of different domains via both the shared users and keywords. It is non-parametric and semi-supervised in nature. Furthermore, we also study the problem of shared user selection to prevent 'negative transfer'. In the experiments, we demonstrate the effectiveness of U-Cross by comparing it with existing state-of-the-art techniques on three real data sets.

Original language	English (US)
Title of host publication	Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017
Editors	Nitesh Chawla, Wei Wang
Publisher	Society for Industrial and Applied Mathematics Publications
Pages	471-479
Number of pages	9
ISBN (Electronic)	9781611974874
DOIs	https://doi.org/10.1137/1.9781611974973.53
State	Published - 2017
Event	17th SIAM International Conference on Data Mining, SDM 2017 - Houston, United States Duration: Apr 27 2017 → Apr 29 2017

Publication series

Name	Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017

Other

Other	17th SIAM International Conference on Data Mining, SDM 2017
Country/Territory	United States
City	Houston
Period	4/27/17 → 4/29/17

Keywords

Classification
Transfer learning
User modeling

ASJC Scopus subject areas

Software
Computer Science Applications

Access to Document

10.1137/1.9781611974973.53

Cite this

Nelakurthi, A. R., Tong, H., Maciejewski, R., Bliss, N., & He, J. (2017). User-guided cross-domain sentiment classification. In N. Chawla, & W. Wang (Eds.), Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017 (pp. 471-479). (Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017). Society for Industrial and Applied Mathematics Publications. https://doi.org/10.1137/1.9781611974973.53

User-guided cross-domain sentiment classification. / Nelakurthi, Arun Reddy; Tong, Hanghang; Maciejewski, Ross et al.
Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017. ed. / Nitesh Chawla; Wei Wang. Society for Industrial and Applied Mathematics Publications, 2017. p. 471-479 (Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Nelakurthi, AR, Tong, H, Maciejewski, R , Bliss, N & He, J 2017, User-guided cross-domain sentiment classification. in N Chawla & W Wang (eds), Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017. Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017, Society for Industrial and Applied Mathematics Publications, pp. 471-479, 17th SIAM International Conference on Data Mining, SDM 2017, Houston, United States, 4/27/17. https://doi.org/10.1137/1.9781611974973.53

Nelakurthi AR, Tong H, Maciejewski R , Bliss N, He J. User-guided cross-domain sentiment classification. In Chawla N, Wang W, editors, Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017. Society for Industrial and Applied Mathematics Publications. 2017. p. 471-479. (Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017). doi: 10.1137/1.9781611974973.53

Nelakurthi, Arun Reddy ; Tong, Hanghang ; Maciejewski, Ross et al. / User-guided cross-domain sentiment classification. Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017. editor / Nitesh Chawla ; Wei Wang. Society for Industrial and Applied Mathematics Publications, 2017. pp. 471-479 (Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017).

@inproceedings{a7f7348fd19541dc87889a0ddf46c2c7,

title = "User-guided cross-domain sentiment classification",

abstract = "Sentiment analysis has been studied for decades, and it is widely used in many real applications such as media monitoring. In sentiment analysis, when addressing the problem of limited labeled data from the target domain, transfer learning, or domain adaptation, has been successfully applied, which borrows information from a relevant source domain with abundant labeled data to improve the prediction performance in the target domain. The key to transfer learning is how to model the relatedness among different domains. For sentiment analysis, a common practice is to assume similar sentiment polarity for the common keywords shared by different domains. However, existing methods largely overlooked the human factor, i.e., the users who expressed such sentiment. In this paper, we address this problem by explicitly modeling the human factor related to sentiment classification. In particular, we assume that the content generated by the same user across different domains is biased in the same way in terms of the sentiment polarity. In other words, optimistic/pessimistic users demonstrate consistent sentiment patterns, no matter what the context is. To this end, we propose a new graph-based approach named U-Cross, which models the relatedness of different domains via both the shared users and keywords. It is non-parametric and semi-supervised in nature. Furthermore, we also study the problem of shared user selection to prevent 'negative transfer'. In the experiments, we demonstrate the effectiveness of U-Cross by comparing it with existing state-of-the-art techniques on three real data sets.",

keywords = "Classification, Transfer learning, User modeling",

author = "Nelakurthi, {Arun Reddy} and Hanghang Tong and Ross Maciejewski and Nadya Bliss and Jingrui He",

note = "Publisher Copyright: Copyright {\textcopyright} by SIAM.; 17th SIAM International Conference on Data Mining, SDM 2017 ; Conference date: 27-04-2017 Through 29-04-2017",

year = "2017",

doi = "10.1137/1.9781611974973.53",

language = "English (US)",

series = "Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017",

publisher = "Society for Industrial and Applied Mathematics Publications",

pages = "471--479",

editor = "Nitesh Chawla and Wei Wang",

booktitle = "Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017",

}

TY - GEN

T1 - User-guided cross-domain sentiment classification

AU - Nelakurthi, Arun Reddy

AU - Tong, Hanghang

AU - Maciejewski, Ross

AU - Bliss, Nadya

AU - He, Jingrui

PY - 2017

Y1 - 2017

N2 - Sentiment analysis has been studied for decades, and it is widely used in many real applications such as media monitoring. In sentiment analysis, when addressing the problem of limited labeled data from the target domain, transfer learning, or domain adaptation, has been successfully applied, which borrows information from a relevant source domain with abundant labeled data to improve the prediction performance in the target domain. The key to transfer learning is how to model the relatedness among different domains. For sentiment analysis, a common practice is to assume similar sentiment polarity for the common keywords shared by different domains. However, existing methods largely overlooked the human factor, i.e., the users who expressed such sentiment. In this paper, we address this problem by explicitly modeling the human factor related to sentiment classification. In particular, we assume that the content generated by the same user across different domains is biased in the same way in terms of the sentiment polarity. In other words, optimistic/pessimistic users demonstrate consistent sentiment patterns, no matter what the context is. To this end, we propose a new graph-based approach named U-Cross, which models the relatedness of different domains via both the shared users and keywords. It is non-parametric and semi-supervised in nature. Furthermore, we also study the problem of shared user selection to prevent 'negative transfer'. In the experiments, we demonstrate the effectiveness of U-Cross by comparing it with existing state-of-the-art techniques on three real data sets.

AB - Sentiment analysis has been studied for decades, and it is widely used in many real applications such as media monitoring. In sentiment analysis, when addressing the problem of limited labeled data from the target domain, transfer learning, or domain adaptation, has been successfully applied, which borrows information from a relevant source domain with abundant labeled data to improve the prediction performance in the target domain. The key to transfer learning is how to model the relatedness among different domains. For sentiment analysis, a common practice is to assume similar sentiment polarity for the common keywords shared by different domains. However, existing methods largely overlooked the human factor, i.e., the users who expressed such sentiment. In this paper, we address this problem by explicitly modeling the human factor related to sentiment classification. In particular, we assume that the content generated by the same user across different domains is biased in the same way in terms of the sentiment polarity. In other words, optimistic/pessimistic users demonstrate consistent sentiment patterns, no matter what the context is. To this end, we propose a new graph-based approach named U-Cross, which models the relatedness of different domains via both the shared users and keywords. It is non-parametric and semi-supervised in nature. Furthermore, we also study the problem of shared user selection to prevent 'negative transfer'. In the experiments, we demonstrate the effectiveness of U-Cross by comparing it with existing state-of-the-art techniques on three real data sets.

KW - Classification

KW - Transfer learning

KW - User modeling

UR - http://www.scopus.com/inward/record.url?scp=85027876536&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85027876536&partnerID=8YFLogxK

U2 - 10.1137/1.9781611974973.53

DO - 10.1137/1.9781611974973.53

M3 - Conference contribution

AN - SCOPUS:85027876536

T3 - Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017

SP - 471

EP - 479

BT - Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017

A2 - Chawla, Nitesh

A2 - Wang, Wei

PB - Society for Industrial and Applied Mathematics Publications

T2 - 17th SIAM International Conference on Data Mining, SDM 2017

Y2 - 27 April 2017 through 29 April 2017

ER -

User-guided cross-domain sentiment classification

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this