Generalized Representation of Syntactic Structures

Reihane Boghrati, Kate M. Johnson, Morteza Dehghani

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

Analysis of language provides important insights into the underlying psychological properties of individuals and groups. While the majority of language analysis work in psychology has focused on semantics, psychological information is encoded not just in what people say, but how they say it. In the current work, we propose Conversation Level Syntax Similarity Metric-Group Representations (CASSIM-GR). This tool builds generalized representations of syntactic structures of documents, thus allowing researchers to distinguish between people and groups based on syntactic differences. CASSIM-GR builds off of Conversation Level Syntax Similarity Metric by applying spectral clustering to syntactic similarity matrices and calculating the center of each cluster of documents. This resulting cluster centroid then represents the syntactical structure of the group of documents. To examine the effectiveness of CASSIM-GR, we conduct three experiments across three unique corpora. In each experiment, we calculate the clustering accuracy and compare our proposed technique to a bag-of-words approach. Our results provide evidence for the effectiveness of CASSIM-GR and demonstrate that combining syntactic similarity and tf-idf semantic information improves the total accuracy of group classification.

Original languageEnglish (US)
Title of host publicationCogSci 2017 - Proceedings of the 39th Annual Meeting of the Cognitive Science Society
Subtitle of host publicationComputational Foundations of Cognition
PublisherThe Cognitive Science Society
Pages1648-1653
Number of pages6
ISBN (Electronic)9780991196760
StatePublished - 2017
Externally publishedYes
Event39th Annual Meeting of the Cognitive Science Society: Computational Foundations of Cognition, CogSci 2017 - London, United Kingdom
Duration: Jul 26 2017Jul 29 2017

Publication series

NameCogSci 2017 - Proceedings of the 39th Annual Meeting of the Cognitive Science Society: Computational Foundations of Cognition

Conference

Conference39th Annual Meeting of the Cognitive Science Society: Computational Foundations of Cognition, CogSci 2017
Country/TerritoryUnited Kingdom
CityLondon
Period7/26/177/29/17

Keywords

  • CASSIM
  • Syntactic Similarity
  • Syntax
  • Text Classification
  • Text Clustering

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Human-Computer Interaction
  • Cognitive Neuroscience

Fingerprint

Dive into the research topics of 'Generalized Representation of Syntactic Structures'. Together they form a unique fingerprint.

Cite this