Accession Number : AD1018061


Title :   Using Sentence-Level Classifiers for Cross-Domain Sentiment Analysis


Descriptive Note : Technical Report


Corporate Author : DRDC Toronto Research Centre Toronto, ON Canada


Personal Author(s) : Kwantes,Peter ; Hamm,Jihn ; Dennis,Simon


Full Text : https://apps.dtic.mil/dtic/tr/fulltext/u2/1018061.pdf


Report Date : 01 Sep 2014


Pagination or Media Count : 22


Abstract : DRDC has been developing a suite of capabilities built around models of semantics and visual analytic tools for Applied Research Project (ARP) 15ah. Recently, we implemented a sentiment analyser in a document visualization tool called Handles to allow users to examine the positive and negative opinions associated with concepts. The results were unimpressive. Specifically, the system does poorly classifying document from domains that are different from the training domain. In the work reported here, we consider and explore the two solutions. First we explore whether a more fine-grained analysis of sentiment where the sentences of a document are used as the functional unit of analysis rather than the whole document improves performance. Second, we increased the granularity of the classification during training from binary (positive or negative) to trinary (positive, negative, or neutral) to see if performance improved. Neither solution worked well. However, when we mixed documents from different domains together during training, we did find that the performance improved. We take the results to suggest that the best way to build a sentiment classifier that is agnostic with respect to domain is to train the classifier on examples from as many domains as possible.


Descriptors :   information operations , sociometrics , text analytics , machine learning , sematics , human emotions


Subject Categories : Information Science
      Linguistics
      Psychology
      Cybernetics


Distribution Statement : APPROVED FOR PUBLIC RELEASE