Incorporating conditional random fields and active learning to improve sentiment identification

Neural Netw. 2014 Oct;58:60-7. doi: 10.1016/j.neunet.2014.04.005. Epub 2014 May 10.

Abstract

Many machine learning, statistical, and computational linguistic methods have been developed to identify sentiment of sentences in documents, yielding promising results. However, most of state-of-the-art methods focus on individual sentences and ignore the impact of context on the meaning of a sentence. In this paper, we propose a method based on conditional random fields to incorporate sentence structure and context information in addition to syntactic information for improving sentiment identification. We also investigate how human interaction affects the accuracy of sentiment labeling using limited training data. We propose and evaluate two different active learning strategies for labeling sentiment data. Our experiments with the proposed approach demonstrate a 5%-15% improvement in accuracy on Amazon customer reviews compared to existing supervised learning and rule-based methods.

Keywords: Active learning; Conditional random fields; Customer reviews; Sentiment analysis.

MeSH terms

  • Artificial Intelligence*
  • Humans
  • Linguistics*
  • Pattern Recognition, Visual*
  • Problem-Based Learning / methods*