Dictionary-based sentiment analysis applied to a specific domain

The web and social media have been growing exponentially in recent years. We now have access to documents bearing opinions expressed on a broad range of topics. This constitutes a rich resource for natural language processing tasks, particularly for sentiment analysis. Nevertheless, sentiment analysis is usually difficult because expressed sentiments are usually topic-oriented. In this paper, we propose to automatically construct a sentiment dictionary using relevant terms obtained from web pages for a specific domain. This dictionary is initially built by querying the web with a combination of opinion terms, as well as terms of the domain. In order to select only relevant terms we apply two measures AcroDefMI3 and TrueSkill. Experiments conducted on different domains highlight that our automatic approach performs better for specific cases.

Saved in:
Bibliographic Details
Main Authors: Cruz, Laura, Ochoa, José, Roche, Mathieu, Poncelet, Pascal
Format: conference_item biblioteca
Language:eng
Published: Springer International Publishing
Subjects:C30 - Documentation et information, U30 - Méthodes de recherche, U10 - Informatique, mathématiques et statistiques, 000 - Autres thèmes,
Online Access:http://agritrop.cirad.fr/583838/
http://agritrop.cirad.fr/583838/7/ID583838.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The web and social media have been growing exponentially in recent years. We now have access to documents bearing opinions expressed on a broad range of topics. This constitutes a rich resource for natural language processing tasks, particularly for sentiment analysis. Nevertheless, sentiment analysis is usually difficult because expressed sentiments are usually topic-oriented. In this paper, we propose to automatically construct a sentiment dictionary using relevant terms obtained from web pages for a specific domain. This dictionary is initially built by querying the web with a combination of opinion terms, as well as terms of the domain. In order to select only relevant terms we apply two measures AcroDefMI3 and TrueSkill. Experiments conducted on different domains highlight that our automatic approach performs better for specific cases.