Extracting absolute spatial entities from SMS: Comparing a supervised and an unsupervised approach

More than one hundred thousand SMS messages are sent worldwide every second, and each SMS message is likely to contain lexical creativity. Recently, SMS content has been recognised to be of notable interest in many domains, such as e-commerce or psychiatry and more generally Health Informatics. But the automatic analysis of such data is difficult, particularly when dealing with information extraction. In this study, we will focus on “spatial entity recognition”, which consists of recognising countries, cities, places, bars, restaurants, cinemas, beaches, and so forth. For instance, Montpel, mtpl, mtp, and motpeliè all stand for the city of Montpellier. We will compare two different ways of tackling new forms of spatial entity recognition in SMS.

Saved in:
Bibliographic Details
Main Authors: Lopez, Cédric, Zenasni, Sarah, Kergosien, Eric, Partalas, Ioannis, Roche, Mathieu, Teisseire, Maguelonne, Panckhurst, Rachel
Format: book_section biblioteca
Language:eng
Published: Presses universitaires de Louvain
Subjects:C30 - Documentation et information, U10 - Informatique, mathématiques et statistiques, C10 - Enseignement,
Online Access:http://agritrop.cirad.fr/588683/
http://agritrop.cirad.fr/588683/7/ID588683.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:More than one hundred thousand SMS messages are sent worldwide every second, and each SMS message is likely to contain lexical creativity. Recently, SMS content has been recognised to be of notable interest in many domains, such as e-commerce or psychiatry and more generally Health Informatics. But the automatic analysis of such data is difficult, particularly when dealing with information extraction. In this study, we will focus on “spatial entity recognition”, which consists of recognising countries, cities, places, bars, restaurants, cinemas, beaches, and so forth. For instance, Montpel, mtpl, mtp, and motpeliè all stand for the city of Montpellier. We will compare two different ways of tackling new forms of spatial entity recognition in SMS.