A French text-message corpus: 88milSMS. Synthesis and usage

In this article, firstly we briefly summarise the sud4science project and data collection (http://sud4science.org), ensuing processing/analysing stages, and the resulting corpus, 88milSMS (http://88milsms.huma-num.fr), through a synthesis of quotes and references to previous articles (§ 1). Secondly, we provide a state of the art on some research initiatives that use88milSMS in various domains and frameworks, which will enable future cross-disciplinary insight (§ 2). Then, we present other usages of the 88milSMS corpus we identified through surveys (§ 3). Finally, we suggest future paths for textual data collection and analysis.

Saved in:
Bibliographic Details
Main Authors: Panckhurst, Rachel, Lopez, Cédric, Roche, Mathieu
Format: article biblioteca
Language:eng
Subjects:C30 - Documentation et information, U10 - Informatique, mathématiques et statistiques, fouille de données, analyse de données, traitement des données, collecte de données, application des ordinateurs, fouille de textes, http://aims.fao.org/aos/agrovoc/c_eb9cea5d, http://aims.fao.org/aos/agrovoc/c_15962, http://aims.fao.org/aos/agrovoc/c_10289, http://aims.fao.org/aos/agrovoc/c_2128, http://aims.fao.org/aos/agrovoc/c_24009, http://aims.fao.org/aos/agrovoc/c_dca12b72,
Online Access:http://agritrop.cirad.fr/594953/
http://agritrop.cirad.fr/594953/1/Panckhurst_et_al._Corpus_2020.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!