A French text-message corpus: 88milSMS. Synthesis and usage
In this article, firstly we briefly summarise the sud4science project and data collection (http://sud4science.org), ensuing processing/analysing stages, and the resulting corpus, 88milSMS (http://88milsms.huma-num.fr), through a synthesis of quotes and references to previous articles (§ 1). Secondly, we provide a state of the art on some research initiatives that use88milSMS in various domains and frameworks, which will enable future cross-disciplinary insight (§ 2). Then, we present other usages of the 88milSMS corpus we identified through surveys (§ 3). Finally, we suggest future paths for textual data collection and analysis.
Saved in:
Main Authors: | Panckhurst, Rachel, Lopez, Cédric, Roche, Mathieu |
---|---|
Format: | article biblioteca |
Language: | eng |
Subjects: | C30 - Documentation et information, U10 - Informatique, mathématiques et statistiques, fouille de données, analyse de données, traitement des données, collecte de données, application des ordinateurs, fouille de textes, http://aims.fao.org/aos/agrovoc/c_eb9cea5d, http://aims.fao.org/aos/agrovoc/c_15962, http://aims.fao.org/aos/agrovoc/c_10289, http://aims.fao.org/aos/agrovoc/c_2128, http://aims.fao.org/aos/agrovoc/c_24009, http://aims.fao.org/aos/agrovoc/c_dca12b72, |
Online Access: | http://agritrop.cirad.fr/594953/ http://agritrop.cirad.fr/594953/1/Panckhurst_et_al._Corpus_2020.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Mise en correspondance de données textuelles hétérogènes fondée sur la dimension spatiale
by: Fize, Jacques -
How to define Co-occurrence in a multidisciplinary context?
by: Roche, Mathieu -
Investigating the impact of preprocessing on document embedding: an empirical comparison
by: Yahi, Nourelhouda, et al. -
Combinaison de mesures lexicales et sémantiques pour l'extraction de données expérimentales dans des articles scientifiques
by: Lentschat, Martin, et al. -
Towards combined semantic and lexical scores based on a new representation of textual data to extract experimental data from scientific publications
by: Lentschat, Martin, et al.