Gemedoc: A text similarity annotation platform

We present Gemedoc, a platform for text similarity annotation based on the spatial and the thematic dimension. To this end, a two-step annotation protocol was designed to assess the similarity between two documents: (1) identification of salient features according to the two analysis dimensions; (2) similarity assessment according to a 4-degree scale. Ultimately, the labeled data retrieved from different corpora could be used as benchmark for text-mining applications.

Saved in:
Bibliographic Details
Main Authors: Fize, Jacques, Roche, Mathieu, Teisseire, Maguelonne
Format: conference_item biblioteca
Language:eng
Published: Springer
Online Access:http://agritrop.cirad.fr/588240/
http://agritrop.cirad.fr/588240/1/Fize_et_al_NLDB_2018.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!