Recommending Machine Translation Output to Translators by Estimating Translation Effort: A Case Study

In this paper we use the statistics provided by a field experiment to explore the utility of supplying machine translation suggestions in a computer-assisted translation (CAT) environment. Regression models are trained for each user in order to estimate the time to edit (TTE) for the current translation segment. We use a combination of features from the current segment and aggregated features from formerly translated segments selected with content-based filtering approaches commonly used in recommendation systems. We present and evaluate decision function heuristics to determine if machine translation output will be useful for the translator in the given segment. We find that our regression models do a reasonable job for some users in predicting TTE given only a small number of training examples; although noise in the actual TTE for seemingly similar segments yields large error margins. We propose to include the estimation of TTE in CAT recommendation systems as a well-correlated metric for translation quality.

Saved in:
Bibliographic Details
Main Authors: Mathur,Prashant, Ruiz,Nick, Federico,Marcello
Format: Digital revista
Language:English
Published: Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo 2013
Online Access:http://www.scielo.org.mx/scielo.php?script=sci_arttext&pid=S1870-90442013000100006
Tags: Add Tag
No Tags, Be the first to tag this record!