TMop is an open-source software written in Python designed for cleaning and maintaining a Translation Memory (i.e. a collection of (source, target)
segments, called Translation Units, used to aid human translators operating in a Computer-assisted Translation framework).
The goal of TMop is to identify and remove from the TM all the "bad" TUs, in which any of the two textual elements is either:
i) syntactically poor,
ii) semantically different from the other,
iii) awkward according to some formatting criteria.
TMop has been developed at Fondazione Bruno Kessler with the support of the European Association of Machine Translation (EAMT) and the European Project Modern Machine Translation (MMT). It can be downloaded as a package including: software, documentation, toy data and evaluation scripts.
Matteo Negri, Fondazione Bruno Kessler, Italy ([email protected])
Masoud Jalili Sabet, University of Teheran, Iran ([email protected])
Marco Turchi, Fondazione Bruno Kesler, Italy ([email protected])