Combinatory examples extraction for machine translation

One of the bottlenecks of example-based machine translation (EBMT) is to be able to amass automatically quantities of good examples. In our work in EBMT, we are investigating how far one can go by performing example extraction from parallel corpora using Probabilistic Translation Dictionaries to obt...

Full description

Bibliographic Details
Main Author: Simões, Alberto (author)
Other Authors: Almeida, J. J. (author)
Format: conferencePaper
Language:eng
Published: 2006
Subjects:
Online Access:http://hdl.handle.net/1822/5196
Country:Portugal
Oai:oai:repositorium.sdum.uminho.pt:1822/5196
Description
Summary:One of the bottlenecks of example-based machine translation (EBMT) is to be able to amass automatically quantities of good examples. In our work in EBMT, we are investigating how far one can go by performing example extraction from parallel corpora using Probabilistic Translation Dictionaries to obtain example segmentation points. In fact, the success of EBMT highly depends on examples quality and quantity, but also in their length. Thus, we give special importance on methods to extract different size examples from the same translation unit. With this article we show that it is possible to extract quantities for examples from parallel corpora just using probabilistic translation dictionaries extracted from the same corpora.