Bilingual Terminology Extraction based on Translation Patterns

Parallel corpora are rich sources of translation resources. This document presents a methodology for the extraction of bilingual nominals (terminology candidates) from parallel corpora, using translation patterns. The patterns proposed in this work specify the order changes that occur during transla...

Full description

Bibliographic Details
Main Author: Simões, Alberto (author)
Other Authors: Almeida, José João (author)
Format: article
Language:eng
Published: 2009
Subjects:
Online Access:http://hdl.handle.net/10400.26/34
Country:Portugal
Oai:oai:comum.rcaap.pt:10400.26/34
Description
Summary:Parallel corpora are rich sources of translation resources. This document presents a methodology for the extraction of bilingual nominals (terminology candidates) from parallel corpora, using translation patterns. The patterns proposed in this work specify the order changes that occur during translation and that are intrinsic to the involved languages syntaxes. These patterns are described in a domain specific language named PDL (Pattern Description Language), and are extremely efficient for the detection of nominal phrases.