Tagging with Disambiguation Rules - A New Evolutionary Approach to the Part-of-Speech Tagging Problem

{In this paper we present an evolutionary approach to the part-of-speech tagging problem. The goal of part-of-speech tagging is to assign to each word of a text its part-of-speech. The task is not straightforward, because a large percentage of words has more than one possible part-of-speech, and the...

Full description

Bibliographic Details
Main Author: Silva, Ana Paula (author)
Other Authors: Silva, Arlindo (author), Rodrigues, Irene Pimenta (author)
Format: bookPart
Language:eng
Published: 2013
Subjects:
Online Access:http://hdl.handle.net/10174/7592
Country:Portugal
Oai:oai:dspace.uevora.pt:10174/7592
Description
Summary:{In this paper we present an evolutionary approach to the part-of-speech tagging problem. The goal of part-of-speech tagging is to assign to each word of a text its part-of-speech. The task is not straightforward, because a large percentage of words has more than one possible part-of-speech, and the right choice is determined by the surrounding word's part-of-speeches. This means that to solve this problem we need a method to disambiguate a word's possible tags set. Traditionally there are two groups of methods used to tackle this task. The first group is based on statistical data concerning the different context's possibilities for a word, while the second group is based on rules that capture the language properties. Normally these rules are designed by human experts. In this work we present a solution that tries to incorporate both these approaches. The proposed system is divided in two components. First, we use an evolutionary algorithm that for each part-of-speech tag of the training corpus, evolves a set of disambiguation rules. We then use a second evolutionary algorithm to solve the tagging problem. In this component the evolution is guided by the rules found earlier.