Using machine learning algorithms to identify named entities in legal documents: a preliminary approach

This paper deals with accuracy and performance of var- ious machine learning algorithms in the recognition and extraction of different types of named entities such as date, organization, reg- ulation laws and person. The experiment is based on 20 judicial decision documents from European Lex site. T...

ver descrição completa

Detalhes bibliográficos
Autor principal: Poudyal, Prakash (author)
Outros Autores: Borrego, Luís (author), Quaresma, Paulo (author)
Formato: article
Idioma:eng
Publicado em: 2012
Assuntos:
Texto completo:http://hdl.handle.net/10174/4899
País:Portugal
Oai:oai:dspace.uevora.pt:10174/4899
Descrição
Resumo:This paper deals with accuracy and performance of var- ious machine learning algorithms in the recognition and extraction of different types of named entities such as date, organization, reg- ulation laws and person. The experiment is based on 20 judicial decision documents from European Lex site. The obtained results were proposed for the selection of the best algorithm that selects appropriate maximum entities from the legal documents. To ver- ify the performance of algorithm, obtained data from the tagging entities were compared with manual work as reference.