TextCL: a Python package for NLP preprocessing tasks

Preprocessing text data sets for use in Natural Language Processing tasks is usually a time-consuming and expensive effort. Text data, normally obtained from sources such as, but not limited to, web scraping, scanned documents or PDF files, is typically unstructured and prone to artifacts and other...

ver descrição completa

Detalhes bibliográficos
Autor principal: Petukhova, Alina (author)
Outros Autores: Fachada, Nuno (author)
Formato: article
Idioma:eng
Publicado em: 2022
Assuntos:
Texto completo:http://hdl.handle.net/10437/12937
País:Portugal
Oai:oai:recil.ensinolusofona.pt:10437/12937