TextCL: a Python package for NLP preprocessing tasks

Preprocessing text data sets for use in Natural Language Processing tasks is usually a time-consuming and expensive effort. Text data, normally obtained from sources such as, but not limited to, web scraping, scanned documents or PDF files, is typically unstructured and prone to artifacts and other...

Full description

Bibliographic Details
Main Author: Petukhova, Alina (author)
Other Authors: Fachada, Nuno (author)
Format: article
Language:eng
Published: 2022
Subjects:
Online Access:http://hdl.handle.net/10437/12937
Country:Portugal
Oai:oai:recil.ensinolusofona.pt:10437/12937