TextCL: a Python package for NLP preprocessing tasks
Preprocessing text data sets for use in Natural Language Processing tasks is usually a time-consuming and expensive effort. Text data, normally obtained from sources such as, but not limited to, web scraping, scanned documents or PDF files, is typically unstructured and prone to artifacts and other...
Main Author: | |
---|---|
Other Authors: | |
Format: | article |
Language: | eng |
Published: |
2022
|
Subjects: | |
Online Access: | http://hdl.handle.net/10437/12937 |
Country: | Portugal |
Oai: | oai:recil.ensinolusofona.pt:10437/12937 |