An n-gram cache for large-scale parallel extraction of multiword relevant expressions with LocalMaxs
LocalMaxs extracts relevant multiword terms based on their cohesion but is computationally intensive, a critical issue for very large natural language corpora. The corpus properties concerning n-gram distribution determine the algorithm complexity and were empirically analyzed for corpora up to 982...
Autor principal: | |
---|---|
Outros Autores: | , |
Formato: | conferenceObject |
Idioma: | eng |
Publicado em: |
2019
|
Assuntos: | |
Texto completo: | http://hdl.handle.net/10400.21/9637 |
País: | Portugal |
Oai: | oai:repositorio.ipl.pt:10400.21/9637 |