An n-gram cache for large-scale parallel extraction of multiword relevant expressions with LocalMaxs

LocalMaxs extracts relevant multiword terms based on their cohesion but is computationally intensive, a critical issue for very large natural language corpora. The corpus properties concerning n-gram distribution determine the algorithm complexity and were empirically analyzed for corpora up to 982...

ver descrição completa

Detalhes bibliográficos
Autor principal: Gonçalves, Carlos (author)
Outros Autores: Silva, Joaquim F. (author), Cunha, José C. (author)
Formato: conferenceObject
Idioma:eng
Publicado em: 2019
Assuntos:
Texto completo:http://hdl.handle.net/10400.21/9637
País:Portugal
Oai:oai:repositorio.ipl.pt:10400.21/9637