Semantic Similarity Match for Data Quality

Data quality is a critical aspect of applications that support business operations. Often entities are represented more than once in data repositories. Since duplicate records do not share a common key, they are hard to detect. Duplicate detection over text is usually performed using lexical approac...

ver descrição completa

Detalhes bibliográficos
Autor principal: Martins, Fernando (author)
Outros Autores: Falcão, André (author), Couto, Francisco M. (author)
Formato: report
Idioma:por
Publicado em: 2009
Assuntos:
Texto completo:http://hdl.handle.net/10451/14158
País:Portugal
Oai:oai:repositorio.ul.pt:10451/14158