Semantic Similarity Match for Data Quality

Data quality is a critical aspect of applications that support business operations. Often entities are represented more than once in data repositories. Since duplicate records do not share a common key, they are hard to detect. Duplicate detection over text is usually performed using lexical approac...

Full description

Bibliographic Details
Main Author: Martins, Fernando (author)
Other Authors: Falcão, André (author), Couto, Francisco M. (author)
Format: report
Language:por
Published: 2009
Subjects:
Online Access:http://hdl.handle.net/10451/14158
Country:Portugal
Oai:oai:repositorio.ul.pt:10451/14158