Semantic Similarity Match for Data Quality
Data quality is a critical aspect of applications that support business operations. Often entities are represented more than once in data repositories. Since duplicate records do not share a common key, they are hard to detect. Duplicate detection over text is usually performed using lexical approac...
Main Author: | |
---|---|
Other Authors: | , |
Format: | report |
Language: | por |
Published: |
2009
|
Subjects: | |
Online Access: | http://hdl.handle.net/10451/14158 |
Country: | Portugal |
Oai: | oai:repositorio.ul.pt:10451/14158 |