Data Matching and Deduplication Over Big Data Using Hadoop Framework

Entity Resolution is the process of matching records from more than one database that refer to the same entity. In case of a single database the process is called deduplication. This article proposes a method to solve entity resolution and deduplication problem using MapReduce over Hadoop framework....

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Albanese, Pablo Adrián, Ale, Juan M.
Formato: Objeto de conferencia
Lenguaje:Inglés
Publicado: 2016
Materias:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/56751
Aporte de:

Ejemplares similares