Data Matching and Deduplication Over Big Data Using Hadoop Framework
Entity Resolution is the process of matching records from more than one database that refer to the same entity. In case of a single database the process is called deduplication. This article proposes a method to solve entity resolution and deduplication problem using MapReduce over Hadoop framework....
Guardado en:
| Autores principales: | Albanese, Pablo Adrián, Ale, Juan M. |
|---|---|
| Formato: | Objeto de conferencia |
| Lenguaje: | Inglés |
| Publicado: |
2016
|
| Materias: | |
| Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/56751 |
| Aporte de: |
Ejemplares similares
-
NoSQL en sistemas distribuidos sobre cluster Hadoop
por: Martín, Adriana Elizabeth, et al.
Publicado: (2016) -
Data stream treatment using sliding windows with MapReduce
por: Basgall, María José, et al.
Publicado: (2016) -
DOMEX: un emulador del framework MapReduce
por: Scoffield, David, et al.
Publicado: (2024) -
New technologies for big multimedia data treatment
por: Barrionuevo, Mercedes, et al.
Publicado: (2013) -
Implementing cloud-based parallel metaheuristics: an overview
por: González, Patricia, et al.
Publicado: (2018)