Generic LSH Families for the Angular Distance Based on Johnson-Lindenstrauss Projections and Feature Hashing LSH
In this paper we propose the creation of generic LSH families for the angular distance based on Johnson-Lindenstrauss projections. We show that feature hashing is a valid J-L projection and propose two new LSH families based on feature hashing. These new LSH families are tested on both synthetic an...
Guardado en:
| Autores principales: | , |
|---|---|
| Formato: | Objeto de conferencia |
| Lenguaje: | Inglés |
| Publicado: |
2017
|
| Materias: | |
| Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/63169 http://www.clei2017-46jaiio.sadio.org.ar/sites/default/files/Mem/AGRANDA/AGRANDA-01.pdf |
| Aporte de: |
| id |
I19-R120-10915-63169 |
|---|---|
| record_format |
dspace |
| institution |
Universidad Nacional de La Plata |
| institution_str |
I-19 |
| repository_str |
R-120 |
| collection |
SEDICI (UNLP) |
| language |
Inglés |
| topic |
Ciencias Informáticas Locality Sensitive Hashing |
| spellingShingle |
Ciencias Informáticas Locality Sensitive Hashing Argerich, Luis Golmar, Natalia Generic LSH Families for the Angular Distance Based on Johnson-Lindenstrauss Projections and Feature Hashing LSH |
| topic_facet |
Ciencias Informáticas Locality Sensitive Hashing |
| description |
In this paper we propose the creation of generic LSH families for the angular distance based on Johnson-Lindenstrauss projections. We show that feature hashing is a valid J-L projection and propose two new LSH families based on feature hashing.
These new LSH families are tested on both synthetic and real datasets with very good results and a considerable performance improvement over other LSH families. While the theoretical analysis is done for the angular distance, these families can also be used in practice for the euclidean distance with excellent results [2]. Our tests using real datasets show that the proposed LSH functions work well for the euclidean distance. |
| format |
Objeto de conferencia Objeto de conferencia |
| author |
Argerich, Luis Golmar, Natalia |
| author_facet |
Argerich, Luis Golmar, Natalia |
| author_sort |
Argerich, Luis |
| title |
Generic LSH Families for the Angular Distance Based on Johnson-Lindenstrauss Projections and Feature Hashing LSH |
| title_short |
Generic LSH Families for the Angular Distance Based on Johnson-Lindenstrauss Projections and Feature Hashing LSH |
| title_full |
Generic LSH Families for the Angular Distance Based on Johnson-Lindenstrauss Projections and Feature Hashing LSH |
| title_fullStr |
Generic LSH Families for the Angular Distance Based on Johnson-Lindenstrauss Projections and Feature Hashing LSH |
| title_full_unstemmed |
Generic LSH Families for the Angular Distance Based on Johnson-Lindenstrauss Projections and Feature Hashing LSH |
| title_sort |
generic lsh families for the angular distance based on johnson-lindenstrauss projections and feature hashing lsh |
| publishDate |
2017 |
| url |
http://sedici.unlp.edu.ar/handle/10915/63169 http://www.clei2017-46jaiio.sadio.org.ar/sites/default/files/Mem/AGRANDA/AGRANDA-01.pdf |
| work_keys_str_mv |
AT argerichluis genericlshfamiliesfortheangulardistancebasedonjohnsonlindenstraussprojectionsandfeaturehashinglsh AT golmarnatalia genericlshfamiliesfortheangulardistancebasedonjohnsonlindenstraussprojectionsandfeaturehashinglsh |
| bdutipo_str |
Repositorios |
| _version_ |
1764820480535035905 |