Filtering useless data at the source

There are some processing environments where an application reads remote sequential files with a large number of records only to use some of them. Examples of those environments are servers, proxies, firewall and intrusion detection log analysis tools, sensor log analysis, large scientific datasets pro...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Pessolani, Pablo, Quaglia, Constanza, Nou, Ramón
Formato: Documento de conferencia publisherVersion
Lenguaje:Inglés
Inglés
Publicado: XXV Congreso Argentino de Ciencias de la Computación 2021
Materias:
NFS
Acceso en línea:http://hdl.handle.net/20.500.12272/5360
Aporte de:
id I68-R174-20.500.12272-5360
record_format dspace
institution Universidad Tecnológica Nacional
institution_str I-68
repository_str R-174
collection RIA - Repositorio Institucional Abierto (UTN)
language Inglés
Inglés
topic Logging
Network File System
NFS
spellingShingle Logging
Network File System
NFS
Pessolani, Pablo
Quaglia, Constanza
Nou, Ramón
Filtering useless data at the source
topic_facet Logging
Network File System
NFS
description There are some processing environments where an application reads remote sequential files with a large number of records only to use some of them. Examples of those environments are servers, proxies, firewall and intrusion detection log analysis tools, sensor log analysis, large scientific datasets processing, etc. To be processed, all file records must be transferred through the network, and all of them must be processed by the application. Some of the transferred records would be discarded immediately by the application because it has no interest in them, but they just consumed network bandwidth and operating system’s cache buffers. This article proposes to filter records from the source of data but without changing the application. Those records of interest will be transferred without modifications but only references to the other records will be transferred from the source to the consuming application. At the application side, the sequence of records is rebuilt, keeping the content of records of interest and filling the others with dummy values which will be discarded by the application. As the number and length of records are preserved (and therefore the file size), it is not necessary to modify the application. Once a filtering rule is applied to a file, only the useful records and references to unuseful ones will be transferred to the application side reducing network usage, transfer time, and cache utilization. A modified (but compatible) version of NFS protocol was developed as a proof of concept.
format Documento de conferencia
publisherVersion
author Pessolani, Pablo
Quaglia, Constanza
Nou, Ramón
author_facet Pessolani, Pablo
Quaglia, Constanza
Nou, Ramón
author_sort Pessolani, Pablo
title Filtering useless data at the source
title_short Filtering useless data at the source
title_full Filtering useless data at the source
title_fullStr Filtering useless data at the source
title_full_unstemmed Filtering useless data at the source
title_sort filtering useless data at the source
publisher XXV Congreso Argentino de Ciencias de la Computación
publishDate 2021
url http://hdl.handle.net/20.500.12272/5360
work_keys_str_mv AT pessolanipablo filteringuselessdataatthesource
AT quagliaconstanza filteringuselessdataatthesource
AT nouramon filteringuselessdataatthesource
bdutipo_str Repositorios
_version_ 1764820551871758340