A prototypical tool for analyzing functional dependencies induced from spreadsheets

We present an extension to the GF framework for OntologyBased Data Access with the aim of determining the functional dependencies that hold in a spreadsheet. Spreadsheets are restricted to a single table expressed as a CSV text file. An initial set of tentative functional dependencies is computed us...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Gómez, Sergio Alejandro, Fillottrani, Pablo Rubén
Formato: Objeto de conferencia
Lenguaje:Inglés
Publicado: 2023
Materias:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/164976
Aporte de:
id I19-R120-10915-164976
record_format dspace
spelling I19-R120-10915-1649762024-04-17T20:03:31Z http://sedici.unlp.edu.ar/handle/10915/164976 A prototypical tool for analyzing functional dependencies induced from spreadsheets Gómez, Sergio Alejandro Fillottrani, Pablo Rubén 2023-10 2024 2024-04-17T17:20:03Z en Ciencias Informáticas Spreadsheets TANE Functional dependencies Databases We present an extension to the GF framework for OntologyBased Data Access with the aim of determining the functional dependencies that hold in a spreadsheet. Spreadsheets are restricted to a single table expressed as a CSV text file. An initial set of tentative functional dependencies is computed using the TANE datamining algorithm. This set is then presented to the user who is used as an oracle to revise it. Given a functional dependency, the user can see the tuples from the spreadsheet justifying it. The user can revise the validity of the functional dependency with the help of our system, which will generate tuples not present in the dataset by using values already present in the table. The user can then add some of the new records to the table when he considers their feasibility and rerun the miner to see if the functional dependency still holds. We present a running example along with a downloadable JAVA-based application with source code of the miner in the C programming language and the files used in our experiments to help with the reproducibility of our results. Red de Universidades con Carreras en Informática Objeto de conferencia Objeto de conferencia http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) application/pdf 409-417
institution Universidad Nacional de La Plata
institution_str I-19
repository_str R-120
collection SEDICI (UNLP)
language Inglés
topic Ciencias Informáticas
Spreadsheets
TANE
Functional dependencies
Databases
spellingShingle Ciencias Informáticas
Spreadsheets
TANE
Functional dependencies
Databases
Gómez, Sergio Alejandro
Fillottrani, Pablo Rubén
A prototypical tool for analyzing functional dependencies induced from spreadsheets
topic_facet Ciencias Informáticas
Spreadsheets
TANE
Functional dependencies
Databases
description We present an extension to the GF framework for OntologyBased Data Access with the aim of determining the functional dependencies that hold in a spreadsheet. Spreadsheets are restricted to a single table expressed as a CSV text file. An initial set of tentative functional dependencies is computed using the TANE datamining algorithm. This set is then presented to the user who is used as an oracle to revise it. Given a functional dependency, the user can see the tuples from the spreadsheet justifying it. The user can revise the validity of the functional dependency with the help of our system, which will generate tuples not present in the dataset by using values already present in the table. The user can then add some of the new records to the table when he considers their feasibility and rerun the miner to see if the functional dependency still holds. We present a running example along with a downloadable JAVA-based application with source code of the miner in the C programming language and the files used in our experiments to help with the reproducibility of our results.
format Objeto de conferencia
Objeto de conferencia
author Gómez, Sergio Alejandro
Fillottrani, Pablo Rubén
author_facet Gómez, Sergio Alejandro
Fillottrani, Pablo Rubén
author_sort Gómez, Sergio Alejandro
title A prototypical tool for analyzing functional dependencies induced from spreadsheets
title_short A prototypical tool for analyzing functional dependencies induced from spreadsheets
title_full A prototypical tool for analyzing functional dependencies induced from spreadsheets
title_fullStr A prototypical tool for analyzing functional dependencies induced from spreadsheets
title_full_unstemmed A prototypical tool for analyzing functional dependencies induced from spreadsheets
title_sort prototypical tool for analyzing functional dependencies induced from spreadsheets
publishDate 2023
url http://sedici.unlp.edu.ar/handle/10915/164976
work_keys_str_mv AT gomezsergioalejandro aprototypicaltoolforanalyzingfunctionaldependenciesinducedfromspreadsheets
AT fillottranipabloruben aprototypicaltoolforanalyzingfunctionaldependenciesinducedfromspreadsheets
AT gomezsergioalejandro prototypicaltoolforanalyzingfunctionaldependenciesinducedfromspreadsheets
AT fillottranipabloruben prototypicaltoolforanalyzingfunctionaldependenciesinducedfromspreadsheets
_version_ 1807222954135650304