A prototypical tool for analyzing functional dependencies induced from spreadsheets
We present an extension to the GF framework for OntologyBased Data Access with the aim of determining the functional dependencies that hold in a spreadsheet. Spreadsheets are restricted to a single table expressed as a CSV text file. An initial set of tentative functional dependencies is computed us...
Guardado en:
| Autores principales: | , |
|---|---|
| Formato: | Objeto de conferencia |
| Lenguaje: | Inglés |
| Publicado: |
2023
|
| Materias: | |
| Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/164976 |
| Aporte de: |
| id |
I19-R120-10915-164976 |
|---|---|
| record_format |
dspace |
| spelling |
I19-R120-10915-1649762024-04-17T20:03:31Z http://sedici.unlp.edu.ar/handle/10915/164976 A prototypical tool for analyzing functional dependencies induced from spreadsheets Gómez, Sergio Alejandro Fillottrani, Pablo Rubén 2023-10 2024 2024-04-17T17:20:03Z en Ciencias Informáticas Spreadsheets TANE Functional dependencies Databases We present an extension to the GF framework for OntologyBased Data Access with the aim of determining the functional dependencies that hold in a spreadsheet. Spreadsheets are restricted to a single table expressed as a CSV text file. An initial set of tentative functional dependencies is computed using the TANE datamining algorithm. This set is then presented to the user who is used as an oracle to revise it. Given a functional dependency, the user can see the tuples from the spreadsheet justifying it. The user can revise the validity of the functional dependency with the help of our system, which will generate tuples not present in the dataset by using values already present in the table. The user can then add some of the new records to the table when he considers their feasibility and rerun the miner to see if the functional dependency still holds. We present a running example along with a downloadable JAVA-based application with source code of the miner in the C programming language and the files used in our experiments to help with the reproducibility of our results. Red de Universidades con Carreras en Informática Objeto de conferencia Objeto de conferencia http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) application/pdf 409-417 |
| institution |
Universidad Nacional de La Plata |
| institution_str |
I-19 |
| repository_str |
R-120 |
| collection |
SEDICI (UNLP) |
| language |
Inglés |
| topic |
Ciencias Informáticas Spreadsheets TANE Functional dependencies Databases |
| spellingShingle |
Ciencias Informáticas Spreadsheets TANE Functional dependencies Databases Gómez, Sergio Alejandro Fillottrani, Pablo Rubén A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| topic_facet |
Ciencias Informáticas Spreadsheets TANE Functional dependencies Databases |
| description |
We present an extension to the GF framework for OntologyBased Data Access with the aim of determining the functional dependencies that hold in a spreadsheet. Spreadsheets are restricted to a single table expressed as a CSV text file. An initial set of tentative functional dependencies is computed using the TANE datamining algorithm. This set is then presented to the user who is used as an oracle to revise it. Given a functional dependency, the user can see the tuples from the spreadsheet justifying it. The user can revise the validity of the functional dependency with the help of our system, which will generate tuples not present in the dataset by using values already present in the table. The user can then add some of the new records to the table when he considers their feasibility and rerun the miner to see if the functional dependency still holds. We present a running example along with a downloadable JAVA-based application with source code of the miner in the C programming language and the files used in our experiments to help with the reproducibility of our results. |
| format |
Objeto de conferencia Objeto de conferencia |
| author |
Gómez, Sergio Alejandro Fillottrani, Pablo Rubén |
| author_facet |
Gómez, Sergio Alejandro Fillottrani, Pablo Rubén |
| author_sort |
Gómez, Sergio Alejandro |
| title |
A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| title_short |
A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| title_full |
A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| title_fullStr |
A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| title_full_unstemmed |
A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| title_sort |
prototypical tool for analyzing functional dependencies induced from spreadsheets |
| publishDate |
2023 |
| url |
http://sedici.unlp.edu.ar/handle/10915/164976 |
| work_keys_str_mv |
AT gomezsergioalejandro aprototypicaltoolforanalyzingfunctionaldependenciesinducedfromspreadsheets AT fillottranipabloruben aprototypicaltoolforanalyzingfunctionaldependenciesinducedfromspreadsheets AT gomezsergioalejandro prototypicaltoolforanalyzingfunctionaldependenciesinducedfromspreadsheets AT fillottranipabloruben prototypicaltoolforanalyzingfunctionaldependenciesinducedfromspreadsheets |
| _version_ |
1807222954135650304 |