Data quality measurement framework
dc.contributor.author | Fereira M. | |
dc.contributor.author | Silva L.A. | |
dc.date.accessioned | 2024-03-12T23:56:32Z | |
dc.date.available | 2024-03-12T23:56:32Z | |
dc.date.issued | 2018 | |
dc.description.abstract | © 2018 IEEE.Data Quality evaluation is a key fundamental in Knowledge Data Discovery projects. There are some project frameworks, like CRISP-DM and DAMA DMBOK, that recommend the preparation of the Data Quality Report, as a tool to describe the found problems during the data exploration phase and to describe an approach to fix those problems. However, those frameworks are very generic in their guidelines and neither tell what exactly should be measured nor how to associate any measure to the data quality. Data Profiling tools and some ETL(Extraction, Transformation and Loading) tools as well, implement some basic Statistical Description tooling, but they do not propose any general methodolgy to evaluate quantitatively the quality of a set of data, except, perhaps, in the IBM Watson Analytics tool. This article proposes a quantitative measure for data quality evaluation, based on Statistical Description tools. | |
dc.description.firstpage | 455 | |
dc.description.lastpage | 463 | |
dc.identifier.doi | 10.1109/CLEI.2018.00061 | |
dc.identifier.uri | https://dspace.mackenzie.br/handle/10899/35454 | |
dc.relation.ispartof | Proceedings - 2018 44th Latin American Computing Conference, CLEI 2018 | |
dc.rights | Acesso Restrito | |
dc.subject.otherlanguage | Dat Mining | |
dc.subject.otherlanguage | Data Governance | |
dc.subject.otherlanguage | Data Profiling | |
dc.subject.otherlanguage | Data Quality | |
dc.subject.otherlanguage | Preprocessing | |
dc.title | Data quality measurement framework | |
dc.type | Artigo de evento | |
local.scopus.citations | 1 | |
local.scopus.eid | 2-s2.0-85071120655 | |
local.scopus.subject | Analytics tools | |
local.scopus.subject | Data exploration | |
local.scopus.subject | Data governances | |
local.scopus.subject | Data profiling | |
local.scopus.subject | Data quality | |
local.scopus.subject | Preprocessing | |
local.scopus.subject | Quantitative measures | |
local.scopus.subject | Statistical descriptions | |
local.scopus.updated | 2024-05-01 | |
local.scopus.url | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85071120655&origin=inward |