Data quality measurement framework

dc.contributor.authorFereira M.
dc.contributor.authorSilva L.A.
dc.date.accessioned2024-03-12T23:56:32Z
dc.date.available2024-03-12T23:56:32Z
dc.date.issued2018
dc.description.abstract© 2018 IEEE.Data Quality evaluation is a key fundamental in Knowledge Data Discovery projects. There are some project frameworks, like CRISP-DM and DAMA DMBOK, that recommend the preparation of the Data Quality Report, as a tool to describe the found problems during the data exploration phase and to describe an approach to fix those problems. However, those frameworks are very generic in their guidelines and neither tell what exactly should be measured nor how to associate any measure to the data quality. Data Profiling tools and some ETL(Extraction, Transformation and Loading) tools as well, implement some basic Statistical Description tooling, but they do not propose any general methodolgy to evaluate quantitatively the quality of a set of data, except, perhaps, in the IBM Watson Analytics tool. This article proposes a quantitative measure for data quality evaluation, based on Statistical Description tools.
dc.description.firstpage455
dc.description.lastpage463
dc.identifier.doi10.1109/CLEI.2018.00061
dc.identifier.urihttps://dspace.mackenzie.br/handle/10899/35454
dc.relation.ispartofProceedings - 2018 44th Latin American Computing Conference, CLEI 2018
dc.rightsAcesso Restrito
dc.subject.otherlanguageDat Mining
dc.subject.otherlanguageData Governance
dc.subject.otherlanguageData Profiling
dc.subject.otherlanguageData Quality
dc.subject.otherlanguagePreprocessing
dc.titleData quality measurement framework
dc.typeArtigo de evento
local.scopus.citations1
local.scopus.eid2-s2.0-85071120655
local.scopus.subjectAnalytics tools
local.scopus.subjectData exploration
local.scopus.subjectData governances
local.scopus.subjectData profiling
local.scopus.subjectData quality
local.scopus.subjectPreprocessing
local.scopus.subjectQuantitative measures
local.scopus.subjectStatistical descriptions
local.scopus.updated2024-05-01
local.scopus.urlhttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85071120655&origin=inward
Arquivos