AUT LibraryAUT
View Item 
  •   Open Research
  • AUT Research Institutes, Centres and Networks
  • SERL - Software Engineering Research Laboratory
  • View Item
  •   Open Research
  • AUT Research Institutes, Centres and Networks
  • SERL - Software Engineering Research Laboratory
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

A Taxonomy of Data Quality Challenges in Empirical Software Engineering

Bosu, MF; Macdonell, SG
Thumbnail
View/Open
Bosu and MacDonell (2013b) ASWEC.pdf (215.8Kb)
Permanent link
http://hdl.handle.net/10292/10005
Metadata
Show full metadata
Abstract
Reliable empirical models such as those used in software effort estimation or defect prediction are inherently dependent on the data from which they are built. As demands for process and product improvement continue to grow, the quality of the data used in measurement and prediction systems warrants increasingly close scrutiny. In this paper we propose a taxonomy of data quality challenges in empirical software engineering, based on an extensive review of prior research. We consider current assessment techniques for each quality issue and proposed mechanisms to address these issues, where available. Our taxonomy classifies data quality issues into three broad areas: first, characteristics of data that mean they are not fit for modeling, second, data set characteristics that lead to concerns about the suitability of applying a given model to another data set, and third, factors that prevent or limit data accessibility and trust. We identify this latter area as of particular need in terms of further research. © 2013 IEEE.
Keywords
Accessibility; Commercial sensitivity; Data quality; Empirical software engineering; Provenance; Trustworthiness
Date
2013
Source
Proceedings of the 22nd Australian Software Engineering Conference (ASWEC2013), Melbourne, Australia, pp.97 - 106. doi: 10.1109/ASWEC.2013.21
Item Type
Journal Article
Publisher
IEEE
DOI
10.1109/ASWEC.2013.21
Publisher's Version
http://dx.doi.org/10.1109/ASWEC.2013.21
Rights Statement
Copyright © 2013 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Contact Us
  • Admin

Hosted by Tuwhera, an initiative of the Auckland University of Technology Library

 

 

Browse

Open ResearchTitlesAuthorsDateSERL - Software Engineering Research LaboratoryTitlesAuthorsDate

Alternative metrics

 

Statistics

For this itemFor all Open Research

Share

 
Follow @AUT_SC

Contact Us
  • Admin

Hosted by Tuwhera, an initiative of the Auckland University of Technology Library