Further Causal Search Analyses with UCC’s Effort Estimation Data

March 30, 2018 • Conference Paper

By
Anandi Hira, Barry Boehm (University of California, Los Angeles), Robert W. Stoddard, and Michael D. Konrad

This paper introduces causal discovery to software engineering research by exploring additional causal search algorithms and fast adjacency skewness and comparing their results to the traditional multi-step regression analysis.

Publisher

Acquisition Research Program of the Naval Postgraduate School

Topic or Tag

Measurement And Analysis

Abstract

Correlation does not imply causation. Though this is a well-known fact, most analyses depend on correlation as proof of relationships that are often treated as causal. Causal search, also referred to as causal discovery, involves the application of statistical methods to identify causal relationships using conditional independences (and/or other statistical relationships) within data. Though software cost estimation models use both domain knowledge and statistics, to date, there has yet to be a published report describing the evaluation of a software dataset using causal search. In a previous paper, the authors ran a PC causal search algorithm on the Unified Code Count’s (UCC’s) dataset of maintenance tasks and compared them to correlation test results. This paper builds on the previous paper to introduce causal discovery to software engineering research by exploring additional causal search algorithms (PC-Stable, fast greedy equivalent search [FGES], and fast adjacency skewness [FASK]) and comparing their results to the traditional multi-step regression analysis.

Software Engineering Institute