2010
DOI: 10.1007/978-3-642-16239-8_38
|View full text |Cite
|
Sign up to set email alerts
|

On the Problem of Attribute Selection for Software Cost Estimation: Input Backward Elimination Using Artificial Neural Networks

Abstract: Abstract. Many parameters affect the cost evolution of software projects. In the area of software cost estimation and project management the main challenge is to understand and quantify the effect of these parameters, or 'cost drivers', on the effort expended to develop software systems. This paper aims at investigating the effect of cost attributes on software development effort using empirical databases of completed projects and building Artificial Neural Network (ANN) models to predict effort. Prediction pe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0

Year Published

2012
2012
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 10 publications
0
3
0
Order By: Relevance
“…The sequential backward selection technique removes one least important feature from the subset, based on a fixed criterion in every iteration. The selection process starts with a full set of features and stops when a predefined threshold for the number of variables is reached or further removal of features does not improve the performance [12]. We used mutual information as the selection criteria and set the selection threshold to seven variables.…”
Section: Methodsmentioning
confidence: 99%
“…The sequential backward selection technique removes one least important feature from the subset, based on a fixed criterion in every iteration. The selection process starts with a full set of features and stops when a predefined threshold for the number of variables is reached or further removal of features does not improve the performance [12]. We used mutual information as the selection criteria and set the selection threshold to seven variables.…”
Section: Methodsmentioning
confidence: 99%
“…In the second procedure, the neural network model is estimated with all the selected variables as inputs, and the relative importance of every input variable is calculated. As a nonlinear model, determining the relative importance of variables is more difficult than in linear regression models, and Garson's method is used in this study, because it was proved that the neural network can identify the most influential input variables from a given variable list by Garson's method (Garson 1991), and recently, the reliability of Garson's method in the variable selection process of three-layer neural network is verified to be better than other widely used methods, such as correlation method and principal component analysis (Papatheocharous and Andreou 2010;Fischer 2015;Yousefi et al 2018;Liu et al 2018). According to Garson's method (Garson 1991), for a neural network model with N neurons in the input layer and L neurons in the hidden layer, the relative importance of the ith input variable to the kth output variable ( I ik ) can be defined as where ij is the weight of the ith neuron in the input layer and jth neuron in the hidden layer, and jk is the weight of the jth neuron in the hidden layer and kth neuron in the output layer.…”
Section: Neural Network Model Selection Processmentioning
confidence: 99%
“…After determining the input variables, output variables, and number of neurons in the hidden layer, the actual structure of the neural network model is determined. Garson's method is capable of quantifying the relative importance and select variables (Papatheocharous and Andreou 2010;Fischer 2015;Yousefi et al 2018;Liu et al 2018). Other methods such as Olden's method and SHapley Additive exPlanations (SHAP) values can quantify the intensity and direction of each input variable's contribution to each output variable (Olden and Jackson 2002;Lundberg and Lee 2017), which is helpful to understand the relationship between each input and output variable in the model.…”
Section: Parameter Analysismentioning
confidence: 99%
“…Functional Link Artificial Neural Network (FLANN) based software cost estimation (FBE), which is essentially a machine learning technique, was introduced by Rao et al (2009). Due to its conceptual simplicity and empirical competitiveness, FLANN has been extensively studied and applied (Jodpimai et al, 2010;Papatheocharous et al, 2010;Abhishek et al, 2010;Khatibi et al, 2011). FLANN is basically a flat net and the need of the hidden layer is removed and hence, the BP learning algorithm used in this network becomes very simple, originally proposed by Pao et al (1992).…”
Section: Introductionmentioning
confidence: 99%