Urinary inflammation diagnosis
By Roberto Lopez, Artelnics.
In this tutorial a classification application in medicine is solved by means of a neural network. In particular, the goal is to diagnose acute inflammations/nephritises of urinary bladder. The data for this problem has been taken from the UCI Machine Learning Repository.
The goal of this study is to obtain a model that can diagnose the disease of the acute inflammations of urinary bladder. This data set could be also used to diagnose the acute nephritises.
The data was created by a medical expert as a data set to test the expert system, which will perform the presumptive diagnosis of two diseases of urinary system. The basis for rules detection was Rough Sets Theory. Each instance represents an potential patient.
As the objective is to get a model that can diagnose the first of the diseases, the variable of acute nephritises diagnosis will be set as unused.
The next figure shows the data set page in Neural Designer. It contains four sections:
- Data file.
- Variables information.
- Instances information.
- Missing values information
Neural Designer shows a preview of the data file and says that the number of columns is 8 and the number of rows is 120.
The instances are divided into a training, a selection and a testing subsets. They represent 60%(72) , 20% (24) and 20% (24) of the original instances, respectively, and have been splitted at random.
The second step is to choose a network architecture to represent the classification function. For classification problems, it is composed by:
- Scaling layer.
- Neural network.
- Probabilistic layer.
The next figure shows the neural network page in Neural Designer.
The scaling layer section contains information about the method for scaling the input variables and the statistic values to be used by that method. In this example, we will use the minimum and maximum method for scaling the inputs. The mean and standard deviation would also be appropriate here.
In this case, the neural network structure has 6 inputs, 6 hidden preceptrons and 1 output. This neural network can be denoted as 6:6:1. The next image represents it.
The third step is to set the loss index, which is composed by:
- Error term.
- Regularization term.
The error term chosen for this application is the normalized squared error.
On the other hand, the regularization term is the neural parameters norm. The weight for this term is 0.001. Regularization has two effects here:
- it makes the model to be stable, without oscilations and
- it avoids saturation of the logistic activation functions.
The learning problem can be stated as to find a neural network which minimizes the loss index, i.e., a neural network that fits the data set (objective) and that does not oscillate (regularization).
The next step in solving this problem is to assign the training strategy.
The next figure shows the training strategy page in Neural Designer.
The neural network is trained in order to obtain the best possible performance.
The next table shows the training results by the quasi-Newton method. We can see that the performance and generalization performance are small and the gradient norm is almost zero.
The last step is to validate the generalization performance of the trained neural network. To validate a classification technique we need to compare the values provided by this technique to the actually observed values.
The following table contains the elements of the confusion matrix. The element (0,0) contains the true positives, the element (0,1) contains the false positives, the element (1,0) contains the false negatives, and the element (1,1) contains the true negatives for the variable diagnose. The number of correctly classified instances is 24, and the number of misclassified instances is 0.
We can also perform a ROC curve analysis. ROC curve is computed by plotting in the x-axis the 1-specificity and in the y-axis the sensitivity for different thresholds. ROC curve for a perfect classifier passes through the upper left corner, i.e., the point (0,1), which has 100% sensitivity and 100% specificity. In consequence, the closer to upper left corner ROC curve, the better the discrimination capacity. This can be also measured with the area under curve (AUC) parameter. For a perfect classifier the AUC is 1. The next figure shows the results of this analysis in this case.
The area under curve is 1. These results illustrate the good perfomance of the model.
The neural network is now ready to predict outputs for inputs that it has never seen.
The "Calculate outputs" task will diagnose inflammation of urinary bladder from the new values that we will type in the dialog. The next figure shows the dialog where the user types the input values.
Then the prediction is written in the viewer.
The "Write expression" task exports to the report the mathematical expression of the trained and tested neural network. That expression is listed below.