The Evolutionary Pre-Processor (version 3.0)
Batch Run Report

 

Batch Details
Batch run name diabetes_1
Description Diabetes 1 on Nyquist
This report file final/diabetes_1/diabetes_1_report.html
data file /home/cssip/jsherrah/EPrep3/data/diabetes_1.dat
Time of completion Tue May 12 17:50:27 1998
Duration of Batch 8 hours, 5 minutes, 42 seconds
Random Seed 18446744071562070539 (from clock)
Average generations per run 23.90
Average failed feature creations per run 1316.90
Average fitness evalutions per run 91467.70
Test Set Improvement 15.10 %
 
 

Data Partition
 
Class Training Validation Test Total
0 250 125 125 500
1 134 67 67 268
Total 384 192 192 768

Summary of Results

  Original Classification Errors (%)
Classifier Training Validation Test
Parallelepiped(PPD) 33.07 34.90 32.29
Min. Distance to Means(MDTM) 37.50 33.33 38.02

  EPrep Best-of-run Classification Errors (%)
Run
Training
Validation
Test
McNemar confidence
# Features
# Inputs
# Nodes
Classifier
1 27.60 22.92 23.44 1.000 3 8 78 PPD
2 23.70 21.35 22.92 1.000 4 8 109 MDTM
3 25.52 24.48 22.92 1.000 5 8 131 PPD
4 28.65 22.92 25.00 0.999 2 3 6 MDTM
5 26.82 22.92 22.92 1.000 4 8 95 PPD
6 24.48 25.00 20.83 1.000 2 3 10 MDTM
7 23.70 22.40 25.52 0.999 2 3 7 PPD
8 27.60 23.44 23.44 1.000 1 4 21 PPD
9 25.00 19.27 22.92 1.000 2 8 163 PPD
10 26.30 22.92 22.40 1.000 6 8 58 PPD
Ave. 25.94 (1.64 ) 22.76 (1.51 ) 23.23 (1.24 ) 1.00 (0.00 ) 3.10 (1.51 ) 6.10 (2.34 ) 67.80 (53.61 ) PPD

  Confusion matrix for Best Ever Individual from run 9
Class
Predicted
Total
1
2
Ground Truth
1
116 9 125
2
35 32 67
Total
151 41 192

Average Operator Probabilities
Operator Average Probability
Delete-Feature Mutation 0.097
Add-Feature Mutation 0.099
Hoist Mutation 0.100
Truncate Mutation 0.099
Swap Mutation 0.100
One-Symbol Mutation 0.100
All-Nodes Mutation 0.104
One-Node Mutation 0.094
Grow Mutation 0.101
High-Level Crossover 0.107
 

Number of Run Terminations attributed to each Criterion
Termination Criterion Number of Terminations
TP Criterion 0
GL Criterion 7
Max. Generations 3
Client Abort 0
Zero Validation Error 0
Total 10
 
 

Related Data Files
Description Filename
diabetes_1_gen.m Matlab plot generation function
diabetes_1_bogf_ave.{eps,gif} Best-of-generation Fitness, averaged over 10 runs
diabetes_1_bogv_ave.{eps,gif} Best-of-generation Validation Set Error, averaged over 10 runs
diabetes_1_avef_ave.{eps,gif} Average fitness, averaged over 10 runs
diabetes_1_stdf_ave.{eps,gif} Standard deviation of fitness, averaged over 10 runs
diabetes_1_nftr_ave.{eps,gif} Average number of features per individual, averaged over 10 runs
diabetes_1_nnode_ave.{eps,gif} Average number of nodes per individual, averaged over 10 runs
diabetes_1_nint_ave.{eps,gif} Average number of introns per individual, averaged over 10 runs
diabetes_1_ntrl_ave.{eps,gif} Average number of RAT trials per individual, averaged over 10 runs
diabetes_1_optimp_ave.{eps,gif} Average improvement in fitness due to optimisation, averaged over 10 runs
diabetes_1_opprob_ave.{eps,gif} Average probability of each genetic operator, averaged over 10 runs
diabetes_1_run_x.dat Binary data file containing results of run x (read by Matlab functions)
diabetes_1_bor_x.prep 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Best-of-run individual for run x
diabetes_1_run_x.corr 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Feature correlation file for run x
diabetes_1_tst_bor_x.pred 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Test set predictions for best-of-run individual from run x
diabetes_1_bogf_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Best-of-generation Fitness for run x
diabetes_1_bogv_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Best-of-generation Validation Set Error for run x
diabetes_1_avef_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average fitness for run x
diabetes_1_stdf_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Standard deviation of fitness for run x
diabetes_1_nftr_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average number of features per individual for run x
diabetes_1_nnode_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average number of nodes per individual for run x
diabetes_1_nint_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average number of introns per individual for run x
diabetes_1_ntrl_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average number of RAT trials per individual for run x
diabetes_1_optimp_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average improvement in fitness due to optimisation for run x
diabetes_1_opprob_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average probability of each genetic operator for run x