The Evolutionary Pre-Processor (version 3.0)
Batch Run Report

 

Batch Details
Batch run name diabetes_2
Description Diabetes 2 on Hilbert
This report file final/diabetes_2/diabetes_2_report.html
data file /home/cssip/jsherrah/EPrep3/data/diabetes_2.dat
Time of completion Wed May 27 13:34:49 1998
Duration of Batch 3 hours, 41 minutes, 17 seconds
Random Seed 18446744072275796104 (from clock)
Average generations per run 40.90
Average failed feature creations per run 1258.70
Average fitness evalutions per run 140946.70
Test Set Improvement 9.90 %
 
 

Data Partition
 
Class Training Validation Test Total
0 250 125 125 500
1 134 67 67 268
Total 384 192 192 768

Summary of Results

  Original Classification Errors (%)
Classifier Training Validation Test
Parallelepiped(PPD) 31.51 34.38 34.38
Min. Distance to Means(MDTM) 36.72 34.90 33.85

  EPrep Best-of-run Classification Errors (%)
Run
Training
Validation
Test
McNemar confidence
# Features
# Inputs
# Nodes
Classifier
1 23.96 23.44 32.81 0.618 3 5 22 PPD
2 22.40 22.92 27.60 0.957 5 5 35 MDTM
3 22.66 23.96 28.65 0.903 4 8 52 PPD
4 24.22 22.40 23.96 0.999 4 8 48 PPD
5 23.70 23.96 29.69 0.816 1 3 5 PPD
6 22.66 22.92 28.65 0.913 3 8 29 PPD
7 23.96 22.40 38.54 0.230 4 3 11 PPD
8 24.22 22.40 23.96 0.999 3 8 109 PPD
9 22.66 22.40 24.48 0.995 1 6 21 MDTM
10 22.92 24.48 32.81 0.655 7 8 87 PPD
Ave. 23.33 (0.70 ) 23.12 (0.74 ) 29.11 (4.40 ) 0.81 (0.23 ) 3.50 (1.69 ) 6.20 (1.99 ) 41.90 (31.68 ) PPD

  Confusion matrix for Best Ever Individual from run 9
Class
Predicted
Total
1
2
Ground Truth
1
107 18 125
2
29 38 67
Total
136 56 192

Average Operator Probabilities
Operator Average Probability
Delete-Feature Mutation 0.096
Add-Feature Mutation 0.105
Hoist Mutation 0.096
Truncate Mutation 0.095
Swap Mutation 0.097
One-Symbol Mutation 0.103
All-Nodes Mutation 0.094
One-Node Mutation 0.093
Grow Mutation 0.097
High-Level Crossover 0.123
 

Number of Run Terminations attributed to each Criterion
Termination Criterion Number of Terminations
TP Criterion 0
GL Criterion 1
Max. Generations 9
Client Abort 0
Zero Validation Error 0
Total 10
 
 

Related Data Files
Description Filename
diabetes_2_gen.m Matlab plot generation function
diabetes_2_bogf_ave.{eps,gif} Best-of-generation Fitness, averaged over 10 runs
diabetes_2_bogv_ave.{eps,gif} Best-of-generation Validation Set Error, averaged over 10 runs
diabetes_2_avef_ave.{eps,gif} Average fitness, averaged over 10 runs
diabetes_2_stdf_ave.{eps,gif} Standard deviation of fitness, averaged over 10 runs
diabetes_2_nftr_ave.{eps,gif} Average number of features per individual, averaged over 10 runs
diabetes_2_nnode_ave.{eps,gif} Average number of nodes per individual, averaged over 10 runs
diabetes_2_nint_ave.{eps,gif} Average number of introns per individual, averaged over 10 runs
diabetes_2_ntrl_ave.{eps,gif} Average number of RAT trials per individual, averaged over 10 runs
diabetes_2_optimp_ave.{eps,gif} Average improvement in fitness due to optimisation, averaged over 10 runs
diabetes_2_opprob_ave.{eps,gif} Average probability of each genetic operator, averaged over 10 runs
diabetes_2_run_x.dat Binary data file containing results of run x (read by Matlab functions)
diabetes_2_bor_x.prep 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Best-of-run individual for run x
diabetes_2_run_x.corr 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Feature correlation file for run x
diabetes_2_tst_bor_x.pred 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Test set predictions for best-of-run individual from run x
diabetes_2_bogf_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Best-of-generation Fitness for run x
diabetes_2_bogv_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Best-of-generation Validation Set Error for run x
diabetes_2_avef_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average fitness for run x
diabetes_2_stdf_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Standard deviation of fitness for run x
diabetes_2_nftr_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average number of features per individual for run x
diabetes_2_nnode_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average number of nodes per individual for run x
diabetes_2_nint_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average number of introns per individual for run x
diabetes_2_ntrl_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average number of RAT trials per individual for run x
diabetes_2_optimp_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average improvement in fitness due to optimisation for run x
diabetes_2_opprob_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average probability of each genetic operator for run x