The Evolutionary Pre-Processor (version 3.0)
Batch Run Report

 

Batch Details
Batch run name smoking_1
Description Smoking 1 on Atlas
This report file final/smoking_1/smoking_1_report.html
data file /home/cssip/jsherrah/EPrep3/data/smoking_1.dat
Time of completion Fri May 15 13:41:11 1998
Duration of Batch 49 hours, 7 minutes, 45 seconds
Random Seed 547392136 (from clock)
Average generations per run 13.20
Average failed feature creations per run 2719.60
Average fitness evalutions per run 65597.30
Test Set Improvement 32.87 %
 
 

Data Partition
 
Class Training Validation Test Total
0 75 37 39 151
1 360 179 180 719
2 992 497 496 1985
Total 1427 713 715 2855

Summary of Results

  Original Classification Errors (%)
Classifier Training Validation Test
Maximum Likelihood(ML) 58.79 62.83 65.59
Parallelepiped(PPD) 88.72 87.24 88.53
Min. Distance to Means(MDTM) 60.69 62.69 64.06

  EPrep Best-of-run Classification Errors (%)
Run
Training
Validation
Test
McNemar confidence
# Features
# Inputs
# Nodes
Classifier
1 30.34 30.58 30.91 1.000 1 9 379 PPD
2 29.85 31.00 94.13 0.000 4 7 252 PPD
3 30.20 30.43 30.63 1.000 5 6 81 PPD
4 29.85 30.01 31.19 1.000 5 9 855 PPD
5 30.41 30.43 30.77 1.000 2 4 27 PPD
6 30.27 30.72 74.55 0.000 3 9 493 PPD
7 30.34 30.58 94.41 0.000 1 3 21 PPD
8 30.34 30.15 30.49 1.000 2 9 912 PPD
9 30.27 30.43 74.69 0.000 6 9 539 PPD
10 30.34 30.15 30.91 1.000 3 4 28 MDTM
Ave. 30.22 (0.19 ) 30.45 (0.28 ) 52.27 (27.00 ) 0.60 (0.49 ) 3.20 (1.66 ) 6.90 (2.34 ) 358.70(320.21) PPD

  Confusion matrix for Best Ever Individual from run 4
Class
Predicted
Total
1
2
3
Ground Truth
1
0 0 39 39
2
2 2 176 180
3
2 4 490 496
Total
4 6 705 715

Average Operator Probabilities
Operator Average Probability
Delete-Feature Mutation 0.101
Add-Feature Mutation 0.101
Hoist Mutation 0.099
Truncate Mutation 0.100
Swap Mutation 0.101
One-Symbol Mutation 0.099
All-Nodes Mutation 0.099
One-Node Mutation 0.100
Grow Mutation 0.098
High-Level Crossover 0.102
 

Number of Run Terminations attributed to each Criterion
Termination Criterion Number of Terminations
TP Criterion 0
GL Criterion 10
Max. Generations 0
Client Abort 0
Zero Validation Error 0
Total 10
 
 

Related Data Files
Description Filename
smoking_1_gen.m Matlab plot generation function
smoking_1_bogf_ave.{eps,gif} Best-of-generation Fitness, averaged over 10 runs
smoking_1_bogv_ave.{eps,gif} Best-of-generation Validation Set Error, averaged over 10 runs
smoking_1_avef_ave.{eps,gif} Average fitness, averaged over 10 runs
smoking_1_stdf_ave.{eps,gif} Standard deviation of fitness, averaged over 10 runs
smoking_1_nftr_ave.{eps,gif} Average number of features per individual, averaged over 10 runs
smoking_1_nnode_ave.{eps,gif} Average number of nodes per individual, averaged over 10 runs
smoking_1_nint_ave.{eps,gif} Average number of introns per individual, averaged over 10 runs
smoking_1_ntrl_ave.{eps,gif} Average number of RAT trials per individual, averaged over 10 runs
smoking_1_optimp_ave.{eps,gif} Average improvement in fitness due to optimisation, averaged over 10 runs
smoking_1_opprob_ave.{eps,gif} Average probability of each genetic operator, averaged over 10 runs
smoking_1_run_x.dat Binary data file containing results of run x (read by Matlab functions)
smoking_1_bor_x.prep 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Best-of-run individual for run x
smoking_1_run_x.corr 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Feature correlation file for run x
smoking_1_tst_bor_x.pred 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Test set predictions for best-of-run individual from run x
smoking_1_bogf_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Best-of-generation Fitness for run x
smoking_1_bogv_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Best-of-generation Validation Set Error for run x
smoking_1_avef_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average fitness for run x
smoking_1_stdf_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Standard deviation of fitness for run x
smoking_1_nftr_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average number of features per individual for run x
smoking_1_nnode_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average number of nodes per individual for run x
smoking_1_nint_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average number of introns per individual for run x
smoking_1_ntrl_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average number of RAT trials per individual for run x
smoking_1_optimp_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average improvement in fitness due to optimisation for run x
smoking_1_opprob_x.{eps,gif} 
x = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Average probability of each genetic operator for run x