TABLE 6

Details on the MDR data modeling grid, i.e., performance of various modeling methods versus descriptors

Some combinations were not evaluated because it was apparent from the combination of CDK descriptors and the modeling methods that the results were not going to be equivalent to or better than the baseline model. One thing to note here is that unlike HLM and RRCK, the MDR dataset was divided into two bins only.

	SVM	RP Forest Uni Class	RP Forest	C5.0
CDK
κ	0.31	0.36	0.33	0.62
Sensitivity	0.79	0.68	0.64	0.85
Specificity	0.52	0.67	0.70	0.77
PPV	0.68	0.73	0.73	0.83
MOE2D and SMARTS Keys	Not evaluated	Not evaluated	Not evaluated
κ				0.67
Sensitivity				0.86
Specificity				0.80
PPV				0.85 (baseline)
CDK and SMARTS Keys	Not evaluated	Not evaluated	Not evaluated
κ				0.65
Sensitivity				0.86
Specificity				0.78
PPV				0.84

PPV, positive predicted value.