TABLE 6

Details on the MDR data modeling grid, i.e., performance of various modeling methods versus descriptors

Some combinations were not evaluated because it was apparent from the combination of CDK descriptors and the modeling methods that the results were not going to be equivalent to or better than the baseline model. One thing to note here is that unlike HLM and RRCK, the MDR dataset was divided into two bins only.

SVMRP Forest Uni ClassRP ForestC5.0
CDK
    κ0.310.360.330.62
    Sensitivity0.790.680.640.85
    Specificity0.520.670.700.77
    PPV0.680.730.730.83
MOE2D and SMARTS KeysNot evaluatedNot evaluatedNot evaluated
    κ0.67
    Sensitivity0.86
    Specificity0.80
    PPV0.85 (baseline)
CDK and SMARTS KeysNot evaluatedNot evaluatedNot evaluated
    κ0.65
    Sensitivity0.86
    Specificity0.78
    PPV0.84
  • PPV, positive predicted value.