Support vector machines for the estimation of aqueous solubility

J Chem Inf Comput Sci. 2003 Nov-Dec;43(6):1855-9. doi: 10.1021/ci034107s.

Abstract

Support Vector Machines (SVMs) are used to estimate aqueous solubility of organic compounds. A SVM equipped with a Tanimoto similarity kernel estimates solubility with accuracy comparable to results from other reported methods where the same data sets have been studied. Complete cross-validation on a diverse data set resulted in a root-mean-squared error = 0.62 and R(2) = 0.88. The data input to the machine is in the form of molecular fingerprints. No physical parameters are explicitly involved in calculations.