TY - JOUR
T1 - Design, synthesis and experimental validation of novel potential chemopreventive agents using random forest and support vector machine binary classifiers
AU - Sprague, Brienne
AU - Shi, Qian
AU - Kim, Marlene T.
AU - Zhang, Liying
AU - Sedykh, Alexander
AU - Ichiishi, Eiichiro
AU - Tokuda, Harukuni
AU - Lee, Kuo Hsiung
AU - Zhu, Hao
N1 - Funding Information:
Acknowledgments We thank Kimberlee Moran, the Director of the Center for Forensic Science Research & Education, for her help with the manuscript preparation for the entire project. We also appreciate the help of Dr. Susan Morris-Natschke, Natural Product Research Laboratory, UNC Eshelman School of Pharmacy with proofreading and editing the manuscript. This investigation was also supported in part by NIH Grant CA 177584-01 from National Cancer Institute awarded to K.H. Lee. This study was also supported in part by the Taiwan Department of Health, China Medical University Hospital Cancer Research Center of Excellence (DOH100-TD-C-111-005).
PY - 2014/6
Y1 - 2014/6
N2 - Compared to the current knowledge on cancer chemotherapeutic agents, only limited information is available on the ability of organic compounds, such as drugs and/or natural products, to prevent or delay the onset of cancer. In order to evaluate chemical chemopreventive potentials and design novel chemopreventive agents with low to no toxicity, we developed predictive computational models for chemopreventive agents in this study. First, we curated a database containing over 400 organic compounds with known chemoprevention activities. Based on this database, various random forest and support vector machine binary classifiers were developed. All of the resulting models were validated by cross validation procedures. Then, the validated models were applied to virtually screen a chemical library containing around 23,000 natural products and derivatives. We selected a list of 148 novel chemopreventive compounds based on the consensus prediction of all validated models. We further analyzed the predicted active compounds by their ease of organic synthesis. Finally, 18 compounds were synthesized and experimentally validated for their chemopreventive activity. The experimental validation results paralleled the cross validation results, demonstrating the utility of the developed models. The predictive models developed in this study can be applied to virtually screen other chemical libraries to identify novel lead compounds for the chemoprevention of cancers.
AB - Compared to the current knowledge on cancer chemotherapeutic agents, only limited information is available on the ability of organic compounds, such as drugs and/or natural products, to prevent or delay the onset of cancer. In order to evaluate chemical chemopreventive potentials and design novel chemopreventive agents with low to no toxicity, we developed predictive computational models for chemopreventive agents in this study. First, we curated a database containing over 400 organic compounds with known chemoprevention activities. Based on this database, various random forest and support vector machine binary classifiers were developed. All of the resulting models were validated by cross validation procedures. Then, the validated models were applied to virtually screen a chemical library containing around 23,000 natural products and derivatives. We selected a list of 148 novel chemopreventive compounds based on the consensus prediction of all validated models. We further analyzed the predicted active compounds by their ease of organic synthesis. Finally, 18 compounds were synthesized and experimentally validated for their chemopreventive activity. The experimental validation results paralleled the cross validation results, demonstrating the utility of the developed models. The predictive models developed in this study can be applied to virtually screen other chemical libraries to identify novel lead compounds for the chemoprevention of cancers.
UR - http://www.scopus.com/inward/record.url?scp=84902785117&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84902785117&partnerID=8YFLogxK
U2 - 10.1007/s10822-014-9748-9
DO - 10.1007/s10822-014-9748-9
M3 - Article
C2 - 24840854
AN - SCOPUS:84902785117
SN - 0920-654X
VL - 28
SP - 631
EP - 646
JO - Journal of Computer-Aided Molecular Design
JF - Journal of Computer-Aided Molecular Design
IS - 6
ER -