Accessing the high-throughput screening data landscape

Daniel P. Russo, Hao Zhu

Research output: Contribution to journalArticlepeer-review

1 Scopus citations


The progress of high-throughput screening (HTS) techniques is changing the chemical data landscape by producing massive biological data from tested compounds. Public data repositories (e.g., PubChem) receive HTS data provided by various institutes and this data pool is being updated on a daily basis. The goal of these data sharing efforts is to let users quickly obtain the biological data of target compounds. Without a universal chemical identifier, the repositories (e.g., PubChem) provide users various methods to query and retrieve chemical properties and biological data by several different chemical identifiers (e.g., SMILES, InChIKey, and IUPAC name). The major challenge for most users, especially computational modelers, is obtaining the biological data for a large dataset of compounds (e.g., thousands of drug molecules) instead of a single compound. This chapter aims to introduce the steps to access the public data repositories for target compounds with specific emphasis on the automatic data downloading for large datasets.

Original languageEnglish (US)
Pages (from-to)153-159
Number of pages7
JournalMethods in Molecular Biology
StatePublished - Jan 1 2016
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Molecular Biology
  • Genetics


Dive into the research topics of 'Accessing the high-throughput screening data landscape'. Together they form a unique fingerprint.

Cite this