Skip to the content.

PubChem consists of three inter-linked databases, Substance, Compound and BioAssay. The Substance database contains chemical information deposited by individual data contributors to PubChem, and the Compound database stores unique chemical structures extracted from the Substance database. Biological activity data of chemical substances tested in assay experiments are contained in the BioAssay database. [https://doi.org/10.1093/nar/gkv951]

Index script for PubChem BioAssays json files

Elasticsearch server settings

Since some of the PubChem BioAssay json files are large they require to change few Elasticsearch default settings to higher values:

TODO: