Accéder au contenu
Merck

Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data.

BMC medical genomics (2013-01-30)
Kristina M Hettne, André Boorsma, Dorien A M van Dartel, Jelle J Goeman, Esther de Jong, Aldert H Piersma, Rob H Stierum, Jos C Kleinjans, Jan A Kors
RÉSUMÉ

Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values < 0.05) of the next-gen TM-derived gene sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the triazoles. We confirmed embryotoxic effects, and discriminated triazoles from other chemicals. Gene set analysis with next-gen TM-derived chemical response-specific gene sets is a scalable method for identifying similarities in gene responses to other chemicals, from which one may infer potential mode of action and/or toxic effect.

MATÉRIAUX
Référence du produit
Marque
Description du produit

Sigma-Aldrich
Sulfate de zinc heptahydraté, ACS reagent, 99%
Sigma-Aldrich
Sulfate de zinc heptahydraté, ReagentPlus®, ≥99.0%
Sigma-Aldrich
Sulfate de zinc heptahydraté, puriss. p.a., ACS reagent, reag. ISO, reag. Ph. Eur., ≥99.5%
Sigma-Aldrich
Sulfate de zinc heptahydraté, BioReagent, suitable for cell culture
Sigma-Aldrich
Zinc sulfate solution, 0.3 N
Sigma-Aldrich
Sulfate de zinc heptahydraté, BioUltra, for molecular biology, 2.0 M in H2O
Sigma-Aldrich
Sulfate de zinc heptahydraté, ≥99.95% trace metals basis
Sigma-Aldrich
Sulfate de zinc heptahydraté, suitable for plant cell culture
Sigma-Aldrich
Sulfate de zinc heptahydraté, meets USP testing specifications