CHALLENGE: Excerpt vast amounts of chemical data for scientific database
An online database from a leading scientific publisher provides access to data about thousands of chemical substances and reactions as well as links to related information sources through an easily searchable interface. Information used in the database is culled from patents and from nearly 400 scientific journals. Each patent or journal article can contain information on up to 300 compounds.
Excerpting information about compounds from these data sources and entering it into the right fields in the database requires considerable expertise in chemistry and pharmacology. Even the most basic tasks require a master’s degree, while interpretations of more complex articles demand the expertise of a PhD.