INFORMATION EXTRACTION FROM CHEMICAL PATENTS

Authors

  • Sandra Bergmann Forschungszentrum J¨ulich GmbH, 52425 J¨ulich
  • Mathilde Romberg Forschungszentrum J¨ulich GmbH, 52425 J¨ulich

DOI:

https://doi.org/10.7494/csci.2012.13.2.21

Abstract

The development of new chemicals or pharmaceuticals is preceded by an indepth analysis of published patents in this field. This information retrieval is a costly and time inefficient step when done by a human reader, yet it is mandatory for potential success of an investment. The goal of the research project UIMA-HPC is to automate and hence speed-up the process of knowledge mining about patents. Multi-threaded analysis engines, developed according to UIMA (Unstructured Information Management Architecture) standards, process texts and images in thousands of documents in parallel. UNICORE (UNiform Interface to COmputing Resources) workflow control structures make it possible to dynamically allocate resources for every given task to gain best cpu-time/realtime ratios in an HPC environment.

Downloads

Download data is not yet available.

Downloads

Published

2012-09-23

How to Cite

Bergmann, S., & Romberg, M. (2012). INFORMATION EXTRACTION FROM CHEMICAL PATENTS. Computer Science, 13(2), 21. https://doi.org/10.7494/csci.2012.13.2.21

Issue

Section

Articles