0

Analysing Names of Organic Chemical Compounds

From Morpho-Semantics to SMILES Strings and Classes

Erschienen am 11.07.2012, 1. Auflage 2012
Bibliografische Daten
ISBN/EAN: 9783639440317
Sprache: Englisch
Umfang: 116 S.
Format (T/L/B): 0.7 x 22 x 15 cm
Einband: kartoniertes Buch

Beschreibung

Revision with unchanged content. The growing amount of data in the life sciences requires computer-aided methods to make full use of valuable resources. The identification and understanding of chemical terminology is a key to automated biochemical text processing. The authors present a linguistically motivated approach to analyse (semi-)systematic and underspecified names of organic chemical compounds. A morpho-semantic analysis is obtained via a Prolog grammar developed according to IUPAC nomenclature rules. This results in a detailed intermediate semantic representation coding the information about the compound structure which is contained in a name. The system described provides SMILES strings mapping names to their corresponding molecular structure and computes the chemical classes which the analysed term belongs to. This book gives an introduction to topics relevant for the linguistic analysis of biochemical terminology and presents a specific system to serve as a basis for future research in the BioNLP domain. It is directed towards researchers in the fields of life science, biochemistry, linguistics, and computer science dedicating their work to (semi-)automatic information processing.

Autorenportrait

Dipl.-Ling.: Studies of Computational Linguistics at the IMS, University of Stuttgart. Researcher at the European Academy Bozen/Bolzano, Italy.Gerhard Kremer, Dipl.-Ling.: Studies of Computational Linguistics at the IMS, University of Stuttgart. Researcher at the Center for Mind/Brain Sciences, University of Trento, Italy.