Structured Information Management: Processing and Retrieval


SIMPR - 2083

Keywords document handling, automatic indexing, automatic subject classification


Start Date: 01-JAN-89 / Duration: 42 months

[ contact / participants ]


Objectives and Approach

The SIMPR project developed new techniques for the management of text stored in very large information banks, such as text published on optical media. The main stages of the project were:

Results

A first prototype system for the semi-automatic indexing of technical texts was designed and built. The indexing prototype processes texts to extract "analytics", that is, terms that accurately reflect the meaning of a text. These analytics are optionally validated by the user and then stored in indexes for subsequent search against keywords specified in a request for information. The prototype incorporates advanced linguistic software, performing morphological and syntactic analysis and disambiguation of texts. The results from the project include:

The final output of the project is a set of tools for creating and managing large information banks, such as an authoring system for optical information stores. SIMPR resembles a hypertext system in its ability to link items of information at different levels, but it aids the user by establishing links automatically, according to subject analysis of texts and conformance with user-specified information models.

Exploitation

Application areas in which software could be developed to use this system and its component techniques include: authoring CD-ROMs and constructing a graphics interface to guide the reader through them; indexing and structuring large textual information banks, such as technical manuals; and the automatic indexing of documents destined for storage in technical archives.


CONTACT POINT

Mr Godfrey Smarti
Peingown
Kilmuir
UK - Isle of Skye IV51 9UB
tel: +44/47.052.243
fax: +44/47.052.303
email: 100114.243@compuserve.com

Participants

CRI A/S - DK - C
NOKIA HEAD OFFICE - SF - P
RESEARCH UNIT FOR COMPUTATIONAL
LINGUISTICS - SF - P
UNIVERSITY COLLEGE DUBLIN - IRL - P
UNIVERSIDADE CATOLICA PORTUGESA - P - P
UNIVERSITY OF STRATHCLYDE - UK - P
CAP GEMINI INT. SUPPORT BV - NL - P
DUBLIN CITY UNIVERSITY - IRL - A
TNO INSTITUTE - NL - A


TBP synopses home page TBP acronym index TBP number index
All synopses home page all acronyms index all numbers index

SIMPR - 2083, December 1993


please address enquiries to the ESPRIT Information Desk

html version of synopsis by Nick Cook