A Corpus-based evaluation of lexical components of a domain-specific text to Knowledge Mapping prototype

Shams, Rushdi and Elsayed, Adel (2008) A Corpus-based evaluation of lexical components of a domain-specific text to Knowledge Mapping prototype. 11th International Conference on Computer and Information Technology. ICCIT 2008. 11th International Conference on Computer and Information Technology . IEEE, pp. 242-247. ISBN 978-1-4244-2135-0 (Submitted)

[img] PDF
gcct_conferencepr-8.pdf

Download (217kB)

Abstract

The aim of this paper is to evaluate the lexical components of a Text to Knowledge Mapping (TKM) prototype. The prototype is domain-specific, the purpose of which is to map instructional text onto a knowledge domain. The context of the knowledge domain of the prototype is physics, specifically DC electrical circuits. During development, the prototype has been tested with a limited data set from the domain. The prototype now reached a stage where it needs to be evaluated with a representative linguistic data set called corpus. A corpus is a collection of text drawn from typical sources which can be used as a test data set to evaluate NLP systems. As there is no available corpus for the domain, we developed a representative corpus and annotated it with linguistic information. The evaluation of the prototype considers one of its two main components- lexical knowledge base. With the corpus, the evaluation enriches the lexical knowledge resources like vocabulary and grammar structure. This leads the prototype to parse a reasonable amount of sentences in the corpus.

Item Type: Book Section
Additional Information: This is an electronic version of the paper given at the 11th International Conference on Computer and Information Technology and published in ICCIT 2008. 11th International Conference on Computer and Information Technology, December 2008. Khulna.pp.242-247. ICCIT can be found here http://www.iccitbd.net/
Divisions: School of Creative Technologies > Games Computing and software engineering
Depositing User: Scott Wilson
Date Deposited: 26 Nov 2013 12:50
Last Modified: 14 Jan 2014 11:49
Identification Number: 10.1109/ICCITECHN.2008.4803005
URI: http://ubir.bolton.ac.uk/id/eprint/245

Actions (login required)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics

>