Handling issues with unknown words in POS Tagging

dc.contributor.authorJayaweera, A.J.P.M.P.en_US
dc.contributor.authorDias, N.G.J.en_US
dc.date.accessioned2014-12-24T07:45:47Z
dc.date.available2014-12-24T07:45:47Z
dc.date.issued2014
dc.description.abstractAppearance of unknown words is one of the frequently occurring problems facing in part of speech tagging process, i.e., the words that appear in sentences, but are not contained within the lexicon. New words are continually coined to the language, and people will often use words that a parsing system may not expect. This problem get worse when NLP systems are used for more and more on-line computer applications. Acronyms and proper names are created very often and new nouns and verbs are adding to the language in a surprising rate. So it is impossible to train the tagger for every possible word in the language. So unknown words are non-negligible in part of speech tagging and such unknown words should be handled by further processing with exceptional mechanism, since the statistical information or rules for those words are unknown. Therefore, in order to build a complete tagger, tagger must be incurred with some knowledge of suggesting the tag for an unknown word.en_US
dc.identifier.citationAnnual Research Symposium,Faculty of Graduate Studies, University of Kelaniya, Sri Lanka; 2014 :141pen_US
dc.identifier.departmentStatistics & Computer Scienceen_US
dc.identifier.urihttp://repository.kln.ac.lk/handle/123456789/4941
dc.publisherBook of Abstracts, Annual Research Symposium 2014en_US
dc.titleHandling issues with unknown words in POS Tagging
dc.typeArticleen_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections