Part of speech and tagging in Computational Linguistics
Abstract
The categorization of words according to features that determine the position they occupy in the language system is a formal requirement of any grammatical description. In Computational Linguistics, tagging is the assignment of categories to portions of a text. The objective of this paper is to discuss, in the context of Computational Linguistics, the source of linguistic information in POS tagging – part of speech, in English. As we present a critical view of this process, it becomes clear that the linguist has a very relevant part to play in the elaboration theoretically sound tagsets for Natural Language Processing. We focus, in particular, three part of speech related language phenomena that have notoriously been overlooked in linguistic studies: the participle verb form, the denotative words, and the appositive.
Key words: tagset, participle, appositive, denotative words, computational linguistics, NLP.Downloads
Published
How to Cite
Issue
Section
License
I grant the journal Calidoscópio the first publication of my article, licensed under Creative Commons Attribution license (which allows sharing of work, recognition of authorship and initial publication in this journal).
I confirm that my article is not being submitted to another publication and has not been published in its entirely on another journal. I take full responsibility for its originality and I will also claim responsibility for charges from claims by third parties concerning the authorship of the article.
I also agree that the manuscript will be submitted according to the journal’s publication rules described above.