Part of speech and tagging in Computational Linguistics

Authors

  • Claudia Oliveira
  • Maria Claudia de Freitas

Abstract

The categorization of words according to features that determine the position they occupy in the language system is a formal requirement of any grammatical description. In Computational Linguistics, tagging is the assignment of categories to portions of a text. The objective of this paper is to discuss, in the context of Computational Linguistics, the source of linguistic information in POS tagging – part of speech, in English. As we present a critical view of this process, it becomes clear that the linguist has a very relevant part to play in the elaboration theoretically sound tagsets for Natural Language Processing. We focus, in particular, three part of speech related language phenomena that have notoriously been overlooked in linguistic studies: the participle verb form, the denotative words, and the appositive.

Key words: tagset, participle, appositive, denotative words, computational linguistics, NLP.

Published

2021-05-27

How to Cite

Oliveira, C., & Freitas, M. C. de. (2021). Part of speech and tagging in Computational Linguistics. Calidoscópio, 4(3), 179–188. Retrieved from https://revistas.unisinos.br/index.php/calidoscopio/article/view/6003