Corpus compilation: Representativeness and the CORPOBRAS

Authors

  • Lúcia Pacheco de Oliveira
  • Maria Carmelita Padua Dias

Abstract

This paper discusses an important parameter in corpus design and compilation: representativeness. This parameter is related to the need to include in corpora texts that represent several uses of the language so that comprehensive descriptions can be developed. The paper also presents a corpus of Brazilian Portuguese – CORPOBRAS – that comprises 27 discourse genres and is guided by the representativeness parameter. The paper finally lists several corpus-based studies that draw upon CORPOBRAS data.

Key words: CORPOBRAS, corpus linguistics, genre variation, representativeness, oral and written discourse.

Published

2021-05-27

How to Cite

Oliveira, L. P. de, & Dias, M. C. P. (2021). Corpus compilation: Representativeness and the CORPOBRAS. Calidoscópio, 7(3), 192–198. Retrieved from https://revistas.unisinos.br/index.php/calidoscopio/article/view/4872