Can a graded reader of authentic material be generated?

University essay from Institutionen för datavetenskap; Tekniska högskolan

Author: Kent Danielsson; [2013]

Keywords: ;

Abstract: The thesis investigates if a graded reader for English leveled to the CEFR levels by using the English Vocabulary Profile (EVP) dictionary can be generated from a corpus of authentic material. It was tested on Wikipedia and the ukWaC corpus. There were some problems in making correctmatches between the words in the EVP word lists with the tagged words of the corpora. The results show it might be possible to find enough suitable texts to generate a graded reader for at least the higher CEFR levels if only lemmas are considered. If also the POS tags should be matched between the word list and the corpora the errors were too big to be able to give a conclusive answer.

  AT THIS PAGE YOU CAN DOWNLOAD THE WHOLE ESSAY. (follow the link to the next page)