Repository logo
 

The Diorisis Ancient Greek Corpus

Published version
Peer-reviewed

Type

Article

Change log

Authors

Vatri, A 
McGillivray, Barbara  ORCID logo  https://orcid.org/0000-0003-3426-8200

Abstract

jats:pThe Diorisis Ancient Greek Corpus is a digital collection of ancient Greek texts (from Homer to the early fifth century jats:scad</jats:sc>) compiled for linguistic analyses, and specifically with the purpose of developing a computational model of semantic change in Ancient Greek. The corpus consists of 820 texts sourced from open access digital libraries. The texts have been automatically enriched with morphological information for each word. The automatic assignment of words to the correct dictionary entry (lemmatization) has been disambiguated with the implementation of a part-of-speech tagger (a computer programme that may select the part of speech to which an ambiguous word belongs).</jats:p>

Description

Keywords

4704 Linguistics, 4705 Literary Studies, 43 History, Heritage and Archaeology, 46 Information and Computing Sciences, 47 Language, Communication and Culture, 4303 Historical Studies

Journal Title

Research Data Journal for the Humanities and Social Sciences

Conference Name

Journal ISSN

2452-3666
2452-3666

Volume Title

3

Publisher

Brill
Sponsorship
Alan Turing Institute (EP/N510129/1)