About

A corpus is a computer-held collection of texts, which provides easily accessible and accurate data, useful to researchers studying a particular language. METU NCC Spoken Turkish Cypriot Dialect Corpus Project aims to establish a linguistically analyzed and tagged corpus of spoken Turkish Cypriot Dialect, which will be available online for researchers all around the world.

METU NCC Spoken Turkish Cypriot Dialect Corpus Project aims to collect samples of spoken present-day language from North Cyprus to construct a comprehensive, valid and reliable corpus of Spoken Turkish Cypriot Dialect long-awaited by researchers involved in language studies. To this end, the language samples will be selected from diversified spoken texts representing a wide range of genres, registers and dialects. A demo version of the METU NCC Turkish Cypriot Corpus will soon be available online. A more comprehensive version of the Corpus is planned for January 2012. In the following years, the corpus will reach a size of 250.000 words.