• The Croatian Language Corpus (CLC; Croatian: Hrvatski jezični korpus, HJK) is a corpus of Croatian compiled at the Institute of Croatian Language and...
    5 KB (481 words) - 04:59, 10 November 2024
  • Thumbnail for Croatian language
    Croatian is the standard variety of the Serbo-Croatian language mainly used by Croats. It is the national official language and literary standard of Croatia...
    51 KB (4,838 words) - 15:28, 22 June 2025
  • The Lancaster-Oslo/Bergen (LOB) Corpus is a one-million-word collection of British English texts which was compiled in the 1970s in collaboration between...
    3 KB (230 words) - 02:09, 26 March 2025
  • Corpus linguistics is an empirical method for the study of language by way of a text corpus (plural corpora). Corpora are balanced, often stratified collections...
    20 KB (2,335 words) - 10:40, 25 June 2025
  • The Enron Corpus is a database of over 600,000 emails generated by 158 employees of the Enron Corporation in the years leading up to the company's collapse...
    7 KB (725 words) - 03:40, 16 April 2025
  • multi-million corpus of Croatian started to appear even earlier. The Croatian National Corpus is compiled from selected texts written in Croatian covering...
    4 KB (477 words) - 02:24, 9 November 2024
  • Thumbnail for Brown Corpus
    everyday language use. Compiled by Henry Kučera and W. Nelson Francis at Brown University, in Rhode Island, it is a general language corpus containing...
    11 KB (1,270 words) - 02:43, 26 March 2025
  • Corpus (OEC) is a text corpus of 21st-century English, used by the makers of the Oxford English Dictionary and by Oxford University Press' language research...
    4 KB (348 words) - 21:01, 11 January 2025
  • International Corpus is used to inform Cambridge University Press English Language Teaching publications as well as for research in corpus linguistics....
    8 KB (1,028 words) - 00:21, 18 January 2025
  • Thumbnail for Feast of Corpus Christi
    Christi (Tapetes de Corpus Christi) are made of different materials such as coffee grounds, flowers, sand, and salt. In Croatian language, there are various...
    48 KB (5,104 words) - 16:30, 12 July 2025
  • Thumbnail for Corpus separatum (Fiume)
    Corpus separatum, a Latin term meaning "separated body", refers to the status of the City of Fiume (modern Rijeka, Croatia) while given a special legal...
    19 KB (1,489 words) - 03:55, 16 April 2025
  • Thumbnail for Quranic Arabic Corpus
    part of the Arabic language computing research group within the School of Computing, supervised by Eric Atwell. The annotated corpus includes: A manually...
    7 KB (623 words) - 20:09, 21 July 2025
  • PropBank (redirect from PropBank Corpus)
    is a corpus that is annotated with verbal propositions and their arguments—a "proposition bank". Although "PropBank" refers to a specific corpus produced...
    4 KB (390 words) - 18:00, 28 June 2025
  • computational linguists whose goal was a corpus of modern (at the time of building the corpus), naturally occurring language in the form of speech and text or...
    31 KB (3,894 words) - 01:18, 14 June 2024
  • Cool, a 2019 Nigerian TV series Croatian Language Corpus, a text corpus compiled at the Institute of Croatian Language and Linguistics International Convention...
    6 KB (676 words) - 05:04, 5 July 2024
  • official languages of the ten new member states have been added to the corpus data. The latest release (2012) comprised up to 60 million words per language with...
    6 KB (800 words) - 11:02, 15 September 2022
  • The American National Corpus (ANC) is a text corpus of American English containing 22 million words of written and spoken data produced since 1990. Currently...
    5 KB (605 words) - 10:56, 26 January 2025
  • Thumbnail for Dialects of Serbo-Croatian
    The dialects of Serbo-Croatian include the vernacular forms and standardized sub-dialect forms of Serbo-Croatian as a whole or as part of its standard...
    56 KB (6,275 words) - 17:35, 18 July 2025
  • compositions) (a Croatian–Italian manuscript dictionary). 1649 – Jakov Mikalja, Blago jezika slovinskoga (Treasury of the Slavic language) (containing selected...
    4 KB (428 words) - 01:09, 10 February 2025
  • The Corpus of Contemporary American English (COCA) is a one-billion-word corpus of contemporary American English. It was created by Mark Davies, retired...
    9 KB (1,135 words) - 14:04, 24 May 2025
  • Thumbnail for Croatia
    conquest, the Croatian Parliament elected Ferdinand I of Austria to the Croatian throne. In October 1918, the State of Slovenes, Croats, and Serbs, independent...
    227 KB (20,964 words) - 21:45, 17 July 2025
  • Serbo-Croatian standards of Bosnian, Montenegrin and Serbian which liberally draw on Turkish, Latin, Greek, Russian and English loanwords. Croatian literature...
    22 KB (2,524 words) - 15:19, 21 May 2025
  • Bank of English (category Corpus linguistics stubs)
    English (BoE) is a representative subset of the 4.5 billion words COBUILD corpus, a collection of English texts. These are mainly British in origin, but...
    1 KB (153 words) - 18:12, 28 June 2025
  • Thumbnail for Demographics of Croatia
    last speaker died in 1898. Croatian replaced Latin as the official language of the Croatian government in 1847. The Croatian lect is generally viewed as...
    164 KB (9,556 words) - 13:20, 19 July 2025
  • Thumbnail for Slavomolisano
    Slavic or Molise Croatian (Croatian: Moliški hrvatski; Italian: croato molisano), is a variety of Shtokavian Croatian spoken by Italian Croats in three villages...
    30 KB (2,917 words) - 10:42, 21 June 2025
  • learner's dictionary Collins COBUILD English Language Dictionary, based on the study of the COBUILD corpus and first published in 1987. A collection of...
    2 KB (181 words) - 18:11, 28 June 2025
  • List of text corpora (category Corpus linguistics)
    RusAge: Corpus for Age-Based Text Classification Bulgarian National Corpus Macedonian Electronic Corpus Croatian Language Corpus Croatian National Corpus Slovenian...
    23 KB (2,460 words) - 20:27, 20 June 2025
  • Thumbnail for Serbian language
    the Serbo-Croatian language mainly used by Serbs. It is the official and national language of Serbia, one of the three official languages of Bosnia and...
    49 KB (4,448 words) - 15:28, 22 June 2025
  • the Croatian variety of Serbo-Croatian, which has historically been more stringent to internationalisms. Out of all four varieties of the language, Bosnian...
    24 KB (1,237 words) - 23:11, 23 March 2025
  • National Corpus of Polish (Polish : Narodowy Korpus Języka Polskiego NKJP) is the biggest and the most important corpus of the Polish language. A linguistic...
    4 KB (462 words) - 19:14, 8 July 2023