New releases of two spoken corpora are out now: ORTOFON and ORATOR. Compared to the original releases, the total amount of the language material contained in the ORTOFON v2 and ORATOR v2 corpora has more than doubled.
New releases of two spoken corpora are out now: ORTOFON and ORATOR. Compared to the original releases, the total amount of the language material contained in the ORTOFON v2 and ORATOR v2 corpora has more than doubled.
We are proud to have published the monitor ONLINE corpora that map the Czech web, i.e. internet news, discussions and social networks from 2017 until present. The ONLINE corpora are compiled in cooperation with the Dataweps company, have more than six billion tokens and feature regular daily updates!
The Word at a Glance application has been enhanced with an entirely new operation mode. It shows comparison of word profiles of two or more words in a similar manner as SyD.
An update of Treq, the online tool for looking up translation equivalents, is out! Its database has been updated to release 12 of the InterCorp parallel corpus. Furthermore, you can now also search in translations from/to Spanish (in addition to Czech and English).
Mapka is an interactive map-based application for working with spoken dialectal corpora. It features various functions including a presentation of characteristic features of Czech dialectal areas illustrated by authentic speakers’ utterances.
CNC released a web application for browsing and comparing frequency lists. The Lists app offers interactive filtering based on four types of frequency information for each unit (word form or lemma) in a selected (sub)corpus.
We cordially invite everyone to our free corpus workshop on 2nd November 2019 at the Faculty of Arts CU. For more information see the registration form (in Czech only).
Calc is a brand new corpus calculator that complements the family of CNC web applications. It is divided into a number of user-friendly modules suitable for calculating typical statistical tasks commonly encountered in corpus research.
On the occasion of 25 years since the foundation of the Institute of the Czech National Corpus, a new web application has been released. Word at a Glance presents a quick and user-friendly way to get word profiles based entirely on corpus data.
We are proud to announce that Czech National Corpus has been officially recognized as a CLARIN K-centre in the area of corpus linguistics with emphasis on the empirical research of Czech.