Public Access to Corpus SYN2010
Brief instructions
Searching the SYN2010 corpus using the www interface is quite simple. Enter the query into the search field. Then click on the Search button. After a few moments, the result will show in the form of so-called concordance lines. Each line represents the part of the text in which the searched expression occurred. It also shows the total number of the searched expressions in the SYN2010 corpus. It will, however, show only the first 50 concordance lines.
During search you can use regular expressions:
- dot (.) - represents any character,
- asterisk (*) - represents a random number (that is, 0-n) of repetitions of the previous character or expression ,
- plus (+) - represents 1 or more repetitions of the previous character or expression ,
- question mark (?) - represent none or one occurrence of the previous character or expression ,
- interval {n, k} - representsn to k repetitions; when omitted, it represents the interval of aminimum of n repetitions; when the interval is in the form of {n}, it corresponds to exactly n repetitions.
If you want to search for words starting with les, enter: les.* into the search field. This query will search for forms lesy, lesního, lesů..., but also lesklé, leskly...>
Similarly, the query .*tel will search for words ending with tel, i.e., spasitel, spisovatel, ředitel, obyvatel, etc.
More examples:
| query example | query text |
| word "kdy" with a small or capital first letter | [kK]dy |
| all forms of the word "kočka" | koče?[kc].* |
| infinitives of the prefixed verbs derived from "nést" | .+nést |
| variants of interjection " ratata " of different length | ra(ta)+ |
| spelling doublet: "diskuze" spelled also with "s" | disku(s|z)e or disku[sz]e |
| all morphological variants of the word "smích" (excluding forms derived from "Smíchov" and "smíchat") | [Ss]mích[^oaá].*|[Ss]mích |


