๐ What is this corpus?
This is a Kurdish Kurmanji corpus composed of several literary and linguistic genres:
KurdiLex is the first comprehensive digital language platform dedicated to Kurdish.
It combines a scientific dictionary, a living corpus, and a neologism observatory.
It addresses a critical need, the lack of digital language infrastructure for nearly 50 million speakers.
- Poetry (Helbest) โ Kurdish poems
- Novels (Roman) โ Kurdish novels and stories
- Theatre (ลano) โ Kurdish theatre works
- Newspaper (Rojname) โ Press articles
- Website (Malper) โ Content from Kurdish websites
- Specific Corpus โ Research-specific corpora
- Traditional Songs (Dengbรชj) โ Oral traditional Kurdish songs
- Dictionary (Ferheng) โ Kurdish dictionaries and lexicons
๐ฏ Objectives
- Preserve the Kurdish language
- Support linguistic research
- Advance Kurdish NLP technologies
- Document the richness of Kurdish literature
- Archive Dengbรชj oral traditions
- Build cultural and lexicographic resources
๐ Statistics
The corpus currently contains:
- 35 total documents
- 13 poetry documents
- 17 novels
- 0 theatre documents
- 0 newspapers
- 2 website documents
- 0 specific corpus documents
- 0 traditional songs
- 3 dictionaries
๐ Detailed statistics
๐ How to use?
- Type your word
- Select the genre (optional)
- Click "Search"
- View results in the table
For example, to search in the dictionary:
- Type the word "ziman"
- Select "Dictionary"
- Click "Search"
๐ Dictionary usage
The dictionary section contains words and their meanings. Example:
kurmancรฎ โ the most spoken Kurdish dialect
ferheng โ a book that contains words and definitions
peyv โ the basic unit of language (word)
๐ Contact
For suggestions or questions:
๐ง Email: mahmoudalhadji77@gmail.com
For dictionary contributions or new words, please contact us.