Slovenian National Corpus
Encyclopedia
Slovenian National Corpus FidaPLUS is the biggest and the most important corpus of the Slovenian language
Slovenian language
Slovene or Slovenian is a South Slavic language spoken by approximately 2.5 million speakers worldwide, the majority of whom live in Slovenia. It is the first language of about 1.85 million people and is one of the 23 official and working languages of the European Union...

. It is an upgrade of FIDA corpus, which was developed between 1997 and 2000, with added texts that were published up to 2006. It was the result of the applicative research project of the Faculty of Arts, Faculty of Social Sciences, both University of Ljubljana
University of Ljubljana
The University of Ljubljana is the oldest and largest university in Slovenia. With 64,000 enrolled graduate and postgraduate students, it is among the largest universities in Europe.-Beginnings:...

, and Jožef Stefan Institute
Jožef Stefan Institute
The Jožef Stefan Institute , is the largest research institute in Slovenia. The main research areas are physics, chemistry, molecular biology, biotechnology, information technologies, reactor physics, energy and environment...

's Department of Knowledge Technologies. It consists of 621 million words/tokens gathered from selected texts written in Slovenian of different genres and styles, from books and newspapers mainly .

External links

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK