








Turkish NLP Suite is a nonprofit organization that aims to deliver resources for Turkish NLP, including corpus, pretrained models, benchmarking resources and much more. We aim to bring excellence to the Turlish language and speech research by publishing and open-sourcig for all areas of Turkish processing including morphology, language modelling, subword modelling, corpora building and more. The organization is run by Duygu, your research scientist (originally )from Istanbul, (currently) living in Berlin and San Diego, CA.
know more






















For a long time, Turkish models climbed leaderboards written somewhere else—translated datasets, English‑first assumptions, noisy web scrapes. Useful, yes. Representative of real Turkish? Not quite. TrGLUE is our answer: a...
Read More
We proudly introduce BellaTurca, the ultimate Turkish large corpus, bringing diversity and high quality to fight the dullness and blandness in Turkish language modeling. We’re talking about 250GB of text...
Read More