Belarusian language99

Neural Networks Still Stumble on Belarusian Speech. Belarusians Want to Give Artificial Intelligence the Ideal Voice

Speech synthesis technologies are rapidly taking over the world, but synthesized Belarusian still sounds with noticeable defects. Even the most advanced models stumble on our stresses and phonetics. Belarusians have launched the Sonora project to create the first studio dataset, which should forever change the sound of digital Belarusian.

Recording studio. Illustrative photo. Photo: Freepik / DC Studio

An important technological breakthrough in Belarusian speech synthesis occurred back in the spring of 2025, thanks to Google's implementation of its new Gemini model, which learned to accurately recognize Belarusian speech (STT — Speech-to-Text), thanks to which, for example, automatic Belarusian-language subtitles finally appeared on YouTube.

Belarusians themselves contributed greatly to this through the volunteer project Donar.by, collecting thousands of hours of live voices.

Thanks to this gigantic database, today it is Google's voice that is closest to the correct sound of the Belarusian language. The model understands context well and has a huge vocabulary, leaving competitors from OpenAI or ElevenLabs far behind, whose attempts to speak Belarusian are far from natural speech.

But recognizing speech is only half the battle. When a neural network has to voice text itself (TTS — Text-to-Speech), it systematically makes mistakes in rarely used words and cannot cope with homographs — words that are spelled identically but have different meanings depending on the stress.

If, instead of the correct "sparyshámi", artificial intelligence confidently produces "spáryshami", this immediately reveals its synthetic nature to a native speaker. The native speaker may not even know the meaning of the word, or where the stress is placed, but their linguistic intuition tells them that something is wrong.

Furthermore, such errors, even if rare in Google models, do a disservice to those who are just beginning to learn the Belarusian language, reinforcing distorted pronunciation.

Add to this the problems with conveying the softness of consonants, the specific sound of "ў", and the affricates "дз" and "дж" — listening to and perceiving long texts in such a rendition is still physically difficult.

Voice from a Test Tube

The problem is not that the algorithms are not smart enough — in the case of the Belarusian language, they simply have nothing to learn from. For artificial intelligence to master correct intonation, rhythm, and stresses, audio from YouTube or podcasts, where sound quality is always varied and people's diction is imperfect, is not enough.

To create a natural synthesized voice, a special, crystal-clear studio dataset is required. This means thousands of hours of professional readings, where texts are specially constructed by linguists to cover all possible phonetic combinations and show the model how to correctly place stresses in complex contexts. Today, no such open data array exists for the Belarusian language in the world.

It is precisely this empty niche that the Sonora project intends to fill. This is a volunteer initiative driven by project manager Hanna Maklakova, linguistic engineer Uladzislau, the TuteishyGPT development team, and a number of specialists whose names are not disclosed for security reasons. Their goal is not to create a closed commercial product, but to establish a fundamental database that everyone can use.

How They Plan to Create the Ideal Voice

Currently, the team is at the fundraising stage, planning to collect 13,000 euros for the project. The largest part of the budget will go towards renting a professional studio and paying speakers with ideal pronunciation. The rest will cover sound engineers' services, the painstaking work of linguists who will prepare and annotate the text corpus, and other expenses.

The result of this work will be a completely open dataset with a public license. Based on it, the project authors plan to refine the already existing domestic BexTTS model, bringing it to a fundamentally new level.

The team is seeking direct contact with representatives from Google, OpenAI, Meta, and Speechify to offer them ready and high-quality material. In the logic of global corporations, everything is simple: if they are given a ready tool to improve a product in a local market, they gladly integrate it.

If they cannot collect the entire amount immediately, the project authors promise to start recording with the funds already in their accounts, because even a partial replenishment of the database is a practical step forward.

From Textbooks to Navigators

The presence of the Belarusian language in technologies today is a matter of its survival in principle. High-quality speech synthesis fundamentally changes the rules of the game in content creation.

This means that publishing Belarusian audiobooks or voicing long articles will no longer require huge budgets and weeks of studio work. It's an opportunity for schoolchildren and students to listen to textbooks, and for people with visual impairments or dyslexia — to get full access to Belarusian-language information. It's a basis for creating domestic voice assistants, chatbots, and navigators that will not speak to us in broken Google speech. Finally, it is a convenient tool for the enormous Belarusian diaspora that wants to preserve the linguistic environment for their children abroad.

«Nasha Niva» — the bastion of Belarus

SUPPORT US

Comments9

  • .
    19.04.2026
    1, гугл пакрысе адмяняе беларускую мову на карысць украінскай. Запыты па-беларуску ўсё часцей выдаюць украінскія спасылкі і прапановы зрабіць запыт па-украінску без памылак.
  • беларуская мадэль маўлення Bextts
    19.04.2026
    каб не пераскоквала на іншыя мовы, можна скарыстацца існуючай беларускай мадэллю

    https://huggingface.co/spaces/archivartaunik/Bextts
  • Скептык
    19.04.2026
    А нахалеру нам ідэальны штучны голас? Каб гэб'ё і ментаўё рабіла правакацыі на чысцюткай беларускай мове? Тэхнары такія тэхнары - ім абы нешта скрэацівіць, каб не адставаць ад сіліконавай даліны. а колькі шкоды гэтыя "інструменты" могуць потым нарабіць, пра гэта яны ня думаюць.

Now reading

Tamara Vinnikava sold her London house, adorned with self-portraits MANY PHOTOS 23

Tamara Vinnikava sold her London house, adorned with self-portraits MANY PHOTOS

All news →
All news

Ukraine asked Turkey to organize a summit between Zelenskyy and Putin 3

Forecasters promised wet snow in the coming days

Moscow Frightens Armenia: Through Rapprochement with the EU, Armenia Will Lose 30% of its Economy 3

Tucker Carlson Apologized for Supporting Trump 9

Max creators say 1.3 million Belarusians have registered in their messenger 7

Lukashenka instructed to tighten driver training in driving schools 16

National Bank issued a 12-sided coin, which costs 23 thousand rubles 1

"Work Disciplines". Former Inmates Sent to Collect Stones 14

Political prisoner released in late February gave birth to a child 1

больш чытаных навін
больш лайканых навін

Tamara Vinnikava sold her London house, adorned with self-portraits MANY PHOTOS 23

Tamara Vinnikava sold her London house, adorned with self-portraits MANY PHOTOS

Main
All news →

Заўвага:

 

 

 

 

Закрыць Паведаміць