All posts

SAUTI Translate: speak your language, hear theirs

Two people, two languages, one conversation. We are building real-time English–Kiswahili voice translation that works through any pair of earphones.

Imagine two people in a meeting — one speaks English, the other speaks Kiswahili. Today, they need an interpreter, or one of them has to switch to their second language. Neither is ideal.

SAUTI Translate is our answer.

What we are building

A real-time speech-to-speech translation system for English and Kiswahili. Two people wearing earphones can have a natural conversation — each speaking their own language, each hearing the other in theirs.

Why this matters

East Africa sits at the intersection of English and Kiswahili. Business meetings, healthcare consultations, government services, and customer support calls routinely cross this language boundary. Over 200 million Kiswahili speakers interact daily with English-dominant systems.

Existing translation tools require typing or waiting — neither works for natural conversation. Real-time voice translation removes the interpreter bottleneck and makes these interactions seamless.

How it works

The system chains three stages in a streaming pipeline: speech recognition captures what is said, machine translation converts it, and text-to-speech delivers it in the target language — all in under two seconds through standard earphones.

The impact

This is not just a technology demo. Real-time voice translation has direct applications in:

  • **Healthcare:** Doctors and patients communicating across the language barrier without an interpreter
  • **Business:** Cross-language meetings and negotiations happening naturally
  • **Education:** Students accessing instruction in their strongest language
  • **Government services:** Citizens engaging with officials in the language they think in

Try it

The translation pipeline is live in demo mode. Visit the [Translate playground](/translate) to try English–Kiswahili voice translation.