About the project

About OMU TTS

What is OMU TTS?

OMU TTS is a text-to-speech web application: you enter text, choose a language and voice, and download a WAV file. OMU means parrot in Japanese — a nod to voices that repeat your words clearly.

Speech is powered by the Kokoro-82M model, served through a dedicated TTS API (OMU IA TTS). This site is the public-facing studio that talks to that API securely.

Project context

OMU TTS is an individual project — not a group assignment. I work on it alone as a solopreneur: one person covering product, design, frontend, and integration with the backend API.

Who is behind it?

Ahiru-T3ch is me — my solo studio name for shipping software. When you see “by Ahiru-T3ch” in the footer, that is the same person who designs, builds, and maintains OMU TTS. There is no separate team hidden behind the brand for this product.

Questions, partnerships, or early access: reach out via ahiru-t3ch.com.

Skills & stack

This project showcases full-stack delivery: a modern web UI, server-side API routes that proxy the TTS backend (token never exposed to the browser), and containerized deployment.

Next.js & React
TypeScript
Tailwind CSS
REST API integration
Docker & Compose
UI / UX design
TTS pipelines (Kokoro)

Frontend: Next.js, React, TypeScript, Tailwind CSS · Backend bridge: Next.js Route Handlers · TTS service: OMU IA TTS (Docker) · Infra: Docker Compose, environment-based configuration.

Roadmap

Voice cloning — under construction

Custom voice cloning will be part of OMU TTS. The feature is not available in the app yet; it will ship when the backend API is ready.

Open the studio