Bring the vocal
Upload audio or paste a URL. The worker normalizes input to clean mono 24 kHz WAV.
Teravoice / Singing Voice Conversion
A clean source vocal, a clean reference singer, and a serverless 48GB GPU render path. Teravoice turns the Amphion Vevo2 FM model into a simple studio console.
Kits-style simplicity, but honest: the dashboard exposes the source-plus-reference singing conversion path we have actually deployed.
Upload audio or paste a URL. The worker normalizes input to clean mono 24 kHz WAV.
Use a reference singer take to guide timbre while the source performance stays intact.
Track status, review logs, play the result, and download the rendered WAV.
Bring Hindi, Tamil, Telugu, Bengali, Marathi, Punjabi, Gujarati, Kannada, Malayalam, or English vocals. Teravoice changes the singer timbre while preserving the sung performance; lyric translation is not shown because it is not live yet.
Hostinger serves the app, Cloudflare R2 stores inputs and outputs, and Vast provisions a 48GB+ GPU only when a render is requested.
POST /api/render/upload
POST /api/render
GET /api/render/{job_id}
The product surface stays focused on the production features wired to the deployed serverless GPU stack.
Source vocal plus reference voice through Amphion Vevo2 FM-only SVC.
Signup, login, private dashboard, and user-owned render history.
Bearer-token routes for Hostinger or other backend callers.