Zero-Shot Voice Cloning
Clone any Urdu voice from just 3-10 seconds of reference audio. No speaker training required.
OmniVoiceزواہ
ZUAH · IUB FYP 2025-2026
Urdu zero-shot voice cloning and deepfake detection. Explore the project without an account, or log in for your full workspace.
About the project
Zuah combines zero-shot Urdu voice cloning with state-of-the-art deepfake detection. Urdu remains underrepresented in voice AI, while deepfakes spread misinformation in Urdu-speaking communities — Zuah addresses both challenges in one platform.
Built as an IUB FYP (2025–2026), the system runs on Google Colab GPU with a Flask REST API exposed through an ngrok tunnel — no local GPU required.
زواہ اردو آواز کی سچائی اور جعل سازی کا پتہ لگاتا ہے
Clone any voice from 3 seconds of audio
Real or fake? Know instantly.
CAPABILITIES
From zero-shot cloning to real-time deepfake detection - Zuah handles the full pipeline.
Clone any Urdu voice from just 3-10 seconds of reference audio. No speaker training required.
OmniVoiceMulti-model fusion of Wav2Vec2, AASIST, and Conformer delivers state-of-the-art anti-spoofing performance.
95.45% AccuracyPair up to 10 reference voices with 10 Urdu scripts and run all jobs sequentially with per-job progress tracking.
Up to 10x10Full right-to-left Urdu script input using Noto Nastaliq Urdu typography with RTL-aware placeholders.
BilingualAnimated Real % and Fake % bars update after each analysis, giving instant visual confidence feedback.
Live ResultsFlask + PyTorch on Google Colab GPU, tunneled via ngrok for zero-infrastructure browser access.
CUDA / ColabDEVELOPMENT PROCESS
How Zuah was designed, built, tested, and deployed