HomeDemoFAQTeam

Hear the Truth.
Speak with Intelligence.

سچ سنو۔ ذہانت سے بولو۔

Zuah is an AI-powered Urdu voice system that clones any voice from 3 seconds of audio and detects deepfakes with 95.45% accuracy - powered by Wav2Vec2, AASIST, Conformer, and OmniVoice.

0%
Detection Accuracy
0
Min. Reference Audio
0
AI Models Fused
REAL �FAKE ? CLONED95.45%

Scroll to explore

From Audio to Intelligence in 4 Steps

01

Upload Reference Audio

Drop 1-10 audio files (.wav .mp3 .flac .m4a), each 3-10 seconds.

02

Enter Your Urdu Scripts

Type Urdu text in RTL rows. Add up to 10 dynamically.

03

Generate Cloned Voices

POST /clone via ngrok - sequential jobs with progress.

04

Listen, Download, or Detect

Download WAV or send directly to the detection panel.

01

Upload Audio

Upload any file or use cloned output from the demo.

02

Analyze via /predict

Flask backend returns verdict and confidence scores.

03

View REAL/FAKE Verdict

Animated result card with shield or alert icon.

04

Review Confidence Bars

Real % and Fake % bars animate to final values.

Try Zuah Live

? Requires your ngrok backend URL to be active

? Offline

Contact: +923000781176 · muhammadzaid@gmail.com

Running on Google Colab? Paste the ngrok URL from your notebook output.

How to get your ngrok URL 
  1. Open your Zuah notebook in Google Colab
  2. Run all cells - last cell starts Flask
  3. Copy the ngrok HTTPS URL
  4. Paste above and click Contact
!ngrok config add-authtoken YOUR_TOKEN Get free ngrok token 

Clone a Voice OmniVoice

Live Recording

Record 3–10 seconds directly from your microphone.

00:00

Drop audio files here or click to browse

.wav .mp3 .flac .m4a � 3-10 seconds � Max 10 files
Reference Transcription (optional)

Detect Deepfake 95.45% Accuracy

Drop audio or click to browse

Real %0%
Fake %0%

History (last 5)

Audio Library

Original uploads, live recordings, and generated clones — saved when python local_storage_server.py runs on port 8765.

Storage offline

No saved audio yet. Generate a clone or record a sample in the demo.

Frequently Asked Questions

What is Zuah?

Zuah is an Urdu voice AI platform combining zero-shot voice cloning (OmniVoice) and deepfake detection (Wav2Vec2 + AASIST + Conformer) in a single browser-based application.

Do I need to install anything?

The frontend is a single HTML file - just open it in any modern browser. You do need a running backend (Google Colab + ngrok) to use the clone and detect features.

How accurate is the deepfake detector?

Zuah achieves 95.45% accuracy on our evaluation set, using a fusion of Wav2Vec2 waveform features, AASIST spectral-temporal graphs, and Conformer mel-spectrogram encoding.

What languages does voice cloning support?

OmniVoice is optimized for Urdu. While it may produce output for other languages, best results are with Urdu Nastaliq script input.

How short can the reference audio be?

The minimum is 3 seconds. Optimal quality is achieved with 6-8 seconds of clear, noise-free speech from the target speaker.

Does the system store my audio?

No. Audio is processed in memory on Google Colab and returned to your browser. Nothing is stored on any server.

Can I detect deepfakes without cloning first?

Yes. The detection panel accepts any audio upload independently. You can upload any audio file to check if it is real or AI-generated.

Why does the backend URL change every session?

This happens with the free ngrok plan. To get a persistent URL, create a free ngrok account and use a static domain, or use ngrok's reserved subdomain feature.

What audio formats are supported?

Upload: .wav, .mp3, .flac, .m4a for both cloning and detection. Output: synthesized voices are returned as .wav files.

Is this project open-source?

Yes. The codebase is available on GitHub (link in footer). The models (OmniVoice, etc.) are subject to their own licenses.

The Team

AY

Afifa Yaseen

MZ

Muhammad Zaid

UF

Uswah Fatima

AA

Asjad Amin