Online Tools Toolshu.com Log In Sign Up

Seed-VC Voice Conversion ICON

Seed-VC Voice Conversion

https://github.com/Plachtaa/seed-vc

automatic jump after -s...

Website Introduction

Seed-VC: Zero-shot, Real-time Voice & Singing Voice Conversion Tool 🎙️🎶

Seed-VC is an advanced Voice Conversion (VC) and Singing Voice Conversion (SVC) project. Its standout features are zero-shot learning and real-time processing, allowing it to clone a target voice from just 1 to 30 seconds of reference audio without any training.


✨ Key Features

  • Zero-shot Voice Conversion: Upload source and target reference audio for direct voice conversion.
  • Zero-shot Real-time Voice Conversion: Supports ~300ms algorithm delay + ~100ms device side delay, suitable for online meetings, gaming, and live streaming.
  • Zero-shot Singing Voice Conversion: Specifically optimized for singing, supporting higher sampling rates (44.1kHz) and high-pitched voice quality.
  • Fine-tuning Support: Allows quick fine-tuning on custom data (minimum 1 utterance per speaker, as few as 100 steps).

🤖 Model Versions

Four pretrained models are provided for different scenarios:

  • v1.0 tiny (seed-uvit-tat-xlsr-tiny): 25M params, 22050Hz, best for real-time VC.
  • v1.0 small (seed-uvit-whisper-small-wavenet): 98M params, 22050Hz, best for offline high-quality VC.
  • v1.0 base (seed-uvit-whisper-base): 200M params, 44100Hz, designed for singing voice conversion (SVC) with strong zero-shot performance.
  • v2.0 (hubert-bsqvae-small): 67M+90M params, best at suppressing source speaker traits, supports voice & accent conversion.

🛠️ Usage Methods

Multiple interfaces are available:

  • Command Line: Use inference.py or inference_v2.py for quick file conversion.
  • Web UI: Run app_vc.py (VC), app_svc.py (SVC), or the integrated app.py for browser-based operation.
  • Real-time GUI: Run real-time-gui.py for a graphical interface with adjustable parameters, suitable for virtual microphone routing.

💻 Installation & Requirements

Python 3.10 is recommended. Supports Windows, Linux, and Mac M-series chips. Install dependencies via pip to get started.


📈 Project Status

This project has 3.8k Stars and 492 Forks on GitHub, primarily written in Python (97.4%). It has an active community and is continuously updated.

发现周边 发现周边
Comment area

Loading...