Alternatives

Transformer TTS alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Transformer TTS

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

55
Quality
74
Trust
690
Stars
#1

GPA

Similarity 132Trust 88Excellent 87

[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!

866 starsMay 25, 2026 pushmedia-automationPythonVoice
$ npx skills add AutoArk/GPA
#2

Glow Tts

Similarity 131Trust 75Promising 55

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

714 starsJul 12, 2022 pushmedia-automationPythonVoice
$ npx skills add jaywalnut310/glow-tts
#3

Vits2 Pytorch

Similarity 130Trust 75Needs review 54

unofficial vits2-TTS implementation in pytorch

549 starsMar 28, 2024 pushmedia-automationPythonVoice
$ npx skills add p0p4k/vits2_pytorch
#4

Vits2

Similarity 125Trust 77Needs review 54

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

641 starsSep 11, 2023 pushmedia-automationJupyter NotebookVoice
$ npx skills add daniilrobnikov/vits2
#5

Diffwave

Similarity 123Trust 74Promising 56

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

888 starsMar 26, 2024 pushmedia-automationPythonVoice
$ npx skills add lmnt-com/diffwave
#6

Diffusers

Similarity 120Trust 98Excellent 100

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

34K starsJun 16, 2026 pushmedia-automationPythonImage Generation
$ npx skills add huggingface/diffusers
#7

E2 Tts Pytorch

Similarity 120Trust 85Strong 74

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

517 starsDec 20, 2025 pushmedia-automationPythonVoice
$ npx skills add lucidrains/e2-tts-pytorch
#8

FireRedTTS

Similarity 119Trust 81Promising 69

An Open-Sourced LLM-empowered Foundation TTS System

912 starsSep 28, 2025 pushmedia-automationPythonVoice
$ npx skills add FireRedTeam/FireRedTTS
#9

ComfyUI VibeVoice

Similarity 118Trust 83Promising 67

ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio

586 starsSep 25, 2025 pushmedia-automationPythonVoice
$ npx skills add wildminder/ComfyUI-VibeVoice
#10

Chatterbox TTS Extended

Similarity 118Trust 83Promising 67

Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially for my kids.

565 starsAug 23, 2025 pushmedia-automationPythonVoice
$ npx skills add petermg/Chatterbox-TTS-Extended
#11

Vllm Omni

Similarity 117Trust 94Excellent 100

A framework for efficient model inference with omni-modality models

5.2K starsJun 16, 2026 pushmedia-automationPythonImage Generation
$ npx skills add vllm-project/vllm-omni
#12

CloneTTS

Similarity 116Trust 84Strong 81

A lightweight, offline Android Text-to-Speech (TTS) engine enabling seamless system-wide voice cloning and high-fidelity text reading. / 运行在安卓本地的轻量级文字转语音 (TTS) 引擎,支持离线发音人提取、零门槛音色克隆与双擎系统级全局听书。

692 starsMay 18, 2026 pushmedia-automationVoiceClaude Code
$ npx skills add sipeter/CloneTTS
#13

Voicebox Pytorch

Similarity 116Trust 79Promising 55

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

691 starsOct 1, 2024 pushmedia-automationPythonVoice
$ npx skills add lucidrains/voicebox-pytorch
#14

Alexandria Audiobook

Similarity 115Trust 88Excellent 86

AI-powered multi-voice audiobook generator — LLM script annotation, voice cloning, voice design, LoRA training, per-line style control, and export to MP3, chaptered M4B, or Audacity multi-track. Built on Qwen3-TTS.

682 starsJun 4, 2026 pushmedia-automationPythonVoice
$ npx skills add Finrandojin/alexandria-audiobook
#15

Pandrator

Similarity 115Trust 89Excellent 85

Turn PDFs and EPUBs into audiobooks; subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.

572 starsJun 14, 2026 pushmedia-automationPythonVoice
$ npx skills add lukaszliniewicz/Pandrator
#16

F5 Tts Mlx

Similarity 115Trust 77Needs review 54

Implementation of F5-TTS in MLX

635 starsMar 19, 2025 pushmedia-automationPythonVoice
$ npx skills add lucasnewman/f5-tts-mlx

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Transformer TTS if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.