Alternatives

E2 Tts Pytorch alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

E2 Tts Pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

74
Quality
85
Trust
517
Stars
#1

Voicebox Pytorch

Similarity 140Trust 79Promising 55

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

691 starsOct 1, 2024 pushmedia-automationPythonVoice
$ npx skills add lucidrains/voicebox-pytorch
#2

Glow Tts

Similarity 131Trust 75Promising 55

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

714 starsJul 12, 2022 pushmedia-automationPythonVoice
$ npx skills add jaywalnut310/glow-tts
#3

Vits2 Pytorch

Similarity 130Trust 75Needs review 54

unofficial vits2-TTS implementation in pytorch

549 starsMar 28, 2024 pushmedia-automationPythonVoice
$ npx skills add p0p4k/vits2_pytorch
#4

LLaSA Training

Similarity 127Trust 80Strong 70

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

661 starsJan 21, 2026 pushmedia-automationPythonVoice
$ npx skills add zhenye234/LLaSA_training
#5

GPA

Similarity 124Trust 88Excellent 87

[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!

866 starsMay 25, 2026 pushmedia-automationPythonVoice
$ npx skills add AutoArk/GPA
#6

Bark Voice Cloning HuBERT Quantizer

Similarity 123Trust 75Promising 55

The code for the bark-voicecloning model. Training and inference.

711 starsSep 13, 2023 pushmedia-automationPythonVoice
$ npx skills add gitmylo/bark-voice-cloning-HuBERT-quantizer
#7

F5 Tts Mlx

Similarity 123Trust 77Needs review 54

Implementation of F5-TTS in MLX

635 starsMar 19, 2025 pushmedia-automationPythonVoice
$ npx skills add lucasnewman/f5-tts-mlx
#8

Orpheus Tts Local

Similarity 123Trust 77Needs review 54

Run Orpheus 3B Locally With LM Studio

543 starsMar 20, 2025 pushmedia-automationPythonVoice
$ npx skills add isaiahbjork/orpheus-tts-local
#9

FireRed Image Edit

Similarity 121Trust 93Excellent 96

FireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instruction following, high-fidelity generation, superior identity consistency, and seamless multi-element fusion.

1.3K starsApr 3, 2026 pushmedia-automationPythonImage Generation
$ npx skills add FireRedTeam/FireRed-Image-Edit
#10

Tiktok Voice

Similarity 121Trust 72Needs review 49

Simple Python script to interact with the TikTok TTS API

602 starsOct 12, 2024 pushmedia-automationPythonVoice
$ npx skills add oscie57/tiktok-voice
#11

Pixelle Video

Similarity 119Trust 95Excellent 100

🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine

23K starsJun 14, 2026 pushmedia-automationPythonImage Generation
$ npx skills add AIDC-AI/Pixelle-Video
#12

Local Talking Llm

Similarity 116Trust 84Strong 81

A talking LLM that runs on your own computer without needing the internet.

869 starsApr 4, 2026 pushmedia-automationPythonSpeech
$ npx skills add vndee/local-talking-llm
#13

CloneTTS

Similarity 116Trust 84Strong 81

A lightweight, offline Android Text-to-Speech (TTS) engine enabling seamless system-wide voice cloning and high-fidelity text reading. / 运行在安卓本地的轻量级文字转语音 (TTS) 引擎,支持离线发音人提取、零门槛音色克隆与双擎系统级全局听书。

692 starsMay 18, 2026 pushmedia-automationVoiceClaude Code
$ npx skills add sipeter/CloneTTS
#14

Diffwave

Similarity 115Trust 74Promising 56

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

888 starsMar 26, 2024 pushmedia-automationPythonVoice
$ npx skills add lmnt-com/diffwave
#15

Alexandria Audiobook

Similarity 115Trust 88Excellent 86

AI-powered multi-voice audiobook generator — LLM script annotation, voice cloning, voice design, LoRA training, per-line style control, and export to MP3, chaptered M4B, or Audacity multi-track. Built on Qwen3-TTS.

682 starsJun 4, 2026 pushmedia-automationPythonVoice
$ npx skills add Finrandojin/alexandria-audiobook
#16

GigaAM

Similarity 115Trust 84Strong 79

Foundational Model for Speech Recognition Tasks

619 starsApr 15, 2026 pushmedia-automationPythonSpeech
$ npx skills add salute-developers/GigaAM

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep E2 Tts Pytorch if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.