combine free download

abogen

Generate audiobooks from EPUBs, PDFs and text with captions

...This can be very useful for accessibility, content consumption on the go, or for users who prefer audio over reading. The repository supports handling common ebook formats and generating outputs that combine audio plus caption metadata. By automating text-to-speech for arbitrary documents, abogen reduces the friction of producing audiobooks and could be integrated into larger workflows (e.g., batch converting a library of texts).

Downloads: 11 This Week

Last Update: 2026-02-06

See Project

Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models

...Because it’s part of the broader Qwen ecosystem, it benefits from the model’s understanding of linguistic nuances, enabling more accurate pronunciation, prosody, and contextual delivery than many traditional TTS systems. Developers can customize voice output parameters like speed, pitch, and volume, and combine the TTS stack with other AI components.

Downloads: 13 This Week

Last Update: 2026-03-17

See Project

comfyui-mixlab-nodes

Workflow and speech recognition app

...The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that mix live screen content, generative models, and visual effects. For audio and speech, it provides nodes for SpeechRecognition and SpeechSynthesis, plus workflows that combine voice generation with real-time face swapping and other audio-visual effects. On the AI side, it integrates multiple LLM providers (cloud and local), supports OpenAI-compatible endpoints, Siliconflow models, and includes prompt-focused utilities for random prompt generation, Chinese prompts, clip interrogation.

Downloads: 2 This Week

Last Update: 2025-11-28

See Project

Amica

Amica is an open source interface for interactive communication

Amica is an open source interface for interacting with fully animated 3D characters that combine voice chat, vision, and an emotion engine into a single experience. It lets you hold natural conversations with AI characters that can see, listen, and speak, while expressing emotional states through facial expressions and body language. Users can import VRM character models, adjust their appearance, tune the voice to match the character, and define behavior using different large language models and TTS backends. ...

Downloads: 9 This Week

Last Update: 2025-11-30

See Project

Search Results for "combine"

Showing 4 open source projects for "combine"

abogen

Qwen3-TTS

comfyui-mixlab-nodes

Amica

Search Results for "combine"

Showing 4 open source projects for "combine"

abogen

Qwen3-TTS

comfyui-mixlab-nodes

Amica

Related Searches

Related Categories