Search Results for "text based" - Page 2

Sort By:

Showing 3623 open source projects for "text based"

View related business solutions

Windows Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
1

kokoro-onnx

TTS with kokoro and onnx runtime

kokoro-onnx is a text-to-speech toolkit that wraps the Kokoro neural TTS model in an easy-to-use ONNX Runtime interface, so you can generate speech from Python with minimal setup. It focuses on running efficiently on commodity hardware, including macOS with Apple Silicon, while still delivering near real-time performance for many use cases. The project ships prebuilt model files and a simple example script, so you can go from installation to producing an audio.wav file in just a few steps....

Downloads: 199 This Week

Last Update: 2025-11-28
See Project
2

Wan2.1

Wan2.1: Open and Advanced Large-Scale Video Generative Model

Wan2.1 is a foundational open-source large-scale video generative model developed by the Wan team, providing high-quality video generation from text and images. It employs advanced diffusion-based architectures to produce coherent, temporally consistent videos with realistic motion and visual fidelity. Wan2.1 focuses on efficient video synthesis while maintaining rich semantic and aesthetic detail, enabling applications in content creation, entertainment, and research. The model supports text-to-video and image-to-video generation tasks with flexible resolution options suitable for various GPU hardware configurations. ...

1 Review

Downloads: 45 This Week

Last Update: 2026-03-05
See Project
3

DeepSeek TUI

Coding agent for DeepSeek models that runs in your terminal

DeepSeek-TUI is a terminal-based user interface designed to interact with DeepSeek language models in a lightweight and efficient way. It provides a text-based chat experience directly within the command line, making it ideal for developers who prefer minimal interfaces. The tool supports streaming responses, allowing real-time interaction with the model. It includes features for managing prompts, sessions, and conversation history within the terminal environment. ...

Downloads: 357 This Week

Last Update: 1 day ago
See Project
4

AutoCut

Cut videos with a text editor

...AutoCut supports multiple transcription backends, including Whisper and faster-whisper modes, allowing users to choose based on speed or accuracy needs. After editing the transcript text, the corresponding video clips are merged into the final output, and the tool also produces matching subtitle files. Its command-line interface can be integrated into scripts, making it suitable for automated workflows or batch processing.

Downloads: 2 This Week

Last Update: 2026-02-06
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
5

OneNote Md Exporter

ConsoleApp to export OneNote notebooks to Markdown formats

OneNote Md Exporter is a console application running on Windows that exports your OneNote notebooks in different markdown formats. Evaluate or migrate to an alternative Knowledge Management Software like Joplin, Obsidian or other software based on Markdown format. Backup your OneNote Notebooks in an interoperable and open file format.

Downloads: 17 This Week

Last Update: 2025-12-15
See Project
6

FastAPI

FastAPI framework, high performance, easy to learn, fast to code

FastAPI is a modern, fast (high-performance), web framework for building APIs with Python 3.6+ based on standard Python type hints. Great editor support. Completion everywhere. Less time debugging. Designed to be easy to use and learn. Less time reading docs. Minimize code duplication. Multiple features from each parameter declaration. Fewer bugs. Get production-ready code. With automatic interactive documentation. Based on (and fully compatible with) the open standards for APIs: OpenAPI...

Downloads: 49 This Week

Last Update: 2026-04-23
See Project
7

Pix2Text

Open-Source Python3 tool for recognizing layouts, tables, and math

An Open-Source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported. Pix2Text (P2T) aims to be a free and open-source Python alternative to Mathpix, and it can already accomplish Mathpix's core functionality. Pix2Text (P2T) can recognize layouts, tables, images, text, and mathematical formulas, and integrate all of these contents into Markdown format. ...

Downloads: 5 This Week

Last Update: 2026-02-07
See Project
8

AvaloniaEdit

Avalonia-based text editor (port of AvalonEdit)

This project is a port of AvalonEdit, a WPF-based text editor for Avalonia.

Downloads: 0 This Week

Last Update: 2026-02-05
See Project
9

Applio

A simple, high-quality voice conversion tool focused on ease of use

Applio is a high-quality voice conversion toolkit designed to make modern RVC/VITS-based voice cloning accessible to non-experts. It focuses strongly on ease of use: installation scripts for Windows, Linux, and macOS set up dependencies and then launch a browser-based Gradio interface. Within that interface, users can train and run voice conversion models for tasks like singing conversion, speech-to-speech transformation, and voice cloning. The project is structured to be flexible through...

Downloads: 85 This Week

Last Update: 2026-02-18
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

Helix Editor

A post-modern modal text editor

A Kakoune / Neovim inspired editor, written in Rust. The editing model is very heavily based on Kakoune.

Downloads: 7 This Week

Last Update: 2025-07-18
See Project
11

PaddleOCR-json

OCR offline image text recognition command line windows program

PaddleOCR-json is an OCR engine based on the PaddleOCR project that provides a command-line interface and tools for extracting text from images and exporting results in structured JSON format. It wraps the PaddleOCR models, which are capable of detecting and recognizing text in a wide variety of languages and layouts, into a self-contained executable that can be run locally without needing a deep learning environment configured manually.

Downloads: 9 This Week

Last Update: 2026-01-15
See Project
12

EPUB to Audiobook Converter

EPUB to audiobook converter, optimized for Audiobookshelf

...A recent addition is a Gradio-based WebUI, which wraps all configuration options in a graphical interface for users who prefer not to work with the command line. The tool offers advanced options such as controlling chapter ranges, handling paragraph detection via newline modes, removing endnote markers, and using regex-based search-and-replace files to tweak pronunciations.

Downloads: 18 This Week

Last Update: 2026-02-02
See Project
13

Speech-AI-Forge

Speech-AI-Forge is a project developed around TTS generation model

Speech-AI-Forge is a full-stack project built around modern text-to-speech generation models, providing both an API server and a Gradio-based web UI for interactive use. At its core, it acts as a hub that wires together multiple speech-related capabilities, including TTS, speech-to-text and LLM-based control flows, behind a consistent interface. The system is designed to be deployed in several ways: you can try it online via hosted demos, spin it up in a one-click Colab environment, run it in Docker containers, or set it up locally with its environment preparation scripts. ...

Downloads: 2 This Week

Last Update: 2026-02-02
See Project
14

ONLYOFFICE DocumentServer

ONLYOFFICE Docs is a free collaborative online office suite

ONLYOFFICE Document Server is an open-source office suite that enables users to create, edit, and collaborate on documents, spreadsheets, and presentations in real-time via a web-based interface.

Downloads: 12 This Week

Last Update: 3 days ago
See Project
15

pywinauto

Windows GUI Automation with Python (based on text properties)

pywinauto is a set of Python modules to automate the Microsoft Windows GUI. At its simplest it allows you to send mouse and keyboard actions to Windows dialogs and controls, but it has support for more complex actions like getting text data.

Downloads: 2 This Week

Last Update: 2025-01-06
See Project
16

ChatTTS webUI & API

A simple native web interface that uses ChatTTS to synthesize text

ChatTTS-ui is a local web interface and API wrapper around the ChatTTS speech synthesis system, designed to make advanced TTS models easy to use from a browser. It runs a small backend server (Python + Torch + ffmpeg) and exposes a simple webpage where you can type text, adjust parameters, and generate audio. The project supports Chinese, English, and mixed text with digits and control symbols, making it suitable for bilingual content and numerically heavy text like announcements or prompts....

Downloads: 16 This Week

Last Update: 2025-11-28
See Project
17

Wan2.2

Wan2.2: Open and Advanced Large-Scale Video Generative Model

...The model is trained on significantly larger datasets than its predecessor, greatly enhancing motion complexity, semantic understanding, and aesthetic diversity. Wan2.2 also open-sources a 5-billion parameter high-compression VAE-based hybrid text-image-to-video (TI2V) model that supports 720P video generation at 24fps on consumer-grade GPUs like the RTX 4090. It supports multiple video generation tasks including text-to-video.

1 Review

Downloads: 88 This Week

Last Update: 2026-03-17
See Project
18

emanote

Emanate a structured view of your plain-text notes

Emanate a structured view of your plain-text notes. Create beautiful websites such as personal webpage, blog, wiki, Zettelkasten, notebook, knowledge-base, documentation, etc. from future-proof plain-text notes and arbitrary data, with live preview that updates in real-time. Emanote is the spiritual successor to neurons based on Ema. Emanote is a Haskell software. Thanks to Nix, this repository is pre-configured to provide a delightful development experience with full IDE support in Visual Studio Code.

Downloads: 0 This Week

Last Update: 2025-08-19
See Project
19

OmniVoice

High-Quality Voice Cloning TTS for 600+ Languages

...The system also includes advanced features like non-verbal expression tags and pronunciation overrides, enabling expressive and precise output. With support for both API-based and command-line usage, it is designed for research, production, and experimentation alike.

Downloads: 26 This Week

Last Update: 2026-04-28
See Project
20

Mozc

Mozc - a Japanese Input Method Editor designed for multi-platform

Mozc is an open source Japanese Input Method Editor (IME) developed by Google, designed to provide Japanese text input across multiple operating systems including Android, macOS, Windows, GNU/Linux, and Chromium OS. The project originated as a subset of Google Japanese Input, released publicly under the BSD 3-Clause license for community use and development. Mozc offers core IME functionality such as text conversion, prediction, and dictionary-based input, enabling users to efficiently type and edit Japanese text. ...

Downloads: 4 This Week

Last Update: 1 day ago
See Project
21

Firebird

Firebird server, client and tools

...It has been used in production systems, under a variety of names, since 1981. To enhance the Firebird functionality, IBSurgeon has sponsored the development and now released for public use the free open source "IBSurgeon Full Text Search UDR" to perform full-text search queries within SQL and PSQL. UDR works with Firebird 3 and 4, for Windows, there are ready-to-use binaries, for Linux, it is necessary to build the UDR. The UDR is based on Lucene++ engine, with all the powerful features required for full-text search and with very fast performance (build as native C++ library). ...

Downloads: 7 This Week

Last Update: 2026-04-17
See Project
22

AvalonEdit

The WPF-based text editor component used in SharpDevelop

AvalonEdit is the name of the WPF-based text editor in SharpDevelop 4.x "Mirador" and beyond. It is also being used in ILSpy and many other projects. AvalonEdit is targeting net6.0-windows and net462 TFMs. Because of net6.0-windows, you must have the .NET 6.0 SDK installed on your machine. Visual Studio 2022 Community and up is required for working with the solution (global.json will select the proper SDK to use for building for you).

Downloads: 1 This Week

Last Update: 2025-04-13
See Project
23

FLUX.1

Official inference repo for FLUX.1 models

FLUX.1 repository contains inference code and tooling for the FLUX.1 text-to-image diffusion models, enabling developers and researchers to generate and edit images from natural-language prompts using open-weight versions of the model on their own hardware or within custom applications. The project is part of a larger family of FLUX models developed by Black Forest Labs, designed to produce high-quality, detailed visuals from text descriptions with competitive prompt adherence and artistic...

Downloads: 59 This Week

Last Update: 2026-01-19
See Project
24

Mermaid

Diagram and flowchart generation from text similar to markdown

Mermaid is a JavaScript-based diagram and flowchart generating tool that uses markdown-inspired text for fast and easy generation of diagrams and charts. Forget about using heavy tools to explain your code. Mermaid greatly simplifies documentation with its simple markdown-like script language, and offers a great range of diagram and chart options. The latest version of Mermaid comes with a number of bug fixes and enhancements, as well as a new diagram type, entity relationship diagrams. ...

Downloads: 84 This Week

Last Update: 2026-05-12
See Project
25

TTime

Screenshots, word marking, OCR, AI, translation software

TTime is a desktop productivity tool that combines translation, OCR, and screen capture capabilities into a unified application designed for fast and efficient text processing workflows. It allows users to translate text through multiple methods, including direct input, screenshot-based capture, and real-time word selection, making it versatile for both casual use and professional tasks. The software integrates a wide range of translation engines and OCR services, including cloud-based providers and offline options, ensuring flexibility across different environments and connectivity conditions. ...

Downloads: 2 This Week

Last Update: 2026-03-18
See Project