Search Results for "text based" - Page 2

Sort By:

Showing 3500 open source projects for "text based"

View related business solutions

Linux Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

AI Runner

Offline inference engine for art, real-time voice conversations

AI Runner is an offline inference engine designed to run a collection of AI workloads on your own machine, including image generation for art, real-time voice conversations, LLM-powered chatbots and automated workflows. It is implemented as a desktop-oriented Python application and emphasizes privacy and self-hosting, allowing users to work with text-to-speech, speech-to-text, text-to-image and multimodal models without sending data to external services. At the core of its LLM stack is a mode-based architecture with specialized “modes” such as Author, Code, Research, QA and General, and a workflow manager that automatically routes user requests to the right agent based on the task. ...

Downloads: 6 This Week

Last Update: 2025-12-11
See Project
2

MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server

MiniMax-MCP is the official Model Context Protocol (MCP) server for accessing MiniMax’s multimodal generative APIs from MCP-compatible clients. It acts as a bridge between tools like Claude Desktop, Cursor, Windsurf, OpenAI Agents, and the MiniMax platform, exposing capabilities such as text-to-speech, voice cloning, image generation, text-to-image, video generation, image-to-video, text-to-video, and music generation. The server is written in Python and distributed under the MIT license, with a pyproject.toml and uv-based workflow that makes installation and execution reproducible. Configuration is handled through JSON files that tell MCP clients how to launch the server (typically via uvx minimax-mcp) and which environment variables to use for the API key, host, and output directory. ...

Downloads: 2 This Week

Last Update: 16 hours ago
See Project
3

PaddleOCR-json

OCR offline image text recognition command line windows program

PaddleOCR-json is an OCR engine based on the PaddleOCR project that provides a command-line interface and tools for extracting text from images and exporting results in structured JSON format. It wraps the PaddleOCR models, which are capable of detecting and recognizing text in a wide variety of languages and layouts, into a self-contained executable that can be run locally without needing a deep learning environment configured manually.

Downloads: 11 This Week

Last Update: 2026-01-15
See Project
4

RealtimeSTT

A robust, efficient, low-latency speech-to-text library

RealtimeSTT is a Python-based realtime speech-to-text engine emphasizing low latency, wake-word detection, voice activity detection, and automatic speech segmentation. It provides asynchronous callbacks, nanosecond-precision timestamps, and CLI tools, suitable for building voice assistants, meeting transcribers, or live caption systems.

Downloads: 5 This Week

Last Update: 2026-05-10
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

ONLYOFFICE DocumentServer

ONLYOFFICE Docs is a free collaborative online office suite

ONLYOFFICE Document Server is an open-source office suite that enables users to create, edit, and collaborate on documents, spreadsheets, and presentations in real-time via a web-based interface.

Downloads: 13 This Week

Last Update: 2 days ago
See Project
6

Helix Editor

A post-modern modal text editor

A Kakoune / Neovim inspired editor, written in Rust. The editing model is very heavily based on Kakoune.

Downloads: 6 This Week

Last Update: 2025-07-18
See Project
7

Pix2Text

Open-Source Python3 tool for recognizing layouts, tables, and math

An Open-Source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported. Pix2Text (P2T) aims to be a free and open-source Python alternative to Mathpix, and it can already accomplish Mathpix's core functionality. Pix2Text (P2T) can recognize layouts, tables, images, text, and mathematical formulas, and integrate all of these contents into Markdown format. ...

Downloads: 4 This Week

Last Update: 2026-02-07
See Project
8

canvas-editor

Canvas-based WYSIWYG rich text editor with advanced layout tools

canvas-editor is a browser-based rich text editor that renders content using HTML5 Canvas and SVG instead of traditional DOM-based approaches. It is designed to provide a WYSIWYG editing experience similar to word processors, enabling precise control over layout, rendering, and document structure. canvas-editor supports a wide range of formatting and document features, including text styling, tables, images, and embedded elements, all managed through a structured data model. ...

Downloads: 3 This Week

Last Update: 10 hours ago
See Project
9

Mozc

Mozc - a Japanese Input Method Editor designed for multi-platform

Mozc is an open source Japanese Input Method Editor (IME) developed by Google, designed to provide Japanese text input across multiple operating systems including Android, macOS, Windows, GNU/Linux, and Chromium OS. The project originated as a subset of Google Japanese Input, released publicly under the BSD 3-Clause license for community use and development. Mozc offers core IME functionality such as text conversion, prediction, and dictionary-based input, enabling users to efficiently type and edit Japanese text. ...

Downloads: 5 This Week

Last Update: 12 hours ago
See Project
Earn up to 16% annual interest with Nexo.
More flexibility. More control.

Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
10

Speech-AI-Forge

Speech-AI-Forge is a project developed around TTS generation model

Speech-AI-Forge is a full-stack project built around modern text-to-speech generation models, providing both an API server and a Gradio-based web UI for interactive use. At its core, it acts as a hub that wires together multiple speech-related capabilities, including TTS, speech-to-text and LLM-based control flows, behind a consistent interface. The system is designed to be deployed in several ways: you can try it online via hosted demos, spin it up in a one-click Colab environment, run it in Docker containers, or set it up locally with its environment preparation scripts. ...

Downloads: 2 This Week

Last Update: 2026-02-02
See Project
11

Mermaid

Diagram and flowchart generation from text similar to markdown

Mermaid is a JavaScript-based diagram and flowchart generating tool that uses markdown-inspired text for fast and easy generation of diagrams and charts. Forget about using heavy tools to explain your code. Mermaid greatly simplifies documentation with its simple markdown-like script language, and offers a great range of diagram and chart options. The latest version of Mermaid comes with a number of bug fixes and enhancements, as well as a new diagram type, entity relationship diagrams. ...

Downloads: 96 This Week

Last Update: 2026-05-12
See Project
12

Wan2.2

Wan2.2: Open and Advanced Large-Scale Video Generative Model

...The model is trained on significantly larger datasets than its predecessor, greatly enhancing motion complexity, semantic understanding, and aesthetic diversity. Wan2.2 also open-sources a 5-billion parameter high-compression VAE-based hybrid text-image-to-video (TI2V) model that supports 720P video generation at 24fps on consumer-grade GPUs like the RTX 4090. It supports multiple video generation tasks including text-to-video.

1 Review

Downloads: 83 This Week

Last Update: 2026-03-17
See Project
13

pg_textsearch

PostgreSQL extension for BM25 relevance-ranked full-text search

...By embedding search capabilities within the database, it simplifies architecture and reduces operational complexity. The project is particularly useful for applications that require fast and accurate text retrieval. Overall, pg_textsearch extends PostgreSQL into a more powerful platform for text-based data exploration.

Downloads: 1 This Week

Last Update: 2026-05-12
See Project
14

pywinauto

Windows GUI Automation with Python (based on text properties)

pywinauto is a set of Python modules to automate the Microsoft Windows GUI. At its simplest it allows you to send mouse and keyboard actions to Windows dialogs and controls, but it has support for more complex actions like getting text data.

Downloads: 2 This Week

Last Update: 2025-01-06
See Project
15

Firebird

Firebird server, client and tools

...It has been used in production systems, under a variety of names, since 1981. To enhance the Firebird functionality, IBSurgeon has sponsored the development and now released for public use the free open source "IBSurgeon Full Text Search UDR" to perform full-text search queries within SQL and PSQL. UDR works with Firebird 3 and 4, for Windows, there are ready-to-use binaries, for Linux, it is necessary to build the UDR. The UDR is based on Lucene++ engine, with all the powerful features required for full-text search and with very fast performance (build as native C++ library). ...

Downloads: 7 This Week

Last Update: 2026-04-17
See Project
16

FLUX.1

Official inference repo for FLUX.1 models

FLUX.1 repository contains inference code and tooling for the FLUX.1 text-to-image diffusion models, enabling developers and researchers to generate and edit images from natural-language prompts using open-weight versions of the model on their own hardware or within custom applications. The project is part of a larger family of FLUX models developed by Black Forest Labs, designed to produce high-quality, detailed visuals from text descriptions with competitive prompt adherence and artistic...

Downloads: 59 This Week

Last Update: 2026-01-19
See Project
17

OmniVoice

High-Quality Voice Cloning TTS for 600+ Languages

...The system also includes advanced features like non-verbal expression tags and pronunciation overrides, enabling expressive and precise output. With support for both API-based and command-line usage, it is designed for research, production, and experimentation alike.

Downloads: 24 This Week

Last Update: 2026-04-28
See Project
18

VoxCPM2

Tokenizer-Free TTS for Multilingual Speech Generation

VoxCPM2 is an advanced open-source text-to-speech system that redefines speech synthesis by eliminating traditional tokenization and instead generating continuous speech representations through a diffusion-based autoregressive architecture. Built on top of the MiniCPM model family, it enables highly natural, expressive, and context-aware speech generation that adapts tone, emotion, and pacing directly from input text.

Downloads: 24 This Week

Last Update: 2026-04-28
See Project
19

emanote

Emanate a structured view of your plain-text notes

Emanate a structured view of your plain-text notes. Create beautiful websites such as personal webpage, blog, wiki, Zettelkasten, notebook, knowledge-base, documentation, etc. from future-proof plain-text notes and arbitrary data, with live preview that updates in real-time. Emanote is the spiritual successor to neurons based on Ema. Emanote is a Haskell software. Thanks to Nix, this repository is pre-configured to provide a delightful development experience with full IDE support in Visual Studio Code.

Downloads: 0 This Week

Last Update: 2025-08-19
See Project
20

Swagger Editor

An editor designed for Swagger

Swagger Editor lets you edit Swagger API specifications in YAML inside your browser and to preview documentations in real time. Valid Swagger JSON descriptions can then be generated and used with the full Swagger tooling (code generation, documentation, etc). swagger-editor is a traditional npm module intended for use in single-page applications that are capable of resolving dependencies (via Webpack, Browserify, etc). swagger-editor-dist is a dependency-free module that includes everything...

Downloads: 24 This Week

Last Update: 2 days ago
See Project
21

GoAnime

A TUI tool to browse, stream, and download anime in PT-BR and EN

GoAnime is a command-line based anime streaming and downloading tool that provides a text-based user interface for browsing, selecting, and consuming anime content directly from the terminal. Built in Go, it emphasizes speed, simplicity, and portability across operating systems such as Windows, Linux, and macOS. The application works by scraping anime sources and presenting results in an interactive interface where users can search titles, navigate episode lists, and play content using an external media player like mpv. ...

Downloads: 30 This Week

Last Update: 2 days ago
See Project
22

ChatTTS webUI & API

A simple native web interface that uses ChatTTS to synthesize text

ChatTTS-ui is a local web interface and API wrapper around the ChatTTS speech synthesis system, designed to make advanced TTS models easy to use from a browser. It runs a small backend server (Python + Torch + ffmpeg) and exposes a simple webpage where you can type text, adjust parameters, and generate audio. The project supports Chinese, English, and mixed text with digits and control symbols, making it suitable for bilingual content and numerically heavy text like announcements or prompts....

Downloads: 13 This Week

Last Update: 2025-11-28
See Project
23

HunyuanCustom

Multimodal-Driven Architecture for Customized Video Generation

HunyuanCustom is a multimodal video customization framework by Tencent Hunyuan, aimed at generating customized videos featuring particular subjects (people, characters) under flexible conditions, while maintaining subject/identity consistency. It supports conditioning via image, audio, video, and text, and can perform subject replacement in videos, generate avatars speaking given audio, or combine multiple subject images. The architecture builds on HunyuanVideo, with added modules for identity reinforcement and modality-specific condition injection. Text-image fusion module based on LLaVA for improved multimodal understanding. ...

Downloads: 3 This Week

Last Update: 2025-10-15
See Project
24

Final Cut

A text-based widget toolkit

Library for creating terminal applications with text-based widgets. FINAL CUT is a C++ class library and widget toolkit with full mouse support for creating a text-based user interface. The library supports the programmer to develop an application for the text console. It allows the simultaneous handling of multiple text windows on the screen. The structure of the Qt framework was originally the inspiration for the C++ class design of FINAL CUT. ...

Downloads: 0 This Week

Last Update: 2024-07-27
See Project
25

International Components for Unicode

The home of the ICU project source code

...ICU is released under a nonrestrictive open-source license that is suitable for use with both commercial software and with other open-source or free software. Convert text data to or from Unicode and nearly any other character set or encoding. ICU's conversion tables are based on charset data collected by IBM over the course of many decades and is the most complete available anywhere. Compare strings according to the conventions and standards of a particular language, region or country. ICU's collation is based on the Unicode Collation Algorithm plus locale-specific comparison rules from the Common Locale Data Repository, a comprehensive source for this type of data.

Downloads: 15 This Week

Last Update: 2026-03-17
See Project