Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "python text" - Page 6

x

Sort By:

Relevance

Clear All Filters

OS

ChromeOS 604
Linux 603
Windows 603
More...
Mac 602
BSD 600
Desktop Operating Systems 8
Mobile Operating Systems 5
Game Consoles 2
Server Operating Systems 1

Category

Artificial Intelligence 275
Text Editors 118
Software Development 105
Internet 55
Business 37
Multimedia 37
Scientific/Engineering 26
Games 24
Communications 23
Formats and Protocols 20
Education 17
System 15
Security 9
Database 7
Desktop Environment 6
Printing 6
Productivity 2
Terminals 2
Blockchain 1
Social sciences 1

License

OSI-Approved Open Source 550
Creative Commons Attribution License 8
Public Domain 8
Other License 4

Translations

Programming Language

Status

Production/Stable 105
Beta 77
Alpha 52
Pre-Alpha 28
More...
Planning 24
Mature 16
Inactive 8

604 projects for "python text" with 1 filter applied:

ChromeOS Clear Filters & Widen Search

Auth0 B2B Essentials: SSO, MFA, and RBAC Built In
Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.

Sign Up Free
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
1

DocETL

A system for agentic LLM-powered data processing and ETL

DocETL is an open-source system designed to build and execute data processing pipelines powered by large language models, particularly for analyzing complex collections of documents and unstructured datasets. The platform allows developers and researchers to construct structured workflows that extract, transform, and organize information from sources such as reports, transcripts, legal documents, and other text-heavy data. Instead of relying on single prompts or ad-hoc scripts, DocETL...

Downloads: 6 This Week

Last Update: 2026-03-05
See Project
2

Open-LLM-VTuber

Open source AI VTuber platform with voice chat and Live2D avatars

Open-LLM-VTuber is an open source platform designed to create AI-powered VTuber characters that can interact with users through voice and animated avatars. It enables hands-free conversations with large language models by combining speech recognition, language processing, and text-to-speech synthesis into a single system. Users can speak directly to the AI character, and the system can respond with a generated voice while animating a Live2D avatar to simulate a talking virtual personality....

Downloads: 26 This Week

Last Update: 2026-03-17
See Project
3

DeepSeek-OCR 2

Visual Causal Flow

DeepSeek-OCR-2 is the second-generation optical character recognition system developed to improve document understanding by introducing a “visual causal flow” mechanism, enabling the encoder to reorder visual tokens in a way that better reflects semantic structure rather than strict raster scan order. It is designed to handle complex layouts and noisy documents by giving the model causal reasoning capabilities that mimic human visual scanning behavior, enhancing OCR performance on documents...

Downloads: 7 This Week

Last Update: 2026-02-03
See Project
4

CosyVoice

Multi-lingual large voice generation model, providing inference

CosyVoice is a multilingual large voice generation model that offers a full-stack solution for training, inference, and deployment of high-quality TTS systems. The model supports multiple languages, including Chinese, English, Japanese, Korean, and a range of Chinese dialects such as Cantonese, Sichuanese, Shanghainese, Tianjinese, and Wuhanese. It is designed for zero-shot voice cloning and cross-lingual or mix-lingual scenarios, so a single reference voice can be used to synthesize speech...

Downloads: 3 This Week

Last Update: 2025-11-30
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
5

Machine learning basics

Plain python implementations of basic machine learning algorithms

...The repository includes notebooks that demonstrate classic algorithms such as linear regression, logistic regression, k-nearest neighbors, decision trees, support vector machines, and clustering techniques. Each notebook typically combines explanatory text, Python code, and visualizations to illustrate how the algorithm operates and how it can be applied to datasets.

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
6

Loggifly

Get Alerts from your Docker Container Logs

LoggiFly is a lightweight, open-source monitoring tool designed to watch Docker container logs in real time and trigger alerts, notifications, or automated actions based on predefined keywords or regular expression patterns. Instead of manually scanning logs for issues or relying solely on centralized monitoring stacks, LoggiFly proactively inspects streams of container output and notifies users through services like Ntfy, Slack, Discord, Telegram, or webhooks when significant events occur....

Downloads: 10 This Week

Last Update: 2026-05-02
See Project
7

GLM-4.6V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

GLM-4.6V represents the latest generation of the GLM-V family and marks a major step forward in multimodal AI by combining advanced vision-language understanding with native “tool-call” capabilities, long-context reasoning, and strong generalization across domains. Unlike many vision-language models that treat images and text separately or require intermediate conversions, GLM-4.6V allows inputs such as images, screenshots or document pages directly as part of its reasoning pipeline — and...

Downloads: 2 This Week

Last Update: 2026-04-06
See Project
8

Book4_Power-of-Matrix

Book_4_Matrix Power | The Iris Book: From Addition, Subtraction

...The project is designed to help readers progress from basic arithmetic toward machine learning fundamentals by building a strong conceptual understanding of vectors, matrices, and their operations. It combines explanatory text, diagrams, and Python examples to bridge theory and practical computation. The material emphasizes geometric interpretation and visual reasoning, which makes abstract linear algebra topics more accessible to beginners and self-learners. The repository is continuously updated and intended to accompany the broader Visualize-ML learning ecosystem. ...

Downloads: 0 This Week

Last Update: 2026-05-01
See Project
9

ESPnet

End-to-end speech processing toolkit

ESPnet is a comprehensive end-to-end speech processing toolkit covering a wide spectrum of tasks, including automatic speech recognition (ASR), text-to-speech (TTS), speech translation (ST), speech enhancement, speaker diarization, and spoken language understanding. It uses PyTorch as its deep learning engine and adopts a Kaldi-style data processing pipeline for features, data formats, and experimental recipes. This combination allows researchers to leverage modern neural architectures while...

Downloads: 0 This Week

Last Update: 2026-04-22
See Project
Earn up to 16% annual interest with Nexo.
More flexibility. More control.

Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
10

Dia

A TTS model capable of generating ultra-realistic dialogue

Dia is a neural text-to-speech model designed specifically for generating ultra-realistic dialogue in a single pass. Instead of focusing on isolated sentences or flat narration, it is optimized for conversational audio, complete with natural turn-taking, prosody, and pacing. The model can be conditioned on a reference audio sample, allowing you to control emotion, tone, and other stylistic aspects of the speech. It can also produce nonverbal vocalizations like laughter, coughs, clearing the...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
11

MCP UI

SDK for building interactive UI components over MCP for AI tools

...It includes both client-side and server-side SDKs, making it possible to define UI elements on the backend and handle user interactions on the frontend. It supports multiple programming environments, including TypeScript, Python, and Ruby, broadening its accessibility for developers.

Downloads: 3 This Week

Last Update: 3 days ago
See Project
12

TTS WebUI

A single Gradio + React WebUI with extensions for ACE-Step

TTS-WebUI is a unified Gradio + React web interface that brings together a large ecosystem of text-to-speech, voice conversion, and audio generation models under a single UI. It supports a wide range of models such as Bark, MusicGen, Tortoise, RVC, StyleTTS2, ParlerTTS, CosyVoice, XTTSv2, Stable Audio, SeamlessM4T, and many others, exposing them as interchangeable backends for speech and music synthesis. The project provides an installer that sets up Conda, Python environments, and all necessary dependencies, so users can focus on experimenting with voices instead of managing tooling. ...

Downloads: 4 This Week

Last Update: 2026-04-30
See Project
13

MOSS-TTS-Nano

MOSS-TTS-Nano is an open-source multilingual tiny speech generation

MOSS-TTS-Nano is a lightweight text-to-speech model designed for real-time voice generation in resource-constrained environments. It is part of the broader MOSS-TTS family and focuses on delivering high-quality speech synthesis with a compact architecture. The model operates efficiently on CPU-only systems, enabling deployment without specialized hardware. It supports multilingual voice cloning and produces high-fidelity audio with low latency. The system uses an autoregressive audio...

Downloads: 9 This Week

Last Update: 7 days ago
See Project
14

Hugging Face - Speech To Speech

Open speech-to-speech models and pipelines by Hugging Face toolkit AI

This project from Hugging Face focuses on enabling direct speech-to-speech processing using modern machine learning models. It provides tools and reference implementations that allow audio input to be transformed into audio output without requiring an intermediate text representation. Hugging Face - Speech To Speech builds on recent advances in speech modeling, combining components such as speech recognition, translation, and synthesis into unified pipelines. It is designed to help...

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
15

Natural Language Toolkit

NLTK Source

The Natural Language Toolkit (NLTK) is a widely used open-source Python library designed for working with human language data and building natural language processing (NLP) applications. It provides a comprehensive suite of modules, datasets, and tutorials that support both symbolic and statistical approaches to language processing. The toolkit includes implementations of many foundational NLP algorithms and utilities, enabling developers to perform tasks such as tokenization, stemming,...

Downloads: 0 This Week

Last Update: 2026-03-24
See Project
16

LongWriter

Unleashing 10,000+ Word Generation from Long Context LLMs

LongWriter is an open-source framework and set of large language models designed to enable ultra-long text generation that can exceed 10,000 words while maintaining coherence and structure. Traditional large language models can process large inputs but often struggle to generate long outputs due to limitations in training data and alignment strategies. LongWriter addresses this challenge by introducing a specialized dataset and training approach that encourages models to produce longer...

Downloads: 1 This Week

Last Update: 2026-03-06
See Project
17

ChatTTS_colab

One-click deployment (including offline integration package)

ChatTTS_colab is a wrapper project around the ChatTTS model that focuses on “one-click” deployment, especially in Google Colab. It provides an integrated offline bundle and scripts for Windows and macOS so users can run ChatTTS locally without wrestling with complex environment setup. The repository includes Colab notebooks that launch a Gradio-based web UI and expose streaming TTS, making it possible to listen to generated audio as it is produced. A distinctive feature is the “voice gacha”...

Downloads: 2 This Week

Last Update: 2025-11-28
See Project
18

openctp

Provides CTP stock options and Zhongtai Securities XTP

openctp is a technical service platform built around the CTP trading ecosystem that provides CTP compatible interfaces for a wide range of brokerage backends and markets. Its core idea is to wrap heterogeneous stock and derivatives trading gateways such as Zhongtai XTP, Huaxin Qidian TORA, and others with CTPAPI compatible interfaces, so existing CTP programs can connect simply by swapping dynamic libraries rather than rewriting code. The project offers a comprehensive simulation environment...

Downloads: 0 This Week

Last Update: 2026-01-08
See Project
19

Universal Tool Calling Protocol (UTCP)

Official python implementation of UTCP. UTCP is an open standard

The python-utcp repository is the official Python SDK implementation of the Universal Tool Calling Protocol (UTCP). UTCP is an open, modern standard designed to let AI agents call any tool or API directly—over HTTP, CLI, WebSocket, gRPC, and more—without the overhead of extra wrapper layers or middleware. It leverages a modular, plugin-based architecture built around Pydantic models and separates the core functionality into a lightweight client and extensible protocol plugins, enabling...

Downloads: 1 This Week

Last Update: 2 days ago
See Project
20

WhisperJAV

Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD

WhisperJAV is an open-source speech transcription pipeline designed specifically for generating subtitles for Japanese adult video content. The project addresses challenges that standard speech recognition models face when transcribing this type of audio, which often includes low signal-to-noise ratios and large numbers of non-verbal vocalizations. Traditional automatic speech recognition systems can misinterpret these sounds as words, leading to inaccurate transcripts. WhisperJAV introduces...

Downloads: 12 This Week

Last Update: 1 day ago
See Project
21

Claude Cookbooks

A collection of notebooks/recipes showcasing ways of using Claude

...The repository includes structured examples for integrating Claude with external tools, databases, and APIs, showcasing how to extend its functionality beyond basic text generation. It also covers advanced techniques like sub-agent orchestration, prompt optimization, and automated evaluation workflows. The content is organized into thematic sections, allowing users to explore specific capabilities or integration patterns systematically. Designed with accessibility in mind, the examples are primarily written in Python but can be adapted to other languages.

Downloads: 2 This Week

Last Update: 4 days ago
See Project
22

Claude Code Tools

Practical productivity tools for Claude Code, Codex-CLI

Claude Code Tools is an open-source collection of command-line utilities and productivity plugins designed to enhance developer workflows when using AI coding agents such as Claude Code and Codex-CLI. The project focuses on solving common problems encountered in AI-assisted development environments, including managing session history, automating terminal interactions, and maintaining context across multiple coding sessions. It includes tools that allow developers to search conversation logs...

Downloads: 17 This Week

Last Update: 2026-03-26
See Project
23

LOTUS

AI-Powered Data Processing: Use LOTUS to process all of your datasets

...The system provides a declarative programming model that allows developers to express complex AI data operations using high-level commands rather than manually orchestrating model calls. It offers a Python interface with a Pandas-like API, making it familiar for data scientists and engineers already working with data analysis libraries. The core concept of the framework is the use of semantic operators, which extend traditional relational database operations to support reasoning over text and other unstructured data. These operators allow tasks such as semantic filtering, ranking, clustering, and summarization to be expressed directly within data processing pipelines. ...

Downloads: 2 This Week

Last Update: 2026-03-06
See Project
24

MEDIUM_NoteBook

Repository containing notebooks of my posts on Medium

...The project is useful for learners who want to explore machine learning concepts interactively using Python and common data science libraries.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
25

Engram

A New Axis of Sparsity for Large Language Models

Engram is a high-performance embedding and similarity search library focused on making retrieval-augmented workflows efficient, scalable, and easy to adopt by developers building search, recommendation, or semantic matching systems. It provides utilities to generate embeddings from text or other structured data, index them using efficient approximate nearest neighbor algorithms, and perform real-time similarity queries even on large corpora. Engineered with speed and memory efficiency in...

Downloads: 0 This Week

Last Update: 2026-01-28
See Project

Previous
2
3
4
5
You're on page 6
7
8
9
10
Next

Related Searches

ocr

machine learning

dia

tts

whisperjav

claude

tesseract-ocr-w64-setup.exe

tesseract-ocr-w64-setup-5.5.0.20241111.exe

scan

portable subtitle downloader

Related Categories

Artificial Intelligence

Text Editors

Software Development

Internet

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise