Search Results for "python text" - Page 30

Sort By:

Showing 1913 open source projects for "python text"

View related business solutions

Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

Nostalgic Photo DataBase (platform)

Active repository of jpeg-photos with tags suitable for personal needs

NPDB offers a comprehensive platform for creating and maintaining a database of both old, digitized images and new snapshots captured by smartphones. This versatile system allows users to organize and search through their collection using customizable tags, catering to images of any vintage. One of NPDB's key features is its flexible tagging system, which allows users to categorize their images using an arbitrary set of tags tailored to their preferences. This intuitive approach streamlines...

Downloads: 0 This Week

Last Update: 5 days ago
See Project
2

TensorFlow Hub

A library for transfer learning by reusing parts of TensorFlow models

TensorFlow Hub is a repository that provides a library and platform for publishing, discovering, and reusing pre-trained machine learning models built with TensorFlow. The project enables developers to integrate high-quality models into their applications without needing to train them from scratch. Through TensorFlow Hub, researchers and practitioners can share reusable model components such as image classifiers, text embedding models, and object detection networks. These models can be...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
3

pdf combiner merger converter splitter

PDF Combiner is a user-friendly, GUI-based tool built in

PDF Combiner is a user-friendly open source free to use, GUI-based tool for combining, pdf to excel, pdf to word, image to pdf, zip, unzip annotate and splitting PDF files. It is easy to use, supports multiple file insert and delete and process, and allows you to adjust the order of files before combining.

1 Review

Downloads: 4 This Week

Last Update: 2024-05-03
See Project
4

clavirio

Learn touch typing without leaving the terminal

A free, open-source typing tutor for the terminal. Progressive lessons, practice modes, real-time stats, and a virtual keyboard with finger hints — for QWERTY, Dvorak, and Colemak. Methodology Research on typing skill suggests that skilled typing relies more on implicit procedural control than on explicit knowledge of key locations. The paper also suggests that the keyboard is represented in terms of its row-and-column structure, not as a memorized list of individual...

Downloads: 3 This Week

Last Update: 2026-03-28
See Project
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
5

Obsei

Obsei is a low code AI powered automation tool

Obsei is an automated no-code/low-code AI-powered text observation and analysis framework, designed for extracting insights from unstructured text data such as social media, reviews, and logs.

Downloads: 6 This Week

Last Update: 2025-01-24
See Project
6

Ailice

AIlice is a fully autonomous, general-purpose AI agent

AIlice is an open-source autonomous AI agent framework built to function as a general-purpose assistant that can plan, decompose, and execute complex tasks through a structured multi-agent architecture. The project presents itself as a standalone assistant powered by open-source language models, with an internal design that treats user requests almost like executable programs rather than simple chat prompts. Its core IACT architecture allows the system to break large goals into smaller...

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
7

Wikipedia2Vec

A tool for learning vector representations of words and entities

Wikipedia2Vec is an embedding learning tool that creates word and entity vector representations from Wikipedia, enabling NLP models to leverage structured and contextual knowledge.

Downloads: 2 This Week

Last Update: 2025-01-24
See Project
8

YAYI

Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM

YAYI is an open-source large language model project developed to provide a multilingual conversational AI system capable of performing a wide variety of natural language processing tasks. The model is trained on diverse datasets covering multiple languages and domains so that it can support applications ranging from dialogue systems to text analysis and knowledge retrieval. The architecture is based on transformer-style language models optimized for conversational understanding and...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
9

EmotiVoice

Multi-Voice and Prompt-Controlled TTS Engine

...EmotiVoice provides multiple ways to interact with it, including a web interface, a Docker image, an HTTP API (including an OpenAI-compatible TTS API), and Python scripts for batch synthesis. It also supports voice cloning with your own data, backed by recipes for popular datasets like DataBaker and LJSpeech, so you can train or adapt voices to custom personas.

Downloads: 2 This Week

Last Update: 2025-11-30
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

Transformers4Rec

Transformers4Rec is a flexible and efficient library

Transformers4Rec is an advanced recommendation system library that leverages Transformer models for sequential and session-based recommendations. The library works as a bridge between natural language processing (NLP) and recommender systems (RecSys) by integrating with one of the most popular NLP frameworks, Hugging Face Transformers (HF). Transformers4Rec makes state-of-the-art transformer architectures available for RecSys researchers and industry practitioners. Traditional recommendation...

Downloads: 6 This Week

Last Update: 2025-01-24
See Project
11

ddgr

DuckDuckGo from the terminal

ddgr is a command-line search tool that lets users perform DuckDuckGo web searches directly from their terminal, providing a privacy-focused alternative to browser-based searches without tracking or personalized profiling. It fetches search results via DuckDuckGo’s API or HTML output and presents links, snippets, and metadata in a clean terminal format, making it useful for programmers, sysadmins, and privacy advocates who prefer keyboard-driven workflows. The tool also supports options like...

Downloads: 11 This Week

Last Update: 2026-01-26
See Project
12

HelPyLite!

A TkInter UI for Pythons help() utility.

.... - Save the documentation to a text file, for future reference, notes, etc.

Downloads: 0 This Week

Last Update: 2024-03-22
See Project
13

GPT-2 Output Dataset

Dataset of GPT-2 outputs for research in detection, biases, and more

The GPT-2 Output Dataset is a large collection of model-generated text, released by OpenAI alongside the GPT-2 research paper to study the behaviors and limitations of large language models. It contains 250,000 samples of GPT-2 outputs, generated with different sampling strategies such as top-k truncation, to highlight the diversity and quality of model completions. The dataset also includes corresponding human-written text for comparison, enabling researchers to explore methods for...

Downloads: 0 This Week

Last Update: 4 days ago
See Project
14

DB-GPT-Hub

A repository that contains models, datasets, and fine-tuning

DB-GPT-Hub is an open-source repository that provides datasets, models, and training tools designed to improve large language models for database interaction tasks, particularly Text-to-SQL. The project serves as a specialized extension of the broader DB-GPT ecosystem, focusing on the preparation and evaluation of models capable of translating natural language questions into structured database queries. It offers a modular framework that supports data preparation, model fine-tuning,...

Downloads: 3 This Week

Last Update: 2026-03-06
See Project
15

Coqui TTS

A deep learning toolkit for Text-to-Speech, battle-tested in research

TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality. TTS comes with pre-trained models, tools for measuring dataset quality and is already used in 20+ languages for products and research projects. High-performance Deep Learning models for Text2Speech tasks. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings...

Downloads: 18 This Week

Last Update: 2023-12-12
See Project
16

pytextgame

Game development library

PyTextGame is a python library which can be used to develop text based games. [console-based]. It contains several features for making game development 3x faster.

Downloads: 0 This Week

Last Update: 2024-02-26
See Project
17

text-dedup

All-in-one text de-duplication

text-dedup is a Python library that enables efficient deduplication of large text corpora by using MinHash and other probabilistic techniques to detect near-duplicate content. This is especially useful for NLP tasks where duplicated training data can skew model performance. text-dedup scales to billions of documents and offers tools for chunking, hashing, and comparing text efficiently with low memory usage.

Downloads: 0 This Week

Last Update: 2025-04-08
See Project
18

CLIP-as-service

Embed images and sentences into fixed-length vectors

CLIP-as-service is a low-latency high-scalability service for embedding images and text. It can be easily integrated as a microservice into neural search solutions. Serve CLIP models with TensorRT, ONNX runtime and PyTorch w/o JIT with 800QPS[*]. Non-blocking duplex streaming on requests and responses, designed for large data and long-running tasks. Horizontally scale up and down multiple CLIP models on single GPU, with automatic load balancing. Easy-to-use. No learning curve, minimalist...

Downloads: 0 This Week

Last Update: 2023-12-20
See Project
19

Style Aligned

Official code for Style Aligned Image Generation via Shared Attention

StyleAligned is a diffusion-model editing technique and codebase that preserves the visual “style” of an original image while applying new semantic edits driven by text. Instead of fully re-generating an image—and risking changes to lighting, texture, or rendering choices—the method aligns internal features across denoising steps so the target edit inherits the source style. This alignment acts like a constraint on the model’s evolution, steering composition, palette, and brushwork even as...

Downloads: 0 This Week

Last Update: 2025-10-10
See Project
20

towhee

Framework that is dedicated to making neural data processing

Towhee is an open-source machine-learning pipeline that helps you encode your unstructured data into embeddings. You can use our Python API to build a prototype of your pipeline and use Towhee to automatically optimize it for production-ready environments. From images to text to 3D molecular structures, Towhee supports data transformation for nearly 20 different unstructured data modalities. We provide end-to-end pipeline optimizations, covering everything from data decoding/encoding, to model inference, making your pipeline execution 10x faster. ...

Downloads: 1 This Week

Last Update: 2023-12-05
See Project
21

TextGen

textgen, Text Generation models

Implementation of Text Generation models. textgen implements a variety of text generation models, including UDA, GPT2, Seq2Seq, BART, T5, SongNet and other models, out of the box. UDA, non-core word replacement. EDA, simple data augmentation technique: similar words, synonym replacement, random word insertion, deletion, replacement. This project refers to Google's UDA (non-core word replacement) algorithm and EDA algorithm, based on TF-IDF to replace some unimportant words in sentences with...

Downloads: 3 This Week

Last Update: 2023-11-02
See Project
22

Text to Waveform

Create synth presets from words

Convert words to waveforms you can load into a synthesizer oscillator to create synth presets. Have fun turning your name, your friends' names, your city name, your pet's name, your team's name into synth presets you can use to produce a track.

Downloads: 0 This Week

Last Update: 2023-12-09
See Project
23

VALL-E X

Open source implementation of Microsoft's VALL-E X zero-shot TTS model

VALL-E-X is an open-source implementation of Microsoft’s VALL-E X zero-shot text-to-speech model, focused on multilingual, cross-lingual voice cloning. It is capable of synthesizing speech in English, Chinese, and Japanese from text while mimicking the voice characteristics of a speaker given only a short 3–10 second prompt. The model attempts to match not just timbre, but also tone, pitch, emotion, and prosody of the reference audio, resulting in highly personalized output. ...

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
24

Shap-E

Generate 3D objects conditioned on text or images

The shap-e repository provides the official code and model release for Shap-E, a conditional generative model designed to produce 3D assets (implicit functions, meshes, neural radiance fields) from text or image prompts. The model is built with a two-stage architecture: first an encoder that maps existing 3D assets into parameterizations of implicit functions, and then a conditional diffusion model trained on those parameterizations to generate new assets. Because it works at the level of...

Downloads: 1 This Week

Last Update: 2025-10-02
See Project
25

MahaKurawa.My.ID URL Extractor

MahaKurawa.My.ID URL Extractor is Simple Tool to extract unique URL

MahaKurawa.My.ID URL Extractor is Simple Tool to extract unique URL from any text content in instant. It's useful when you lazy enough to identify and copy-paste URL from your content one by one by yourself.

Downloads: 0 This Week

Last Update: 2024-05-01
See Project