Showing 123 open source projects for "text t"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Video-subtitle-extractor

    Video-subtitle-extractor

    A GUI tool for extracting hard-coded subtitle (hardsub) from videos

    ...Support GPU acceleration, after GPU acceleration, you can get higher accuracy and faster extraction speed. (CLI version) No need for users to manually set the subtitle area, the project automatically detects the subtitle area through the text detection model. Filter the text in the non-subtitle area and remove the watermark (station logo) text.
    Downloads: 68 This Week
    Last Update:
    See Project
  • 2
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 4
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 9 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    CadQuery

    CadQuery

    A python parametric CAD scripting framework based on OCCT

    ...Create parametric models that can be very easily customized by end users. Output high-quality CAD formats like STEP and AMF in addition to traditional STL. Provide a non-proprietary, plain text model format that can be edited and executed with only a web browser. The scripts use a standard programming language, Python, and thus can benefit from the associated infrastructure. This includes many standard libraries and IDEs. CadQuery's CAD kernel Open CASCADE Technology (OCCT) is much more powerful than the CGAL used by OpenSCAD. ...
    Downloads: 53 This Week
    Last Update:
    See Project
  • 6
    AudioNotes

    AudioNotes

    Extract audio and video content and organize it into a Markdown note

    AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Text to Chord

    Text to Chord

    Turn words into chords

    Convert words and sentences to 5 note chords you can use to inspire music creation. Have fun turning your name, your city name, your friends' names, your team's name, your pet's name into wild and original harmonies that go beyond serialism and classic jazz.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 8
    IOPaint

    IOPaint

    Image inpainting tool powered by SOTA AI Model

    ...Designed to be fully self-hosted and flexible, IOPaint supports a variety of underlying generators and inpaint models — from LaMa erase networks to Stable Diffusion-based replace/object generation — giving users multiple ways to refine or reconstruct images by removing unwanted elements or expanding artwork beyond its original boundaries. Its feature set includes erasing people, watermarks, or defects, adding or replacing objects, applying text-aware edits, and extending images outward (outpainting) to fill contours or expand compositions.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 9
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps).
    Downloads: 4 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    Podcastfy.ai

    Podcastfy.ai

    Transforming Multimodal Content into Captivating Multilingual Audio

    Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    rich

    rich

    Rich is a Python library for rich text and beautiful formatting

    ...Rich can be installed in the Python REPL, so that any data structures will be pretty printed and highlighted. As you might expect, this will print "Hello World!" to the terminal. Note that unlike the builtin print function, Rich will word-wrap your text to fit within the terminal width.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    asciinema

    asciinema

    Open source terminal session recorder

    ...Forget old screen recording methods and resulting blurry videos. asciinema lets you record your terminal sessions the right way, which is right where you work, in the terminal. Recording is as easy as running one command, and since it’s purely text-based you can copy and paste any content you want, simply pause the recording! You can also easily share your recordings on the web, embed an asciicast player in your blog post, project documentation page or in your conference talk slides. See plenty of example sessions recorded with asciinema here: https://asciinema.org/
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14
    Dream Textures

    Dream Textures

    Stable Diffusion built-in to Blender

    Create textures, concept art, background assets, and more with a simple text prompt. Use the 'Seamless' option to create textures that tile perfectly with no visible seam. Texture entire scenes with 'Project Dream Texture' and depth to image. Re-style animations with the Cycles render pass. Run the models on your machine to iterate without slowdowns from a service. Create textures, concept art, and more with text prompts.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15
    ChatterBot

    ChatterBot

    Machine learning, conversational dialog engine for creating chat bots

    ...Additionally, the machine-learning nature of ChatterBot allows an agent instance to improve it’s own knowledge of possible responses as it interacts with humans and other sources of informative data. An untrained instance of ChatterBot starts off with no knowledge of how to communicate. Each time a user enters a statement, the library saves the text that they entered and the text that the statement was in response to. As ChatterBot receives more input the number of responses that it can reply increase.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16
    Pixoo

    Pixoo

    A library to help you make the most out of your Pixoo 64

    Pixoo is a Python-based library for controlling Divoom Pixoo LED displays using Bluetooth Low Energy (BLE). It allows users to send images, animations, or text to Pixoo devices, enabling creative integrations like desktop widgets, real-time data displays, or custom artwork.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    SCAIL

    SCAIL

    Towards Studio-Grade Character Animation via In-Context Learning of 3D

    ...While specific documentation about SCAIL’s exact goals and implementation is limited from the repository context alone, the project appears to be part of a collection of machine learning and AI research tools that facilitate scalable model development, evaluation, or application workflows. Given its listing alongside other ZAI projects like speech recognition and text-to-speech systems, SCAIL likely emphasizes scalable, composable AI learning frameworks that support researchers and practitioners in experimenting with learning algorithms, datasets, and model components. The repository structure suggests a focus on flexibility and extensibility, with potential integration into other ZAI tooling for training or analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Mozc Devices

    Mozc Devices

    Circuit diagrams and firmware source code for Gboard DIY keyboards

    ...Each subproject in the repository corresponds to a unique input device prototype, including versions such as the Drum Set, Morse Code, Patapata, Magic Hand, Piropiro, Physical Flick, Puchi Puchi, Nazoru, Mageru, Yunomi, Bar, Caps, Double Sided, and Dial editions. These devices creatively reinterpret how users can interact with Japanese text input, blending humor, engineering, and physical computing. The repository serves as an archive of the schematics, firmware, and PCB designs for these inventive input mechanisms, with many projects including promotional videos and technical references.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Super Tiny Icons

    Super Tiny Icons

    Super Tiny Icons are miniscule SVG versions of your favourite website

    ...Designers and developers can embed the icons directly, recolor them via CSS, or combine them in sprites without raster assets. The repository maintains consistent viewboxes and alignment so icons sit neatly alongside text and other UI elements. It is especially useful for landing pages, status banners, and mobile experiences where every byte matters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Scribus

    Scribus

    Powerful desktop publishing software

    Scribus is an Open Source program that brings professional page layout to Linux, BSD UNIX, Solaris, OpenIndiana, GNU/Hurd, Mac OS X, OS/2 Warp 4, eComStation, and Windows desktops with a combination of press-ready output and new approaches to page design. Underneath a modern and user-friendly interface, Scribus supports professional publishing features, such as color separations, CMYK and spot colors, ICC color management, and versatile PDF creation.
    Leader badge
    Downloads: 14,450 This Week
    Last Update:
    See Project
  • 21
    AnimateDiff

    AnimateDiff

    Plug-n-play module turning text-to-image models into animation

    AnimateDiff is an open-source project designed to enhance text-to-image diffusion models by adding animation capabilities. It allows users to turn static images generated by popular text-to-image models into animated sequences without requiring additional model training. This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models.
    Leader badge
    Downloads: 33 This Week
    Last Update:
    See Project
  • 22
    Asymptote

    Asymptote

    2D & 3D TeX-Aware Vector Graphics Language

    Asymptote is a powerful descriptive vector graphics language for technical drawing, inspired by MetaPost but with an improved C++-like syntax. Asymptote provides for figures the same high-quality typesetting that LaTeX does for scientific text.
    Leader badge
    Downloads: 259 This Week
    Last Update:
    See Project
  • 23
    xSTUDIO

    xSTUDIO

    xSTUDIO is a high performance playback and review tool.

    xSTUDIO is a high performance playback and review tool designed by and for Visual Effects, Animation and Post Production professionals. The application can load and play large collections of media files. The efficient playback engine allows you to quickly load and play high resolution image formats with a wide range of file formats and encoding. Intuitive tools allow you to create and organise playlists and media sub-sets within playlists to build interactive review sessions, image and video...
    Leader badge
    Downloads: 20 This Week
    Last Update:
    See Project
  • 24
    SpeeD ReaD ("Speedy Read-y")

    SpeeD ReaD ("Speedy Read-y")

    SpeeD ReaD is a little program to help you read faster.

    ...SpeeD ReaD forces you to simply comprehend words by sight. Additionally, as we read blocks of text on a screen or in a book, our eyes tend to vacillate between rapid movements (saccades), as we move to the next word, and stops, as we focus on a word. By showing each word in the same area of the screen, SpeeD ReaD eliminates those unnecessary eye movements. Try SpeeD ReaD today and find out how quickly you can read.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    myplayer Free Karaoke Software

    myplayer Free Karaoke Software

    myplayer Free Karaoke & Media Player Software (Myanmar)

    ...Karaoke device (PC) doesn't have the song you want to sing, but you can sing with the karaoke song file on your phone, so it's convenient if you have your favorite song on your phone. The playlist can also be managed up and down by hand, as well as emoji images from the mobile app. Fun (text) texts can also be displayed on the TV screen (player display) and can be created as a fun activity. Football match on TV screen in retail stores. the film, It is still possible to show ads while playing songs. VOD using myIPTV feature and m3u playlist file football match You can also watch and enjoy IPTV streams such as news. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB