Showing 46 open source projects for "text based"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 1
    Video-subtitle-extractor

    Video-subtitle-extractor

    A GUI tool for extracting hard-coded subtitle (hardsub) from videos

    Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files. Use local OCR recognition, no need to set up and call any API, and do not need to access online OCR services such as Baidu and Ali to complete text recognition locally. ...
    Downloads: 39 This Week
    Last Update:
    See Project
  • 2
    IOPaint

    IOPaint

    Image inpainting tool powered by SOTA AI Model

    ...Designed to be fully self-hosted and flexible, IOPaint supports a variety of underlying generators and inpaint models — from LaMa erase networks to Stable Diffusion-based replace/object generation — giving users multiple ways to refine or reconstruct images by removing unwanted elements or expanding artwork beyond its original boundaries. Its feature set includes erasing people, watermarks, or defects, adding or replacing objects, applying text-aware edits, and extending images outward (outpainting) to fill contours or expand compositions.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 3
    CadQuery

    CadQuery

    A python parametric CAD scripting framework based on OCCT

    CadQuery is an intuitive, easy-to-use Python library for building parametric 3D CAD models. It has several goals. Build models with scripts that are as close as possible to how you’d describe the object to a human, using a standard, already established programming language. Create parametric models that can be very easily customized by end users. Output high-quality CAD formats like STEP and AMF in addition to traditional STL. Provide a non-proprietary, plain text model format that can be...
    Downloads: 47 This Week
    Last Update:
    See Project
  • 4
    Pixoo

    Pixoo

    A library to help you make the most out of your Pixoo 64

    Pixoo is a Python-based library for controlling Divoom Pixoo LED displays using Bluetooth Low Energy (BLE). It allows users to send images, animations, or text to Pixoo devices, enabling creative integrations like desktop widgets, real-time data displays, or custom artwork.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    asciinema

    asciinema

    Open source terminal session recorder

    ...Forget old screen recording methods and resulting blurry videos. asciinema lets you record your terminal sessions the right way, which is right where you work, in the terminal. Recording is as easy as running one command, and since it’s purely text-based you can copy and paste any content you want, simply pause the recording! You can also easily share your recordings on the web, embed an asciicast player in your blog post, project documentation page or in your conference talk slides. See plenty of example sessions recorded with asciinema here: https://asciinema.org/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Podcastfy.ai

    Podcastfy.ai

    Transforming Multimodal Content into Captivating Multilingual Audio

    Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    video-use

    video-use

    Edit videos with Claude Code

    Video Use is an open-source AI-powered video editing tool that allows users to transform raw footage into polished videos using natural language commands. Designed to work with Claude Code, it automates the entire editing process—from cutting clips to rendering the final output—without requiring manual timelines or complex software interfaces. The system intelligently analyzes audio transcripts and visual cues to make precise, context-aware editing decisions. It supports a wide range of...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 9
    Windrecorder

    Windrecorder

    Windrecorder is a memory search app by records everything

    ...It captures screen content locally and builds a searchable database using OCR and image understanding, allowing users to rewind and rediscover anything they have previously seen. The system indexes only meaningful visual changes, extracting text, browser data, and contextual information to improve search accuracy and reduce storage overhead. It includes a web-based interface where users can browse timelines, analyze activity, and perform semantic queries on recorded content. The tool emphasizes privacy by running entirely offline, ensuring that all captured data remains on the user’s device without external transmission. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    anchorcastapp

    anchorcastapp

    Free AI-powered church presentation & live sermon display app

    AnchorCast is a free, open-source AI-powered church presentation desktop app for Windows, built with Electron. Features: - Live Sermon Transcription — real-time speech-to-text via Whisper AI - AI Bible Verse Detection — automatically detects and displays verses from live sermons - Song Manager — display song lyrics on projection - Media Playback — images and video on projection screen - NDI Output — stream projection over local network - Remote Control — control presentation from...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    stmani3

    stmani3

    Stereo Photo Manipulation

    A set of programs for Alignment and Rendering of still Stereo Photos (3D). This is a Python3 updated version of the old StMani
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Color to Word

    Color to Word

    Turn colors into words

    The program will turn a color into a list of 10 words, obtained according to a custom designed algorithm based on letter shape and position in the alphabet. - Click inside the frame on the left to pick a color through the color chooser window - The program will match the color with the colors corresponding to a list of all the English words contained in the file wordcolor.txt - The first 10 matches will appear in the frame on the right - Right-click - Copy to copy the word matches and the RGB values This version comes with a text file (wordcolor.txt) containing all the English words followed by Red, Green, Blue channel values for the corresponding color. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Bhakti-Apps
    This app is meant for a simplified user experience, perfect for note takers. It has a good encryption system, and the best feature is that it works on the CLI, so its compatibility is great.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Nostalgic Photo DataBase (platform)

    Nostalgic Photo DataBase (platform)

    Active repository of jpeg-photos with tags suitable for personal needs

    NPDB offers a comprehensive platform for creating and maintaining a database of both old, digitized images and new snapshots captured by smartphones. This versatile system allows users to organize and search through their collection using customizable tags, catering to images of any vintage. One of NPDB's key features is its flexible tagging system, which allows users to categorize their images using an arbitrary set of tags tailored to their preferences. This intuitive approach streamlines...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Midi_Master_Clock

    A simple command line Midi Master Clock

    .... - MIDI Sync Commands (Stop, Song Position Pointer (0), Start) to prepare your devices. - Accurate MIDI Clock signals based on the song's BPM, keeping your gear perfectly in sync.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    Clipstitch

    Uility to make home movies from your digital camera files

    Full documentation: Download clipstitchX.Y.html To make movies from your camera (or phone) video files. FFmpeg is a professional-quality, free, open-source program for video editing, with the ability to implement a huge number of operations and handle every data format! This kind of ability comes at a cost: its commands are quite complex-looking and difficult to use and remember. Clipstitch runs as a front-end to ffmpeg so that you use only the sub-set of ffmpeg commands necessary...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    DALL-E 2 - Pytorch

    DALL-E 2 - Pytorch

    Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis

    Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch. The main novelty seems to be an extra layer of indirection with the prior network (whether it is an autoregressive transformer or a diffusion network), which predicts an image embedding based on the text embedding from CLIP. Specifically, this repository will only build out the diffusion prior network, as it is the best performing variant (but which incidentally involves a causal transformer as the denoising network) To train DALLE-2 is a 3 step process, with the training of CLIP being the most important. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    auto-subtitle

    auto-subtitle

    Automatically generate and overlay subtitles for any video

    auto-subtitle is a Python-based command-line tool that automatically generates and overlays subtitles on video files using AI-driven speech recognition. It combines FFmpeg with OpenAI’s Whisper model to transcribe spoken audio into text and synchronize it with video playback. The tool processes video input, extracts audio, and produces subtitle files that can be either exported separately or burned directly into the final video output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    audio-diffusion-pytorch

    audio-diffusion-pytorch

    Audio generation using diffusion models, in PyTorch

    A fully featured audio diffusion library, for PyTorch. Includes models for unconditional audio generation, text-conditional audio generation, diffusion autoencoding, upsampling, and vocoding. The provided models are waveform-based, however, the U-Net (built using a-unet), DiffusionModel, diffusion method, and diffusion samplers are both generic to any dimension and highly customizable to work on other formats. Note: no pre-trained models are provided here, this library is meant for research purposes.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate the robustness gaps of your model! ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    video-to-ascii

    video-to-ascii

    It is a simple python package to play videos in the terminal

    ...It also includes color approximation using ANSI palettes to enhance visual fidelity within terminal limitations. Designed as both a creative and technical project, it demonstrates how video data can be transformed into text-based representations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Downloads: 17 This Week
    Last Update:
    See Project
  • 24

    TimingDrawer

    Text based timing diagram generator

    This tool generates timing diagrams for documenting hardware design. It reads the description from a text file with a simple syntax. It generates vector graphic (EPS, SVG or EMF format). It can be used in command line mode or with a GUI. It is written in Python and works on any platform.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    MysticBBS Shoutcast Panel

    MysticBBS Shoutcast Panel

    Python Script for Mystic BBS Shoutcast Dashboard

    ... - Contains simple code to grab XML/RSS feed and display on the BBS. - Contains various functions to make it easy to display information in a correctly sized mystic bbs text based frame including titles and prompts. This script can be a skeleton to get you going in writing your own properly fit modular text based python apps for Mystic BBS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB