Showing 1027 open source projects for "text to"

View related business solutions
  • Outgrown Windows Task Scheduler? Icon
    Outgrown Windows Task Scheduler?

    Free diagnostic identifies where your workflow is breaking down—with instant analysis of your scheduling environment.

    Windows Task Scheduler wasn't built for complex, cross-platform automation. Get a free diagnostic that shows exactly where things are failing and provides remediation recommendations. Interactive HTML report delivered in minutes.
    Download Free Tool
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    Video-subtitle-extractor

    Video-subtitle-extractor

    A GUI tool for extracting hard-coded subtitle (hardsub) from videos

    ...Support GPU acceleration, after GPU acceleration, you can get higher accuracy and faster extraction speed. (CLI version) No need for users to manually set the subtitle area, the project automatically detects the subtitle area through the text detection model. Filter the text in the non-subtitle area and remove the watermark (station logo) text.
    Downloads: 53 This Week
    Last Update:
    See Project
  • 2
    Subtitle Edit

    Subtitle Edit

    The subtitle editor

    Subtitle Edit (SE) is a free, open‑source subtitle editor for creating, editing, synchronizing, and converting subtitles. It supports a wide range of formats (over 300) and offers both graphical and text-based editing views.  Easy insertion, deletion, and shift of subtitle lines. Portable versions available (.NET 4.8, 32/64-bit), runs on Windows and via compatibility on Linux. Active development with frequent updates and issue tracking. Plugin support and rich editing tools (e.g., translation, spellcheck, sync).
    Downloads: 252 This Week
    Last Update:
    See Project
  • 3
    Koodo Reader

    Koodo Reader

    A modern ebook manager and reader with sync and backup

    ...Customize the source folder and synchronize among multiple devices using OneDrive, iCloud, Dropbox, etc. Single-column, two-column, or continuous scrolling layouts. Text-to-speech, translation, progress slider, touch screen support, batch import. Add bookmarks, notes, highlights to your books. Adjust font size, font family, line-spacing, paragraph spacing, background color, text color, margins, and brightness. Night mode and theme color. Text highlight, underline, boldness, italics and shadow. Adjust font size, font family, line-spacing, paragraph spacing, background color, text color, margins, and brightness.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 4
    Frescobaldi

    Frescobaldi

    LilyPond sheet music text editor

    Frescobaldi is a free and open source LilyPond sheet music text editor. Designed to be powerful yet lightweight and easy-to-use, Frescobaldi offers great functionality and a host of useful features such as music view with advanced two-way Point & Click, Midi capturing to enter music, a Snippet Manager and many more. Frescobaldi is named after Girolamo Frescobaldi (1583-1643), an Italian composer of keyboard music in the late Renaissance and early Baroque period.
    Downloads: 48 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 5
    Mermaid

    Mermaid

    Diagram and flowchart generation from text similar to markdown

    Mermaid is a JavaScript-based diagram and flowchart generating tool that uses markdown-inspired text for fast and easy generation of diagrams and charts. Forget about using heavy tools to explain your code. Mermaid greatly simplifies documentation with its simple markdown-like script language, and offers a great range of diagram and chart options. The latest version of Mermaid comes with a number of bug fixes and enhancements, as well as a new diagram type, entity relationship diagrams. ...
    Downloads: 119 This Week
    Last Update:
    See Project
  • 6
    Translate-Subtitle-File

    Translate-Subtitle-File

    Subtitle Creation Assistant

    ...You can configure your own API Key to use your own account's free quota, such as Tencent's free translation quota of 5 million characters per month, IBM's 500-minute speech-to-text free quota (tern. best The domain name has expired and I don't want to renew it.) Azure speech-to-text and DeepL free version have problems, it is normal to not use it, please wait for the next version to fix. Machine translation of subtitle files, use machine translation to process files.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    ShareX

    ShareX

    Screen capture, file sharing and productivity tool

    ...ShareX offers numerous methods of capture including full screen, monitor, freehand and so much more. It also features plenty of customizable after-capture tasks and supports more than 80 different image, text, and file hosting services, as well as many URL shortening and sharing services.
    Downloads: 53 This Week
    Last Update:
    See Project
  • 8
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Run applications fast and securely in a fully managed environment Icon
    Run applications fast and securely in a fully managed environment

    Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.

    Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
    Try for free
  • 10
    CadQuery

    CadQuery

    A python parametric CAD scripting framework based on OCCT

    ...Create parametric models that can be very easily customized by end users. Output high-quality CAD formats like STEP and AMF in addition to traditional STL. Provide a non-proprietary, plain text model format that can be edited and executed with only a web browser. The scripts use a standard programming language, Python, and thus can benefit from the associated infrastructure. This includes many standard libraries and IDEs. CadQuery's CAD kernel Open CASCADE Technology (OCCT) is much more powerful than the CGAL used by OpenSCAD. ...
    Downloads: 62 This Week
    Last Update:
    See Project
  • 11
    Paint.NET

    Paint.NET

    Downloads for Paint.NET, such as installer EXEs and portable ZIPs

    Every feature and user interface element was designed to be immediately intuitive and quickly learnable without assistance. In order to handle multiple images easily, it uses a tabbed document interface. The tabs display a live thumbnail of the image instead of a text description. This makes navigation very simple and fast. Extensive work has gone into making it the fastest image editor available. Starting the app is nearly instantaneous, and every feature has been thoroughly optimized to take advantage of the latest multicore CPUs, GPUs, and NVMe SSDs. The use of DXGI Flip Model ensures low input latency and reduced power consumption. ...
    Downloads: 129 This Week
    Last Update:
    See Project
  • 12
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    PhotoEditor

    PhotoEditor

    A Photo Editor library with simple, easy support for image editing

    A Photo Editor library with simple, easy support for image editing using Paints, Text, Filters, Emoji and Sticker like stories. Drawing on the image with the option to change its Brush's Color, Size, Opacity, Erasing and basic shapes. Apply Filter Effect on the image using MediaEffect. Adding/Editing Text with the option to change its Color with Custom Fonts. Adding Emoji with Custom Emoji Fonts. Adding Images/Stickers.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14
    three.js

    three.js

    JavaScript 3D library

    three.js is a lightweight and easy-to-use JavaScript 3D library with its own default WebGL renderer. It works by creating a scene with a camera and a geometric cube, the latter of which is added to the scene. This will become the animated object within the scene. A WebGL renderer is created for the scene and the camera, with an added viewport to the document.body element. Finally, the cube is animated within the scene for the camera. A plethora of examples can be found in the project’s...
    Downloads: 85 This Week
    Last Update:
    See Project
  • 15
    PixelIt

    PixelIt

    ESP8266/ESP32 and WS2812B LED Matrix based PixelArt display

    PixelIt is an open-source firmware and hardware project designed to drive LED matrix displays using the ESP32 microcontroller. It's tailored for DIY enthusiasts and makers who want to create pixel art displays, show scrolling text, and integrate data from various APIs like weather or news. The project emphasizes customization and ease of use, allowing users to control displays via a web interface or REST API.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 16
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps).
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    rich

    rich

    Rich is a Python library for rich text and beautiful formatting

    ...Rich can be installed in the Python REPL, so that any data structures will be pretty printed and highlighted. As you might expect, this will print "Hello World!" to the terminal. Note that unlike the builtin print function, Rich will word-wrap your text to fit within the terminal width.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Image Toolbox

    Image Toolbox

    Image Toolbox is an powerful picture editor, which can crop

    Image Toolbox is a powerful picture editor, which can crop, apply filters, add some drawings, erase background, edit EXIF, or even create a PDF file.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 19
    D2

    D2

    D2 is a modern diagram scripting language that turns text to diagrams

    D2 is a diagram scripting language that turns text to diagrams. It stands for Declarative Diagramming. Declarative, as in, you describe what you want diagrammed, it generates the image. As well, the functioning of the install script is described in detail to alleviate any concern of its use. We recommend using your OS's package manager directly instead for improved security but the install script is by no means insecure.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    Laravolt Avatar

    Laravolt Avatar

    Plug n play avatar, turn name, email, and other strings into avatars

    Laravolt Avatar is a PHP library for generating customizable placeholder avatars using text initials. Often used in user profile systems, it creates square avatars with randomly assigned or custom background colors and font settings. The package is framework-agnostic but integrates especially well with Laravel projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Screenity

    Screenity

    The most powerful screen recorder & annotation tool for Chrome

    ...Annotate your screen to give feedback, emphasize your clicks, edit your recording, and much more. Make unlimited recordings of your tab, desktop, any application, and camera. Annotate by drawing anywhere on the screen, adding text, and creating arrows. Highlight your clicks, focus on your mouse, or hide it from the recording. Individual microphone and computer audio controls, push to talk, and more. Custom countdowns, show controls only on hover, and many other customization options. Export as mp4, gif, and webm, or save the video directly to Google Drive. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 22
    DirectX-Graphics-Samples

    DirectX-Graphics-Samples

    Samples that demonstrate how to build graphics intensive applications

    This repo contains the DirectX 12 Graphics samples that demonstrate how to build graphics-intensive applications for Windows 10. In the Samples directory, you will find samples that attempt to break off specific features and specific usage scenarios into bite-sized chunks. For example, the ExecuteIndirect sample will show you just enough about execute indirect to get started with that feature without diving too deep into multiengine whereas the nBodyGravity sample will delve into multiengine...
    Downloads: 66 This Week
    Last Update:
    See Project
  • 23
    PhotoEditor SDK

    PhotoEditor SDK

    PhotoEditor SDK: A fully customizable photo editor for your app

    ...PESDK is a powerful and customizable mobile photo editing framework developed by IMG.LY, enabling developers to add professional-grade photo editing capabilities to their apps. The demo illustrates features such as cropping, filtering, stickers, text overlays, adjustments, and more—all powered by a clean and user-friendly interface. It supports deep customization, allowing developers to tailor the experience to their branding and user needs while maintaining native performance.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    AudioNotes

    AudioNotes

    Extract audio and video content and organize it into a Markdown note

    AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Radegast

    Radegast

    Lightweight client for connecting to Second Life and OpenSim

    ...There is a strong focus on accessibility and non-3D interaction. Given the current nature of changes in Second Life, I felt it was prudent to take on another abandoned text-focused viewer. Introducing MEGAbolt, a fork of the METAbolt viewer which was abandoned by its author almost eight years ago. Keep in mind, this is the first version and much has changed under the hood. It has been brought up to date and only supports Windows x64 currently and is built against the .NET 5 platform. Like Radegast, it utilizes the LibreMetaverse library and now shares much of the same code as Radegast does, which should make maintaining both viewers more manageable.
    Downloads: 49 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next