Showing 3499 open source projects for "text based"

View related business solutions
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    textlint

    textlint

    The pluggable natural language linter for text and markdown

    Textlint is an extensible linting tool for text and markdown files, designed to enforce style guidelines, detect errors, and improve writing quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    HunyuanWorld 1.0

    HunyuanWorld 1.0

    Generating Immersive, Explorable, and Interactive 3D Worlds

    HunyuanWorld-1.0 is an open-source, simulation-capable 3D world generation model developed by Tencent Hunyuan that creates immersive, explorable, and interactive 3D environments from text or image inputs. It combines the strengths of video-based diversity and 3D-based geometric consistency through a novel framework using panoramic world proxies and semantically layered 3D mesh representations. This approach enables 360° immersive experiences, seamless mesh export for graphics pipelines, and disentangled object representations for enhanced interactivity. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    AutoCut

    AutoCut

    Cut videos with a text editor

    ...AutoCut supports multiple transcription backends, including Whisper and faster-whisper modes, allowing users to choose based on speed or accuracy needs. After editing the transcript text, the corresponding video clips are merged into the final output, and the tool also produces matching subtitle files. Its command-line interface can be integrated into scripts, making it suitable for automated workflows or batch processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    SimpleHTR

    SimpleHTR

    Handwritten Text Recognition (HTR) system implemented with TensorFlow

    SimpleHTR is an open-source implementation of a handwriting text recognition system based on deep learning techniques. The project focuses on converting images of handwritten text into machine-readable digital text using neural networks. The system uses a combination of convolutional neural networks and recurrent neural networks to extract visual features and model sequential character patterns in handwriting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Perlite

    Perlite

    A web-based markdown viewer optimized for Obsidian

    A web-based markdown viewer optimized for Obsidian Notes. Just put your whole Obsidian vault or markdown folder/file structure in your web directory. The page builds itself. It's an open source alternative to Obsidian Publish.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Step-Video-T2V

    Step-Video-T2V

    State-of-the-art (SoTA) text-to-video pre-trained model

    Step-Video-T2V is a state-of-the-art text-to-video foundation model developed to generate videos from natural-language prompts; its 30B-parameter architecture is designed to produce coherent, temporally extended video sequences — up to around 204 frames — based on input text. Under the hood it uses a compressed latent representation (a Video-VAE) to reduce spatial and temporal redundancy, and a denoising diffusion (or similar) process over that latent space to generate smooth, plausible motion and visuals. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Polyglot

    Polyglot

    Cross-platform AI language practice app

    Polyglot is a cross platform AI language practice application that runs as a desktop app and also offers a web version. It is built around conversational large language models and Azure based text to speech services, turning them into an interactive environment for speaking practice in multiple languages. Users can define custom AI personas, choose languages, and configure their own OpenAI and Azure keys so they retain control over which backends they use. The app supports speech recognition with quick keyboard shortcuts, allowing learners to hold down a key to speak and release it to submit for recognition and response. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    HeartMuLa

    HeartMuLa

    A Family of Open Sourced Music Foundation Models

    ...The project also includes HeartCodec, a music codec optimized for high reconstruction fidelity, enabling efficient tokenization and reconstruction workflows that are critical for training and generation pipelines. For text extraction from audio, it provides HeartTranscriptor, a Whisper-based model tuned specifically for lyrics transcription, which helps bridge generated or recorded audio back into structured text. It also introduces HeartCLAP, which aligns audio and text into a shared embedding space.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 9
    Audiblez

    Audiblez

    Generate audiobooks from e-books

    Audiblez is a tool for generating high-quality .m4b audiobooks directly from .epub e-books using the Kokoro-82M neural text-to-speech model. It focuses on making audiobook creation easy and fast: from a single command, the tool splits an e-book into chapters, synthesizes audio for each section, and then merges the results into a structured audiobook with chapter-based WAV files and a final .m4b container. The Kokoro-82M model it uses is compact (82M parameters) yet natural sounding, trained on under 100 hours of audio, and supports multiple languages, including English (US/UK), Spanish, French, Hindi, Italian, Japanese, Brazilian Portuguese, and Mandarin Chinese. ...
    Downloads: 59 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    Asciidoctor

    Asciidoctor

    A fast, open source text processor and publishing toolchain

    A fast text processor & publishing toolchain for converting AsciiDoc to HTML5, DocBook & more. Asciidoctor is a fast, open source, Ruby-based text processor for parsing AsciiDoc® into a document model and converting it to output formats such as HTML 5, DocBook 5, manual pages, PDF, EPUB 3, and other formats. Asciidoctor also has an ecosystem of extensions, converters, build plugins, and tools to help you author and publish content written in AsciiDoc.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    JupyterLab

    JupyterLab

    JupyterLab computational environment

    JupyterLab is the next-generation web-based user interface for Project Jupyter. Try it on Binder. JupyterLab follows the Jupyter Community Guides. JupyterLab enables you to work with documents and activities such as Jupyter notebooks, text editors, terminals, and custom components in a flexible, integrated, and extensible manner. You can arrange multiple documents and activities side by side in the work area using tabs and splitters.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 12
    FastKoko

    FastKoko

    Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model

    ...It supports multiple languages and voicepacks and allows phoneme based generation for more accurate pronunciation and prosody. The server also offers per-word timestamped captions, which makes it useful for creating subtitles or aligning audio with text. A built in web UI, API documentation, and debug endpoints for monitoring system status help users explore voices, test requests, and integrate the service into larger systems.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    mavonEditor

    mavonEditor

    A markdown editor based on Vue

    A markdown editor based on Vue that supports a variety of personalized features. The default toolbar properties are all true, You can customize the object to cover them. The language parsing files and code highlighting in Code Highlighting highlight.js will be loaded on demand. GitHub-markdown-CSS and katex will load only when mounted.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MinDoc

    MinDoc

    Document management system developed for the IT team

    MinDoc is a simple and easy-to-use document management system developed for the IT team. MinDoc's predecessor was the SmartWiki document system. SmartWiki is a document management system developed based on the PHP framework laravel. Because the deployment of PHP is too complicated for ordinary users, it was developed instead of Golang. It can facilitate user deployment and practicality. The starting point for development is that the company's IT department needs a simple and practical system...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Lesan

    Lesan

    New way to create web server and NoSQL data model

    Lesan is a multilingual text processing and translation library designed for natural language processing (NLP) applications. It provides tools for text normalization, tokenization, and translation across multiple languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Shiki

    Shiki

    A beautiful yet powerful syntax highlighter

    A beautiful syntax highlighter based on TextMate grammar, accurate and powerful.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Supertonic

    Supertonic

    Lightning-fast, on-device TTS, running natively via ONNX

    Supertonic is a lightning-fast, on-device text-to-speech system built around ONNX Runtime for maximum speed and portability. It focuses on running entirely locally, eliminating the need for cloud APIs and providing low latency and strong privacy guarantees, even on constrained devices like Raspberry Pi boards and e-readers. The core model is highly compact at around 66 million parameters, yet benchmarks show it can generate speech up to 167× faster than real time on modern consumer hardware...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 18
    DramaBox

    DramaBox

    super expressive prompting model based on ltx2.3

    DramaBox is an expressive text-to-speech and voice cloning project from Resemble AI built on top of the LTX-2.3 audio branch. It generates speech from prompts that control not only the spoken text, but also speaker identity, emotion, delivery style, laughs, sighs, pauses, and transitions. Users can optionally provide a voice reference of around 10 seconds or more to clone the target timbre while still guiding performance through scene-style prompting. The project includes a warm inference...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    VNote

    VNote

    A pleasant note-taking platform in native C++

    VNote is a Qt-based, free and open-source note-taking application, focusing on Markdown now. VNote is designed to provide a pleasant note-taking platform with excellent editing experience. VNote is NOT just a simple editor for Markdown. By providing note management, VNote makes taking notes in Markdown simpler. In the future, VNote will support more formats besides Markdown. Utilizing Qt, VNote could run on Linux, Windows, and macOS.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    TRIBE v2

    TRIBE v2

    A multimodal model for brain response prediction

    TRIBE v2 is a multimodal foundation model developed by Meta AI for predicting human brain activity from naturalistic stimuli such as video, audio, and text. It is designed for in-silico neuroscience, enabling researchers to model how the brain responds to complex real-world inputs. The system integrates state-of-the-art encoders—including LLaMA for text, V-JEPA for video, and Wav2Vec-BERT for audio—into a unified Transformer architecture. This combined representation is mapped onto the...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 21
    Better BibTeX for Zotero

    Better BibTeX for Zotero

    Make Zotero effective for us LaTeX holdouts

    Better BibTeX (BBT) is a plugin for Zotero and Juris-M that makes it easier to manage bibliographic data, especially for people authoring documents using text-based toolchains (e.g. based on LaTeX / Markdown). Zotero does all its work in UTF-8 Unicode, which is absolutely the right thing to do. Unfortunately, for those shackled to BibTeX and who cannot (yet) move to BibLaTeX, unicode is a major PITA. Also, Zotero supports some simple HTML markup in your items that Bib(La)TeX won’t understand.
    Downloads: 90 This Week
    Last Update:
    See Project
  • 22
    ILSpy

    ILSpy

    .NET Decompiler with support for PDB generation, ReadyToRun, Metadata

    ILSpy is the open-source .NET assembly browser and decompiler. Visual Studio 2022 ships with decompilation support for F12 enabled by default (using our engine v7.1). In Visual Studio 2019, you have to manually enable F12 support. Go to Tools / Options / Text Editor / C# / Advanced and check Enable navigation to decompiled source. C# for Visual Studio Code ships with decompilation support as well. To enable, activate the setting "Enable Decompilation Support. ILSpy is distributed under the...
    Downloads: 403 This Week
    Last Update:
    See Project
  • 23
    React Wrap Balancer

    React Wrap Balancer

    Simple React Component That Makes Titles More Readable

    The React Wrap Balancer project is a React component that improves text readability by intelligently balancing line breaks in headings and other text elements. It addresses common layout issues where text wraps unevenly, such as leaving a single word on the last line, which can negatively impact visual design. The component dynamically adjusts how text is split across lines based on the available space, resulting in more aesthetically pleasing layouts. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Matcha-TTS

    Matcha-TTS

    A fast TTS architecture with conditional flow matching

    Matcha-TTS is a non-autoregressive neural text-to-speech architecture that uses conditional flow matching to generate speech quickly while maintaining natural quality. It models speech as an ODE-based generative process, and conditional flow matching lets it reach high-quality audio in only a few synthesis steps, which greatly reduces latency compared to score-matching diffusion approaches.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Summernote

    Summernote

    Super simple WYSIWYG editor

    ...Summernote allows you to customize the toolbar. You can compose a toolbar with pre-shipped buttons. Air-mode give an interface without the Toolbar. To reveal popover Toolbar, select a text where you want to modify. Simply turn on airMode and just focus on text. Styles change according to Bootstraps Theme. The editor uses the Bootswatch Themes based on Bootstrap 3, you can also do the same with Bootstrap 4.
    Downloads: 9 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB