Showing 4525 open source projects for "text based"

View related business solutions
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    GPT Image Playground

    GPT Image Playground

    OpenAI gpt-image-2 API

    GPT Image Playground is a web-based image generation and editing interface built around OpenAI’s GPT Image APIs and compatible providers. It gives users a clean UI for text-to-image generation, reference-image workflows, and mask-based editing. The project supports OpenAI-compatible endpoints, fal.ai, and custom HTTP providers, which makes it flexible for different image backend setups.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Watlings

    Watlings

    Learn WebAssembly by writing small programs

    Watlings is an educational platform designed to teach WebAssembly concepts through interactive, browser-based exercises that guide users step by step in learning the WebAssembly text format (WAT). Inspired by projects like Rustlings, it provides a collection of small, focused challenges that help users understand how WebAssembly works at a low level, including instructions, memory management, and control flow. Each exercise presents partially completed code that the user must fix or complete, reinforcing learning through hands-on problem solving rather than passive reading. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    latexindent.pl

    latexindent.pl

    Perl script to add indentation to LaTeX files

    Perl script to add indentation (leading horizontal space) to LaTeX files. It can modify line breaks before, during and after code blocks; it can perform text wrapping and paragraph line break removal. It can also perform string-based and regex-based substitutions/replacements. The script is customizable through its YAML interface. latexindent.exe is a standalone executable file that does not require a perl installation. A nice way to test the script is to navigate to the test-cases directory, and then run the command (on Linux/Mac -- sorry, a Windows test-case version is not available)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Anki

    Anki

    Anki is a smart spaced repetition flashcard program

    Anki is a free, open-source spaced repetition flashcard application designed for efficient long‑term memorization. It supports a wide variety of media types (text, images, audio, LaTeX), advanced scheduling algorithms (SM‑2, FSRS), and extensibility via add‑ons. It’s widely used for education, language learning, medical training, and more.
    Downloads: 21 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    OpenAI-Compatible Edge-TTS API

    OpenAI-Compatible Edge-TTS API

    Free, high-quality text-to-speech API endpoint to replace OpenAI

    OpenAI-Compatible Edge-TTS API is a local, OpenAI-compatible text-to-speech API that uses edge-tts—Microsoft Edge’s online TTS service—as the backend. The project emulates the /v1/audio/speech endpoint used by OpenAI, so any client that can talk to the OpenAI TTS API can be redirected to this service with minimal changes. It exposes parameters for input text, voice selection, audio format, and playback speed, mirroring the OpenAI interface while mapping popular OpenAI voice names to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    LlamaGen

    LlamaGen

    Autoregressive Model Beats Diffusion

    ...Instead of relying on diffusion models, the framework treats images as sequences of tokens that can be generated progressively using transformer architectures similar to those used for text generation. The project explores how scaling autoregressive models and improving image tokenization techniques can produce competitive results compared with modern diffusion-based image generators. LlamaGen provides several pre-trained models and training configurations that support both class-conditional image generation and text-conditioned image synthesis. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Tiktoken

    Tiktoken

    tiktoken is a fast BPE tokeniser for use with OpenAI's models

    tiktoken is a high-performance, tokenizer library (based on byte-pair encoding, BPE) designed for use with OpenAI’s models. It handles encoding and decoding text to token IDs efficiently, with minimal overhead. Because tokenization is a fundamental step in preparing text for models, tiktoken is optimized for speed, memory, and correctness in model contexts (e.g. matching OpenAI’s internal tokenization).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Lingua-RS

    Lingua-RS

    The most accurate natural language detection library for Rust

    Lingua-RS is a language detection library implemented in Rust, designed to accurately identify the language of given text samples. It tells you which language some text is written in. This is very useful as a preprocessing step for linguistic data in natural language processing applications such as text classification and spell checking. Other use cases, for instance, might include routing e-mails to the right geographically located customer service department, based on the e-mails' languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    The CUE Data Constraint Language

    The CUE Data Constraint Language

    The home of the CUE language. Validate and define text-based config

    CUE is an open source data constraint language which aims to simplify tasks involving defining and using data. CUE merges the notion of schema and data. The same CUE definition can simultaneously be used for validating data and act as a template to reduce boilerplate. Schema definition is enriched with fine-grained value definitions and default values. At the same time, data can be simplified by removing values implied by such detailed definitions. The merging of these two concepts enables...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    LLM-Aided OCR Project

    LLM-Aided OCR Project

    Enhances Tesseract OCR output using LLMs (local or API)

    LLM Aided OCR is an open-source system designed to improve optical character recognition accuracy by combining traditional OCR tools with large language models. The project addresses common OCR challenges such as distorted text, unusual fonts, historical documents, and complex layouts that often produce inaccurate results with standard OCR pipelines. The system first extracts raw text using OCR engines and then applies language models to analyze and correct recognition errors based on context. This AI-assisted correction process helps reconstruct missing characters, fix formatting mistakes, and produce more coherent text outputs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    RunAnywhere

    RunAnywhere

    Production ready toolkit to run AI locally

    RunAnywhere SDKs are a set of cross-platform development tools that enable applications to run artificial intelligence models directly on user devices instead of relying on cloud infrastructure. The toolkit allows developers to integrate language models, speech recognition, and voice synthesis capabilities into mobile or desktop applications while keeping all computation local. By running models entirely on device, the platform eliminates network latency and protects user data because...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    moe

    moe

    A command line based editor inspired by Vim. Written in Nim

    Moe is a lightweight, modern text editor written in Nim, designed for simplicity and efficiency while providing a minimal yet functional editing environment.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    fess

    fess

    Open source enterprise search server for websites, files, and data

    Fess is an open source enterprise search server designed to provide powerful full-text search capabilities across multiple data sources. It enables organizations to quickly deploy a scalable search environment without requiring deep knowledge of underlying search technologies. Fess is built on top of OpenSearch and offers an integrated solution for crawling, indexing, and searching documents from websites, file systems, and various data stores.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    jQuery-Marquee with CSS3 Support

    jQuery-Marquee with CSS3 Support

    jQuery plugin to scroll the text like the old traditional marquee

    A ~2KB MINIFIED + GZIPPED (minified) jQuery plugin to scroll the text like the old traditional marquee.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Automa.jl

    Automa.jl

    A julia code generator for regular expressions

    Automa is a regex-to-Julia compiler. By compiling regex to Julia code in the form of Expr objects, Automa provides facilities to create efficient and robust regex-based lexers, tokenizers and parsers using Julia's metaprogramming capabilities. You can view Automa as a regex engine that can insert arbitrary Julia code into its input-matching process, which will be executed when certain parts of the regex match an input.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Omi

    Omi

    AI that sees your screen and listens to conversations

    The Omi project is an open-source AI wearable ecosystem developed by Based Hardware that combines hardware, software, and cloud infrastructure to create a persistent “second brain” for capturing and processing real-world interactions. It is designed as a system that continuously listens to conversations and monitors screen activity, converting this input into structured data such as transcripts, summaries, and actionable insights in real time.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    ProxyPal

    ProxyPal

    A desktop app that lets you use your AI subscriptions

    ProxyPal is a desktop application that lets developers unify and manage multiple AI model subscriptions (such as Claude, ChatGPT, Gemini, and GitHub Copilot) with any coding tool by acting as a local proxy and connection manager. By wrapping a CLI-based proxy API with an intuitive graphical interface, ProxyPal bridges your paid AI services into environments that don’t natively support them, allowing tools like text editors, terminals, and IDE plugins to route requests through a centralized proxy endpoint. This enables developers to use their own AI credentials and models consistently across different clients while monitoring usage, token counts, and performance in real time. ...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 18
    OmniTools

    OmniTools

    Self-hosted collection of powerful web-based tools for everyday tasks

    OmniTools is a self-hosted web application that bundles a large collection of everyday utilities into a single clean interface you can run on your own infrastructure. It’s designed to replace the random assortment of “free online tools” people use for quick tasks, while avoiding ads, tracking, and the need to upload sensitive files to unknown servers. A key design choice is that file processing happens entirely on the client side, meaning your data stays in your browser instead of being sent...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    SLIME

    SLIME

    The Superior Lisp Interaction Mode for Emacs

    SLIME is a Emacs mode for Common Lisp development. Inspired by existing systems such Emacs Lisp and ILISP, we are working to create an environment for hacking Common Lisp in. SLIME extends Emacs with support for interactive programming in Common Lisp. The features are centered around slime mode, an Emacs minor mode that complements the standard lisp mode. While lisp-mode supports editing Lisp source files, slime-mode adds support for interacting with a running Common Lisp process for...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    ffplayout

    ffplayout

    Rust and ffmpeg based playout

    ...It also offers overlay features such as logos and text, along with logging and monitoring capabilities. Overall, it provides a robust and lightweight solution for automated media broadcasting workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    FastSD CPU

    FastSD CPU

    Fast stable diffusion on CPU and AI PC

    FastSD CPU is an optimized fork of Stable Diffusion designed to run efficiently on CPUs and devices without dedicated GPUs by leveraging Latent Consistency Models and Adversarial Diffusion Distillation techniques that accelerate inference. It focuses on bringing fast text-to-image generation to mainstream hardware like desktop CPUs, lower-end laptops, or edge devices without requiring high-end graphics processors. The repository contains multiple interfaces including a desktop GUI for simple generation, an advanced web-based UI with support for extensions like LoRA and ControlNet, and a command-line interface for scripted usage or server deployments. ...
    Downloads: 34 This Week
    Last Update:
    See Project
  • 22
    Monaco Editor

    Monaco Editor

    A browser based code editor

    ...For example, the model holds the text content, determines the language of the content, and tracks the edit history of the content. An editor is a user facing view of the model. This is what gets attached to the DOM and what your users see visually. Typical editor operations are displaying a model, managing the view state, or executing actions or commands.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 23
    cryptii

    cryptii

    Web app and framework offering modular conversion and encoding

    ...When the content gets edited inside a Viewer or when Brick settings get changed, the result propagates through the Pipe's Bricks in order and in both directions. Chain objects encapsulate UTF-8 text or binary-based content exchanged between Bricks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    VideoChat

    VideoChat

    Real-time voice interactive digital human

    VideoChat is a real-time voice-interactive “digital human” system that combines automatic speech recognition, large language models, text-to-speech, and talking-head generation into a single conversational pipeline. It supports both pure end-to-end voice solutions based on multimodal large language models (GLM-4-Voice feeding directly into talking-head generation) and a more traditional cascaded pipeline using ASR → LLM → TTS → talking head. It is built as a Gradio Python demo, exposing a web interface where users can talk to an animated avatar that lip-syncs to synthesized speech while responding intelligently. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    SentenceTransformers

    SentenceTransformers

    Multilingual sentence & image embeddings with BERT

    SentenceTransformers is a Python framework for state-of-the-art sentence, text and image embeddings. The initial work is described in our paper Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. You can use this framework to compute sentence / text embeddings for more than 100 languages. These embeddings can then be compared e.g. with cosine-similarity to find sentences with a similar meaning. This can be useful for semantic textual similar, semantic search, or paraphrase mining. ...
    Downloads: 8 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB