Showing 2772 open source projects for "text based"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    Puter

    Puter

    The Internet OS! Free, Open-Source, and Self-Hostable

    Puter is a browser-based personal operating system and file explorer built entirely with web technologies. It mimics the experience of a traditional desktop OS, allowing users to manage files, launch apps, and interact with a graphical interface—all within a web browser. The project is designed to be both functional and nostalgic, combining the aesthetic of retro operating systems with modern development tools like React and Tailwind CSS. Puter offers a self-hostable, customizable...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 2
    Kimi K2.5

    Kimi K2.5

    Moonshot's most powerful AI model

    Kimi K2.5 is Moonshot AI’s open-source, native multimodal agentic model built through continual pretraining on approximately 15 trillion mixed vision and text tokens. Based on a 1T-parameter Mixture-of-Experts (MoE) architecture with 32B activated parameters, it integrates advanced language reasoning with strong visual understanding. K2.5 supports both “Thinking” and “Instant” modes, enabling either deep step-by-step reasoning or low-latency responses depending on the task. Designed for agentic workflows, it features an Agent Swarm mechanism that decomposes complex problems into coordinated sub-agents executing in parallel. ...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 3
    LLM-Aided OCR Project

    LLM-Aided OCR Project

    Enhances Tesseract OCR output using LLMs (local or API)

    LLM Aided OCR is an open-source system designed to improve optical character recognition accuracy by combining traditional OCR tools with large language models. The project addresses common OCR challenges such as distorted text, unusual fonts, historical documents, and complex layouts that often produce inaccurate results with standard OCR pipelines. The system first extracts raw text using OCR engines and then applies language models to analyze and correct recognition errors based on context. This AI-assisted correction process helps reconstruct missing characters, fix formatting mistakes, and produce more coherent text outputs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    RunAnywhere

    RunAnywhere

    Production ready toolkit to run AI locally

    RunAnywhere SDKs are a set of cross-platform development tools that enable applications to run artificial intelligence models directly on user devices instead of relying on cloud infrastructure. The toolkit allows developers to integrate language models, speech recognition, and voice synthesis capabilities into mobile or desktop applications while keeping all computation local. By running models entirely on device, the platform eliminates network latency and protects user data because...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    Lingua-RS

    Lingua-RS

    The most accurate natural language detection library for Rust

    Lingua-RS is a language detection library implemented in Rust, designed to accurately identify the language of given text samples. It tells you which language some text is written in. This is very useful as a preprocessing step for linguistic data in natural language processing applications such as text classification and spell checking. Other use cases, for instance, might include routing e-mails to the right geographically located customer service department, based on the e-mails' languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Hollama

    Hollama

    A minimal LLM chat app that runs entirely in your browser

    ...Because the application runs as a static web interface, it does not require complex backend infrastructure and can be easily deployed or self-hosted. Hollama supports both text-based and multimodal interactions, allowing users to work with models that process images as well as text. The interface includes features for editing prompts, retrying responses, copying generated code snippets, and storing conversation history locally within the browser. Mathematical expressions can be rendered using KaTeX, and Markdown formatting allows code blocks and structured outputs to appear clearly within conversations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    LlamaGen

    LlamaGen

    Autoregressive Model Beats Diffusion

    ...Instead of relying on diffusion models, the framework treats images as sequences of tokens that can be generated progressively using transformer architectures similar to those used for text generation. The project explores how scaling autoregressive models and improving image tokenization techniques can produce competitive results compared with modern diffusion-based image generators. LlamaGen provides several pre-trained models and training configurations that support both class-conditional image generation and text-conditioned image synthesis. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Tiktoken

    Tiktoken

    tiktoken is a fast BPE tokeniser for use with OpenAI's models

    tiktoken is a high-performance, tokenizer library (based on byte-pair encoding, BPE) designed for use with OpenAI’s models. It handles encoding and decoding text to token IDs efficiently, with minimal overhead. Because tokenization is a fundamental step in preparing text for models, tiktoken is optimized for speed, memory, and correctness in model contexts (e.g. matching OpenAI’s internal tokenization).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Watlings

    Watlings

    Learn WebAssembly by writing small programs

    Watlings is an educational platform designed to teach WebAssembly concepts through interactive, browser-based exercises that guide users step by step in learning the WebAssembly text format (WAT). Inspired by projects like Rustlings, it provides a collection of small, focused challenges that help users understand how WebAssembly works at a low level, including instructions, memory management, and control flow. Each exercise presents partially completed code that the user must fix or complete, reinforcing learning through hands-on problem solving rather than passive reading. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • 10
    jQuery-Marquee with CSS3 Support

    jQuery-Marquee with CSS3 Support

    jQuery plugin to scroll the text like the old traditional marquee

    A ~2KB MINIFIED + GZIPPED (minified) jQuery plugin to scroll the text like the old traditional marquee.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    PixelIt

    PixelIt

    ESP8266/ESP32 and WS2812B LED Matrix based PixelArt display

    PixelIt is an open-source firmware and hardware project designed to drive LED matrix displays using the ESP32 microcontroller. It's tailored for DIY enthusiasts and makers who want to create pixel art displays, show scrolling text, and integrate data from various APIs like weather or news. The project emphasizes customization and ease of use, allowing users to control displays via a web interface or REST API.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    The CUE Data Constraint Language

    The CUE Data Constraint Language

    The home of the CUE language. Validate and define text-based config

    CUE is an open source data constraint language which aims to simplify tasks involving defining and using data. CUE merges the notion of schema and data. The same CUE definition can simultaneously be used for validating data and act as a template to reduce boilerplate. Schema definition is enriched with fine-grained value definitions and default values. At the same time, data can be simplified by removing values implied by such detailed definitions. The merging of these two concepts enables...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    FireRed-Image-Edit

    FireRed-Image-Edit

    General-purpose image editing model that delivers high-fidelity

    FireRed-Image-Edit is an open-source general-purpose image editing model and toolset designed to deliver high-fidelity, visually coherent edits across a wide range of editing tasks, from simple object modifications to complex enhancements like restoration and style preservation. It is built on a flexible text-to-image foundation model that has been extended with training paradigms including pretraining, supervised fine-tuning, and reinforcement learning to imbue the system with strong...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    cloudflare-speed-cli

    cloudflare-speed-cli

    CLI for internet speed test via cloudflare

    cloudflare-speed-cli is a Rust-based command-line internet speed testing tool that interfaces with Cloudflare’s public speed test endpoints, letting users measure download and upload throughput, latency, and more from a terminal. It presents results in an interactive text-user-interface (TUI) that charts real-time metrics as tests run, making it easy to visually inspect performance trends without leaving the console.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    FastSD CPU

    FastSD CPU

    Fast stable diffusion on CPU and AI PC

    FastSD CPU is an optimized fork of Stable Diffusion designed to run efficiently on CPUs and devices without dedicated GPUs by leveraging Latent Consistency Models and Adversarial Diffusion Distillation techniques that accelerate inference. It focuses on bringing fast text-to-image generation to mainstream hardware like desktop CPUs, lower-end laptops, or edge devices without requiring high-end graphics processors. The repository contains multiple interfaces including a desktop GUI for simple generation, an advanced web-based UI with support for extensions like LoRA and ControlNet, and a command-line interface for scripted usage or server deployments. ...
    Downloads: 35 This Week
    Last Update:
    See Project
  • 16
    SentenceTransformers

    SentenceTransformers

    Multilingual sentence & image embeddings with BERT

    SentenceTransformers is a Python framework for state-of-the-art sentence, text and image embeddings. The initial work is described in our paper Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. You can use this framework to compute sentence / text embeddings for more than 100 languages. These embeddings can then be compared e.g. with cosine-similarity to find sentences with a similar meaning. This can be useful for semantic textual similar, semantic search, or paraphrase mining. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 17
    Lingua-Py

    Lingua-Py

    The most accurate natural language detection library for Python

    Its task is simple: It tells you which language some text is written in. This is very useful as a preprocessing step for linguistic data in natural language processing applications such as text classification and spell checking. Other use cases, for instance, might include routing e-mails to the right geographically located customer service department, based on the e-mails' languages. Language detection is often done as part of large machine learning frameworks or natural language processing applications. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Hurl

    Hurl

    Hurl, run and test HTTP requests with plain text

    Hurl is a command line tool that runs HTTP requests defined in a simple plain text format. It can chain requests, capture values and evaluate queries on headers and body responses. Hurl is very versatile: it can be used for both fetching data and testing HTTP sessions. Hurl makes it easy to work with HTML content, REST / SOAP / GraphQL APIs, or any other XML / JSON-based APIs. Hurl can run HTTP requests but can also be used to test HTTP responses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ProxyPal

    ProxyPal

    A desktop app that lets you use your AI subscriptions

    ProxyPal is a desktop application that lets developers unify and manage multiple AI model subscriptions (such as Claude, ChatGPT, Gemini, and GitHub Copilot) with any coding tool by acting as a local proxy and connection manager. By wrapping a CLI-based proxy API with an intuitive graphical interface, ProxyPal bridges your paid AI services into environments that don’t natively support them, allowing tools like text editors, terminals, and IDE plugins to route requests through a centralized proxy endpoint. This enables developers to use their own AI credentials and models consistently across different clients while monitoring usage, token counts, and performance in real time. ...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 20
    VideoChat

    VideoChat

    Real-time voice interactive digital human

    VideoChat is a real-time voice-interactive “digital human” system that combines automatic speech recognition, large language models, text-to-speech, and talking-head generation into a single conversational pipeline. It supports both pure end-to-end voice solutions based on multimodal large language models (GLM-4-Voice feeding directly into talking-head generation) and a more traditional cascaded pipeline using ASR → LLM → TTS → talking head. It is built as a Gradio Python demo, exposing a web interface where users can talk to an animated avatar that lip-syncs to synthesized speech while responding intelligently. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    VVVVVV

    VVVVVV

    The source code to VVVVVV

    From the creator of Super Hexagon, VVVVVV, the hit 2010 indie platformer, now available on android and iOS! Winner of Indiecade's 2010 award for Most Fun/Compelling Game. There are two versions of the VVVVVV source code available, the desktop version (based on the C++ port, and currently live on Steam), and the mobile version (based on a fork of the original flash source code, and currently live on iOS and Android). If you’re looking for the game’s text, that’s mainly (but not entirely) in the Scripts.cpp and TerminalScripts.cpp classes. These functions basically load data into a very simple script parser that controls cutscene logic. ...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 22
    cryptii

    cryptii

    Web app and framework offering modular conversion and encoding

    ...When the content gets edited inside a Viewer or when Brick settings get changed, the result propagates through the Pipe's Bricks in order and in both directions. Chain objects encapsulate UTF-8 text or binary-based content exchanged between Bricks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Chroma

    Chroma

    A general purpose syntax highlighter in pure Go

    As Chroma has just been released, its API is still in flux. That said, the high-level interface should not change significantly. Chroma takes source code and other structured text and converts it into syntax-highlighted HTML, ANSI-coloured text, etc. Chroma is based heavily on Pygments and includes translators for Pygments lexers and styles. ABAP, ABNF, ActionScript, ActionScript 3, Ada, Angular2, ANTLR, ApacheConf, APL, AppleScript, Arduino, Awk. PacmanConf, Perl, PHP, PHTML, Pig, PkgConfig, PL/pgSQL, plaintext, Pony, PostgreSQL SQL dialect, PostScript, POVRay, PowerShell, Prolog, PromQL, Properties, Protocol Buffer, PSL, Puppet, Python 2, Python. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Pretext

    Pretext

    Pure JavaScript/TypeScript library for multiline text measurement

    Pretext is an experimental project that explores new ways of structuring and presenting textual or code-based content, with a strong emphasis on clarity, composability, and developer-oriented workflows. It is designed to rethink how text and structured information are authored, processed, and rendered, often focusing on minimal abstractions and highly readable formats. The project reflects a philosophy of reducing unnecessary complexity while still enabling powerful transformations and reuse of content. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Open Vision Agents by Stream

    Open Vision Agents by Stream

    Build Vision Agents quickly with any model or video provider

    Open Vision Agents by Stream is an open source framework from Stream for building real time, multimodal AI agents that watch, listen, and respond to live video streams. It focuses on combining video understanding models, such as YOLO and Roboflow based detectors, with real time large language models like OpenAI Realtime and Gemini Live to create interactive experiences. The framework uses Stream’s ultra low latency edge network so agents can join sessions quickly and maintain very low audio...
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB