Showing 92 open source projects for "text t"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Loro

    Loro

    Make your JSON data collaborative and version-controlled with CRDTs

    ...Written in Rust, loro is designed to be compact, fast, and embeddable in a wide range of environments, from desktop to mobile to web via WebAssembly. Its architecture supports multiple data types like text, maps, and lists, and it offers automatic conflict resolution with minimal data overhead. It is ideal for real-time editors, whiteboards, and distributed systems that require reliable, peer-to-peer synchronization.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Korvus

    Korvus

    Korvus is a search SDK that unifies the entire RAG pipeline

    Korvus is an open-source retrieval-augmented generation (RAG) pipeline designed to run entirely inside PostgreSQL, allowing developers to build AI search and knowledge systems directly within a database environment. The project consolidates the typical steps of a RAG pipeline—including embedding generation, document retrieval, reranking, and text generation—into a single query executed within the Postgres ecosystem. By leveraging PostgresML and vector extensions such as pgvector, Korvus eliminates the need for external microservices typically used for AI search architectures, reducing both system complexity and latency. The architecture enables machine learning operations to occur directly in the database, minimizing data transfer between services and improving overall performance for large datasets.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Onefetch

    Onefetch

    Git repository summary on your terminal

    ...By default, the repo's information is displayed alongside the dominant language's logo, but you can further configure onefetch to instead use an image, on supported terminals, text input, or nothing at all. It automatically detects open source licenses from texts and provides the user with valuable information like code distribution, pending changes, number of dependencies (by package manager), top contributors (by number of commits), the size on disk, creation date, LOC (lines of code), etc. Onefetch can be configured via command-line flags to display exactly what you want, the way you want it to: you can customize ASCII/Text formatting, disable info lines, ignore files & directories, and output in multiple formats (JSON, Yaml), etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ffplayout

    ffplayout

    Rust and ffmpeg based playout

    ...It supports streaming output to servers or local playback environments, making it suitable for radio, TV, or online channels. ffplayout includes fallback mechanisms that replace missing content with filler clips or generated placeholders to maintain uninterrupted streams. It also offers overlay features such as logos and text, along with logging and monitoring capabilities. Overall, it provides a robust and lightweight solution for automated media broadcasting workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 5
    PostgresML

    PostgresML

    The GPU-powered AI application database

    PostgresML is a complete platform in a PostgreSQL extension. Build simpler, faster, and more scalable models right inside your database. Explore the SDK and test open source models in our hosted database. Combine and automate the entire workflow from embedding generation to indexing and querying for the simplest (and fastest) knowledge-based chatbot implementation. Leverage multiple types of natural language processing and machine learning models such as vector search and personalization...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Atuin

    Atuin

    Magical shell history

    Atuin is a modern shell history replacement tool and CLI utility that records all your shell commands in a SQLite database alongside contextual metadata. It offers encrypted sync across devices, full-text search, usage statistics, and a desktop application to run executable runbooks as native documents.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    ArtCraft

    ArtCraft

    Crafting engine for artists, designers, and filmmakers

    ...The project positions itself as an intentional “crafting engine” for artists, designers, and filmmakers who want deeper control over generative media pipelines. Rather than relying purely on text prompts, ArtCraft emphasizes visual manipulation, compositional control, and iterative refinement so creators can treat AI output more like a malleable creative medium. The application is built with performance and responsiveness in mind, enabling users to move between different creative canvases and asset workflows within a unified interface. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 8
    Difftastic

    Difftastic

    A structural diff that understands syntax

    Difftastic is a structural diff tool written in Rust that parses source files using syntax trees (via tree‑sitter) and produces human‑readable diffs at the expression level. It works across 30+ languages and emphasizes readability by aligning code structure rather than lines. Ideal for code review and understanding semantic changes.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Static Web Server

    Static Web Server

    A cross-platform, high-performance and asynchronous web server

    Static Web Server (or SWS abbreviated) is a tiny and fast production-ready web server suitable to serve static web files or assets. It is focused on lightness and easy-to-use principles while keeping high performance and safety powered by The Rust Programming Language. Written on top of Hyper and Tokio runtime, it provides concurrent and asynchronous networking abilities and the latest HTTP/1 - HTTP/2 implementations. Cross-platform and available for Linux, macOS, Windows, FreeBSD, NetBSD,...
    Downloads: 9 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    Ollama-rs

    Ollama-rs

    A simple and easy-to-use library for interacting with the Ollama API

    ...It follows the official Ollama API closely, ensuring compatibility while offering an idiomatic Rust experience with strong typing and asynchronous execution. The library supports a wide range of operations, including text generation, chat interactions, embeddings, and model management, making it suitable for both simple applications and more advanced AI workflows. It includes streaming capabilities for real-time output, allowing developers to process responses incrementally rather than waiting for full completions. The project also introduces support for tool calling and custom function integration, enabling more dynamic and agent-like behavior within applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Extractous

    Extractous

    Fast and efficient unstructured data extraction

    Extractous is a Rust-based unstructured data extraction library focused on fast local parsing of documents and other content-heavy files. Its purpose is to extract text and metadata efficiently from formats such as PDF, Word, HTML, email archives, images, and more, without depending on external APIs or separate parsing servers. The project emphasizes performance and low memory usage, and its maintainers describe it as a local-first alternative to heavier extraction stacks. For broader format support, the system combines its Rust core with ahead-of-time compiled Apache Tika shared libraries, which allows it to extend parsing coverage while still avoiding traditional server-based overhead. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    IronClaw

    IronClaw

    IronClaw is OpenClaw inspired but focused on privacy & security

    IronClaw is a security-first, open-source personal AI assistant built in Rust and designed to keep your data fully under your control. It operates on the principle that your AI should work for you, not external vendors, ensuring all data is stored locally, encrypted, and never shared. The platform emphasizes transparency, offering auditable code with no hidden telemetry or data harvesting. IronClaw runs untrusted tools inside isolated WebAssembly (WASM) sandboxes with strict capability-based...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 13
    Tokenizers

    Tokenizers

    Fast State-of-the-Art Tokenizers optimized for Research and Production

    ...Train new vocabularies and tokenize, using today’s most used tokenizers. Extremely fast (both training and tokenization), thanks to the Rust implementation. Takes less than 20 seconds to tokenize a GB of text on a server’s CPU. Easy to use, but also extremely versatile. Designed for both research and production. Full alignment tracking. Even with destructive normalization, it’s always possible to get the part of the original sentence that corresponds to any token. Does all the pre-processing: Truncation, Padding, add the special tokens your model needs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    trieve

    trieve

    All-in-one infrastructure for search, recommendations, RAG

    Trieve is an all-in-one infrastructure for building hybrid vector search, recommendations, and RAG.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    AIChat

    AIChat

    All-in-one LLM CLI tool featuring Shell Assistant

    AIChat is a lightweight terminal-based chatbot powered by GPT models, enabling AI-driven conversations directly from the command line.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Scanopy

    Scanopy

    Clean network diagrams, One-time setup, zero upkeep

    Scanopy is a powerful multi-modal data capture and analysis toolkit that enables users to collect, process, and visualize structured and unstructured information from a variety of sources in a flexible pipeline. It is built to handle complex scanning tasks — such as OCR, document analysis, audio transcription, network data capture, and image extraction — while providing unified APIs and workflows that make managing heterogeneous data sources seamless. Developers can compose custom pipelines...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Databend

    Databend

    Cloud-native open source data warehouse for analytics and AI queries

    ...This architecture enables cost-efficient storage and elastic scaling for workloads that involve large datasets and complex queries. Databend provides a unified engine capable of handling analytics, vector search, and full-text search within a single platform. Databend supports SQL-based workflows and enables real-time data ingestion, transformation, and analysis through streaming and task orchestration features. With its cloud-native design and distributed architecture, Databend can run both as a self-hosted system or within managed environments to power data analytics, AI workloads, and large-scale data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    tree-sitter

    tree-sitter

    An incremental parsing system for programming tools

    ...It can build a concrete syntax tree for a source file and efficiently update the syntax tree as the source file is edited. General enough to parse any programming language. Fast enough to parse on every keystroke in a text editor. Robust enough to provide useful results even in the presence of syntax errors. Dependency-free so that the runtime library (which is written in pure C) can be embedded in any application. All of Tree-sitter’s parsing functionality is exposed through C APIs. Applications written in higher-level languages can use Tree-sitter via binding libraries like node-tree-sitter or the tree-sitter rust crate, which have their own documentation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    BAML

    BAML

    The AI framework that adds the engineering to prompt engineering

    BAML is an open-source framework and domain-specific language designed to bring structured engineering practices to prompt development for large language model applications. Instead of treating prompts as unstructured text, BAML introduces a schema-driven approach where prompts are defined as typed functions with explicit inputs and outputs. This design allows developers to treat language model interactions as predictable software components rather than ad-hoc prompt strings. The framework enables developers to define prompt logic in a dedicated language while integrating it into applications written in various programming languages such as Python, TypeScript, Ruby, and Go. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    ReductStore

    ReductStore

    The fastest time series object store for Edge AI

    History storage and management of images, vibration data, text, labels, and more - all in one place with the highest performance. Merge blob and time series functionalities, reducing the need for multiple databases. Customize real-time data retention policies and replication strategies. Store billions of time-stamped blobs with AI labels and access them with low latency. Outperform other databases with a customized solution for time-series object data.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    Moltis

    Moltis

    A Rust-native claw you can trust

    ...Moltis emphasizes security through sandboxed execution environments, where commands and browsing tasks run in isolated containers and require explicit approval. The platform also includes long-term memory powered by hybrid vector and full-text search, allowing the assistant to retain context across sessions. With multi-channel access such as web UI, Telegram, and API endpoints, Moltis functions as a unified automation hub intended for developers and advanced users who want full control.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    OpenAI Harmony

    OpenAI Harmony

    Renderer for the harmony response format to be used with gpt-oss

    Harmony is a response format developed by OpenAI for use with the gpt-oss model series. It defines a structured way for language models to produce outputs, including regular text, reasoning traces, tool calls, and structured data. By mimicking the OpenAI Responses API, Harmony provides developers with a familiar interface while enabling more advanced capabilities such as multiple output channels, instruction hierarchies, and tool namespaces. The format is essential for ensuring gpt-oss models operate correctly, as they are trained to rely on this structure for generating and organizing their responses. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    SQLpage

    SQLpage

    Fast SQL-only data application builder

    SQLPage is a rapid application development tool that enables developers to build data-driven web applications using only SQL queries. By writing simple .sql files, users can create interactive web pages that display data as text, lists, grids, plots, and forms, streamlining the process of developing data applications without extensive frontend coding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Quickwit

    Quickwit

    Sub-second search & analytics engine on cloud storage

    Sub-second search & analytics engine on cloud storage. Quickwit is the fastest search engine on cloud storage. Quickwit has an Elasticsearch-compatible Ingest-API to make it easier to migrate your log shippers (Vector, Fluent Bit, Syslog, ...) to Quickwit. However, we only support ES aggregation DSL, the query DSL support is planned for Q2 2023. The core difference and advantage of Quickwit are its architecture built from the ground to search on cloud storage. We optimized IO paths, revamped...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Biome

    Biome

    A toolchain for web projects, aimed to provide functionalities

    Biome formats and lints your code in a fraction of a second. Biome supports JavaScript, TypeScript, JSON, and CSS. It aims to support all main languages of modern web development. Biome has sane defaults and requires minimal configuration. Biome helps you as much as possible by displaying detailed and contextualized diagnostics. Biome unifies functionality that has previously been separate tools. Building upon a shared base allows us to provide a cohesive experience for processing code,...
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB