Showing 521 open source projects for "text based"

View related business solutions
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Youtu-GraphRAG

    Youtu-GraphRAG

    Vertically Unified Agents for Graph Retrieval-Augmented Reasoning

    Youtu-GraphRAG is a research framework developed by Tencent for performing complex reasoning using graph-based retrieval-augmented generation. The system combines knowledge graphs, retrieval mechanisms, and agent-based reasoning into a unified architecture designed to handle knowledge-intensive tasks. Instead of relying solely on text retrieval, the framework organizes information into structured graph schemas that represent entities, relationships, and attributes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    MCP Server RAG Web Browser

    MCP Server RAG Web Browser

    A MCP Server for the RAG Web Browser Actor

    The MCP Server for the RAG Web Browser Actor allows AI assistants and LLMs to perform web searches and extract information from web pages. It facilitates interaction with the web, enabling up-to-date context retrieval for AI applications. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Scriberr

    Scriberr

    Self-hosted AI audio transcription

    Scriberr is a self-hosted AI-powered transcription platform designed to convert audio and video into highly accurate text while prioritizing privacy and local processing. Unlike cloud-based transcription services, Scriberr runs entirely on the user’s machine, ensuring that sensitive recordings are never sent to third-party servers and remain fully under user control. It leverages modern speech recognition models such as Whisper and other advanced architectures to deliver precise transcripts with word-level timing and speaker identification. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    1D Visual Tokenization and Generation

    1D Visual Tokenization and Generation

    This repo contains the code for 1D tokenizer and generator

    The 1D Visual Tokenization and Generation project from ByteDance introduces a novel “one-dimensional” tokenizer designed for images: instead of representing images with large grids of 2D tokens (as in many prior generative/image-modeling systems), it compresses images into as few as 32 discrete tokens (or more, optionally) — thereby achieving a very compact, efficient representation that drastically speeds up generation and reconstruction while retaining strong fidelity. This compact...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    TEN

    TEN

    Open-source framework for conversational voice AI agents

    ...Using components like graph-based workflow design, drag-and-drop UI (via TMAN Designer), and reusable extensions such as real-time avatars, RAG (Retrieval-Augmented Generation), and image generation, TEN enables highly customizable, scalable agent development with minimal code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    opencode.nvim

    opencode.nvim

    Integrate the opencode AI assistant with Neovim

    opencode.nvim is a Neovim plugin that integrates the opencode AI coding assistant directly into the editor, enabling developers to interact with AI agents in a deeply context-aware and workflow-native way. It allows users to send prompts that automatically include relevant editor context such as the current buffer, selected text, diagnostics, and visible content, making AI interactions far more precise and useful during development. The plugin supports a prompt library system, allowing developers to reuse predefined prompts or create custom ones tailored to their workflows. It also enables direct execution of AI-driven actions, such as code modifications or command execution, while giving users full control to review, accept, or reject changes through diff-based interfaces. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    RAG Web UI

    RAG Web UI

    RAG Web UI is an intelligent dialogue system based on RAG

    RAG Web UI is an open-source intelligent dialogue system built on retrieval-augmented generation technology, designed to enable users to create AI-powered question answering systems grounded in their own knowledge bases. It combines document retrieval with large language models to provide accurate, context-aware responses based on indexed data rather than generic model knowledge. The platform supports ingestion of multiple document formats, including PDFs, Word files, Markdown, and plain text, automatically processing and vectorizing them for efficient retrieval. It features a multi-turn conversational interface that maintains context across interactions, allowing users to engage in more natural and continuous dialogues with their data. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Universal Tool Calling Protocol (UTCP)

    Universal Tool Calling Protocol (UTCP)

    Official python implementation of UTCP. UTCP is an open standard

    The python-utcp repository is the official Python SDK implementation of the Universal Tool Calling Protocol (UTCP). UTCP is an open, modern standard designed to let AI agents call any tool or API directly—over HTTP, CLI, WebSocket, gRPC, and more—without the overhead of extra wrapper layers or middleware. It leverages a modular, plugin-based architecture built around Pydantic models and separates the core functionality into a lightweight client and extensible protocol plugins, enabling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Step-Audio 2

    Step-Audio 2

    Multi-modal large language model designed for audio understanding

    Step-Audio2 is an advanced, end-to-end multimodal large language model designed for high-fidelity audio understanding and natural speech conversation: unlike many pipelines that separate speech recognition, processing, and synthesis, Step-Audio2 processes raw audio, reasons about semantic and paralinguistic content (like emotion, speaker characteristics, non-verbal cues), and can generate contextually appropriate responses — including potentially generating or transforming audio output. It integrates a latent-space audio encoder, discrete acoustic tokens, and reinforcement-learning–based training (CoT + RL) to enhance its ability to capture and reproduce voice styles, intonations, and subtle vocal cues. Moreover, Step-Audio2 supports tool-calling and retrieval-augmented generation (RAG), allowing it to access external knowledge sources or audio/text databases, thus reducing hallucinations and improving coherence in complex dialogues.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    Rig

    Rig

    Rust framework for building modular and scalable LLM-powered apps

    ...Its architecture emphasizes modularity, enabling developers to integrate only the components and integrations they need for a specific application. Rig includes built-in support for agent workflows, allowing systems to perform multi-turn reasoning, tool calling, and retrieval-based tasks within structured pipelines. It also supports capabilities such as text generation, embeddings, transcription, image generation, and audio generation depending on the provider used. Developers can integrate language models into their software with minimal boilerplate while maintaining flexibility for complex AI workflows.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    KubeAI

    KubeAI

    Private Open AI on Kubernetes

    Get inferencing running on Kubernetes: LLMs, Embeddings, Speech-to-Text. KubeAI serves an OpenAI compatible HTTP API. Admins can configure ML models by using the Model Kubernetes Custom Resources. KubeAI can be thought of as a Model Operator (See Operator Pattern) that manages vLLM and Ollama servers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PPTAgent

    PPTAgent

    PPTAgent: Generating and Evaluating Presentations

    PPTAgent is a research system for generating and evaluating slide decks that goes beyond simple text-to-slides. It follows a two-stage, edit-based workflow: first it analyzes reference presentations to infer slide roles and structure, then it drafts an outline and iteratively performs editing actions to produce new slides. The project includes both the generation agent and an evaluation framework, PPTEval, to score content quality, design, and coherence.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Short Video Factory

    Short Video Factory

    AI tool for automatic batch short video creation and editing

    Short Video Factory is an open source desktop application designed to simplify the creation of short-form videos using AI-driven automation. It enables users to generate product marketing clips and general content videos by combining simple prompt-based input with pre-prepared media assets. Short Video Factory integrates multiple stages of video production, including script generation, voice synthesis, video editing, and subtitle effects, into a single streamlined workflow. By leveraging AI...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    myGPTReader

    myGPTReader

    AI Slack bot for reading, summarizing, and chatting with content

    ...It enables users to quickly understand web pages, documents, and even video content by transforming them into interactive discussions rather than static reading experiences. myGPTReader supports a wide range of file formats, including eBooks, PDFs, and text-based documents, making it flexible for both casual and professional use cases. It also integrates voice interaction capabilities, allowing users to communicate with the system verbally and even use it as a language practice assistant. In addition to content reading, myGPTReader includes built-in prompt templates that enhance conversations and help users get more structured and relevant responses. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ShortGPT

    ShortGPT

    AI framework for automated short video creation and editing tools

    ShortGPT is an experimental AI-powered framework designed to automate the creation of short-form and long-form video content. It provides a structured system that handles multiple stages of the content creation workflow, including script generation, asset sourcing, voiceover synthesis, and video editing. ShortGPT uses large language models to generate scripts and prompts that guide the automated editing and production process. ShortGPT includes specialized content engines that manage...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Skywork-R1V4

    Skywork-R1V4

    Skywork-R1V is an advanced multimodal AI model series

    Skywork-R1V is an open-source multimodal reasoning model designed to extend the capabilities of large language models into vision-language tasks that require complex logical reasoning. The project introduces a model architecture that transfers the reasoning abilities of advanced text-based models into visual domains so the system can interpret images and perform multi-step reasoning about them. Instead of retraining both language and vision models from scratch, the framework uses a lightweight visual projection layer that connects a pretrained vision backbone with a reasoning-capable language model. This design allows the model to analyze images while maintaining strong textual reasoning performance, enabling tasks such as solving visual math problems, interpreting scientific diagrams, and answering questions about images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Sparrow

    Sparrow

    Structured data extraction and instruction calling with ML, LLM

    Sparrow is an open-source platform designed to extract structured information from documents, images, and other unstructured data sources using machine learning and large language models. The system focuses on transforming complex documents such as invoices, receipts, forms, and scanned pages into structured formats like JSON that can be processed by downstream applications. It combines several components, including OCR pipelines, vision-language models, and LLM-based reasoning modules to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    PaSa

    PaSa

    An advanced paper search agent powered by large language models

    ...The Selector then reads abstracts or full text (depending on what’s available) and decides which papers are relevant.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Reins

    Reins

    Ollama client that simplifies experimenting with LLMs

    ...It also includes features for editing and regenerating messages, enabling iterative refinement of outputs without restarting conversations. Reins extends beyond text by supporting image input and multimodal interactions, which expands its use cases beyond basic chat scenarios. Overall, it is best suited for users who want granular control over model behavior and experimentation while maintaining a clean and intuitive interface.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Banana Slides

    Banana Slides

    A native AI PPT generation application based on nano banana pro

    Banana Slides is an open-source application designed to automatically generate presentation slides using artificial intelligence. Built on top of the Nano Banana Pro framework, the software enables users to transform simple prompts or outlines into complete slide decks without manually formatting content. Instead of relying on traditional slide editing workflows, the system allows users to describe the desired presentation in natural language and have the AI generate structured slides,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    DeepTutor

    DeepTutor

    AI-Powered Personalized Learning Assistant

    ...It goes beyond simple Q&A by constructing multi-stage educational narratives, breaking down complex topics into sequenced “lesson steps,” and offering prompts, examples, and exercises that build on each other in a logical curriculum. The core architecture combines LLM-based reasoning with structured pedagogy modules so that explanations accommodate different learning styles and address misconceptions in follow-up responses. DeepTutor supports retrieval of external references, definitions, and diagrams so responses are grounded in authoritative content and not just generative text, and it includes internal checks to ensure accuracy and conceptual consistency.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    openclaw-kapso-whatsapp

    openclaw-kapso-whatsapp

    Give your OpenClaw AI agent a WhatsApp number

    openclaw-kapso-whatsapp is a plugin repository designed to extend the OpenClaw AI agent by giving it a dedicated WhatsApp phone number using the official Meta Cloud API via Kapso, enabling direct interaction through one of the most widely used messaging platforms. This integration allows the autonomous AI assistant to send and receive messages on WhatsApp, turning the agent into a real-world task performer accessible through text conversations. The plugin is built in Go and handles...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    MLJAR Studio

    MLJAR Studio

    Python package for AutoML on Tabular Data with Feature Engineering

    We are working on new way for visual programming. We developed a desktop application called MLJAR Studio. It is a notebook-based development environment with interactive code recipes and a managed Python environment. All running locally on your machine. We are waiting for your feedback. The mljar-supervised is an Automated Machine Learning Python package that works with tabular data. It is designed to save time for a data scientist. It abstracts the common way to preprocess the data,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    llama.cpp

    llama.cpp

    LLM inference in C/C++

    llama.cpp is a high-performance C and C++ project for running large language models locally and in the cloud with minimal setup. It is built around efficient inference, broad hardware support, and the GGUF model format. The project supports many model families and has become a major foundation for local AI tools, model serving, and embedded inference workflows. It provides command-line tools, a server mode with an OpenAI-compatible API style, model conversion utilities, and extensive backend...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    CodeGraph

    CodeGraph

    Pre-indexed code knowledge graph for Claude Code, Codex, Cursor

    CodeGraph is a local-first code intelligence tool that gives AI coding agents a pre-indexed understanding of a repository. Instead of forcing agents to repeatedly scan files with grep, glob, and read commands, it builds a searchable knowledge graph of symbols, relationships, call graphs, and code structure. The project uses deterministic parsing rather than LLM summaries, which makes its indexed data more predictable and grounded in the actual source tree. It is designed for tools such as...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB