Search Results for "text based" - Page 14

Sort By:

Showing 4524 open source projects for "text based"

View related business solutions

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

Evennia

Python MUD/MUX/MUSH/MU* development system

Evennia is a mature, open-source framework written in Python — specifically designed to build text-based, online multiplayer games such as MUDs, MUCKs, MUSHes, MUXes, and other “MU-style” virtual worlds. Rather than prescribing a rigid game structure, Evennia gives you a bare-bones but powerful foundation: default systems handle networking, database/storage, server management, user accounts, characters, rooms, items, chat channels, and basic commands — but you define the gameplay rules, content, and game logic yourself in pure Python modules. ...

Downloads: 0 This Week

Last Update: 2026-02-15
See Project
2

NVIDIA NeMo Framework

Scalable generative AI framework built for researchers and developers

NVIDIA NeMo is a scalable, cloud-native generative AI framework aimed at researchers and PyTorch developers working on large language models, multimodal models, and speech AI (ASR and TTS), with growing support for computer vision. It provides collections of domain-specific modules and reference implementations that make it easier to pre-train, fine-tune, and deploy very large models on multi-GPU and multi-node infrastructure. NeMo 2.0 introduces a Python-based configuration system,...

Downloads: 0 This Week

Last Update: 2026-04-22
See Project
3

WavTokenizer

SOTA discrete acoustic codec models with 40/75 tokens per second

WavTokenizer is a state-of-the-art discrete acoustic codec designed specifically for audio language modeling, capable of compressing 24 kHz audio into just 40 or 75 tokens per second while preserving high perceptual quality. It is built to represent speech, music, and general audio with extremely low bitrate, making it ideal as a front-end for large audio language models like GPT-4o and similar architectures. The model uses a single-quantizer design together with temporal compression to...

Downloads: 2 This Week

Last Update: 2025-11-28
See Project
4

ZetaJS

JS wrapper for ZetaOffice in the browser

The zeta.js library provides the facilities to run an instance of ZetaOffice integrated into your web site, allowing you to control it with JavaScript code via the LibreOffice UNO technology. Use cases range from an in-browser office suite that looks and feels just like its desktop counterpart, to fine-tuned custom text editing and spreadsheet capabilities embedded in your website, to a headless zetajs instance that does document conversion in the background.

Downloads: 2 This Week

Last Update: 2025-05-30
See Project
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
5

InvokeAI

InvokeAI is a leading creative engine for Stable Diffusion models

InvokeAI is an implementation of Stable Diffusion, the open source text-to-image and image-to-image generator. It provides a streamlined process with various new features and options to aid the image generation process. It runs on Windows, Mac and Linux machines, and runs on GPU cards with as little as 4 GB or RAM. InvokeAI is a leading creative engine built to empower professionals and enthusiasts alike.

1 Review

Downloads: 11 This Week

Last Update: 2026-03-22
See Project
6

Betterfox

Firefox user.js for optimal privacy and security

...Betterfox recommends pairing these settings with essential extensions like ad blockers and DNS-level protections to achieve a well-rounded browsing experience. Because the preferences are text-based and version controlled, users can review and customize them to meet their own balance of privacy and convenience.

Downloads: 10 This Week

Last Update: 2026-04-21
See Project
7

ComfyUI-WanVideoWrapper

ComfyUI wrapper nodes for WanVideo and related models

...It acts as a standalone wrapper layer that allows developers and creators to integrate experimental features and models without modifying the core ComfyUI codebase. This design makes it easier to rapidly test new capabilities such as text-to-video and image-to-video generation while avoiding compatibility issues with the main framework. The project supports complex node-based pipelines where users can control sampling, conditioning, and frame continuity across generated sequences. It also enables extended video generation by linking outputs between iterations, allowing for longer and more coherent animations. ...

Downloads: 1 This Week

Last Update: 2026-05-05
See Project
8

paperless-gpt

Use LLMs and LLM Vision (OCR) to handle paperless-ngx

paperless-gpt is an AI-powered extension for document management systems that enhances the capabilities of paperless-ngx by integrating large language models and vision-based OCR to automate document processing and organization. It is designed to transform scanned or uploaded documents into structured, searchable, and intelligently categorized data without requiring manual tagging or sorting. The system uses OCR combined with LLM reasoning to extract text, classify documents, and generate metadata such as tags, titles, and categories automatically. ...

Downloads: 0 This Week

Last Update: 2026-03-19
See Project
9

NeMo Retriever Library

Document content and metadata extraction microservice

NeMo Retriever Library is a scalable microservice framework designed for extracting, structuring, and enriching content from documents to support downstream generative AI applications. It processes various document types by splitting them into components such as text, tables, charts, and images, and then applies OCR and contextual analysis to convert them into structured data formats. The system is built on NVIDIA NIM microservices, enabling high-performance parallel processing and efficient...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

LongWriter

Unleashing 10,000+ Word Generation from Long Context LLMs

...The system uses an agent-based pipeline called AgentWrite that decomposes large writing tasks into smaller subtasks, allowing the model to produce long documents section by section. Researchers also created the LongWriter-6k dataset containing thousands of examples with outputs ranging from a few thousand to tens of thousands of words.

Downloads: 1 This Week

Last Update: 2026-03-06
See Project
11

Podcastfy.ai

Transforming Multimodal Content into Captivating Multilingual Audio

Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.

Downloads: 0 This Week

Last Update: 2024-11-16
See Project
12

Asciidoctor PDF

Asciidoctor PDF: A native PDF converter for AsciiDoc

A fast text processor & publishing toolchain for converting AsciiDoc to HTML5, DocBook & more. Asciidoctor is a fast, open source, Ruby-based text processor for parsing AsciiDoc® into a document model and converting it to output formats such as HTML 5, DocBook 5, manual pages, PDF, EPUB 3, and other formats. Asciidoctor also has an ecosystem of extensions, converters, build plugins, and tools to help you author and publish content written in AsciiDoc.

Downloads: 0 This Week

Last Update: 2025-11-15
See Project
13

InkOS

Autonomous novel writing CLI AI Agent

InkOS is a multi-agent creative writing system designed to automate the production of long-form narrative content such as novels through coordinated AI workflows. The system organizes multiple specialized agents that collaborate in stages, including drafting, reviewing, editing, and refining text, with optional human checkpoints to ensure quality and coherence. Its architecture reflects a pipeline approach where each agent contributes a specific function, allowing iterative improvement...

Downloads: 9 This Week

Last Update: 4 days ago
See Project
14

AI YouTube Shorts Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV

AI-YouTube-Shorts-Generator is a Python-based tool that automates the creation of short-form vertical video clips (“shorts”) from longer source videos — ideal for adapting content for platforms like YouTube Shorts, Instagram Reels, or TikTok. It analyzes input video (whether a local file or a YouTube URL), transcribes audio (with optional GPU-accelerated speech-to-text), uses an AI model to identify the most compelling or engaging segments, and then crops/resizes the video and applies subtitle overlays, producing a polished short video without manual editing. ...

Downloads: 6 This Week

Last Update: 2026-05-03
See Project
15

Bailing

Bailing is a voice dialogue robot similar to GPT-4o

Bailing is an open-source voice-dialogue assistant designed to deliver natural voice-based conversations by combining automatic speech recognition (ASR), voice activity detection (VAD), a large language model (LLM), and text-to-speech (TTS) in a single pipeline. Its goal is to offer a “voice-first” chat experience similar to what one might expect from a system like GPT-4o, but fully open and deployable by users. The project is modular: each core function — ASR, VAD, LLM, TTS — exists as a separately replaceable component, which allows flexibility in picking your preferred models depending on resources or languages. ...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
16

VisualGLM-6B

Chinese and English multimodal conversational language model

VisualGLM-6B is an open-source multimodal conversational language model developed by ZhipuAI that supports both images and text in Chinese and English. It builds on the ChatGLM-6B backbone, with 6.2 billion language parameters, and incorporates a BLIP2-Qformer visual module to connect vision and language. In total, the model has 7.8 billion parameters. Trained on a large bilingual dataset — including 30 million high-quality Chinese image-text pairs from CogView and 300 million English pairs...

Downloads: 0 This Week

Last Update: 5 days ago
See Project
17

PostgresML

The GPU-powered AI application database

PostgresML is a complete platform in a PostgreSQL extension. Build simpler, faster, and more scalable models right inside your database. Explore the SDK and test open source models in our hosted database. Combine and automate the entire workflow from embedding generation to indexing and querying for the simplest (and fastest) knowledge-based chatbot implementation. Leverage multiple types of natural language processing and machine learning models such as vector search and personalization...

Downloads: 0 This Week

Last Update: 2025-01-16
See Project
18

AnySoftKeyboard

Android (f/w 2.1+) on screen keyboard for multiple languages

The only Android keyboard you'll ever need. Free as in speech and Free as in beer. Android (f/w 4.0.3+, API level 15+) on screen keyboard for multiple languages.

Downloads: 4 This Week

Last Update: 2026-02-08
See Project
19

OpenAI DALL·E AsyncImage SwiftUI

OpenAI swift async text to image for SwiftUI app using OpenAI

SwiftUI views that asynchronously loads and displays an OpenAI image from open API. You just type in your idea and AI will give you an art solution. DALL-E and DALL-E 2 are deep learning models developed by OpenAI to generate digital images from natural language descriptions, called "prompts". You need to have Xcode 13 installed in order to have access to Documentation Compiler (DocC) OpenAI's text-to-image model DALL-E 2 is a recent example of diffusion models. It uses diffusion models for...

Downloads: 0 This Week

Last Update: 2025-08-14
See Project
20

Emacs Markdown Mode

Emacs Markdown Mode

markdown-mode is a major mode for editing Markdown-formatted text. The latest stable version is markdown-mode 2.5, released on Feb 12, 2022. See the release notes for details. markdown-mode is free software, licensed under the GNU GPL, version 3 or later. The primary documentation for Markdown Mode is available below, and is generated from comments in the source code. For a more in-depth treatment, the Guide to Markdown Mode for Emacs covers Markdown syntax, advanced movement and editing in...

Downloads: 0 This Week

Last Update: 2026-03-08
See Project
21

Difftastic

A structural diff that understands syntax

Difftastic is a structural diff tool written in Rust that parses source files using syntax trees (via tree‑sitter) and produces human‑readable diffs at the expression level. It works across 30+ languages and emphasizes readability by aligning code structure rather than lines. Ideal for code review and understanding semantic changes.

Downloads: 0 This Week

Last Update: 2026-04-30
See Project
22

video-use

Edit videos with Claude Code

Video Use is an open-source AI-powered video editing tool that allows users to transform raw footage into polished videos using natural language commands. Designed to work with Claude Code, it automates the entire editing process—from cutting clips to rendering the final output—without requiring manual timelines or complex software interfaces. The system intelligently analyzes audio transcripts and visual cues to make precise, context-aware editing decisions. It supports a wide range of...

Downloads: 16 This Week

Last Update: 7 days ago
See Project
23

Perplexica

Perplexica is an AI-powered answering engine.

Perplexica is a privacy-focused AI answering engine like Perplexity that you can self-host on your own hardware for private, source-cited web research. It combines live internet search results with AI models, letting you use local LLMs via Ollama or connect to providers like OpenAI, Claude, Gemini, and Groq. Powered by SearxNG, it aggregates results from multiple search engines while keeping your identity and queries private. Perplexica offers multiple search modes—Speed, Balanced, and...

Downloads: 16 This Week

Last Update: 2026-04-09
See Project
24

JabRef Bibliography Management

Graphical Java application for managing BibTeX and biblatex

JabRef is an open-source, cross-platform citation and reference management tool. Stay on top of your literature: JabRef helps you to collect and organize sources, find the paper you need and discover the latest research. JabRef is available free of charge and is actively developed. It supports you in every step of your research work.

Downloads: 3 This Week

Last Update: 2025-04-27
See Project
25

ChatGPT Exporter

Export and Share your ChatGPT conversation history

...The tool supports a wide range of output formats including plain text, HTML, Markdown, JSON, and even image-based exports, making it suitable for documentation, knowledge management, and data analysis workflows. One of its key strengths is its ability to preserve formatting such as code blocks, tables, and structured responses, ensuring that exported content remains usable and readable. It also allows exporting entire conversations or selected portions, giving users flexibility depending on their needs.

Downloads: 3 This Week

Last Update: 2026-05-12
See Project