Search Results for "natural language processing" - Page 13

Sort By:

Showing 1440 open source projects for "natural language processing"

View related business solutions

Linux Clear Filters & Widen Search

Stop Storing Third-Party Tokens in Your Database
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.

Try Auth0 for Free
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
1

WebGLM

An Efficient Web-enhanced Question Answering System

WebGLM is a web-enhanced question-answering system that combines a large language model with web search and retrieval capabilities to produce more accurate answers. The system is based on the General Language Model architecture and was designed to enable language models to interact directly with web information during the question-answering process. Instead of relying solely on knowledge stored in the model’s training data, the system retrieves relevant web content and integrates it into the...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
2

DeerFlow

Deep Research framework, combining language models with tools

DeerFlow is an open-source, community-driven “deep research” framework / multi-agent orchestration platform developed by ByteDance. It aims to combine the reasoning power of large language models (LLMs) with automated tool-use — such as web search, web crawling, Python execution, and data processing — to enable complex, end-to-end research workflows. Instead of a monolithic AI assistant, DeerFlow defines multiple specialized agents (e.g. “planner,” “searcher,” “coder,” “report generator”) that collaborate in a structured workflow, allowing tasks like literature reviews, data gathering, data analysis, code execution, and final report generation to be largely automated. ...

Downloads: 85 This Week

Last Update: 4 days ago
See Project
3

GLM-4-Voice

GLM-4-Voice | End-to-End Chinese-English Conversational Model

GLM-4-Voice is an open-source speech-enabled model from ZhipuAI, extending the GLM-4 family into the audio domain. It integrates advanced voice recognition and generation with the multimodal reasoning capabilities of GLM-4, enabling smooth natural interaction via spoken input and output. The model supports real-time speech-to-text transcription, spoken dialogue understanding, and text-to-speech synthesis, making it suitable for conversational AI, virtual assistants, and accessibility...

Downloads: 2 This Week

Last Update: 4 days ago
See Project
4

Search with Lepton

Lightweight demo to build a conversational AI search engine quickly

Search with Lepton is an open source demonstration project that shows how to build a conversational search engine using the Lepton AI framework. It combines traditional web search with large language models to provide natural language answers to user queries. It retrieves information from supported search engines and uses that context to generate responses through a retrieval-augmented generation approach. The implementation is intentionally minimal, containing fewer than 500 lines of code while still providing a complete working example of an AI-powered search system. ...

Downloads: 2 This Week

Last Update: 4 days ago
See Project
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
5

Mistral Vibe CLI

Minimal CLI coding agent by Mistral

Mistral Vibe is an AI-powered “vibe-coding” command-line interface (CLI) and coding-assistant framework built by Mistral AI to let developers write, refactor, search, and manage code through natural language and context-aware automation, rather than manual typing only. It aims to take developers out of repetitive boilerplate and let them stay “in the flow”: you can ask the tool to generate functions, refactor code, search across the codebase, manipulate files, commit changes via Git, or run commands — all from a unified CLI interface. ...

Downloads: 12 This Week

Last Update: 3 days ago
See Project
6

Flock

Flock is a workflow-based low-code platform for building chatbots

Flock is a workflow-based low-code platform designed for building AI applications such as chatbots, retrieval-augmented generation systems, and multi-agent workflows. The platform uses a visual workflow architecture where different nodes represent processing steps such as input processing, model inference, retrieval operations, and tool execution. Developers can connect these nodes to create complex pipelines that orchestrate multiple language models and external services. Built on technologies such as LangChain, LangGraph, FastAPI, and Next.js, Flock combines a modern web interface with a flexible backend capable of supporting advanced AI workflows. ...

Downloads: 1 This Week

Last Update: 2 days ago
See Project
7

ChatDBG

ChatDBG - AI-assisted debugging. Uses AI to answer 'why'

ChatDBG is an AI-assisted debugging tool that integrates large language models into standard debuggers like pdb, lldb, and gdb. It allows developers to engage in a dialog with the debugger, asking open-ended questions about their program's behavior, and provides error diagnoses and suggested fixes.

Downloads: 0 This Week

Last Update: 2025-11-05
See Project
8

MongoDB Lens

MongoDB Lens: Full Featured MCP Server for MongoDB Databases

MongoDB Lens is a local Model Context Protocol (MCP) server offering full-featured access to MongoDB databases using natural language via large language models (LLMs). It enables users to perform queries, run aggregations, optimize performance, and more through conversational interactions.

Downloads: 0 This Week

Last Update: 2025-04-23
See Project
9

TaxHacker

Self-hosted AI accounting app. LLM analyzer for receipts

TaxHacker is an open-source, self-hosted accounting application that uses artificial intelligence to automate financial record management for freelancers, independent developers, and small businesses. The system is designed to simplify bookkeeping by automatically processing financial documents such as receipts, invoices, and transaction records. It integrates large language models to analyze these documents, extract relevant financial information, and categorize expenses or income based on configurable rules. Users can deploy the application on their own infrastructure, ensuring that financial data remains private and under their control rather than being processed by external services. ...

Downloads: 4 This Week

Last Update: 2026-04-03
See Project
Compliant and Reliable File Transfers Backed by Top Security Certifications
Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.

Start Free Trial
10

Numaflow

Kubernetes-native platform to run massively parallel data/streaming

Numaflow is a Kubernetes-native tool for running massively parallel stream processing. A Numaflow Pipeline is implemented as a Kubernetes custom resource and consists of one or more source, data processing, and sink vertices. Numaflow installs in a few minutes and is easier and cheaper to use for simple data processing applications than a full-featured stream processing platform.

Downloads: 0 This Week

Last Update: 19 hours ago
See Project
11

Step-Video-T2V

State-of-the-art (SoTA) text-to-video pre-trained model

Step-Video-T2V is a state-of-the-art text-to-video foundation model developed to generate videos from natural-language prompts; its 30B-parameter architecture is designed to produce coherent, temporally extended video sequences — up to around 204 frames — based on input text. Under the hood it uses a compressed latent representation (a Video-VAE) to reduce spatial and temporal redundancy, and a denoising diffusion (or similar) process over that latent space to generate smooth, plausible motion and visuals. ...

Downloads: 6 This Week

Last Update: 2025-12-02
See Project
12

Lemon AI

Full-stack Open-source Self-Evolving General AI Agent

LemonAI is an open-source full-stack framework for building autonomous AI agents capable of performing complex tasks such as research, programming, data analysis, and document processing. The platform is designed to run primarily on local infrastructure, providing a privacy-focused alternative to cloud-dependent agent platforms. It integrates with local large language models through tools such as Ollama, vLLM, and other model runtimes while also allowing optional connections to external cloud models. The system includes a multi-agent architecture that supports planning, action execution, reflection, and memory, allowing the agent to reason through tasks and refine results iteratively. ...

Downloads: 1 This Week

Last Update: 2026-03-07
See Project
13

Open CoDesign

Open-source Claude Design alternative

Open CoDesign is an open-source, desktop AI design tool that transforms natural language prompts into fully structured design artifacts such as prototypes, slide decks, and marketing assets. It is designed as a local-first alternative to cloud-based design tools, allowing users to run everything on their own machine while bringing their own AI model and API keys. The system supports multiple model providers and integrates directly with existing developer tools, enabling seamless workflows without vendor lock-in. ...

Downloads: 238 This Week

Last Update: 2026-05-23
See Project
14

Qwen-2.5-VL

Qwen2.5-VL is the multimodal large language model series

Qwen2.5 is a series of large language models developed by the Qwen team at Alibaba Cloud, designed to enhance natural language understanding and generation across multiple languages. The models are available in various sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B parameters, catering to diverse computational requirements. Trained on a comprehensive dataset of up to 18 trillion tokens, Qwen2.5 models exhibit significant improvements in instruction following, long-text generation (exceeding 8,000 tokens), and structured data comprehension, such as tables and JSON formats. ...

Downloads: 18 This Week

Last Update: 2026-01-30
See Project
15

VibeVoice

Open-source multi-speaker long-form text-to-speech model

VibeVoice-1.5B is Microsoft’s frontier open-source text-to-speech (TTS) model designed for generating expressive, long-form, multi-speaker conversational audio such as podcasts. Unlike traditional TTS systems, it excels in scalability, speaker consistency, and natural turn-taking for up to 90 minutes of continuous speech with as many as four distinct speakers. A key innovation is its use of continuous acoustic and semantic speech tokenizers operating at an ultra-low frame rate of 7.5 Hz, enabling high audio fidelity with efficient processing of long sequences. The model integrates a Qwen2.5-based large language model with a diffusion head to produce realistic acoustic details and capture conversational context. ...

Downloads: 9 This Week

Last Update: 2026-05-06
See Project
16

Blinko

An open-source, self-hosted personal AI note tool prioritizing privacy

...It allows users to quickly jot down fleeting thoughts, draft content, and organize ideas with Markdown support, making it easy to record insights as they happen. What sets Blinko apart is its AI-enhanced retrieval — users can search their notes using natural language queries and get relevant results instantly rather than relying solely on keyword matches. Thanks to its lightweight architecture powered by Tauri and React, Blinko runs smoothly across platforms including Windows, macOS, Linux, and mobile, while remaining responsive and efficient even with large notebooks. The project emphasizes extensibility and open collaboration, offering a plugin marketplace and documentation for developers to build and share enhancements.

Downloads: 1 This Week

Last Update: 2026-04-12
See Project
17

Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models

Qwen3-TTS is an open-source text-to-speech (TTS) project built around the Qwen3 large language model family, focused on generating high-quality, natural-sounding speech from plain text input. It provides researchers and developers with tools to transform text into expressive, intelligible audio, supporting multiple languages and voice characteristics tuned for clarity and fluidity. The project includes pre-trained models and inference scripts that let users synthesize speech locally or integrate TTS into larger pipelines such as voice assistants, accessibility tools, or multimedia generation workflows. ...

Downloads: 13 This Week

Last Update: 2026-03-17
See Project
18

Step1X-Edit

A SOTA open-source image editing model

Step1X-Edit is a state-of-the-art open-source image editing model/framework that uses a multimodal large language model (LLM) together with a diffusion-based image decoder to let users edit images simply via natural-language instructions plus a reference image. You supply an existing image and a textual command — e.g. “add a ruby pendant on the girl’s neck” or “make the background a sunset over mountains” — and the model interprets the instruction, computes a latent embedding combining the image content and user intent, then decodes a new image implementing the edit. ...

Downloads: 0 This Week

Last Update: 2026-04-29
See Project
19

Browser Use

Make websites accessible for AI agents

Browser Use is an AI-powered browser automation framework designed to let agents interact with websites just like humans do. It enables developers and AI systems to perform complex online tasks such as form filling, data extraction, and navigation through natural language instructions. Built with Python and compatible with modern LLMs, it integrates seamlessly with tools like ChatBrowserUse, Google Gemini, and Anthropic models. The platform supports both open-source deployment and a fully hosted cloud version for enhanced scalability and performance. Its cloud offering includes advanced capabilities like stealth browsing, CAPTCHA solving, and proxy rotation for reliable automation. ...

Downloads: 1 This Week

Last Update: 6 days ago
See Project
20

SHAP

A game theoretic approach to explain the output of ml models

SHAP (SHapley Additive exPlanations) is a game theoretic approach to explain the output of any machine learning model. It connects optimal credit allocation with local explanations using the classic Shapley values from game theory and their related extensions. While SHAP can explain the output of any machine learning model, we have developed a high-speed exact algorithm for tree ensemble methods. Fast C++ implementations are supported for XGBoost, LightGBM, CatBoost, scikit-learn and pyspark...

Downloads: 1 This Week

Last Update: 2026-05-28
See Project
21

HarfBuzz

Open source text shaping engine

HarfBuzz is an open source text-shaping engine with a C API that turns fonts and strings of character codes into a form that is correctly arranged for the corresponding language and writing system. This is essentially the process of text shaping: translating a string of character codes into a properly arranged sequence of glyphs that can be rendered onto a screen or into final output form for inclusion in a document. This shaping depends on a number of factors: the input string, the active...

Downloads: 7 This Week

Last Update: 2026-06-02
See Project
22

Pluely

The Open Source Alternative to Cluely

Pluely is an open-source AI automation framework designed to simplify the development and deployment of AI-driven workflows across applications and services. The system focuses on orchestrating tasks performed by large language models and other AI components, allowing developers to define structured workflows where models interact with tools, APIs, and external systems. By providing a modular architecture for building AI pipelines, the platform enables developers to connect multiple processing steps such as data retrieval, prompt execution, analysis, and response generation. ...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
23

TorchAudio

Data manipulation and transformation for audio signal processing

...By supporting PyTorch, torchaudio follows the same philosophy of providing strong GPU acceleration, having a focus on trainable features through the autograd system, and having consistent style (tensor names and dimension names). Therefore, it is primarily a machine learning library and not a general signal processing library. The benefits of PyTorch can be seen in torchaudio through having all the computations be through PyTorch operations which makes it easy to use and feel like a natural extension.

Downloads: 0 This Week

Last Update: 2026-02-17
See Project
24

Riemann

A network event stream processing system, in Clojure

Riemann aggregates events from your servers and applications with a powerful stream processing language. Send an email for every exception in your app. Track the latency distribution of your web app. See the top processes on any host, by memory and CPU. Combine statistics from every Riak node in your cluster and forward to Graphite. Track user activity from second to second. Riemann streams are just functions which accept an event.

Downloads: 0 This Week

Last Update: 2025-05-26
See Project
25

Passmark

The open-source Playwright library for AI browser regression testing

The Passmark project is an open-source AI-powered regression testing framework built on top of Playwright that enables developers to write end-to-end browser tests using natural language instead of traditional scripting. It is designed to simplify and accelerate testing workflows by allowing AI models to interpret human-readable instructions and translate them into executable browser actions. One of its defining features is a cache-first execution model, where AI is used initially to discover how to perform actions, and those actions are then stored and replayed at native speed in future runs. ...

Downloads: 0 This Week

Last Update: 2026-06-08
See Project