Search Results for "natural language processing" - Page 12

Sort By:

Showing 532 open source projects for "natural language processing"

View related business solutions

Python Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.

Try for Free
1

LiveKit Agents

Framework for building realtime multimodal voice AI agents apps

LiveKit Agents is an open source framework designed for building realtime AI agents that can participate as programmable entities within communication sessions. It enables developers to create conversational and multimodal agents capable of processing voice, audio, and other inputs in realtime environments. These agents can join LiveKit rooms as participants and interact with users or systems through speech, text, and other modalities. LiveKit Agents provides libraries and tooling that allow developers to combine speech-to-text, large language models, and text-to-speech services to build interactive AI experiences. ...

Downloads: 4 This Week

Last Update: 4 days ago
See Project
2

Bot Framework SDK for Python

Build and connect intelligent bots that interact naturally

This repository contains code for the Python version of the Microsoft Bot Framework SDK, which is part of the Microsoft Bot Framework - a comprehensive framework for building enterprise-grade conversational AI experiences. This SDK enables developers to model conversation and build sophisticated bot applications using Python. SDKs for JavaScript and .NET are also available. The Microsoft Bot Framework provides what you need to build and connect intelligent bots that interact naturally...

Downloads: 0 This Week

Last Update: 2026-01-05
See Project
3

ESPnet

End-to-end speech processing toolkit

ESPnet is a comprehensive end-to-end speech processing toolkit covering a wide spectrum of tasks, including automatic speech recognition (ASR), text-to-speech (TTS), speech translation (ST), speech enhancement, speaker diarization, and spoken language understanding. It uses PyTorch as its deep learning engine and adopts a Kaldi-style data processing pipeline for features, data formats, and experimental recipes.

Downloads: 0 This Week

Last Update: 2026-04-22
See Project
4

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis

AutoGPTQ is an implementation of GPTQ (Quantized GPT) that optimizes large language models (LLMs) for faster inference by reducing their computational footprint while maintaining accuracy.

Downloads: 4 This Week

Last Update: 2025-01-21
See Project
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
5

Memobase

Fast backend for long-term AI user memory via structured profiles

...Instead of relying purely on traditional embedding-based retrieval or RAG systems, Memobase uses profile and timeline structures to deliver memory that reflects user context efficiently and meaningfully. The system focuses on three principal performance metrics: high search performance, reduced large language model (LLM) costs through batch processing techniques, and low latency with minimal SQL operations. Memobase supports integration with existing LLM workflows via APIs and SDKs (including Python, Node, and Go), making it easy to adopt within diverse application stacks.

Downloads: 3 This Week

Last Update: 16 hours ago
See Project
6

CowAgent

AI assistant based on large models that can actively think and plan

...It supports multi-turn conversations with per-user context tracking, allowing more natural and persistent interactions across private and group chats. Developers can extend functionality through a plugin architecture and customizable rules, making it suitable for both personal assistants and enterprise automation scenarios.

Downloads: 4 This Week

Last Update: 2026-06-09
See Project
7

Matrix

Multi-Agent daTa geneRation Infra and eXperimentation framework

Matrix is a distributed, large-scale engine for multi-agent synthetic data generation and experiments: it provides the infrastructure to run thousands of “agentic” workflows concurrently (e.g. multiple LLMs interacting, reasoning, generating content, data-processing pipelines) by leveraging distributed computing (like Ray + cluster management). The idea is to treat data generation as a “data-to-data” transformation: each input item defines a task, and the runtime orchestrates asynchronous, peer-to-peer agent workflows, avoiding global synchronization bottlenecks. That design makes Matrix particularly well-suited for large-batch inference, model benchmarking, data curation, augmentation, or generation — whether for language, code, dialogue, or multimodal tasks. ...

Downloads: 4 This Week

Last Update: 2026-03-05
See Project
8

Qwen-Image-Layered

Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

Qwen-Image-Layered is an extension of the Qwen series of multimodal models that introduces layered image understanding, enabling the model to reason about hierarchical visual structures — such as separating foreground, background, objects, and contextual layers within an image. This architecture allows richer semantic interpretation, enabling use cases such as scene decomposition, object-level editing, layered captioning, and more fine-grained multimodal reasoning than with flat image...

Downloads: 5 This Week

Last Update: 2026-01-05
See Project
9

Claude Code Hooks Mastery

Master Claude Code Hooks

...Although the project itself doesn’t include a single coherent application, it functions as a curated collection of advanced hook examples, best practices, and coding patterns that show how to tailor Claude Code to specific use cases such as automated CI workflows, custom command triggers, and integrations with external tools. The repository is part of a larger ecosystem of Claude Code tooling that enables natural-language-driven coding tasks, and the hooks contained here help users go beyond default behaviors to solve real problems efficiently.

Downloads: 0 This Week

Last Update: 2026-02-06
See Project
Atera - an All-in-one platform for IT management
Ideal for IT departments and MSPs (managed service providers)

Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!

Try Atera now
10

Unredact

A simple tool for reading in poorly redacted documents

Unredact is a specialized tool that attempts to reconstruct redacted or obscured text in images, PDFs, or screenshots using a combination of image processing and generative AI inference to suggest plausible completions of blurred, black-boxed, or jumbled content. Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and...

Downloads: 9 This Week

Last Update: 2026-02-03
See Project
11

GalTransl

Automated translation solution for visual novels

GalTransl is an automated translation system specifically designed for visual novels, particularly those in the “galgame” genre, leveraging large language models to streamline and enhance the translation process. It integrates support for multiple advanced LLM providers such as GPT-4, Claude, DeepSeek, and other models, enabling high-quality, context-aware translations that go beyond traditional machine translation approaches. The platform is built to handle the unique structure of visual...

Downloads: 1 This Week

Last Update: 2026-05-22
See Project
12

HunyuanOCR

OCR expert VLM powered by Hunyuan's native multimodal architecture

HunyuanOCR is an open-source, end-to-end OCR (optical character recognition) Vision-Language Model (VLM) developed by Tencent‑Hunyuan. It’s designed to unify the entire OCR pipeline, detection, recognition, layout parsing, information extraction, translation, and even subtitle or structured output generation, into a single model inference instead of a cascade of separate tools. Despite being fairly lightweight (about 1 billion parameters), it delivers state-of-the-art performance across a...

Downloads: 1 This Week

Last Update: 2026-06-02
See Project
13

CodeLlama

Inference code for CodeLlama models

...The repo documents the sizes and capabilities (e.g., 7B, 13B, 34B) and highlights features like infilling and large input context to support real IDE workflows. It targets both general software synthesis and language-specific productivity, offering strong performance among open models at release time. Typical usage includes prompt-driven generation, function or class completion, and zero-shot adherence to natural-language instructions about code changes. The ecosystem provides multiple distributions (e.g., HF format) so developers can integrate with standard toolchains and serving stacks. ...

Downloads: 4 This Week

Last Update: 2025-10-08
See Project
14

AudioMuse-AI

AudioMuse-AI is an Open Source Dockerized environment

AudioMuse-AI is an open-source system designed to automatically generate playlists and analyze music libraries using artificial intelligence and audio signal processing techniques. The platform runs locally in a Dockerized environment and performs detailed sonic analysis on audio files to understand characteristics such as tempo, mood, and acoustic similarity. By analyzing the underlying audio content rather than relying on external metadata services, the system can organize large personal...

Downloads: 2 This Week

Last Update: 2 days ago
See Project
15

Llama Cloud Services

Knowledge Agents and Management in the Cloud

Llama Cloud Services is a suite of tools designed to facilitate the integration of large language models (LLMs) into applications. It offers components for parsing, extracting, and reporting on complex documents, streamlining the process of preparing data for LLM consumption.

Downloads: 0 This Week

Last Update: 2026-02-13
See Project
16

Chinese-LLaMA-Alpaca-3

Chinese Llama-3 LLMs) developed from Meta Llama 3

Chinese-LLaMA-Alpaca-3 is an open-source project that provides Mandarin-focused large language models based on Meta’s LLaMA-3 architecture, with both foundational and instruction-tuned variants to support high-quality Chinese natural language understanding and generation. It extends the original LLaMA models with expanded Chinese vocabularies and additional pretraining on Chinese corpora to improve semantic encoding and decoding specifically for Chinese text. ...

Downloads: 0 This Week

Last Update: 2026-01-15
See Project
17

Fli

Google Flights MCP and Python Library

...In addition to its CLI interface, fli includes a Model Context Protocol (MCP) server that allows AI assistants to interact with flight data using structured tools, enabling natural language queries and automation workflows.

Downloads: 1 This Week

Last Update: 2026-05-24
See Project
18

BlenderMCP

Blender Model Context Protocol Integration

BlenderMCP is a bridge that connects Blender, a 3D modeling and rendering software, with AI systems like Claude through the Model Context Protocol, enabling direct AI-driven interaction with 3D environments. It allows users to control Blender using natural language prompts, effectively turning AI into a co-creator for 3D modeling, scene construction, and asset manipulation. The system establishes a two-way communication channel between Blender and the AI, where commands can be sent and results retrieved in real time. It includes features for object manipulation, material editing, and scene inspection, giving the AI deep control over the modeling environment. ...

Downloads: 1 This Week

Last Update: 6 days ago
See Project
19

AI Data Science Team

An AI-powered data science team of agents

...It provides a modular agent framework where each agent focuses on a step in the typical data science pipeline — for example, loading data from CSV/Excel files, cleaning and wrangling messy datasets, engineering predictive features, building models with AutoML, connecting to SQL databases, and producing visual outputs — all driven by natural language or programmatic instructions. The project includes ready-to-use applications that showcase these agents in action, such as an exploratory data analysis copilot that generates reports, a pandas data analyst that combines wrangling and plotting, and SQL database agents that can query business databases and output results directly.

Downloads: 1 This Week

Last Update: 2026-01-26
See Project
20

ASSERT

Requirement-driven evaluation harness for AI agents and LLM

ASSERT is a requirement-driven evaluation harness for AI agents and LLM applications. It turns natural-language specifications, policies, product requirements, and launch criteria into structured tests that can be reviewed, executed, scored, and improved. The pipeline derives behavior categories, generates single-turn and multi-turn test cases, runs them against a target system, and uses an LLM judge to score conversations against the stated policies.

Downloads: 0 This Week

Last Update: 2026-06-04
See Project
21

Violin

Open-source Video Translation Skill

Violin is an open-source video translation and dubbing tool that turns existing videos into localized versions with translated voice-over and optional subtitles. It transcribes the original speech, translates the text, generates natural-sounding speech in the target language, and remuxes the new audio back into the video. The project is designed to keep the generated speech aligned with the original timing so the final result feels closer to a real dubbed video. It can be used from the command line, through a FastAPI web app, or as a Claude Code skill. Violin supports multilingual workflows and is useful for creators, educators, localization teams, and developers building automated video translation pipelines. ...

Downloads: 0 This Week

Last Update: 2026-05-19
See Project
22

Spark TTS

Spark-TTS Inference Code

Spark TTS is an open-source, PyTorch-based text-to-speech inference system that leverages large language models to produce highly natural, intelligible speech from text input. It uses an efficient single-stream architecture where speech tokens are directly reconstructed from the predictions of an LLM, removing the need for external acoustic models or complex vocoders and making the generation pipeline cleaner and faster. The project supports zero-shot voice cloning, meaning it can imitate a new speaker’s voice without dedicated training for that specific voice, and works across languages, including English and Chinese, even in cross-lingual code-switching scenarios. ...

Downloads: 0 This Week

Last Update: 2026-02-04
See Project
23

SkillForge

Ultimate meta-skill for generating best-in-class Claude Code skills

...It introduces a multi-phase architecture where every input or request is triaged intelligently, analyzed deeply through structured lenses, specified formally, synthesized with automated generation, and finally subjected to multi-agent review before consideration complete. The system includes tooling that routes natural language inputs to existing skills, augments them, or generates new ones using autonomous phases, enforcing quality, extensibility, security, and timelessness. By codifying best practices into automated workflows, SkillForge aims to raise the standard of AI skill implementations and make them more robust, reliable, and maintainable.

Downloads: 0 This Week

Last Update: 2026-02-25
See Project
24

Step-Audio

Open-source framework for intelligent speech interaction

Step-Audio is a unified, open-source framework aimed at building intelligent speech systems that combine both comprehension and generation: it integrates large language models (LLMs) with speech input/output to handle not only semantic understanding but also rich vocal characteristics like tone, style, dialect, emotion, and prosody. The design moves beyond traditional separate-component pipelines (ASR → text model → TTS), instead offering a multimodal model that ingests speech or audio and produces speech accordingly, enabling natural dialogue, voice cloning, and expressive speech synthesis. ...

Downloads: 0 This Week

Last Update: 2026-03-16
See Project
25

MindsDB

Making Enterprise Data Intelligent and Responsive for AI

MindsDB is an AI data solution that enables humans, AI, agents, and applications to query data in natural language and SQL, and get highly accurate answers across disparate data sources and types. MindsDB connects to diverse data sources and applications, and unifies petabyte-scale structured and unstructured data. Powered by an industry-first cognitive engine that can operate anywhere (on-prem, VPC, serverless), it empowers both humans and AI with highly informed decision-making capabilities. ...

Downloads: 3 This Week

Last Update: 2026-04-23
See Project