Search Results for "natural language processing" - Page 18

Showing 1440 open source projects for "natural language processing"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    Benthos

    Benthos

    Fancy stream processing made operationally mundane

    Benthos is a high performance and resilient stream processor, able to connect various sources and sinks in a range of brokering patterns and perform hydration, enrichments, transformations and filters on payloads. It comes with a powerful mapping language, is easy to deploy and monitor, and ready to drop into your pipeline either as a static binary, docker image, or serverless function, making it cloud native as heck. Delivery guarantees can be a dodgy subject. Benthos processes and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    repo2txt

    repo2txt

    Web-based tool converts GitHub repository contents

    repo2txt is an open-source developer tool that converts the contents of a code repository into a single structured text file that can be easily consumed by large language models. The tool is designed to address the challenge of analyzing entire codebases with AI assistants, where code is normally distributed across many files and directories. By collecting repository contents and formatting them into a single text document, repo2txt allows developers to feed complete projects into AI systems...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    NeMo Curator

    NeMo Curator

    Scalable data pre processing and curation toolkit for LLMs

    NeMo Curator is a Python library specifically designed for fast and scalable dataset preparation and curation for large language model (LLM) use-cases such as foundation model pretraining, domain-adaptive pretraining (DAPT), supervised fine-tuning (SFT) and paramter-efficient fine-tuning (PEFT). It greatly accelerates data curation by leveraging GPUs with Dask and RAPIDS, resulting in significant time savings. The library provides a customizable and modular interface, simplifying pipeline...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    DeepSeek-OCR

    DeepSeek-OCR

    Contexts Optical Compression

    ...It supports local deployment, enabling organizations concerned about privacy or latency to run the pipeline on-premises rather than send sensitive documents to third-party cloud services. The codebase is written in Python with a focus on modularity: you can swap preprocessing, recognition, and post-processing components as needed for custom workflows.
    Downloads: 12 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    Stagehand

    Stagehand

    An AI web browsing framework focused on simplicity and extensibility

    An AI web browsing framework focused on simplicity and extensibility. Stagehand is the AI-powered successor to Playwright, offering three simple APIs (act, extract, and observe) that provide the building blocks for natural language-driven web automation. The goal of Stagehand is to provide a lightweight, configurable framework, without overly complex abstractions, as well as modular support for different models and model providers. It's not going to order you a pizza, but it will help you reliably automate the web. Each Stagehand function takes in an atomic instruction, such as act("click the login button") or extract("find the red shoes"), generates the appropriate Playwright code to accomplish that instruction, and executes it.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    OpenClaw

    OpenClaw

    Your own personal AI assistant. Any OS. Any Platform.

    OpenClaw (formerly Clawdbot/Moltbot) is an open-source, self-hosted autonomous AI assistant designed to run on user-controlled hardware and bridge conversational natural language with real-world task execution, effectively acting as a proactive digital assistant rather than a reactive chatbot. It lets you send instructions through familiar messaging platforms like WhatsApp, Telegram, Discord, Slack, Signal, iMessage, and more, and then interprets those instructions to carry out actions such as managing calendars, sending emails or messages, browsing the web, executing system commands, and coordinating workflows across services — all while maintaining long-term memory and context across sessions. ...
    Downloads: 172 This Week
    Last Update:
    See Project
  • 7
    ai-renamer

    ai-renamer

    A Node.js CLI that uses Ollama and LM Studio models

    ...The tool supports batch processing, making it particularly useful for organizing large collections of files quickly and efficiently. It also provides customization options such as naming conventions, language preferences, and prompt modifications to tailor the output to specific use cases. By leveraging AI for semantic understanding, it significantly reduces the time spent on manual file organization and improves discoverability.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    AI App Lab

    AI App Lab

    Implementing large models into scenario-based applications

    AI App Lab is an open-source platform developed by Volcengine that provides tools, SDKs, and example applications for building real-world AI applications powered by large language models. The project focuses on helping developers bridge the gap between AI models and practical business use cases by offering a structured environment for creating production-ready AI systems. It includes a high-level SDK called Arkitect, which provides workflows and tools for integrating models, plugins, and multimodal capabilities such as text, image, and voice processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    CyberStrikeAI

    CyberStrikeAI

    CyberStrikeAI is an AI-native security testing platform built in Go

    CyberStrikeAI is an AI-native security testing platform built in Go that brings autonomous penetration testing, vulnerability discovery, and attack chain analysis into a unified interface. The platform integrates over 100 security tools out of the box and pairs them with an intelligent orchestration engine that can be directed via natural language or policy definitions, allowing users to automate reconnaissance, scanning, exploitation, and reporting without manual sequencing of tools. It supports role-based testing, letting teams define security roles with tailored tool access and prompts, and includes a skills system that encapsulates specialized testing strategies that the AI can incorporate into its planning. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    LLM Scraper

    LLM Scraper

    Extract structured data from webpages using LLM-powered scraping

    ...LLM Scraper integrates browser automation through Playwright, allowing it to load webpages and process their content before sending it to a language model for interpretation. Multiple content processing modes are supported, including raw HTML, cleaned HTML, Markdown, extracted text, screenshots, and custom inputs, making it adaptable to a wide range of scraping scenarios. LLM Scraper also provides streaming output and code generation capabilities that help developers build reusable scraping workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    AudioBookConverter

    AudioBookConverter

    Improved AudioBookConverter based on freeipodsoftware release

    AudioBookConverter is a lightweight desktop application designed to convert and organize audiobook files into optimized formats such as M4B for playback on modern devices. It allows users to combine multiple audio files into a single audiobook while preserving chapters and metadata for seamless listening. The software supports a wide range of input formats including MP3, FLAC, and AAC, and provides flexible output options for different devices. It includes intelligent artwork handling,...
    Downloads: 46 This Week
    Last Update:
    See Project
  • 12
    OP Vault

    OP Vault

    Give ChatGPT long-term memory using the OP Stack

    ...Users can query the system in natural language and receive answers that include references to specific files and sections, improving transparency and trust in the responses. The project is designed to handle large volumes of data, making it suitable for personal knowledge management, research archives, or enterprise documentation systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Text-to-image Playground

    Text-to-image Playground

    A playground to generate images from any text prompt using SD

    dalle-playground is an open-source web application that allows users to generate images from natural language text prompts using modern text-to-image generative models. Originally built around DALL-E Mini, the project later transitioned to using Stable Diffusion, enabling more detailed and higher-quality image synthesis. The system combines a backend machine learning service with a browser-based frontend interface that lets users experiment interactively with prompt engineering and generative AI. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    YourInfo

    YourInfo

    Real-time browser fingerprinting demo with cross-browser tracking

    YourInfo is a personal information management tool designed to let users securely store, structure, and retrieve their key data — such as contacts, credentials, personal notes, and preferences — while also enabling AI-assisted queries or reminders using that data. The platform prioritizes privacy by focusing on local storage or user-controlled databases, ensuring sensitive data stays under the user’s control rather than in third-party servers. Users can define types of information, tag...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Grounded-Segment-Anything

    Grounded-Segment-Anything

    Marrying Grounding DINO with Segment Anything & Stable Diffusion

    Grounded-Segment-Anything is a research-oriented project that combines powerful open-set object detection with pixel-level segmentation and subsequent creative workflows, effectively enabling detection, segmentation, and high-level vision tasks guided by free-form text prompts. The core idea behind the project is to pair Grounding DINO — a zero-shot object detector that can locate objects described by natural language — with Segment Anything Model (SAM), which can produce detailed masks for objects once they are localized. This fusion lets users provide arbitrary text descriptions (e.g., “a cat, a bicycle, or a coffee mug”), have the detection model find relevant bounding boxes, and then use SAM to generate precise segmentation masks that isolate each object in the scene.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    NeuTTS Air

    NeuTTS Air

    NeuTTS model built from small LLM backbones

    NeuTTS Air is an open-source collection of on-device text-to-speech speech language models from Neuphonic. It is built for natural-sounding voice generation that can run locally instead of relying on a remote web API. The project emphasizes instant voice cloning, real-time performance, and deployment on smaller devices such as phones, laptops, and Raspberry Pi-class hardware. Its LLM-based architecture is intended to bring more expressive and flexible speech generation to local applications. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    UI UX Pro Max

    UI UX Pro Max

    AI SKILL that provide design intelligence

    ...It uses an AI reasoning engine to generate complete design systems tailored to project requirements, recommending layouts, typography, colors, spacing, and component structures automatically based on natural language prompts. Users can ask for specific UI/UX tasks or design patterns, and the skill will produce guidelines, code snippets, and responsive implementation suggestions that align with industry best practices and accessibility standards. It supports a broad range of tech stacks including HTML/Tailwind, React, Vue, mobile UI frameworks, and more, making it versatile for designers and developers alike. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 18
    OpenPlanter

    OpenPlanter

    Language-model investigation agent with a terminal UI

    OpenPlanter is an open-source Python project focused on building an intelligent automated planting or gardening system powered by software control and data processing. The repository is designed to help developers and hobbyists create programmable plant management workflows that can monitor, schedule, and optimize growing conditions. It emphasizes automation and extensibility, allowing integration with sensors, environmental data, and control logic for smart cultivation setups. The system is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    GitHub Copilot SDK

    GitHub Copilot SDK

    Multi-platform SDK for integrating GitHub Copilot Agent into apps

    The GitHub Copilot SDK is a developer toolkit that enables creators to build custom AI-assisted experiences powered by Copilot models within their own applications, editors, and workflows. Instead of being limited to editors like VS Code, this SDK lets teams embed Copilot-style code suggestions, natural language assistance, and predictive completions anywhere they see fit—such as internal IDEs, browser extensions, documentation portals, or bespoke tools tailored to specific languages or frameworks. It provides a structured API surface for invoking the Copilot model in context with the surrounding user state, capturing document content, cursor position, and invocation triggers so suggestions are relevant and responsive. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    WhisperJAV

    WhisperJAV

    Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD

    WhisperJAV is an open-source speech transcription pipeline designed specifically for generating subtitles for Japanese adult video content. The project addresses challenges that standard speech recognition models face when transcribing this type of audio, which often includes low signal-to-noise ratios and large numbers of non-verbal vocalizations. Traditional automatic speech recognition systems can misinterpret these sounds as words, leading to inaccurate transcripts. WhisperJAV introduces...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    ESPnet

    ESPnet

    End-to-end speech processing toolkit

    ESPnet is a comprehensive end-to-end speech processing toolkit covering a wide spectrum of tasks, including automatic speech recognition (ASR), text-to-speech (TTS), speech translation (ST), speech enhancement, speaker diarization, and spoken language understanding. It uses PyTorch as its deep learning engine and adopts a Kaldi-style data processing pipeline for features, data formats, and experimental recipes.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Bolt.new

    Bolt.new

    Prompt, run, edit, and deploy full-stack web applications

    Bolt.new is an AI-powered full-stack development platform created by StackBlitz that enables users to build, run, edit, and deploy complete web applications directly from the browser without requiring any local setup or traditional development environment. It operates as an intelligent coding agent where users describe what they want to build in natural language, and the system generates functional applications, including frontend, backend, and infrastructure components. The platform is built on StackBlitz’s WebContainers technology, which allows Node.js environments to run entirely in the browser, eliminating the need for installations while maintaining real development capabilities. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 23
    Exq

    Exq

    Job processing library for Elixir - compatible with Resque / Sidekiq

    Exq is a job processing library compatible with Resque / Sidekiq for the Elixir language. Exq uses Redis as a store for background processing jobs. Exq handles concurrency, job persistence, job retries, reliable queueing and tracking so you don't have to. Jobs are persistent so they would survive across node restarts. You can use multiple Erlang nodes to process from the same pool of jobs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    wasm-bindgen

    wasm-bindgen

    Facilitating high-level interactions between Wasm modules and Js

    ...It allows developers to write performance-critical code in Rust, compile it to WebAssembly, and then expose functions, structs, and APIs directly to JavaScript in a natural and idiomatic way. The tool automates the process of bridging differences between Rust’s type system and JavaScript’s dynamic environment, making cross-language integration significantly easier. One of its key design principles is efficiency, following a “pay only for what you use” model where only the required bindings are generated, minimizing unnecessary overhead. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    Ultravox

    Ultravox

    Fast multimodal LLM for real-time voice interaction and AI apps

    Ultravox is an open source multimodal large language model designed specifically for real-time voice-based interactions. It is built to process both text and spoken audio directly, eliminating the need for a separate speech recognition stage and enabling more seamless conversational experiences. Ultravox works by combining text prompts with encoded audio inputs, allowing it to understand spoken language alongside written instructions in a unified pipeline. Internally, it leverages pretrained...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo