Showing 403 open source projects for "cross-platform"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 1
    Modular Platform

    Modular Platform

    The Modular Platform (includes MAX & Mojo)

    Modular is a high-performance AI infrastructure company repository focused on building next-generation compute and software tools for machine learning workloads. The project centers on enabling developers to run AI models faster and more efficiently by rethinking the traditional ML software stack. It is closely associated with the Mojo programming language and related tooling that aims to combine Python usability with systems-level performance. Modular’s ecosystem is designed to simplify...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Ultimate Vocal Remover (UVR5)

    Ultimate Vocal Remover (UVR5)

    GUI for a Vocal Remover that uses Deep Neural Networks

    This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).
    Downloads: 39,062 This Week
    Last Update:
    See Project
  • 3
    Wanwu AI Agent Platform

    Wanwu AI Agent Platform

    Enterprise AI agent platform for workflows, models, and RAG apps

    Wanwu is an enterprise-grade AI agent development platform designed to help organizations build and deploy intelligent applications at scale. It provides a multi-tenant environment that enables teams to create AI agents, orchestrate workflows, and implement retrieval-augmented generation systems within a unified framework. Wanwu integrates large language models with business process automation, allowing developers to design complex, production-ready AI solutions tailored to enterprise needs. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Hunyuan3D-2.1

    Hunyuan3D-2.1

    From Images to High-Fidelity 3D Assets

    ...It supports both shape generation (mesh geometry) and texture generation modules. Physically Based Rendering texture synthesis to model realistic material effects, including reflections, subsurface scattering, etc. Cross-platform support (MacOS, Windows, Linux) via Python / PyTorch, including diffusers-style APIs.
    Downloads: 16 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    pyttsx3

    pyttsx3

    Offline Text To Speech synthesis for python

    ...It is designed to work entirely without an internet connection, making it suitable for local automation, kiosks, accessibility tools, and embedded applications. On Windows it uses SAPI5, on Linux it typically uses eSpeak or eSpeak-NG, and on macOS it can use NSSpeechSynthesizer or AVSpeechSynthesizer, giving it broad cross-platform compatibility. The library exposes a simple but flexible API for controlling voice selection, speaking rate, volume, and other synthesis parameters from Python code. It supports both a high-level speak convenience function and a lower-level engine object with event hooks, queuing, and saving output to audio files. The repository includes examples and documentation that show how to adjust properties dynamically, persist synthesized output, and integrate pyttsx3 into GUIs or background services.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 6
    PennyLane

    PennyLane

    A cross-platform Python library for differentiable programming

    A cross-platform Python library for differentiable programming of quantum computers. Train a quantum computer the same way as a neural network. Built-in automatic differentiation of quantum circuits, using the near-term quantum devices directly. You can combine multiple quantum devices with classical processing arbitrarily! Support for hybrid quantum and classical models, and compatible with existing machine learning libraries.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 7
    CLI-Anything

    CLI-Anything

    Making ALL Software Agent-Native

    ...The system provides a methodology and tooling for generating CLI wrappers around existing applications, allowing them to be controlled programmatically using natural language instructions interpreted by AI agents. It integrates with multiple AI platforms such as Claude Code, OpenClaw, Codex, and GitHub Copilot CLI, enabling cross-platform compatibility and flexibility. CLI-Anything emphasizes structured outputs such as JSON to reduce parsing complexity and improve reliability in automation scenarios.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    LTX-2

    LTX-2

    Python inference and LoRA trainer package for the LTX-2 audio–video

    LTX-2 is a powerful, open-source toolkit developed by Lightricks that provides a modular, high-performance base for building real-time graphics and visual effects applications. It is architected to give developers low-level control over rendering pipelines, GPU resource management, shader orchestration, and cross-platform abstractions so they can craft visually compelling experiences without starting from scratch. Beyond basic rendering scaffolding, LTX-2 includes optimized math libraries, resource loaders, utilities for texture and buffer handling, and integration points for native event loops and input systems. The framework targets both interactive graphical applications and media-rich experiences, making it a solid foundation for games, creative tools, or visualization systems that demand both performance and flexibility. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 9
    Dolphin

    Dolphin

    Document Image Parsing via Heterogeneous Anchor Prompting”

    ...It seeks to combine performant media playback or handling (audio/video decoding, streaming, buffering) with a modular, developer-friendly API that allows easy embedding into larger applications or services. Because multimedia delivery requirements vary widely (adaptive streaming, live feeds, cross-platform compatibility, custom UI, performance constraints), Dolphin aims to offer a foundation that developers can build upon or adapt to their needs. It is designed to integrate with other tools and libraries and provide stable playback or media-processing pipelines, while remaining open-source so that users can inspect, extend, and adapt it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    Cua

    Cua

    Open-source infrastructure for Computer-Use Agents. Sandboxes

    ...It introduces a declarative syntax for specifying build scripts, automation pipelines, environment setups, and project-specific commands so contributors don’t need to memorize disparate scripts or tooling across languages and ecosystems. Cua can also manage task dependencies, handle cross-platform invocations, and simplify complex workflows into simple aliases or compound commands that are easy to share in teams. By centralizing shared commands in a structured, documented config, it helps reduce errors, accelerates onboarding of new contributors, and keeps task definitions versioned with the codebase. The CLI is typically lightweight, easy to install, and designed to integrate with existing toolchains and shells without friction.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    NoneBot

    NoneBot

    Asynchronous multi-platform robot framework written in Python

    ...Asynchronous priority development to improve operational efficiency. Simple and clear dependency injection system, built-in dependency functions reduce user code. NoneBot2 is a modern, cross-platform, and extensible Python chatbot framework. It is based on Python's type annotations and asynchronous features, and can provide convenient and flexible support for your needs. NoneBot2 is written based on Python asyncio , and has a certain degree of synchronous function compatibility based on the asynchronous mechanism. NoneBot2 provides an easy-to-use, interactive command-line tool -- nb-cli, making it easier to get started with NoneBot2 for the first time. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Open-LLM-VTuber

    Open-LLM-VTuber

    Open source AI VTuber platform with voice chat and Live2D avatars

    Open-LLM-VTuber is an open source platform designed to create AI-powered VTuber characters that can interact with users through voice and animated avatars. It enables hands-free conversations with large language models by combining speech recognition, language processing, and text-to-speech synthesis into a single system. Users can speak directly to the AI character, and the system can respond with a generated voice while animating a Live2D avatar to simulate a talking virtual personality. ...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 13
    KaTrain

    KaTrain

    Improve your Baduk skills by training with KataGo

    KaTrain is an advanced training and analysis tool for the board game Go that leverages the powerful KataGo AI engine to provide real-time feedback and in-depth game review capabilities. It is designed to help players of all skill levels improve by identifying mistakes, analyzing move efficiency, and offering alternative strategies based on AI evaluation. The application allows users to play against AI opponents with adjustable difficulty, including intentionally weakened versions of the...
    Downloads: 64 This Week
    Last Update:
    See Project
  • 14
    PaddleOCR

    PaddleOCR

    Awesome multilingual OCR toolkits based on PaddlePaddle

    PaddleOCR offers exceptional, multilingual, and practical Optical Character Recognition (OCR) tools that can help users train better models and apply them into practice. Inspired by PaddlePaddle, PaddleOCR is an ultra lightweight OCR system, with multilingual recognition, digit recognition, vertical text recognition, as well as long text recognition. It features a PPOCR series of high-quality pre-trained models, which includes: ultra lightweight ppocr_mobile series models, general...
    Downloads: 60 This Week
    Last Update:
    See Project
  • 15
    GPT4All

    GPT4All

    Run Local LLMs on Any Device. Open-source

    GPT4All is an open-source project that allows users to run large language models (LLMs) locally on their desktops or laptops, eliminating the need for API calls or GPUs. The software provides a simple, user-friendly application that can be downloaded and run on various platforms, including Windows, macOS, and Ubuntu, without requiring specialized hardware. It integrates with the llama.cpp implementation and supports multiple LLMs, allowing users to interact with AI models privately. This...
    Downloads: 137 This Week
    Last Update:
    See Project
  • 16
    Hermes Agent

    Hermes Agent

    The agent that grows with you

    Hermes Agent is a fully open-source autonomous AI agent designed to run persistently on your own machine or server, becoming more capable the longer it operates by learning from experience and building reusable procedural skills. Rather than functioning as a stateless chatbot, it maintains long-term memory across sessions and can generate searchable “Skill Documents” that capture how it solved complex tasks so it doesn’t start from scratch each time. The agent interfaces with messaging...
    Downloads: 52 This Week
    Last Update:
    See Project
  • 17
    runprompt

    runprompt

    Run LLM prompts from your shell

    ...The project emphasizes extensibility, letting users define custom actions, integrate with existing shell environments, and even leverage fuzzy matching or contextual prompts to narrow down options as you type. Designed to be cross-platform, RunPrompt works with standard shells on Windows, macOS, and Linux while honoring the user’s preferred environment and configurations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    OpenRecall

    OpenRecall

    OpenRecall is a fully open-source, privacy-first alternative

    ...Unlike proprietary alternatives, OpenRecall operates entirely locally, ensuring that all captured data remains on the user’s device and is never transmitted to external servers. The platform supports multiple operating systems, including Windows, macOS, and Linux, making it widely accessible across different environments.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    chatd

    chatd

    Chat with your documents using local AI

    ...The software focuses on privacy and security by ensuring that all document processing and inference occur entirely on the user’s computer without sending data to external cloud services. It includes a built-in integration with the Ollama runtime, which provides a cross-platform environment for running large language models locally. The application typically runs models such as Mistral-7B and allows users to load and analyze documents while asking questions in natural language. Unlike many document-chat tools that require manual installation of model servers, chatd packages the model runner with the application so that users can start interacting with documents immediately after launching the program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    FaceFusion

    FaceFusion

    Industry leading face manipulation platform

    ...The tool is often used in content creation, visual effects experimentation, and research into generative media. Overall, FaceFusion functions as a flexible and extensible platform for AI-driven face replacement and enhancement tasks.
    Downloads: 393 This Week
    Last Update:
    See Project
  • 21
    Hugging Face Skills

    Hugging Face Skills

    Definitions for AI/ML tasks like dataset creation

    ...Each skill is a self-contained folder with structured metadata and guidance that tells an agent how to execute tasks such as dataset creation, model training, evaluation, or Hub operations. The project is designed to be interoperable across major agent ecosystems, including Claude Code, OpenAI Codex, Gemini CLI, and Cursor, making it a cross-platform building block for agent automation. By formalizing best practices and workflows, Skills helps transform general-purpose coding agents into domain-aware assistants that can execute complex ML pipelines with less manual prompting. The repository also includes ready-to-use skills for common Hugging Face operations and encourages teams to extend them with custom domain logic.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    GPT-SoVITS

    GPT-SoVITS

    1 min voice data can also be used to train a good TTS model

    GPT‑SoVITS is a state-of-the-art voice conversion and TTS system that enables zero‑shot and few‑shot synthesis based on a short vocal sample (e.g., 5 seconds). It supports cross‑lingual speech synthesis across English, Chinese, Japanese, Korean, Cantonese, and more. It's powered by VITS architecture enhanced for few‑sample adaptation and real‑time usability.
    Downloads: 44 This Week
    Last Update:
    See Project
  • 23
    MLC LLM

    MLC LLM

    Universal LLM Deployment Engine with ML Compilation

    MLC LLM is a machine learning compiler and deployment framework designed to enable efficient execution of large language models across a wide range of hardware platforms. The project focuses on compiling models into optimized runtimes that can run natively on devices such as GPUs, mobile processors, browsers, and edge hardware. By leveraging machine learning compilation techniques, mlc-llm produces high-performance inference engines that maintain consistent APIs across platforms. The system...
    Downloads: 27 This Week
    Last Update:
    See Project
  • 24
    Rhino

    Rhino

    On-device Speech-to-Intent engine powered by deep learning

    Rhino is Picovoice's Speech-to-Intent engine. It directly infers intent from spoken commands within a given context of interest, in real-time. The end-to-end platform for embedding private voice AI into any software in a few lines of code. Design with no limits on top of a modular platform. Create use-case-specific voice AI models in seconds. Develop voice features with a few lines of code using intuitive and cross-platform SDKs. Deliver voice AI everywhere: on-device, mobile, web browsers, on-premise, or cloud. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    OpenVoice

    OpenVoice

    Instant voice cloning by MIT and MyShell. Audio foundation model

    ...It is designed not only to match the timbre of the reference voice, but also to give granular control over style parameters such as emotion, accent, rhythm, pauses, and intonation. The model supports cross-lingual and even zero-shot cross-lingual voice cloning, so a speaker recorded in one language can be made to speak naturally in others. Architecturally, OpenVoice separates “tone color” cloning from style control, which makes it easier to keep a consistent identity while flexibly changing prosody or language. The project provides open-weight models, inference code, and examples, making it suitable both for research and for building production voice experiences. ...
    Downloads: 31 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
Auth0 Logo