• $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    RTP-LLM

    RTP-LLM

    Alibaba's high-performance LLM inference engine for diverse apps

    RTP-LLM is an open-source large language model inference acceleration engine developed by Alibaba to provide high-performance serving infrastructure for modern LLM deployments. The system focuses on improving throughput, latency, and resource utilization when running large models in production environments. It achieves this by implementing optimized GPU kernels, batching strategies, and memory management techniques tailored for transformer inference workloads. The framework is designed for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    UCCL

    UCCL

    UCCL is an efficient communication library for GPUs

    UCCL is a high-performance GPU communication library designed to support distributed machine learning workloads and large-scale AI systems. The library focuses on enabling efficient data transfer and collective communication between GPUs during training and inference processes. It supports a variety of communication patterns including collective operations such as all-reduce as well as peer-to-peer transfers that are commonly used in modern machine learning architectures. UCCL is designed to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    nndeploy

    nndeploy

    An Easy-to-Use and High-Performance AI Deployment Framework

    nndeploy is an open-source framework designed to simplify the deployment of artificial intelligence models across multiple hardware platforms and devices. The framework focuses on making it easier to transform trained AI models into production-ready applications that can run efficiently on desktops, mobile devices, servers, and edge computing hardware. Developers can use visual workflows to design and configure AI processing pipelines by connecting modular nodes that represent different...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PowerInfer

    PowerInfer

    High-speed Large Language Model Serving for Local Deployment

    PowerInfer is a high-performance inference engine designed to run large language models efficiently on personal computers equipped with consumer-grade GPUs. The project focuses on improving the performance of local AI inference by optimizing how neural network computations are distributed between CPU and GPU resources. Its architecture exploits the observation that only a subset of neurons in large models are frequently activated, allowing the system to preload frequently used neurons into...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 5
    RunAnywhere

    RunAnywhere

    Production ready toolkit to run AI locally

    RunAnywhere SDKs are a set of cross-platform development tools that enable applications to run artificial intelligence models directly on user devices instead of relying on cloud infrastructure. The toolkit allows developers to integrate language models, speech recognition, and voice synthesis capabilities into mobile or desktop applications while keeping all computation local. By running models entirely on device, the platform eliminates network latency and protects user data because...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Open Model Zoo

    Open Model Zoo

    Pre-trained Deep Learning models and demos

    ...In addition to model files, Open Model Zoo provides demo applications that show realistic usage patterns and help developers quickly prototype and understand inference pipelines in C++, Python, or via the OpenCV Graph API. Tools in the repository also help automate model downloads and other tasks, making it easier to incorporate these models into production systems or custom solutions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Seamless Communication

    Seamless Communication

    Foundational Models for State-of-the-Art Speech and Text Translation

    Seamless Communication is a research project focused on building more integrated, low-latency multimodal communication between humans and AI agents. The motivation is to move beyond “text in, text out” and enable direct, live, multi-turn exchange involving language, gesture, gaze, vision, and modality switching without user friction. The system architecture includes a real-time multimodal signal pipeline for audio, video, and sensor data, a dialog manager that can decide when to act (speak,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    3FS

    3FS

    A high-performance distributed file system

    The 3FS repository (standing likely for “Feature 3F System” or similar) is focused on providing a feature extraction and transformation framework tailored to deep and large models, especially in token-based systems. Its primary aim is to support efficient and scalable feature transformation pipelines—especially for inference environments—by batching, caching, and integrating feature-based modules like segmenters, sparse retrievers, and scorers seamlessly. The repo includes APIs to define...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    DALI

    DALI

    A GPU-accelerated library containing highly optimized building blocks

    The NVIDIA Data Loading Library (DALI) is a library for data loading and pre-processing to accelerate deep learning applications. It provides a collection of highly optimized building blocks for loading and processing image, video and audio data. It can be used as a portable drop-in replacement for built-in data loaders and data iterators in popular deep learning frameworks. Deep learning applications require complex, multi-stage data processing pipelines that include loading, decoding,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    PyTensor

    PyTensor

    Python library for defining and optimizing mathematical expressions

    ...Extensible graph framework is suitable for rapid development of custom operators and symbolic optimizations. Implements an extensible graph transpilation framework that currently provides compilation via C, JAX, and Numba. Based on one of the most widely-used Python tensor libraries: Theano.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Generative AI for Beginners .NET

    Generative AI for Beginners .NET

    Hands-on .NET course for building real-world generative AI apps

    Generative AI for Beginners .NET is a hands-on course that helps developers build real-world AI applications using the .NET ecosystem. It walks through core concepts such as text generation, chat-based interactions, and integrating large language models into applications. Each lesson includes short videos, working code samples, and step-by-step instructions, making it easy to follow and apply immediately. Generative AI for Beginners .NET supports tools like GitHub Models, Azure OpenAI...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Mooncake

    Mooncake

    Mooncake is the serving platform for Kimi

    Mooncake is an open-source infrastructure platform designed to optimize large language model serving by focusing on efficient management and transfer of model data and KV cache. The platform was originally developed as part of the serving infrastructure for the Kimi large language model system. Its architecture centers on a high-performance transfer engine that provides unified data transfer across different storage and networking technologies. This engine enables efficient movement of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    openctp

    openctp

    Provides CTP stock options and Zhongtai Securities XTP

    openctp is a technical service platform built around the CTP trading ecosystem that provides CTP compatible interfaces for a wide range of brokerage backends and markets. Its core idea is to wrap heterogeneous stock and derivatives trading gateways such as Zhongtai XTP, Huaxin Qidian TORA, and others with CTPAPI compatible interfaces, so existing CTP programs can connect simply by swapping dynamic libraries rather than rewriting code. The project offers a comprehensive simulation environment...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    FlashMLA

    FlashMLA

    FlashMLA: Efficient Multi-head Latent Attention Kernels

    FlashMLA is a high-performance decoding kernel library designed especially for Multi-Head Latent Attention (MLA) workloads, targeting NVIDIA Hopper GPU architectures. It provides optimized kernels for MLA decoding, including support for variable-length sequences, helping reduce latency and increase throughput in model inference systems using that attention style. The library supports both BF16 and FP16 data types, and includes a paged KV cache implementation with a block size of 64 to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    GPTel

    GPTel

    A no-frills ChatGPT client for Emacs

    ...It will ask you for the key if you skipped the previous step. Run it with a prefix-arg to start a new session. In the gptel buffer, send your prompt with M-x gptel-send, bound to C-c RET. Set chat parameters (GPT model, directives etc) for the session by calling gptel-send with a prefix argument.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    DSharpPlus

    DSharpPlus

    A .NET Standard library for making bots using the Discord API

    All Nightly versions are available on Nuget as a pre-release. These are cutting-edge versions automatically built from the latest commit in the master branch in this repository, and as such always contains the latest changes. If you want to use the latest features on Discord, you should use the nightlies Despite the nature of pre-release software, all changes to the library are held under a level of scrutiny; for this library, unstable does not mean bad quality, rather it means that the API...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    oneDNN

    oneDNN

    oneAPI Deep Neural Network Library (oneDNN)

    This software was previously known as Intel(R) Math Kernel Library for Deep Neural Networks (Intel(R) MKL-DNN) and Deep Neural Network Library (DNNL). oneAPI Deep Neural Network Library (oneDNN) is an open-source cross-platform performance library of basic building blocks for deep learning applications. oneDNN is part of oneAPI. The library is optimized for Intel(R) Architecture Processors, Intel Processor Graphics and Xe Architecture graphics. oneDNN has experimental support for the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    AI4U

    AI4U

    Multi-engine plugin to specify agents with reinforcement learning

    AI4U is a multi-engine plugin (Godot and Unity) that allows you to design Non-Player Characters (NPCs) of games using an agent abstraction. In addition, AI4U has a low-level API that allows you to connect the agent to any algorithm made available in Python by the reinforcement learning community specifically and by the Artificial Intelligence community in general. Reinforcement learning promises to overcome traditional navigation mesh mechanisms in games and to provide more autonomous...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PyTorch/XLA

    PyTorch/XLA

    Enabling PyTorch on Google TPU

    PyTorch/XLA is a Python package that uses the XLA deep learning compiler to connect the PyTorch deep learning framework and Cloud TPUs. You can try it right now, for free, on a single Cloud TPU with Google Colab, and use it in production and on Cloud TPU Pods with Google Cloud. Take a look at one of our Colab notebooks to quickly try different PyTorch networks running on Cloud TPUs and learn how to use Cloud TPUs as PyTorch devices. We are also introducing new TPU VMs for more transparent...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    AWS IoT FleetWise Edge

    AWS IoT FleetWise Edge

    AWS IoT FleetWise Edge Agent

    Easily collect, transform, and transfer vehicle data to the cloud in near-real-time. AWS IoT FleetWise makes it easy and cost-effective for automakers to collect, transform, and transfer vehicle data to the cloud in near-real-time and use it to build applications with analytics and machine learning that improve vehicle quality, safety, and autonomy. Train autonomous vehicles (AVs) and advanced driver assistance systems (ADAS) with camera data collected from a fleet of production vehicles....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MIVisionX

    MIVisionX

    Set of comprehensive computer vision & machine intelligence libraries

    MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX delivers highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions along with Convolution Neural Net Model Compiler & Optimizer supporting ONNX, and Khronos NNEF™ exchange formats. The toolkit allows for rapid prototyping and deployment of optimized computer vision and machine learning...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Megatron

    Megatron

    Ongoing research training transformer models at scale

    ...Megatron is also used in NeMo Megatron, a framework to help enterprises overcome the challenges of building and training sophisticated natural language processing models with billions and trillions of parameters. Copyright (c) 2022, NVIDIA CORPORATION. All rights reserved.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    NErlNet

    NErlNet

    Nerlnet is a framework for research and development

    NErlNet is a research-grade framework for distributed machine learning over IoT and edge devices. Built with Erlang (Cowboy HTTP), OpenNN, and Python (Flask), it enables simulation of clusters on a single machine or real deployment across heterogeneous devices.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Model Context Protocol (MCP)

    Model Context Protocol (MCP)

    Specification and documentation for the Model Context Protocol

    ...It gives developers a consistent way to expose tools, prompts, resources, and server capabilities to language models. Its broader ecosystem supports many languages, including TypeScript, Python, Java, Kotlin, C#, Go, PHP, Ruby, Rust, and Swift.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    pgvecto.rs

    pgvecto.rs

    Vector database plugin for Postgres, written in Rust

    pgvecto.rs is a Postgres extension that provides vector similarity search functions. It is written in Rust and based on pgrx. It is currently under heavy development, please take care when using it in production. pgvecto.rs is a Postgres extension, which means that you can use it directly within your existing database. This makes it easy to integrate into your existing workflows and applications. pgvecto.rs supports filtering. You can set conditions when searching or retrieving points. This...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo