27 projects for "identity" with 2 filters applied:

  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 1
    InfiniteYou

    InfiniteYou

    Flexible Photo Recrafting While Preserving Your Identity

    ...The team uses a multi-stage training strategy with synthetic multi-sample data per identity to fine-tune for both identity consistency and aesthetic quality. Compared to prior methods, InfiniteYou significantly improves on identity similarity, text-prompt adherence, overall image quality, and avoids common problems such as face copy-pasting artifacts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    DESIGN.md

    DESIGN.md

    A format specification for describing a visual identity

    design.md is an open specification created by Google Labs that defines a standardized way to describe design systems for AI coding agents. It allows developers to encode visual identity elements such as colors, typography, spacing, and components in a structured format. The file combines machine-readable design tokens with human-readable explanations, enabling agents to generate consistent user interfaces aligned with a brand. By providing persistent design context, it eliminates the need to repeatedly describe styling requirements to AI tools. ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 3
    AgentField

    AgentField

    Build and run AI agents like microservices

    ...Instead of treating agents as isolated scripts or prototypes, the system elevates them to first-class infrastructure components that can be deployed, orchestrated, and managed at scale across distributed environments. Developers define agents as typed functions, and the platform automatically handles orchestration, communication, identity, and execution, allowing agents to behave like APIs within a broader system architecture. The framework includes built-in support for asynchronous execution, long-running processes, and multi-agent coordination, enabling complex workflows that go far beyond simple prompt-response interactions. It also introduces strong identity and governance mechanisms, such as cryptographic identities and policy enforcement, ensuring that agents can operate securely.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Agent Auth Protocol

    Agent Auth Protocol

    Standard and implementation for AI agent authentication

    Agent Auth Protocol is a specification-oriented project within the Better Auth ecosystem that defines a structured protocol for authentication between applications and autonomous agents. It is designed to standardize how AI agents and services securely authenticate, communicate, and manage identity within modern software systems. Built alongside the broader Better Auth framework, it integrates with a comprehensive authentication system that supports features like OAuth, multi-factor authentication, and session management. The protocol focuses on enabling secure interactions in environments where automated agents act on behalf of users or systems, requiring robust identity verification and permission handling. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    HunyuanCustom

    HunyuanCustom

    Multimodal-Driven Architecture for Customized Video Generation

    HunyuanCustom is a multimodal video customization framework by Tencent Hunyuan, aimed at generating customized videos featuring particular subjects (people, characters) under flexible conditions, while maintaining subject/identity consistency. It supports conditioning via image, audio, video, and text, and can perform subject replacement in videos, generate avatars speaking given audio, or combine multiple subject images. The architecture builds on HunyuanVideo, with added modules for identity reinforcement and modality-specific condition injection. Text-image fusion module based on LLaVA for improved multimodal understanding. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    GitClaw

    GitClaw

    A universal git-native AI agent framework

    ...For example, identity and personality may be defined in files such as SOUL.md, while behavioral constraints and policies can be placed in rule definitions. Memory is persisted directly in the repository as version-controlled files, which means conversations, experiences, or learned data can be tracked over time using Git history.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    FLUX.2

    FLUX.2

    Official inference repo for FLUX.2 models

    ...FLUX.2 is built with a modern architecture (a flow-matching transformer + a revamped VAE + a strong vision-language encoder), enabling strong prompt adherence, correct rendering of text/typography in images, reliable lighting, layout, and physical realism, and consistent style/character/product identity across multiple generations or edits.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 8
    DreamO

    DreamO

    A Unified Framework for Image Customization

    DreamO is a unified, open-source framework from ByteDance for advanced image customization and generation that consolidates multiple “image manipulation” tasks into a single system, rather than requiring separate specialized models. Built on a diffusion-transformer (DiT) backbone, it supports a diverse set of tasks — including identity preservation, virtual “try-on” (e.g. clothing, accessories), style transfer, IP adaptation (objects/characters), and layout/condition-aware customizations — all handled within the same unified architecture. DreamO’s design introduces a feature routing constraint that helps disentangle different control conditions (like identity, style, clothing) when more than one is specified, which significantly reduces conflicts and artifacts when combining controls. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Universal Commerce Protocol (UCP)

    Universal Commerce Protocol (UCP)

    The common language for platforms, agents and businesses.

    ...Its modular, capability-based architecture allows businesses to expose only what they support while remaining flexible and extensible. By leveraging existing industry standards for payments, identity, and security, UCP avoids reinventing the wheel while ensuring reliability and trust. The result is a developer-friendly, future-ready protocol that simplifies commerce integration at global scale.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    HunyuanVideo-I2V

    HunyuanVideo-I2V

    A Customizable Image-to-Video Model based on HunyuanVideo

    HunyuanVideo-I2V is a customizable image-to-video generation framework from Tencent Hunyuan, built on their HunyuanVideo foundation. It extends video generation so that given a static reference image plus an optional prompt, it generates a video sequence that preserves the reference image’s identity (especially in the first frame) and allows stylized effects via LoRA adapters. The repository includes pretrained weights, inference and sampling scripts, training code for LoRA effects, and support for parallel inference via xDiT. Resolution, video length, stability mode, flow shift, seed, CPU offload etc. Parallel inference support using xDiT for multi-GPU speedups. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    UNO

    UNO

    A Universal Customization Method for Single and Multi Conditioning

    ...It suggests a framework for image (or more general generative) modeling where the model can be conditioned either on a single subject or multiple subjects — which may correspond to generating or customizing images featuring specific people, styles, or objects, possibly with fine-grained control over subject identity or composition. Because the project is new (see activity logs for 2025), it seems to aim at bridging between single-subject customization and multi-subject generation in generative modeling — potentially useful for personalized content creation, flexible composition, or controlled generation tasks. UNO likely offers tools to fine-tune or condition generation models so that they can incorporate novel subjects, enabling users to produce custom outputs beyond standard training distribution.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    JoyAI-Echo

    JoyAI-Echo

    Pushing the Frontier of Long Audio-Visual Generation

    ...It is designed to create minute-level, multi-shot video stories from structured prompts while preserving continuity across scenes. The system uses a paired cross-modal memory bank to maintain visual identity and voice consistency over longer sequences. It also uses a distilled DMD generator to reduce inference cost and improve generation speed compared with heavier multi-step pipelines. JoyAI-Echo focuses on text-to-video and multi-shot long-video generation, while image-to-video support is not part of the current release scope. It is most useful for research and experimental video workflows that need synchronized audio, coherent characters, and editable story-level generation.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    DramaBox

    DramaBox

    super expressive prompting model based on ltx2.3

    DramaBox is an expressive text-to-speech and voice cloning project from Resemble AI built on top of the LTX-2.3 audio branch. It generates speech from prompts that control not only the spoken text, but also speaker identity, emotion, delivery style, laughs, sighs, pauses, and transitions. Users can optionally provide a voice reference of around 10 seconds or more to clone the target timbre while still guiding performance through scene-style prompting. The project includes a warm inference server, a CLI workflow, and a Gradio app for interactive generation. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Cheshire Cat AI

    Cheshire Cat AI

    AI agent microservice

    ...It is highly extensible through a plugin system that supports custom tools, event hooks, and workflows, giving developers fine-grained control over agent behavior and interactions. Cheshire Cat also supports multi-user environments with granular permissions and identity provider integration, making it suitable for enterprise use cases.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    HunyuanVideo-Avatar

    HunyuanVideo-Avatar

    Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model

    HunyuanVideo-Avatar is a multimodal diffusion transformer (MM-DiT) model by Tencent Hunyuan for animating static avatar images into dynamic, emotion-controllable, and multi-character dialogue videos, conditioned on audio. It addresses challenges of motion realism, identity consistency, and emotional alignment. Innovations include a character image injection module, an Audio Emotion Module for transferring emotion cues, and a Face-Aware Audio Adapter to isolate audio effects on faces, enabling multiple characters to be animated in a scene. Character image injection module for better consistency between training and inference conditioning. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    OuteTTS

    OuteTTS

    Interface for OuteTTS models

    ...The project supports multiple backends including llama.cpp (Python bindings and server), Hugging Face Transformers, ExLlamaV2, VLLM and a JavaScript interface via Transformers.js, allowing it to run on CPUs, NVIDIA CUDA GPUs, AMD ROCm, Vulkan-capable GPUs, and Apple Metal. It also includes a notion of speaker profiles: you can create a speaker from a short audio sample, save it as JSON, and reuse it for consistent voice identity across generations and sessions. For best quality, the model is designed to work with a reference speaker clip and will inherit emotion, style, and accent from that reference.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    MARS5

    MARS5

    MARS5 speech model (TTS) from CAMB.AI

    ...The model is built to handle prosodically challenging content such as sports commentary, anime dialogue, and other high-energy or highly varied speech patterns with realistic rhythm and intonation. To control speaker identity, MARS5 uses a short reference audio clip, typically between 2 and 12 seconds, from which it learns the voice characteristics. It supports two main inference modes: shallow clone, which is faster and only needs the reference audio, and deep clone, which additionally uses the transcript of the reference audio to increase similarity and naturalness at the cost of more computation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Make-A-Video - Pytorch (wip)

    Make-A-Video - Pytorch (wip)

    Implementation of Make-A-Video, new SOTA text to video generator

    Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch. They combine pseudo-3d convolutions (axial convolutions) and temporal attention and show much better temporal fusion. The pseudo-3d convolutions isn't a new concept. It has been explored before in other contexts, say for protein contact prediction as "dimensional hybrid residual networks". The gist of the paper comes down to, take a SOTA text-to-image model (here they use DALL-E2, but the same learning...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    CoTracker

    CoTracker

    CoTracker is a model for tracking any point (pixel) on a video

    CoTracker is a learning-based point tracking system that jointly follows many user-specified points across a video, rather than tracking each point independently. By reasoning about all tracks together, it can maintain temporal consistency, handle mutual occlusions, and reduce identity swaps when trajectories cross. The model takes sparse point queries on one frame and predicts their sub-pixel locations and a visibility score for every subsequent frame, producing long, coherent trajectories. Its transformer-style architecture aggregates information both along time and across points, allowing it to recover tracks even after brief disappearances. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    AEA Framework

    AEA Framework

    A framework for autonomous economic agent (AEA) development

    agents-aea by Fetch.ai is a framework for building autonomous economic agents (AEAs) that can act independently, communicate, and transact on decentralized networks. It focuses on enabling AI-driven agents to participate in digital marketplaces and ecosystems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Talking Head Anime from a Single Image

    Talking Head Anime from a Single Image

    Demo for the "Talking Head Anime from a Single Image"

    ...The underlying model uses deep learning techniques to predict how different facial features and body parts should move based on pose parameters or input signals. This allows the software to create realistic animated frames while preserving the identity and appearance of the original character. The repository includes demo applications that allow users to interact with the system through graphical controls or webcam input to drive character motion. These demonstrations illustrate how generative neural rendering can be used to build real-time avatar systems for virtual characters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    VGGFace2

    VGGFace2

    VGGFace2 Dataset for Face Recognition

    VGGFace2 is a large-scale face recognition dataset developed to support research on facial recognition across variations in pose, age, illumination, and identity. It consists of 3.31 million images covering 9,131 subjects, with an average of over 360 images per subject. The dataset was collected from Google Image Search, ensuring a wide diversity in ethnicity, profession, and real-world conditions. It is split into a training set with 8,631 identities and a test set with 500 identities, making it suitable for benchmarking and large-scale model training. ...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 23
    Resemblyzer

    Resemblyzer

    A python package to analyze and compare voices with deep learning

    ...These embeddings can then be used for speaker similarity, clustering, diarization experiments, voice comparison, and audio dataset exploration. The project is useful for researchers and developers who need a practical way to reason about speaker identity without building a voice encoder from scratch. It can help identify whether two recordings sound like the same speaker or visualize voice relationships across many samples. Its main value is making speaker representation accessible through a simple Python workflow.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    FaceAccess Facial Recognition System

    FaceAccess Facial Recognition System

    FaceAccess is an Access Control System based on Facial Recognition

    ...FaceAccess provides a prototype implementation as a "login module" of an information system. The goal is to enhance the level of system security by periodically checking the user’s identity without disrupting the user’s activities. Installation instructions can be found in the package. If you need anymore guidance, please use the Wiki to post any kind of inquiry. NB: Please Donate to support the development of this project. PM me for other means. Any kind of support will be very much appreciated. Thanks a bunch.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ANts P2P
    ANts P2P realizes a third generation P2P net. It protects your privacy while you are connected and makes you not trackable, hiding your identity (ip) and crypting everything you are sending/receiving from others.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo