Showing 72 open source projects for "face swap video"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 1
    Deep-Live-Cam

    Deep-Live-Cam

    Real time face swap and one-click video deepfake

    Real time face swap and one-click video deepfake with only a single image. Choose a face (image with the desired face) and the target image/video (image/video in which you want to replace the face) and click on Start. Open File Explorer and navigate to the directory you select your output to be in. You will find a directory named <video_title> where you can see the frames being swapped in real time.
    Downloads: 415 This Week
    Last Update:
    See Project
  • 2
    FaceFusion

    FaceFusion

    Industry leading face manipulation platform

    FaceFusion is an open-source face swapping and facial enhancement toolkit designed for high-quality video and image manipulation workflows. The project enables users to replace faces in images or videos while maintaining temporal consistency and visual realism. It integrates modern deep learning models for face detection, alignment, and blending to produce smoother results than traditional approaches.
    Downloads: 325 This Week
    Last Update:
    See Project
  • 3

    face detection from video file

    نرم افزار تشخیص چهره انسان از روی فیلم

    👁️ Advanced Face Detection Tool - ابزار پیشرفته تشخیص چهره یک نرم افزار مبتنی بر هوش مصنوعی برای شناسایی چهره در ویدیوها با قابلیتهای تعاملی جذاب
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    SoniTranslate

    SoniTranslate

    Synchronized Translation for Videos

    SoniTranslate is a video translation and dubbing system that produces synchronized target-language audio tracks for existing video content. It provides a web UI built with Gradio, allowing users to upload a video, choose source and target languages, and then run a pipeline that handles transcription, translation and re-synthesis of speech. Under the hood, it uses advanced speech and diarization models to separate speakers, align audio with timecodes and respect subtitle timing, which lets the generated dub track stay in sync with the original video structure. ...
    Downloads: 51 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    OptiScaler

    OptiScaler

    OptiScaler bridges upscaling/frame gen across GPUs

    OptiScaler is an open-source modding tool designed to enhance graphics performance and flexibility in modern PC games by allowing users to replace or modify built-in upscaling technologies. Instead of relying on the upscaling method originally integrated by a game developer, the software intercepts the game’s rendering pipeline and redirects it to alternative technologies chosen by the user. This makes it possible to swap technologies such as NVIDIA DLSS, AMD FSR, or Intel XeSS even if the...
    Downloads: 210 This Week
    Last Update:
    See Project
  • 6
    HunyuanVideo-Avatar

    HunyuanVideo-Avatar

    Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model

    ...It addresses challenges of motion realism, identity consistency, and emotional alignment. Innovations include a character image injection module, an Audio Emotion Module for transferring emotion cues, and a Face-Aware Audio Adapter to isolate audio effects on faces, enabling multiple characters to be animated in a scene. Character image injection module for better consistency between training and inference conditioning. Emotion control by extracting emotion reference images and transferring emotional style into video sequences.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    TikTok-ViewBot

    TikTok-ViewBot

    ViewBot using requests updated 2025

    TikTok-ViewBot explores automated interactions with TikTok’s viewing mechanisms for research and educational purposes. The code demonstrates how scripted traffic might be generated and measured, highlighting the kinds of heuristics a platform could use to validate or discount views. It is often used to study rate limits, signature schemes, request patterns, and the fragility of naïve automation. Because it touches on automation against a third-party service, responsible use and adherence to...
    Downloads: 88 This Week
    Last Update:
    See Project
  • 8
    Delphi-OpenCV

    Delphi-OpenCV

    Project Delphi-OpenCV. Translation of OpenCV library header files

    Delphi-OpenCV is a set of Object Pascal bindings and tools that bring OpenCV functionality into the Delphi development environment. It provides access to computer vision capabilities such as image processing, feature detection, and video analysis within native Delphi applications. The project includes sample programs, preconfigured packages, and integration with related libraries like SDL and OpenGL. It supports multiple OpenCV versions and includes resources such as Haar cascades for face detection tasks. Developers can use it to build desktop applications that leverage real-time vision processing. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    MediaPipe Solutions

    MediaPipe Solutions

    Cross-platform, customizable ML solutions

    ...MediaPipe is widely used in computer vision and multimedia applications such as hand tracking, face detection, pose estimation, object recognition, and gesture analysis. The framework includes prebuilt solutions that developers can quickly integrate into applications as well as lower-level APIs that allow custom pipeline construction.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    Qwen-2.5-VL

    Qwen-2.5-VL

    Qwen2.5-VL is the multimodal large language model series

    Qwen2.5 is a series of large language models developed by the Qwen team at Alibaba Cloud, designed to enhance natural language understanding and generation across multiple languages. The models are available in various sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B parameters, catering to diverse computational requirements. Trained on a comprehensive dataset of up to 18 trillion tokens, Qwen2.5 models exhibit significant improvements in instruction following, long-text generation...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 11
    Perception Models

    Perception Models

    State-of-the-art Image & Video CLIP, Multimodal Large Language Models

    Perception Models is a state-of-the-art framework developed by Facebook Research for advanced image and video perception tasks. It introduces two primary components: the Perception Encoder (PE) for visual feature extraction and the Perception Language Model (PLM) for multimodal decoding and reasoning. The PE module is a family of vision encoders designed to excel in image and video understanding, surpassing models like SigLIP2, InternVideo2, and DINOv2 across multiple benchmarks. Meanwhile,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    WhisperJAV

    WhisperJAV

    Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD

    WhisperJAV is an open-source speech transcription pipeline designed specifically for generating subtitles for Japanese adult video content. The project addresses challenges that standard speech recognition models face when transcribing this type of audio, which often includes low signal-to-noise ratios and large numbers of non-verbal vocalizations. Traditional automatic speech recognition systems can misinterpret these sounds as words, leading to inaccurate transcripts. ...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 13
    Qwen2.5-Omni

    Qwen2.5-Omni

    Capable of understanding text, audio, vision, video

    Qwen2.5-Omni is an end-to-end multimodal flagship model in the Qwen series by Alibaba Cloud, designed to process multiple modalities (text, images, audio, video) and generate responses both as text and natural speech in streaming real-time. It supports “Thinker-Talker” architecture, and introduces innovations for aligning modalities over time (for example synchronizing video/audio), robust speech generation, and low-VRAM/quantized versions to make usage more accessible. It holds...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MediaPipe Face Detection

    MediaPipe Face Detection

    Detect faces in an image

    The MediaPipe Face Detection model is a high-performance, real-time face detection solution that uses machine learning to identify faces in images and video streams. It is optimized for mobile and embedded platforms, offering fast and accurate face detection while maintaining a small memory footprint. This model supports multiple face detections and is highly efficient, making it suitable for a variety of applications such as augmented reality, user authentication, and facial expression analysis.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    htop

    htop

    An interactive process viewer

    ...Check out the video and slides of Hisham's presentation at FOSDEM 2016 about how this came to be. The current releases support Linux, FreeBSD, OpenBSD, DragonFly BSD, MacOSX and Solaris. This is htop, a cross-platform interactive process viewer. It is a text-mode application (for console or X terminals) and requires ncurses.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 16
    Peinture

    Peinture

    A general-purpose AI image generation framework that supports HF

    Peinture is a sleek, dark-themed web application that brings AI-powered image generation to artists, designers, and casual creators through a modern interface built with React, TypeScript, and Tailwind CSS. Instead of tying users to a single service, Peinture integrates multiple backend providers, including Hugging Face, Gitee AI, Model Scope, and others, so you can switch between models or extend support to custom endpoints — giving you flexibility over where and how your images are...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Armbian Linux Build Framework

    Armbian Linux Build Framework

    Armbian Linux Build Framework

    ...A distributed image is compressed to its real data size which starts below 1G. Login is possible via serial, HDMI/VGA or SSH. Boot loader and kernel optimizations, memory caching, ZRAM swap, and video acceleration where applicable. Images are made fully automatized from sources. Releases are PGP signed and code is regularly inspected by the community. Long-term support, quick security fixes, documentation and community-based end-user support. Make sure you have a good & reliable SD card and a proper power supply. Insert SD card into a slot and power the board. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    comfyui-mixlab-nodes

    comfyui-mixlab-nodes

    Workflow and speech recognition app

    ...It introduces a “Workflow-to-APP” concept, where a ComfyUI graph can be transformed into a Web App through an AppInfo node, complete with categories, batch prompts, and editable configurations. The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that mix live screen content, generative models, and visual effects. For audio and speech, it provides nodes for SpeechRecognition and SpeechSynthesis, plus workflows that combine voice generation with real-time face swapping and other audio-visual effects. On the AI side, it integrates multiple LLM providers (cloud and local), supports OpenAI-compatible endpoints, Siliconflow models, and includes prompt-focused utilities for random prompt generation, Chinese prompts, clip interrogation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Fast3R

    Fast3R

    Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

    Fast3R is Meta AI’s official CVPR 2025 release for “Towards 3D Reconstruction of 1000+ Images in One Forward Pass.” It represents a next-generation feedforward 3D reconstruction model capable of producing dense point clouds and camera poses for hundreds to thousands of images or video frames in a single inference pass—eliminating the need for slow, iterative structure-from-motion pipelines. Built on PyTorch Lightning and extending concepts from DUSt3R and Spann3r, Fast3R unifies multi-view...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Buzz

    Buzz

    Transcribe and translate audio offline on your personal computer

    Buzz transcribes and translates audio to text offline using OpenAI's Whisper. Import audio and video files into Buzz and export them as TXT, SRT, or VTT files. Buzz supports Whisper, Whisper.cpp, Faster Whisper, Whisper-compatible models from the Hugging Face repository, and the OpenAI Whisper API. Get linux versions from: - https://flathub.org/apps/io.github.chidiwilliams.Buzz - https://snapcraft.io/buzz Home page of Buzz https://github.com/chidiwilliams/buzz Note for Windows: App is not signed, you will get a warning when you install it. ...
    Leader badge
    Downloads: 9,298 This Week
    Last Update:
    See Project
  • 21
    Morphing Editor

    Morphing Editor

    Java application for creating Morphing videos

    Java application for creating Morphing videos. You can create a Morphing effect with your chosen pictures. Then you will be able to see the morphing result and save the Morphing project in a xml file. You will have the option of creating a video with the Morphing effect of your project. An example of video created with the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Snap Camera Server

    Snap Camera Server

    Self hosted Docker server to use Snap Camera after the shutdown.

    Snap Camera Server is an alternative self hosted solution for Snapcamera. It will let you continue to use Snapcamera with all Snapchat filters after the shutdown on January 25, 2023. ✔️ No previously backed up files or third party server required. ✔️ Access all Snap Lenses from Snapchat without restrictions. ✔️ Import Lenses from local application cache e.g. AppData\Local\Snap\Snap Camera\cache\lenses. ✔️ All files will be stored inside a Docker Volume (e.g. on your local machine). ✔️...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 23

    Liveliness and Face Identification

    Leading free and open-source liveliness check &face recognition system

    This is a free and open-source liveliness check and face recognition project. Essentially, it is an application that can be used as a standalone server or deployed in the cloud. You don’t need prior machine learning skills to set up and use. The application is customizable react based mobile friendly UI and Python based backend. The program is a real-time face detection application. It allows you to detect faces using your webcam and displays the video feed with oval drawn around the detected faces. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Roop

    Roop

    One-click face swap

    Take a video and replace the face with a face of your choice. You only need one image of the desired face. No dataset, and no training.
    Downloads: 130 This Week
    Last Update:
    See Project
  • 25
    EvaDB

    EvaDB

    Database system for building simpler and faster AI-powered application

    ...They are accurate on various tasks ranging from question answering to object tracking in videos. To use an AI model, the user needs to program against multiple low-level libraries, like PyTorch, Hugging Face, Open AI, etc. This tedious process often leads to a complex AI app that glues together these libraries to accomplish the given task. This programming complexity prevents people who are experts in other domains from benefiting from these models. Running these deep learning models on large document or video datasets is costly and time-consuming. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB