picture 2 video free download

Showing 24 open source projects for "picture 2 video"

View related business solutions

Artificial Intelligence Mac Clear Filters & Widen Search

Our Free Plans just got better! | Auth0 by Okta
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.

Try free now
Save hundreds of developer hours with components built for SaaS applications.
The #1 Embedded Analytics Solution for SaaS Teams.

Whether you want full self-service analytics or simpler multi-tenant security, Qrvey’s embeddable components and scalable data management remove the guess work.

Try Developer Playground
1

Make-A-Video - Pytorch (wip)

Implementation of Make-A-Video, new SOTA text to video generator

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch. They combine pseudo-3d convolutions (axial convolutions) and temporal attention and show much better temporal fusion. The pseudo-3d convolutions isn't a new concept. It has been explored before in other contexts, say for protein contact prediction as "dimensional hybrid residual networks". The gist of the paper comes down to, take a SOTA text-to-image model (here they use DALL-E2, but the same learning...

Downloads: 1 This Week

Last Update: 2024-05-03
See Project
2

DeepFaceLive

Real-time face swap for PC streaming or video calls

You can swap your face from a webcam or the face in the video using trained face models. There is also a Face Animator module in DeepFaceLive app. You can control a static face picture using video or your own face from the camera. The quality is not the best, and requires fine face matching and tuning parameters for every face pair, but enough for funny videos and memes or real-time streaming at 25 fps using 35 TFLOPS GPU.

Downloads: 116 This Week

Last Update: 2024-07-31
See Project
3

YYeTsBot

Renren Film and Television bot, fully connected to Renren resources

Renren Film and Television bot, fully connected to all Renren Film and television resources without deletion of resources. The database of this site is permanently open source and free. You can directly send the name of the episode you want to watch, and you can choose to share the webpage or link (ed2k and magnet links). When searching for resources, it will be searched according to my predetermined priority (everyone video offline, subtitle man), of course, you can also use commands to force...

Downloads: 1 This Week

Last Update: 2024-01-07
See Project
4

UForm

Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion

UForm is a Multi-Modal Modal Inference package, designed to encode Multi-Lingual Texts, Images, and, soon, Audio, Video, and Documents, into a shared vector space! It comes with a set of homonymous pre-trained networks available on HuggingFace portal and extends the transfromers package to support Mid-fusion Models. Late-fusion models encode each modality independently, but into one shared vector space. Due to independent encoding late-fusion models are good at capturing coarse-grained features...

Downloads: 0 This Week

Last Update: 2024-10-01
See Project
Free CRM Software With Something for Everyone
216,000+ customers in over 135 countries grow their businesses with HubSpot

Think CRM software is just about contact management? Think again. HubSpot CRM has free tools for everyone on your team, and it’s 100% free. Here’s how our free CRM solution makes your job easier.

Get free CRM
5

Phenaki - Pytorch

Implementation of Phenaki Video, which uses Mask GIT

Implementation of Phenaki Video, which uses Mask GIT to produce text-guided videos of up to 2 minutes in length, in Pytorch. It will also combine another technique involving a token critic for potentially even better generations. A new paper suggests that instead of relying on the predicted probabilities of each token as a measure of confidence, one can train an extra critic to decide what to iteratively mask during sampling. This repository will also endeavor to allow the researcher to train...

Downloads: 0 This Week

Last Update: 2024-07-29
See Project
6

Recurrent Interface Network (RIN)

Implementation of Recurrent Interface Network (RIN)

Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in Pytorch. The author unawaredly reinvented the induced set-attention block from the set transformers paper. They also combine this with the self-conditioning technique from the Bit Diffusion paper, specifically for the latents. The last ingredient seems to be a new noise function based around the sigmoid, which the author claims is better than cosine scheduler...

Downloads: 0 This Week

Last Update: 2024-02-14
See Project
7

Conscious Artificial Intelligence

It's possible for machines to become self-aware.

This project is a quest for conscious artificial intelligence. A number of prototypes will be developed as the project progresses. This project has 2 subprojects: Object Pascal based CAI NEURAL API - https://github.com/joaopauloschuler/neural-api Python based K-CAI NEURAL API - https://github.com/joaopauloschuler/k-neural-api A video from the first prototype has been made: http://www.youtube.com/watch?v=qH-IQgYy9zg Above video shows a popperian agent collecting mining ore from 3...

3 Reviews

Downloads: 4 This Week

Last Update: 2024-10-07
See Project
8

MyBox

Easy Tools of PDF, Image, File, Network, Data, and Medias

javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.

Downloads: 0 This Week

Last Update: 2024-05-18
See Project
9

Translate-Subtitle-File

Subtitle Creation Assistant

Subtitle group machine translation assistant - [Function 1: Translate subtitle file] .srt .ass .vtt [Function 2: Voice to text] (Drag in video or audio to recognize subtitles) (The latest version v4.1.0 Update time 2021 2 May 23) 12 translation service providers can be configured, such as Google, Baidu, Tencent, Caiyun, IBM, Azure, Amazon, etc. (6 voice service providers can be configured: Alibaba Cloud, Xunfei, Tencent Cloud, IBM, Azure, Amazon ) Advantages: 1. You can use multiple service...

Downloads: 1 This Week

Last Update: 2022-09-29
See Project
Red Hat Ansible Automation Platform on Microsoft Azure
Red Hat Ansible Automation Platform on Azure allows you to quickly deploy, automate, and manage resources securely and at scale.

Deploy Red Hat Ansible Automation Platform on Microsoft Azure for a strategic automation solution that allows you to orchestrate, govern and operationalize your Azure environment.

Learn More
10

Deep Exemplar-based Video Colorization

The source code of CVPR 2019 paper "Deep Exemplar-based Colorization"

The source code of CVPR 2019 paper "Deep Exemplar-based Video Colorization". End-to-end network for exemplar-based video colorization. The main challenge is to achieve temporal consistency while remaining faithful to the reference style. To address this issue, we introduce a recurrent framework that unifies the semantic correspondence and color propagation steps. Both steps allow a provided reference image to guide the colorization of every frame, thus reducing accumulated propagation errors...

Downloads: 3 This Week

Last Update: 2023-03-23
See Project
11

DCVGAN

DCVGAN: Depth Conditional Video Generation, ICIP 2019.

This paper proposes a new GAN architecture for video generation with depth videos and color videos. The proposed model explicitly uses the information of depth in a video sequence as additional information for a GAN-based video generation scheme to make the model understands scene dynamics more accurately. The model uses pairs of color video and depth video for training and generates a video using the two steps. Generate the depth video to model the scene dynamics based on the geometrical...

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
12

qqbot

A conversation robot base on Tencent's SmartQQ

qqbot is a QQ robot implemented in python and based on Tencent SmartQQ protocol, which can run on Linux, Windows and Mac OSX platforms. During the startup process, the QR code picture will pop up automatically. You need to scan the code with the mobile QQ client and authorize the login. After the startup is successful, the current login information will be saved to the local file. When the next startup, you can enter: qqbot -q qq number , first try to restore the login information from...

Downloads: 0 This Week

Last Update: 2022-01-14
See Project
13

NASH OS

Nash Operating System for Modern Ecommerce

The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.

Downloads: 0 This Week

Last Update: 2019-03-24
See Project
14

Face Recognition

World's simplest facial recognition api for Python & the command line

Face Recognition is the world's simplest face recognition library. It allows you to recognize and manipulate faces from Python or from the command line using dlib's (a C++ toolkit containing machine learning algorithms and tools) state-of-the-art face recognition built with deep learning. Face Recognition is highly accurate and is able to do a number of things. It can find faces in pictures, manipulate facial features in pictures, identify faces in pictures, and do face recognition on a...

Downloads: 10 This Week

Last Update: 2023-10-11
See Project
15

Five video classification methods

Code that accompanies my blog post outlining five video classification

Classifying video presents unique challenges for machine learning models. As I’ve covered in my previous posts, video has the added (and interesting) property of temporal features in addition to the spatial features present in 2D images. While this additional information provides us more to work with, it also requires different network architectures and, often, adds larger memory and computational demands.We won’t use any optical flow images. This reduces model complexity, training time...

Downloads: 0 This Week

Last Update: 2022-08-19
See Project
16

Lip Reading

Cross Audio-Visual Recognition using 3D Architectures

The input pipeline must be prepared by the users. This code is aimed to provide the implementation for Coupled 3D Convolutional Neural Networks for audio-visual matching. Lip-reading can be a specific application for this work. Audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenarios. The approach of AVR systems is to leverage the...

Downloads: 12 This Week

Last Update: 2022-08-11
See Project
17

Scene

Scene is a computer vision framework that performs background subtraction and object tracking, using two traditional algorithms and three more recent algorithms based on neural networks and fuzzy classification rules. For each detected object, Scene sends TUIO messages to one or several client applications. The present release features GPU accelerated versions of all the background subtraction methods and morphological post processing of the object blobs with dilation and erosion filters...

3 Reviews

Downloads: 3 This Week

Last Update: 2016-03-14
See Project
18

Text to Speech for Video

create wav files for video character speech by typing in dialogue

Choose from the "voices" available, and type in what you want the computer to say. A wave file called sounds.wav is stored to the output sub folder. Output is intended primarily for users who need speech for animated characters in videos.

Downloads: 0 This Week

Last Update: 2015-10-16
See Project
19

Tower of Zaldagor

The classic intelligent role playing game

...! Find it here: http://market.android.com/details?id=toz.android See a promo video here: http://www.youtube.com/watch?v=Rh7GGlk5m30 Follow Sarquah on Twitter about the development: http://www.twitter.com/sarquah87

1 Review

Downloads: 0 This Week

Last Update: 2016-10-18
See Project
20

Swarm Wars

Safety in numbers.

REPOSITORY MOVED TO GITHUB: https://github.com/happyjack27/SwarmWars video sample: http://youtu.be/s5mLNbdBQGY A game where you evolve & compete AI swarms. The organisms use swarm intelligence & ant colony optimization. The organisms can communicate through 3-color signaling as well as by laying beacons. They can attack and repair other organisms. They can select mates, and they can gather and distribute food and material. This behavior is controlled by a genetically evolved...

Downloads: 1 This Week

Last Update: 2014-07-20
See Project
21

BayesianCortex

simple algorithm for a realtime interactive visual cortex for painting

A paint program where the canvas is the visual cortex of a simple kind of artificial intelligence. You paint with the mouse into its dreams and it responds by changing what you painted gradually. There will also be an API for using it with other programs as a general high-dimensional space. Each pixel's brightness is its own dimension. Bayesian nodes have exactly 3 childs because that is all thats needed to do NAND in a fuzzy way as Bayes' Rule which is NAND at certain extremes. NAND can be...

Downloads: 0 This Week

Last Update: 2013-05-30
See Project
22

BATLIB

Experimental video-tracking library

Downloads: 0 This Week

Last Update: 2015-03-16
See Project
23

superscape

A landscape viewable outside of a window invites users to participate with the outside environment by actively changing it. A familiar space becomes disorienting when interaction with a familiar setup responds contradictorily to users expectations. This

Downloads: 0 This Week

Last Update: 2015-02-07
See Project
24

WHAY

WHAY is a Video-based Face Recognition tool written in MATLAB. It aims to exploits PCA recognizing as better as possible and tests the limits of this approach.

Downloads: 0 This Week

Last Update: 2015-11-08
See Project