Search Results for "ai text to video"

Sort By:

Showing 578 open source projects for "ai text to video"

View related business solutions

Windows Clear Filters & Widen Search

Bright Data - All in One Platform for Proxies and Web Scraping
Say goodbye to blocks, restrictions, and CAPTCHAs

Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.

Get Started
Start building the next generation of GenAI apps today
MongoDB and Google Cloud bring together powerful technologies that enable you to confidently build GenAI experiences.

MongoDB Atlas is a fully-managed developer data platform built by developers, for developers. With tight integration to Google Cloud services such as Vertex AI and BigQuery, you can accelerate application deployment to stay at the forefront of AI innovation.

Learn More
1

Video Diffusion - Pytorch

Implementation of Video Diffusion Models

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch. Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch. It uses a special space-time factored U-net, extending generation from 2D images to 3D videos. 14k for difficult moving mnist (converging much faster and better than NUWA) - wip. Any new developments for text-to-video synthesis will be centralized at Imagen-pytorch...

Downloads: 7 This Week

Last Update: 2024-05-03
See Project
2

Make-A-Video - Pytorch (wip)

Implementation of Make-A-Video, new SOTA text to video generator

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch. They combine pseudo-3d convolutions (axial convolutions) and temporal attention and show much better temporal fusion. The pseudo-3d convolutions isn't a new concept. It has been explored before in other contexts, say for protein contact prediction as "dimensional hybrid residual networks". The gist of the paper comes down to, take a SOTA text-to-image model (here they use DALL-E2, but the same learning...

Downloads: 1 This Week

Last Update: 2024-05-03
See Project
3

AI Models

A repository of trained models

All models (at least currently) are supported by chaiNNer, an upscaling GUI that allows for both very simple and very complex tasks to be completed in a nice manner where you "chain" nodes together. Highly recommended for images. If you're looking to upscale videos using the models then use enhancr simply due to the fact that it supports TensorRT, which will allow you to upscale videos at incredible speeds! The GUI is one of the best looking applications out there and is personally my go to...

Downloads: 8 This Week

Last Update: 2023-03-29
See Project
4

Video-subtitle-extractor

A GUI tool for extracting hard-coded subtitle (hardsub) from videos

Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files. Use local OCR recognition, no need to set up and call any API, and do not need to access online OCR services such as Baidu...

1 Review

Downloads: 42 This Week

Last Update: 2024-10-09
See Project
Top-Rated Free CRM Software
216,000+ customers in over 135 countries grow their businesses with HubSpot

HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.

Get started free
5

Text Generation Web UI

A gradio web UI for running Large Language Models like LLaMA

... efficient text streaming. Parameter presets, 8-bit mode. Layers splitting across GPU(s), CPU, and disk. CPU mode, FlexGen, DeepSpeed ZeRO-3, API with streaming and without streaming. LLaMA model, including 4-bit GPTQ. RWKV model, LoRA (loading and training), Softprompts, and extensions.

Downloads: 5 This Week

Last Update: 2024-10-25
See Project
6

Super Easy AI Installer Tool

Application that simplifies the installation of AI-related projects

"Super Easy AI Installer Tool" is a user-friendly application that simplifies the installation process of AI-related repositories for users. The tool is designed to provide an easy-to-use solution for accessing and installing AI repositories with minimal technical hassle to none the tool will automatically handle the installation process, making it easier for users to access and use AI tools. "Super Easy AI Installer Tool" is currently in early development phase and may have a few bugs...

Downloads: 5 This Week

Last Update: 2023-09-14
See Project
7

AI Upscaler for Blender

AI Upscaler for Blender using Real-ESRGAN

Blender add-on to dramatically reduce render times using the Real-ESRGAN upscaler. Rendering an HD image in Blender takes 37 minutes. Upscaling can render a similar quality image in 5 mins total. Any PC or laptop can now do 3D rendering. 4k images can be rendered in the time it would take to render HD 1080p images. HD 1080p images can be rendered in record time on low-end hardware. Installation is easy. Just install the addon. No special hardware or GPU is required. Upscaling is done...

Downloads: 2 This Week

Last Update: 2023-08-08
See Project
8

Obsidian Text Generator Plugin

Text generator is a handy plugin for Obsidian

Text Generator is an open-source AI Assistant Tool that brings the power of Generative Artificial Intelligence to the power of knowledge creation and organization in Obsidian. For example, use Text Generator to generate ideas, attractive titles, summaries, outlines, and whole paragraphs based on your knowledge database.

Downloads: 1 This Week

Last Update: 24 hours ago
See Project
9

Vercel AI SDK

Build AI-powered applications with React, Svelte, Vue, and Solid

The Vercel AI SDK is a library for building AI-powered streaming text and chat UIs.

Downloads: 0 This Week

Last Update: 21 hours ago
See Project
Red Hat Enterprise Linux on Microsoft Azure
Deploy Red Hat Enterprise Linux on Microsoft Azure for a secure, reliable, and scalable cloud environment, fully integrated with Microsoft services.

Red Hat Enterprise Linux (RHEL) on Microsoft Azure provides a secure, reliable, and flexible foundation for your cloud infrastructure. Red Hat Enterprise Linux on Microsoft Azure is ideal for enterprises seeking to enhance their cloud environment with seamless integration, consistent performance, and comprehensive support.

Learn More
10

Minimal text diffusion

A minimal implementation of diffusion models for text generation

A minimal implementation of diffusion models of text: learns a diffusion model of a given text corpus, allowing to generate text samples from the learned model. The main idea was to retain just enough code to allow training a simple diffusion model and generating samples, remove image-related terms, and make it easier to use. To train a model, run scripts/train.sh. By default, this will train a model on the simple corpus. However, you can change this to any text file using the --train_data...

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
11

AI Atelier

Based on the Disco Diffusion, version of the AI art creation software

Based on the Disco Diffusion, we have developed a Chinese & English version of the AI art creation software "AI Atelier". We offer both Text-To-Image models (Disco Diffusion and VQGAN+CLIP) and Text-To-Text (GPT-J-6B and GPT-NEOX-20B) as options. Making available complete source code of licensed works and modifications, which include larger works using a licensed work, under the same license. Copyright and license notices must be preserved. When a modified version is used to provide a service...

Downloads: 6 This Week

Last Update: 2023-03-23
See Project
12

QualityScaler

Image/video AI upscaler app (BSRGAN)

Qualityscaler is a Windows app that uses BSRGAN Artificial Intelligence to enhance, enlarge and reduce noise in photographs and videos. QualityScaler is completely written in Python, from the backend to the front end. Image/list of images upscale. Video upscale. Drag&drop files [image / multiple images/video] Automatic image tiling and merging to avoid gpu VRAM limitation. Resize image/video before upscaling. Multiple Gpu support. Compatible images - png, jpeg, bmp, webp, tif. Compatible video...

Downloads: 115 This Week

Last Update: 2024-09-25
See Project
13

ChatGPT Desktop Application

🔮 ChatGPT Desktop Application (Mac, Windows and Linux)

ChatGPT Desktop Application (Mac, Windows and Linux)

1 Review

Downloads: 93 This Week

Last Update: 2023-08-03
See Project
14

LZ4

Extremely fast compression algorithm

LZ4 is lossless compression algorithm, providing compression speed > 500 MB/s per core (>0.15 Bytes/cycle). It features an extremely fast decoder, with speed in multiple GB/s per core (~1 Byte/cycle). A high compression derivative, called LZ4_HC, is available, trading customizable CPU time for compression ratio. LZ4 library is provided as open-source software using a BSD license. This benchmark simulates simple "static content transfer" scenario such as OS Kernel compression or video game's...

Downloads: 166 This Week

Last Update: 2024-07-22
See Project
15

ComfyUI

The most powerful and modular diffusion model GUI, api and backend

The most powerful and modular diffusion model is GUI and backend. This UI will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart-based interface. We are a team dedicated to iterating and improving ComfyUI, supporting the ComfyUI ecosystem with tools like node manager, node registry, cli, automated testing, and public documentation. Open source AI models will win in the long run against closed models and we are only at the beginning. Our core mission...

Downloads: 74 This Week

Last Update: 5 days ago
See Project
16

AI Chatbots based on GPT Architecture

Training & Implementation of chatbots leveraging GPT-like architecture

Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations. It sure seems like there are a lot of text-generation chatbots out there, but it's hard to find a python package or model that is easy to tune around a simple text file of message data. This repo is a simple attempt to help solve that problem. ai-msgbot covers the practical use case of building a chatbot that sounds like you (or some dataset/persona you choose...

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
17

InvokeAI

InvokeAI is a leading creative engine for Stable Diffusion models

InvokeAI is an implementation of Stable Diffusion, the open source text-to-image and image-to-image generator. It provides a streamlined process with various new features and options to aid the image generation process. It runs on Windows, Mac and Linux machines, and runs on GPU cards with as little as 4 GB or RAM. InvokeAI is a leading creative engine built to empower professionals and enthusiasts alike. Generate and create stunning visual media using the latest AI-driven technologies...

2 Reviews

Downloads: 17 This Week

Last Update: 3 days ago
See Project
18

enhancr

Video Frame Interpolation & Super Resolution using NVIDIA's TensorRT

enhancr is an elegant and easy to use GUI for Video Frame Interpolation and Video Upscaling which takes advantage of artificial intelligence - built using node.js and Electron. It was created to enhance the user experience for anyone interested in enhancing video footage using artificial intelligence. The GUI was designed to provide a stunning experience powered by state-of-the-art technologies without feeling clunky and outdated like other alternatives. It features blazing-fast TensorRT...

Downloads: 26 This Week

Last Update: 2023-06-07
See Project
19

Frigate

NVR with realtime local object detection for IP cameras

Frigate - NVR With Realtime Object Detection for IP Cameras A complete and local NVR designed for Home Assistant with AI object detection. Uses OpenCV and Tensorflow to perform realtime object detection locally for IP cameras. Use of a Google Coral Accelerator is optional, but highly recommended. The Coral will outperform even the best CPUs and can process 100+ FPS with very little overhead.

Downloads: 42 This Week

Last Update: 2024-08-28
See Project
20

StoryTeller

Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.

A multimodal AI story teller, built with Stable Diffusion, GPT, and neural text-to-speech (TTS). Given a prompt as an opening line of a story, GPT writes the rest of the plot; Stable Diffusion draws an image for each sentence; a TTS model narrates each line, resulting in a fully animated video of a short story, replete with audio and visuals. To develop locally, install dev dependencies and install pre-commit hooks. This will automatically trigger linting and code quality checks before each...

Downloads: 7 This Week

Last Update: 2023-08-22
See Project
21

Coqui TTS

A deep learning toolkit for Text-to-Speech, battle-tested in research

TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality. TTS comes with pre-trained models, tools for measuring dataset quality and is already used in 20+ languages for products and research projects. High-performance Deep Learning models for Text2Speech tasks. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings...

Downloads: 31 This Week

Last Update: 2023-12-12
See Project
22

Stable Diffusion v 2.1 web UI

Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img

Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img, in paint and upscale4x. Gradio app for Stable Diffusion 2 by Stability AI. It uses Hugging Face Diffusers implementation. Currently supported pipelines are text-to-image, image-to-image, inpainting, upscaling and depth-to-image.

Downloads: 14 This Week

Last Update: 2023-03-22
See Project
23

Whishper

Transcribe any audio to text, translate and edit subtitles 100% locall

Open-source, local-first audio transcription and subtitling suite with a simple web UI. Thanks to open-source technologies, Whishper can run 100% offline. Your data never leaves your computer. Whishper allows you to translate your transcriptions to and from more than 60 languages thanks to Argos Translate and LibreTranslate. Download the transcriptions in many formats (json, txt, vtt, srt). Easily edit your subtitles right in the Web-UI.

Downloads: 24 This Week

Last Update: 2024-09-10
See Project
24

Waifu2x-Extension-GUI

Video, Image and GIF upscale/enlarge(Super-Resolution)

... unstable than the stable builds because the beta builds have not been fully tested before release. Multimedia support: Supports processing Image & GIF&APNG & Video at the same time. Full image style support: Multiple built-in algorithms, 2D anime, or your daily photos & videos, this software can handle all of them. Video frame interpolation: Automatically use AI to interpolate frames after enlarge the video.

Downloads: 11 This Week

Last Update: 2 days ago
See Project
25

Vue-Video-Player

@videojs component for @vuejs

video.js player component for Vue. The Video.js player is a component. Like all components, you can define what children it includes, what order they appear in, and what options are passed to them. Components can be given custom options via the lower-camel-case variant of the component name (e.g. controlBar for ControlBar). These can be nested in a representation of grandchild relationships. Video.js playback technologies (i.e. "techs") can be given custom options as part of the options...

Downloads: 1 This Week

Last Update: 2022-08-14
See Project