Showing 83 open source projects for "ai video"

View related business solutions
  • Employee monitoring software with screenshots Icon
    Employee monitoring software with screenshots

    Clear visibility and insights into how employees work. Even remotely

    Our computer monitoring software allows employees, field contractors, and freelancers to manually clock in when they begin working on an assignment. The application will take screenshots randomly or at set intervals, which allows employers to observe the work process. The application only tracks activity when the employee is clocked in. No spying, only transparency.
  • Translate docs, audio, and videos in real time with Google AI Icon
    Translate docs, audio, and videos in real time with Google AI

    Make your content and apps multilingual with fast, dynamic machine translation available in thousands of language pairs.

    Google Cloud’s AI-powered APIs help you translate documents, websites, apps, audio files, videos, and more at scale with best-in-class quality and enterprise-grade control and security.
  • 1
    ComfyUI

    ComfyUI

    The most powerful and modular diffusion model GUI, api and backend

    The most powerful and modular diffusion model is GUI and backend. This UI will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart-based interface. We are a team dedicated to iterating and improving ComfyUI, supporting the ComfyUI ecosystem with tools like node manager, node registry, cli, automated testing, and public documentation. Open source AI models will win in the long run against closed models and we are only at the beginning. Our core mission...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Deep Lake

    Deep Lake

    Data Lake for Deep Learning. Build, manage, and query datasets

    Deep Lake (formerly known as Activeloop Hub) is a data lake for deep learning applications. Our open-source dataset format is optimized for rapid streaming and querying of data while training models at scale, and it includes a simple API for creating, storing, and collaborating on AI datasets of any size. It can be deployed locally or in the cloud, and it enables you to store all of your data in one place, ranging from simple annotations to large videos. Deep Lake is used by Google, Waymo, Red...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Jina

    Jina

    Build cross-modal and multimodal applications on the cloud

    Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    DataGym.ai

    DataGym.ai

    Open source annotation and labeling tool for image and video assets

    DATAGYM enables data scientists and machine learning experts to label images up to 10x faster. AI-assisted annotation tools reduce manual labeling effort, give you more time to finetune ML models and speed up your go to market of new products. Accelerate your computer vision projects by cutting down data preparation time up to 50%. A machine learning model is only as good as its training data. DATAGYM is an end-to-end workbench to create, annotate, manage, and export the right training data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • NeoLoad is a very comprehensive tool if you are looking for a performance test tool for web applications and other applications Icon
    Your applications are all built differently, but they all need to perform. NeoLoad simplifies and scales performance testing for everything, from APIs and microservices, to end-to-end application testing through innovative protocol and browser-based capabilities.
  • 5
    Vearch

    Vearch

    A distributed system for embedding-based vector retrieval

    Vearch is the vector search infrastructure for deep learning and AI applications. Vearch is a distributed vector storage and retrieval system which can be easily extended to billions scale. Vearch implements a high-performance, lockless real-time vector indexing subsystem that utilizes various optimization techniques to support millisecond vector update and retrieval. End-to-end one-click deployment. Through the module of the plugin, a complete default visual search system can be deployed just...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Triton Inference Server

    Triton Inference Server

    The Triton Inference Server provides an optimized cloud

    Triton Inference Server is an open-source inference serving software that streamlines AI inferencing. Triton enables teams to deploy any AI model from multiple deep learning and machine learning frameworks, including TensorRT, TensorFlow, PyTorch, ONNX, OpenVINO, Python, RAPIDS FIL, and more. Triton supports inference across cloud, data center, edge, and embedded devices on NVIDIA GPUs, x86 and ARM CPU, or AWS Inferentia. Triton delivers optimized performance for many query types, including...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    txtai

    txtai

    Build AI-powered semantic search applications

    txtai executes machine-learning workflows to transform data and build AI-powered semantic search applications. Traditional search systems use keywords to find data. Semantic search applications have an understanding of natural language and identify results that have the same meaning, not necessarily the same keywords. Backed by state-of-the-art machine learning models, data is transformed into vector representations for search (also known as embeddings). Innovation is happening at a rapid pace...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DeepDetect

    DeepDetect

    Deep Learning API and Server in C++14 support for Caffe, PyTorch

    ... of image tagging, object detection, segmentation, OCR, Audio, Video, Text classification, CSV for tabular data and time series. Neural network templates for the most effective architectures for GPU, CPU, and Embedded devices. Training in a few hours and with small data thanks to 25+ pre-trained models. Full Open Source, with an ecosystem of tools (API clients, video, annotation, ...) Fast Server written in pure C++, a single codebase for Cloud, Desktop & Embedded.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Video Processor

    Comprehensive utility for streamlining video ad creation processes.

    The Video Processor V1 script emerged from a light-hearted debate about automation in the workplace, leading to the creation of a tool capable of automating aspects of ad production. Initially crafted as a jest, this script has evolved into a utility that others might find valuable for streamlining video ad creation processes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • ContractSafe: Contract Management Software Icon
    ContractSafe: Contract Management Software

    Take Control Of Your Contracts Without Wrecking The Budget

    Ditch those spreadsheets, shared drives & crazy-expensive solutions with too many bells & whistles. ContractSafe offers the simplest way to manage your contracts efficiently without breaking the bank.
  • 10
    GemAI

    GemAI

    Virtual Assistant with Limitless Possibilities 🌟

    GemAI is an advanced virtual assistant developed by GemGroup, founded by Sami Chemali and Ahmed Hassan. It combines GemBrain's cutting-edge algorithms with innovative AI technology. GemAI can write perfect essays with compelling ideas and pictures, analyze various media types, and communicate verbally through voice commands. It is multilingual, continuously learning, and delivering reliable results. With GemAI, the possibilities are truly limitless! 😊🌟
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    video-subtitle-remover

    video-subtitle-remover

    AI-based tool for removing hardsubs and text-like watermarks

    Video-subtitle-remover (VSR) is an AI-based software that removes hardcoded subtitles from videos or Pictures.
    Downloads: 45 This Week
    Last Update:
    See Project
  • 12

    ACTIV: AI-apps-ChatGPT- T2Image-Video

    Vuong-ACTIV:: AI-apps for Chatgpt, Text to Image and to Video

    _____ Project founder: Dr. The Anh Vuong (Admin) https://vuongblog.wordpress.com/vuong-publications/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    AI-Demo

    Demo of AI Application : ChatGPT, Text2Images, Text2Video

    AI Demo Applications for Chat GPT, Text2Images, Text2Video Variable AI Demo applications are developing by Students at University Frankfurt. Project founder: Dr. The Anh Vuong (Admin) https://vuongblog.wordpress.com/vuong-publications/ Support-WIKI https://sourceforge.net/p/vuong-aidemo/wiki/Home/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    G2SConverter

    G2SConverter

    Convert models from GoldSource engine to Source engine with AI

    Convert models from GoldSource engine to the Source engine with AI. This utility converts GoldSource engine models to Source engine models. A feature of this utility is the ability to improve the quality of textures of models using Upscaling, deblurring, and normal map generating. All operations to improve the quality of textures are performed by neural networks. To improve the quality of the texture, it is first Upscaled using RealESRGAN. The user can select scaling factor: x2, x4 or x8...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    TagUI

    TagUI

    Free RPA tool by AI Singapore

    Write flows in simple TagUI language and automate away repetitive time-consuming tasks on your computer. Tasks include those on websites (native support for Chrome and Edge), desktop apps, or the command line. The TagUI project is open-source and free forever. It's easy to setup and use, and works on Windows, macOS and Linux. Besides English, flows can be written in 22 other languages, so you can do RPA using your native language. Check out this demo video automating data collection in 4...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 16
    Disco Diffusion

    Disco Diffusion

    Notebooks, models and techniques for the generation of AI Art

    A frankensteinian amalgamation of notebooks, models, and techniques for the generation of AI art and animations. This project uses a special conversion tool to convert the Python files into notebooks for easier development. What this means is you do not have to touch the notebook directly to make changes to it. The tool being used is called Colab-Convert. Initial QoL improvements added, including user-friendly UI, settings+prompt saving, and improved google drive folder organization. Now...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    ruDALL-E

    ruDALL-E

    Generate images from texts. In Russian

    We present a family of generative models from SberDevices and Sber AI! Models allow you to create images that did not exist before. All you need is a text description in Russian or another language. Try to create unique images together with generative artists using your own formulations. Ask generative artists to depict something special for you as well. The Kandinsky 2.0 model uses the reverse diffusion method and creates colorful images on various topics in a matter of seconds by text query...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    NWT - Pytorch (wip)

    NWT - Pytorch (wip)

    Implementation of NWT, audio-to-video generation, in Pytorch

    Implementation of NWT, audio-to-video generation, in Pytorch. The paper proposes a new discrete latent representation named Memcodes, which can be succinctly described as a type of multi-head hard-attention to learned memory (codebook) key/values. They claim the need for less codes and smaller codebook dimensions in order to achieve better reconstructions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    PRESENTA Lib

    PRESENTA Lib

    The javascript presentation library for the automation era

    ... at a time. Each scene contains one or more block of content. The scene is responsible to keep blocks together. A block is a minimum unit that renders specific content from a given config object. PRESENTA Lib is designed to be extensible by using external plugins. Each scene can include one or more blocks. A block is responsible to render a specific content or media, such as text, image or video.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    BlackBelt Privacy Tor/i2p+WASTE+VidVoIP

    BlackBelt Privacy Tor/i2p+WASTE+VidVoIP

    Browse on Tor/i2p, Anon p2p Chat / FileTx, Conf / Video VoIP

    Open Source - GPLv3 inc images. *** PLEASE NOTE: There are now 2 seperate versions here. *** One is Pre Firefox 57. The other is Post Firefox 57. *** For those providing mirrors, please enable your users to realize this. Vidalia Based, Tor as a Service Solution. MicroSip: enables FREE PC to PC video calling with no account sign-up and no middleman server. WASTE: enables FREE Conference VoIP, chat, file transfer and support. *** AI Powered *** Tor/i2p: enables safer browsing. Tor/i2p Profile...
    Leader badge
    Downloads: 33 This Week
    Last Update:
    See Project
  • 22
    File-Studio

    File-Studio

    A tool that automates complex file operations.

    File studio is a tool that assists in handling complex file operations such as bulk renaming, organizing folders and more.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    Conversations

    Conversations

    App in java for chatting to a generative A.I. (involving tts and stt)

    Java application for chatting to generative AI Llama3. * The user can speak into the microphone (speechToText), edit the recognized text and send it to the AI. * The AI ​​responds and the server returns that response in real time, and the sentences converted to audio (textToSpeech), and the application broadcasts them through the speaker. The application is prepared so that only one user occupies the server's resources, so if the server is busy, in theory it will not let you connect...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    uweb browser: unlimited power

    minimal suckless android web browser with unlimited power

    - Powerful: html5 enhancement; any urls to host a website; javascript and shell scripting for general processing; and more with Termux. - Customizable: user-defined menus, (new) buttons and gestures for user agents, bookmarklets, url services, shell commands, internal functionality links and text processing etc. - Convenient: Any book (pdf/djvu)/dictionary (mdict)/txt/command line/app/webapp (web extensions) can be search engine. - Tiny: less than 200k - Fast: run fast, even with thousands...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    TikTok Filter Remover

    TikTok Filter Remover

    Removes some filters from TikTok videos

    Introducing our groundbreaking app designed to redefine your TikTok video editing experience! Tired of being limited by filters in downloaded TikTok videos? Our innovative program empowers you to effortlessly remove select filters, giving you unparalleled control over your content. Utilizing an intuitive interface, our app harnesses sophisticated algorithms and the robust capabilities of the Midjourney API to diligently reconstruct frames, striving for optimal results. While our process isn't...
    Downloads: 0 This Week
    Last Update:
    See Project