Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Multimedia Software
Search Results

Search Results for "artificial intelligence script" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Windows 74
Linux 72
Mac 56
More...
BSD 28
ChromeOS 18
Desktop Operating Systems 4
Mobile Operating Systems 3
Server Operating Systems 1

Category

Multimedia 90
Artificial Intelligence 86
Scientific/Engineering 16
Software Development 13
Business 5
Communications 5
Desktop Environment 4
System 4
Games 3
Database 2
Education 2
Internet 2
Text Editors 1

License

OSI-Approved Open Source 80
Creative Commons Attribution License 3

Translations

English 18
German 3
Bosnian 1
Chinese (Simplified) 1
More...
Croatian 1
Korean 1
Serbian 1
Spanish 1

Programming Language

Python 90
C++ 15
C 7
Java 5
Unix Shell 5
More...
Perl 3
Ruby 2
C# 1
PHP 1
PL/SQL 1
S/R 1
TypeScript 1

Status

Production/Stable 13
Beta 12
Pre-Alpha 8
Alpha 7
More...
Planning 6
Mature 1

Showing 90 open source projects for "artificial intelligence script"

View related business solutions

Multimedia Python Clear Filters & Widen Search

Go from Code to Production URL in Seconds
Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
1

Asteroid

The PyTorch-based audio source separation toolkit for researchers

The PyTorch-based audio source separation toolkit for researchers. Pytorch-based audio source separation toolkit that enables fast experimentation on common datasets. It comes with a source code thats supports a large range of datasets and architectures, and a set of recipes to reproduce some important papers. Building blocks are thought and designed to be seamlessly plugged together. Filterbanks, encoders, maskers, decoders and losses are all common building blocks that can be combined in a...

Downloads: 0 This Week

Last Update: 2023-10-12
See Project
2

DALL-E 2 - Pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch. The main novelty seems to be an extra layer of indirection with the prior network (whether it is an autoregressive transformer or a diffusion network), which predicts an image embedding based on the text embedding from CLIP. Specifically, this repository will only build out the diffusion prior network, as it is the best performing variant (but which incidentally involves a causal transformer as...

Downloads: 1 This Week

Last Update: 2023-10-19
See Project
3

MMAction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

OpenMMLab's next generation video understanding toolbox and benchmark. MMAction2 is an open-source toolbox for video understanding based on PyTorch. It is a part of the OpenMMLab project. Modular design: We decompose a video understanding framework into different components. One can easily construct a customized video understanding framework by combining different modules. Support four major video understanding tasks: MMAction2 implements various algorithms for multiple video understanding...

Downloads: 0 This Week

Last Update: 2023-10-12
See Project
4

Stable Diffusion in Docker

Run the Stable Diffusion releases in a Docker container

Run the Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint. Run the Stable Diffusion releases on Huggingface in a GPU-accelerated Docker container. By default, the pipeline uses the full model and weights which requires a CUDA capable GPU with 8GB+ of VRAM. It should take a few seconds to create one image. On less powerful GPUs you may need to modify some of the options; see the Examples section for more details. If you lack a...

Downloads: 0 This Week

Last Update: 2023-09-22
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

Roop

One-click face swap

Take a video and replace the face with a face of your choice. You only need one image of the desired face. No dataset, and no training.

Downloads: 151 This Week

Last Update: 2024-09-03
See Project
6

MMClassification

OpenMMLab Image Classification Toolbox and Benchmark

MMClassification is an open-source image classification toolbox based on PyTorch. It is a part of the OpenMMLab project. Supports DenseNet, VAN and PoolFormer, and provide pre-trained models. Supports training on IPU. Supports a series of CSP networks, such as CSP-ResNet, CSP-ResNeXt and CSP-DarkNet. MMClassification is an open source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors who implement their methods or...

Downloads: 0 This Week

Last Update: 2023-07-31
See Project
7

VATSG

Video automatic transcribe and translated subtitle generator

It generates srt format subtitle from videofile which can be any source language that whisper support , and then make translated subtitle file of your target language which deepl support. This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you...

Downloads: 0 This Week

Last Update: 2023-09-19
See Project
8

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch

A fully featured audio diffusion library, for PyTorch. Includes models for unconditional audio generation, text-conditional audio generation, diffusion autoencoding, upsampling, and vocoding. The provided models are waveform-based, however, the U-Net (built using a-unet), DiffusionModel, diffusion method, and diffusion samplers are both generic to any dimension and highly customizable to work on other formats. Note: no pre-trained models are provided here, this library is meant for research...

Downloads: 0 This Week

Last Update: 2023-03-29
See Project
9

AiHound

AI powered image classification for nudity and documents / id-cards

AI Hound is designed to run from an USB pendrive or any other kind of removeable and writeable media. The programm checks all Office-documents, Images and videos for various categories for images. Actually It can recognice nudity/porn and scanned or photographed documents / ID- and credit-cards. I am working on a model that also recognice various types of drugs in images.

Downloads: 10 This Week

Last Update: 2023-04-20
See Project
10

FrankMocap

A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

FrankMocap is a monocular 3D human capture system that estimates body, hand, and optionally face pose from a single RGB image or video. It regresses parametric human models (e.g., SMPL/SMPL-X) directly, producing temporally stable meshes and joint angles suitable for animation or analytics. The pipeline couples a robust 2D keypoint detector with 3D mesh regression networks and priors that keep results anatomically plausible. It can run frame-by-frame or with temporal smoothing, and includes...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
11

AnimeGAN

A simple PyTorch Implementation of Generative Adversarial Networks

A simple PyTorch Implementation of Generative Adversarial Networks, focusing on anime face drawing. The images are generated from a DCGAN model trained on 143,000 anime character faces for 100 epochs. Manipulating latent codes enables the transition from images in the first row to the last row. The images are not clean, some outliers can be observed, which degrades the quality of the generated images. Anime-style images of 126 tags are collected from danbooru.donmai.us using the crawler tool...

Downloads: 1 This Week

Last Update: 2023-03-21
See Project
12

Video Pre-Training

Learning to Act by Watching Unlabeled Online Videos

The Video PreTraining (VPT) repository provides code and model artifacts for a project where agents learn to act by watching human gameplay videos—specifically, gameplay of Minecraft—using behavioral cloning. The idea is to learn general priors of control from large-scale, unlabeled video data, and then optionally fine-tune those priors for more goal-directed behavior via environment interaction. The repository contains demonstration models of different widths, fine-tuned variants (e.g. for...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
13

KAIR

Image Restoration Toolbox (PyTorch). Training and testing codes

Image restoration toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSR/GAN, SwinIR.

Downloads: 14 This Week

Last Update: 2022-08-11
See Project
14

pyntcloud

pyntcloud is a Python library for working with 3D point clouds

This page will introduce the general concept of point clouds and illustrate the capabilities of pyntcloud as a point cloud processing tool. Point clouds are one of the most relevant entities for representing three dimensional data these days, along with polygonal meshes (which are just a special case of point clouds with connectivity graph attached). In its simplest form, a point cloud is a set of points in a cartesian coordinate system. Accurate 3D point clouds can nowadays be (easily and...

Downloads: 0 This Week

Last Update: 2022-07-31
See Project
15

Alfi Face

Face Recognition based Attendance System for school, college...

ALFI FACE uses facial recognition technology to record the attendance through a digital camera that detects and recognizes faces and compare the faces with students’ faces images stored in faces database. Once the recognized face matches a stored image, attendance is marked in attendance database for that person. Note: While adding a new student you have to click on" Train the Recognizer" button . In excel sheet dates are "#" so that no one can change the dates.

1 Review

Downloads: 0 This Week

Last Update: 2022-04-16
See Project
16

Photonix Photo Manager

A modern, web-based photo management server

A modern, web-based photo management server. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, face recognition, location awareness, color analysis and other ML algorithms. This project is currently in development and not feature complete for a version 1.0 yet. If you don't mind putting up with broken parts or want to help out, run the Docker image and give it a go. I'd love for...

Downloads: 0 This Week

Last Update: 2022-09-02
See Project
17

TimeSformer

The official pytorch implementation of our paper

TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
18

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the...

Downloads: 2 This Week

Last Update: 2021-04-08
See Project
19

Objectron

A dataset of short, object-centric video clips

The Objectron dataset is a collection of short, object-centric video clips, which are accompanied by AR session metadata that includes camera poses, sparse point-clouds and characterization of the planar surfaces in the surrounding environment. In each video, the camera moves around the object, capturing it from different angles. The data also contain manually annotated 3D bounding boxes for each object, which describe the object’s position, orientation, and dimensions. The dataset consists...

Downloads: 2 This Week

Last Update: 2022-02-21
See Project
20

Surface Defect Detection Dataset Papers

Constantly summarizing open source dataset and critical papers

At present, surface defect equipment based on machine vision has widely replaced artificial visual inspection in various industrial fields, including 3C, automobiles, home appliances, machinery manufacturing, semiconductors and electronics, chemical, pharmaceutical, aerospace, light industry and other industries. Traditional surface defect detection methods based on machine vision often use conventional image processing algorithms or artificially designed features plus classifiers. Generally...

Downloads: 1 This Week

Last Update: 2022-08-17
See Project
21

GIMP ML

AI for GNU Image Manipulation Program

This repository introduces GIMP3-ML, a set of Python plugins for the widely popular GNU Image Manipulation Program (GIMP). It enables the use of recent advances in computer vision to the conventional image editing pipeline. Applications from deep learning such as monocular depth estimation, semantic segmentation, mask generative adversarial networks, image super-resolution, de-noising and coloring have been incorporated with GIMP through Python-based plugins. Additionally, operations on...

Downloads: 17 This Week

Last Update: 2022-08-19
See Project
22

Consistent Depth

We estimate dense, flicker-free, geometrically consistent depth

Consistent Depth is a research project developed by Facebook Research that presents an algorithm for reconstructing dense and geometrically consistent depth information for all pixels in a monocular video. The system builds upon traditional structure-from-motion (SfM) techniques to provide geometric constraints while integrating a convolutional neural network trained for single-image depth estimation. During inference, the model fine-tunes itself to align with the geometric constraints of a...

Downloads: 0 This Week

Last Update: 11 hours ago
See Project
23

Linux-Intelligent-Ocr-Solution

Easy-OCR solution and Tesseract trainer for GNU/Linux

Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial :...

5 Reviews

Downloads: 2 This Week

Last Update: 2020-10-19
See Project
24

Spectral Python

A python module for hyperspectral image processing

Spectral Python (SPy) is a python package for reading, viewing, manipulating, and classifying hyperspectral image (HSI) data. SPy includes functions for clustering, dimensionality reduction, supervised classification, and more.

4 Reviews

Downloads: 1 This Week

Last Update: 2020-08-01
See Project
25

DeepFaceLab

The leading software for creating deepfakes

DeepFaceLab is currently the world's leading software for creating deepfakes, with over 95% of deepfake videos created with DeepFaceLab. DeepFaceLab is an open-source deepfake system that enables users to swap the faces on images and on video. It offers an imperative and easy-to-use pipeline that even those without a comprehensive understanding of the deep learning framework or model implementation can use; and yet also provides a flexible and loose coupling structure for those who want to...

1 Review

Downloads: 229 This Week

Last Update: 2023-09-07
See Project

Previous
1
You're on page 2
3
4
Next

Related Searches

roop-cam

deepfake

roop

deepfacelab

video call face swap

gimp 2.10 plugins

gimp

audio separation

ai text to video

depth map creator

Related Categories

Multimedia

Artificial Intelligence

Scientific/Engineering

Software Development

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise