Page 2 | gpu faster free download

Lightweight' GAN

Implementation of 'lightweight' GAN, proposed in ICLR 2021

...You can turn on automatic mixed precision with one flag --amp. You should expect it to be 33% faster and save up to 40% memory. Aim is an open-source experiment tracker that logs your training runs, and enables a beautiful UI to compare them.

Downloads: 0 This Week

Last Update: 2025-01-12

See Project

DeepSpeed

Deep learning optimization library: makes distributed training easy

DeepSpeed is an easy-to-use deep learning optimization software suite that enables unprecedented scale and speed for Deep Learning Training and Inference. With DeepSpeed you can: 1. Train/Inference dense or sparse models with billions or trillions of parameters 2. Achieve excellent system throughput and efficiently scale to thousands of GPUs 3. Train/Inference on resource constrained GPU systems 4. Achieve unprecedented low latency and high throughput for inference 5. Achieve extreme...

Downloads: 0 This Week

Last Update: 2026-03-13

See Project

HunyuanVideo-I2V

A Customizable Image-to-Video Model based on HunyuanVideo

HunyuanVideo-I2V is a customizable image-to-video generation framework developed by Tencent, extending the capabilities of HunyuanVideo. It allows for high-quality video creation from still images, using PyTorch and providing pre-trained model weights, inference code, and customizable training options. The system includes a LoRA training code for adding special effects and enhancing video realism, aiming to offer versatile and scalable solutions for generating videos from static image inputs.

1 Review

Downloads: 2 This Week

Last Update: 2025-03-10

See Project

A2M — Audio to MIDI

A2M is a desktop app that converts AUDIO TO MIDI in one click.

...Using A2M is straightforward: Select an audio file, click Convert, and the application generates a MIDI file automatically in your Downloads/A2M folder. All processing is done locally on your device, no uploads, no accounts, and no telemetry. The app runs on CPU by default, with optional NVIDIA GPU acceleration for faster conversions. Project links: Website: justagwas.com/projects/a2m GitHub: github.com/Justagwas/a2m Documentation: https://github.com/Justagwas/a2m/wiki A2M is fully open source and operates only on the files you choose. VirusTotal scan result: https://www.virustotal.com/gui/file/cc2a961baaaac2f8932c2e9ed04f0c27a55309cc03ed0825e44c8af18e263ce6

Downloads: 23 This Week

Last Update: 2026-02-20

See Project

EvaDB

Database system for building simpler and faster AI-powered application

Over the last decade, AI models have radically changed the world of natural language processing and computer vision. They are accurate on various tasks ranging from question answering to object tracking in videos. To use an AI model, the user needs to program against multiple low-level libraries, like PyTorch, Hugging Face, Open AI, etc. This tedious process often leads to a complex AI app that glues together these libraries to accomplish the given task. This programming complexity prevents...

Downloads: 0 This Week

Last Update: 2023-11-19

See Project

Detectron

FAIR's research platform for object detection research

Detectron is an object detection and instance segmentation research framework that popularized many modern detection models in a single, reproducible codebase. Built on Caffe2 with custom CUDA/C++ operators, it provided reference implementations for models like Faster R-CNN, Mask R-CNN, RetinaNet, and Feature Pyramid Networks. The framework emphasized a clean configuration system, strong baselines, and a “model zoo” so researchers could compare results under consistent settings. It includes training and evaluation pipelines that handle multi-GPU setups, standard datasets, and common augmentations, which helped standardize experimental practice in detection research. ...

Downloads: 0 This Week

Last Update: 2025-10-06

See Project

text-dedup

All-in-one text de-duplication

text-dedup is a Python library that enables efficient deduplication of large text corpora by using MinHash and other probabilistic techniques to detect near-duplicate content. This is especially useful for NLP tasks where duplicated training data can skew model performance. text-dedup scales to billions of documents and offers tools for chunking, hashing, and comparing text efficiently with low memory usage. It supports Jaccard similarity thresholding, parallel execution, and flexible...

Downloads: 0 This Week

Last Update: 2025-04-08

See Project

AI-powered enterprise search engine

AI-powered enterprise search engine is an open-source, AI-powered enterprise search engine designed to help organizations quickly locate and retrieve information scattered across multiple internal tools, documents, and communication platforms. It enables users to search across sources such as Slack, Confluence, Jira, Google Drive, and other enterprise systems, consolidating fragmented knowledge into a single, unified search experience. By leveraging natural language processing, Gerev allows...

Downloads: 15 This Week

Last Update: 2026-03-18

See Project

FFCV

Fast Forward Computer Vision (and other ML workloads!)

ffcv is a drop-in data loading system that dramatically increases data throughput in model training. From gridding to benchmarking to fast research iteration, there are many reasons to want faster model training. Below we present premade codebases for training on ImageNet and CIFAR, including both (a) extensible codebases and (b) numerous premade training configurations.

Downloads: 1 This Week

Last Update: 2024-08-07

See Project

Point-E

Point cloud diffusion for 3D model synthesis

point-e is the official repository for Point-E, a generative model developed by OpenAI that produces 3D point clouds from textual (or image) prompts. Its principal advantage is speed: it can generate 3D assets in just 1–2 minutes on a single GPU, which is significantly faster than many competing text-to-3D models. The model works via a two-stage diffusion approach: first, it uses a text → image diffusion network to produce a synthetic 2D view consistent with the prompt; then a second diffusion model converts that image into a 3D point cloud. While it does not match the fine detail of some slower methods, the tradeoff in speed makes it practical for prototyping and interactive 3D generation. ...

Downloads: 1 This Week

Last Update: 2025-10-02

See Project

G-Diffuser Bot

Discord bot and Interface for Stable Diffusion

The first release of the all-in-one installer version of G-Diffuser is here. This release no longer requires the installation of WSL or Docker and has a systray icon to keep track of and launch G-Diffuser components. The infinite zoom scripts have been updated with some improvements, notably a new compositer script that is hundreds of times faster than before. The first release of the all-in-one installer is here. It notably features much easier "one-click" installation and updating, as well...

Downloads: 0 This Week

Last Update: 2023-03-23

See Project

Fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python

Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. These ideas are encapsulated in the...

Downloads: 1 This Week

Last Update: 2022-06-27

See Project

Tensorflow Transformers

State of the art faster Transformer with Tensorflow 2.0

...Images, for tasks like image classification, object detection, and segmentation. Audio, for tasks like speech recognition and audio classification. Faster AutoReggressive Decoding, TFlite support, creating TFRecords is simple. Auto-Batching tf.data.dataset or tf.ragged tensors. Everything is dictionary (inputs and outputs) Multiple mask modes like causal, user-defined, prefix. tensorflow-text tokenizer support. Supports GPU, TPU, multi-GPU trainer with wandb, multiple callbacks, auto tensorboard.

Downloads: 0 This Week

Last Update: 2023-03-23

See Project

Guild AI

Experiment tracking, ML developer tools

Guild AI is an open-source experiment tracking toolkit designed to bring systematic control to machine learning workflows, enabling users to build better models faster. It automatically captures every detail of training runs as unique experiments, facilitating comprehensive tracking and analysis. Users can compare and analyze runs to deepen their understanding and incrementally improve models. Guild AI simplifies hyperparameter tuning by applying state-of-the-art algorithms through...

Downloads: 0 This Week

Last Update: 2024-11-13

See Project

TensorFlowOnSpark

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters

By combining salient features from the TensorFlow deep learning framework with Apache Spark and Apache Hadoop, TensorFlowOnSpark enables distributed deep learning on a cluster of GPU and CPU servers. It enables both distributed TensorFlow training and inferencing on Spark clusters, with a goal to minimize the amount of code changes required to run existing TensorFlow programs on a shared grid.

Downloads: 0 This Week

Last Update: 2024-08-05

See Project

Hugging Face Transformer

CPU/GPU inference server for Hugging Face transformer models

...Both are great tools but not very performant in inference. Then, if you spend some time, you can build something over ONNX Runtime and Triton inference server. You will usually get from 2X to 4X faster inference compared to vanilla Pytorch. It's cool! However, if you want the best in class performances on GPU, there is only a single possible combination: Nvidia TensorRT and Triton. You will usually get 5X faster inference compared to vanilla Pytorch.

Downloads: 0 This Week

Last Update: 2022-08-22

See Project

Detectron2

Next-generation platform for object detection and segmentation

...Includes more features such as panoptic segmentation, Densepose, Cascade R-CNN, rotated bounding boxes, PointRend, DeepLab, etc. Can be used as a library to support different projects on top of it. We'll open source more research projects in this way. It trains much faster. Models can be exported to TorchScript format or Caffe2 format for deployment. With a new, more modular design, Detectron2 is flexible and extensible, and able to provide fast training on single or multiple GPU servers. Detectron2 includes high-quality implementations of state-of-the-art object detection.

Downloads: 1 This Week

Last Update: 2021-10-26

See Project

Tez

Tez is a super-simple and lightweight Trainer for PyTorch

Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep learning projects in PyTorch. tez (तेज़ / تیز) means sharp, fast & active. This is a simple, to-the-point, library to make your PyTorch training easy. This library is in early-stage currently! So, there might be breaking changes. Currently, tez supports cpu, single gpu and multi-gpu & tpu training. More coming soon! Using tez is super-easy. We don't want you to...

Downloads: 0 This Week

Last Update: 2022-08-19

See Project

Spleeter

Deezer source separation library including pretrained models

...It makes it easy to train music source separation models (assuming you have a dataset of isolated sources), and provides already trained state of the art models for performing various flavours of separation. 2 stems and 4 stems models have state of the art performances on the musdb dataset. Spleeter is also very fast as it can perform separation of audio files to 4 stems 100x faster than real-time when run on a GPU. We designed Spleeter so you can use it straight from command line as well as directly in your own development pipeline as a Python library. It can be installed with Conda, with pip or be used with Docker.

1 Review

Downloads: 57 This Week

Last Update: 2021-09-03

See Project

HiFi-GAN

Generative Adversarial Networks for Efficient and High Fidelity Speech

...It introduces a generator architecture tailored to model the periodic structure of speech and a set of discriminators that focus on different scales and periods of the waveform to better capture naturalness. The model targets a sweet spot between sample quality and generation speed, outperforming many previous GAN vocoders while being far faster than typical autoregressive models. In experiments on LJSpeech, HiFi-GAN was shown to achieve mean opinion scores close to human recordings while synthesizing 22.05 kHz audio up to ~168× faster than real time on an NVIDIA V100 GPU. A smaller configuration trades a bit of quality for even higher speed and can run more than 13× faster than real time on CPU, making it suitable for deployment scenarios without powerful GPUs.

Downloads: 0 This Week

Last Update: 2025-11-28

See Project

PyText

A natural language modeling framework based on PyTorch

...We use PyText at Facebook to iterate quickly on new modeling ideas and then seamlessly ship them at scale. Distributed-training support built on the new C10d backend in PyTorch 1.0. Mixed precision training support through APEX (trains faster with less GPU memory on NVIDIA Tensor Cores). Extensible components that allows easy creation of new models and tasks.

Downloads: 0 This Week

Last Update: 2021-08-31

See Project

Bangla TTS

Bangla text to speech synthesis in python

Bangla text to speech Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library. Installation -------------------------------------- * Install Anaconda * conda create -n new_virtual_env python==3.6.8 * conda activate new_virtual_env * pip install -r requirements.txt * While running for the first time, keep your internet connection on to download the weights of the speech synthesis models (>500 MB) * For...

Downloads: 5 This Week

Last Update: 2020-09-03

See Project

textgenrnn

Easily train your own text-generating neural network

...Train on and generate text at either the character-level or word-level. Configure RNN size, the number of RNN layers, and whether to use bidirectional RNNs. Train on any generic input text file, including large files. Train models on a GPU and then use them to generate text with a CPU. Utilize a powerful CuDNN implementation of RNNs when trained on the GPU, which massively speeds up training time as opposed to typical LSTM implementations. Train the model using contextual labels, allowing it to learn faster and produce better results in some cases.

Downloads: 0 This Week

Last Update: 2021-11-24

See Project

maskrcnn-benchmark

Fast, modular reference implementation of Instance Segmentation

Mask R-CNN Benchmark is a PyTorch-based framework that provides high-performance implementations of object detection, instance segmentation, and keypoint detection models. Originally built to benchmark Mask R-CNN and related models, it offers a clean, modular design to train and evaluate detection systems efficiently on standard datasets like COCO. The framework integrates critical components—region proposal networks (RPNs), RoIAlign layers, mask heads, and backbone architectures such as...

Downloads: 0 This Week

Last Update: 2025-10-06

See Project

Tensorpack

A Neural Net Training Interface on TensorFlow, with focus on speed

Tensorpack is a neural network training interface based on TensorFlow v1. Uses TensorFlow in the efficient way with no extra overhead. On common CNNs, it runs training 1.2~5x faster than the equivalent Keras code. Your training can probably gets faster if written with Tensorpack. Scalable data-parallel multi-GPU / distributed training strategy is off-the-shelf to use. Squeeze the best data loading performance of Python with tensorpack.dataflow. Symbolic programming (e.g. tf.data) does not offer the data processing flexibility needed in research. ...

Downloads: 0 This Week

Last Update: 2022-08-01

See Project

Search Results for "gpu faster" - Page 2

Showing 51 open source projects for "gpu faster"

Lightweight' GAN

DeepSpeed

HunyuanVideo-I2V

A2M — Audio to MIDI

EvaDB

Detectron

text-dedup

AI-powered enterprise search engine

FFCV

Point-E

G-Diffuser Bot

Fairseq

Tensorflow Transformers

Guild AI

TensorFlowOnSpark

Hugging Face Transformer

Detectron2

Tez

Spleeter

HiFi-GAN

PyText

Bangla TTS

textgenrnn

maskrcnn-benchmark

Tensorpack

Search Results for "gpu faster" - Page 2

Showing 51 open source projects for "gpu faster"

Lightweight' GAN

DeepSpeed

HunyuanVideo-I2V

A2M — Audio to MIDI

EvaDB

Detectron

text-dedup

AI-powered enterprise search engine

FFCV

Point-E

G-Diffuser Bot

Fairseq

Tensorflow Transformers

Guild AI

TensorFlowOnSpark

Hugging Face Transformer

Detectron2

Tez

Spleeter

HiFi-GAN

PyText

Bangla TTS

textgenrnn

maskrcnn-benchmark

Tensorpack

Related Searches

Related Categories