A fast, powerful, and simple hierarchical vision transformer
Code release for Cut and Learn for Unsupervised Object Detection
Training Large Language Model to Reason in a Continuous Latent Space
High-resolution models for human tasks
CLIP, Predict the most relevant text snippet given an image
Ling is a MoE LLM provided and open-sourced by InclusionAI
Multimodal-Driven Architecture for Customized Video Generation
Personalize Any Characters with a Scalable Diffusion Transformer
The NVIDIA AgentIQ toolkit is an open-source library
Extensible AGI Framework
AI agent that streamlines the entire process of data analysis
SWE-agent takes a GitHub issue and tries to automatically fix it
Multilingual Automatic Speech Recognition with word-level timestamps
A system for quickly generating training data with weak supervision
PyTorch version of Stable Baselines
Beta Machine Learning Toolkit
OpenDAN is an open source Personal AI OS
Low-code framework for building custom LLMs, neural networks
Open platform for training, serving, and evaluating language models
Integrate ChatGPT into your own discord bot
Implementation of Phenaki Video, which uses Mask GIT
Conditional GAN for generating synthetic tabular data
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
An open source implementation of CLIP