The Python code to reproduce illustrations from Machine Learning Book
Automatically translates the text of a video based on a subtitle file
Interface for OuteTTS models
Scalable data pre processing and curation toolkit for LLMs
The open-source data curation platform for LLMs
User toolkit for analyzing and interfacing with Large Language Models
A very simple framework for state-of-the-art NLP
State-of-the-art (SoTA) text-to-video pre-trained model
Code and models for ICML 2024 paper, NExT-GPT
Documentation for Google's Gen AI site - including Gemini API & Gemma
Implementation of AudioLM audio generation model in Pytorch
An Open Source implementation of Notebook LM with more flexibility
Adding guardrails to large language models
Framework for building, orchestrating, and deploying AI agents
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Foundational model for human-like, expressive TTS
Chinese and English multimodal conversational language model
Context-aware desktop AI assistant that understands screen content
Search all of YouTube from the command line
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Real-time voice interactive digital human
Capable of understanding text, audio, vision, video
Efficient few-shot learning with Sentence Transformers
Concatenate a directory full of files into a single prompt
Python framework for adversarial attacks, and data augmentation