High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Machine learning, conversational dialog engine for creating chat bots
AlphaFold 3 inference pipeline
A Python library for audio data augmentation
Python inference and LoRA trainer package for the LTX-2 audio–video
Python Stream Processing
Shell command execution server implementing the Model Context Protocol
Speech recognition module for Python
TensorFlow is an open source library for machine learning
OCRmyPDF adds an OCR text layer to scanned PDF files
An open-source toolkit for monitoring Language Learning Models (LLMs)
RGBD video generation model conditioned on camera input
Topic Modelling for Humans
Generate short videos with one click using AI LLM
Chat & pretrained large audio language model proposed by Alibaba Cloud
Talk to Your AI Agents from Anywhere
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
The Multi-Agent Framework
A nearly-live implementation of OpenAI's Whisper
High-quality multi-lingual text-to-speech library by MyShell.ai
Large Audio Language Model built for natural interactions
Tool for visualizing and tracking your machine learning experiments
State-of-the-art (SoTA) text-to-video pre-trained model
This repository contains the official implementation of FastVLM
An experimental version of DeepSeek model