Easy-to-use Speech Toolkit including Self-Supervised Learning model
Proofs, cases, concept supplements, and reference explanations
Tool for visualizing and tracking your machine learning experiments
Build cross-modal and multimodal applications on the cloud
Run the Stable Diffusion releases in a Docker container
Application that simplifies the installation of AI-related projects
Implementation of MusicLM music generation model in Pytorch
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Inference code for Llama models
Basaran, an open-source alternative to the OpenAI text completion API
An open-source framework for training large multimodal models
Unified embedding model
Open source annotation tool for machine learning practitioners
Task-oriented finetuning for better embeddings on neural search
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Implementation of "Tree of Thoughts
Dealing with all unstructured data, such as reverse image search
State-of-the-art Multilingual Question Answering research
Open-source framework that gives you AI Agents
A simple client for doccano API
Implementation / replication of DALL-E, OpenAI's Text to Image
Convert an image to text to spot intelligible words.
Chuyển đổi văn bản thành giọng nói không giới hạn