Large Audio Language Model built for natural interactions
Stable Diffusion web UI
Fast, powerful, git-native ticket tracking in a single bash script
A lightweight text-to-speech model with zero-shot voice cloning
95% token savings. 155x faster queries. 16 languages
An Open-Source AI Agent Platform for Financial Analysis using LLMs
Follow along with my AI Agents Masterclass videos
Open source AI Agents hosted on the oTTomator Live Agent Studio
Chinese Llama-3 LLMs) developed from Meta Llama 3
Chinese XLNet pre-trained model
The official Python SDK for UCP
Inference script for Oasis 500M
Document Image Parsing via Heterogeneous Anchor Prompting”
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
Fast forecasting with statistical and econometric models
Generate Any 3D Scene in Seconds
Build GenAI application quick and easy
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Fast and Universal 3D reconstruction model for versatile tasks
Implementation of Vision Transformer, a simple way to achieve SOTA
The best ChatGPT that $100 can buy
A secure sandbox environment for malware developers and red teamers
A Model Context Protocol server for searching and analyzing arXiv
This repository contains the official implementation of FastVLM