A framework to enable multimodal models to operate a computer
Uniform Manifold Approximation and Projection
Automate browser-based workflows with LLMs and Computer Vision
Python chatbot framework with Natural Language Understanding
MCP Server for IDA Pro
A full spaCy pipeline and models for scientific/biomedical documents
GLM-4 series: Open Multilingual Multimodal Chat LMs
Training and deploying machine learning models on Amazon SageMaker
Implementation of Vision Transformer, a simple way to achieve SOTA
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Structured outputs for llms
LLM
Models and examples built with TensorFlow
Industrial-strength Natural Language Processing (NLP)
OCR expert VLM powered by Hunyuan's native multimodal architecture
Simplifies the local serving of AI models from any source
ChatGLM2-6B: An Open Bilingual Chat LLM
A modular, primitive-first, python-first PyTorch library
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
The official Python SDK for the ElevenLabs API
A text-to-speech, speech-to-text and speech-to-speech library
ChatGPT interface with better UI
Enable AI to control your desktop, mobile and HMI devices
Operating LLMs in production
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX