Python inference and LoRA trainer package for the LTX-2 audio–video
A Pragmatic VLA Foundation Model
LLM-based Reinforcement Learning audio edit model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Instructions on how to use the Realtime API on Microcontrollers
OpenAI’s open-weight 120B model optimized for reasoning and tooling
CTC-based forced aligner for audio-text in 158 languages
Ultra-efficient 3B multimodal instruct model built for edge deployment
Llama 3.2–1B: Multilingual, instruction-tuned model for mobile AI