Video understanding codebase from FAIR for reproducing video models
DeepMind model for tracking arbitrary points across videos & robotics
A series of math-specific large language models of our Qwen2 series
Designed for text embedding and ranking tasks
Build your chatbot within minutes on your favorite device
Toolkit for conversational AI
A library for deep learning end-to-end dialog systems and chatbots
Faster and easier training and deployments
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
Synthetic data generators for tabular and time-series data
Models and examples built with TensorFlow
Focus on prompting and generating
Industry leading face manipulation platform
Generate short videos with one click using AI LLM
Towards Human-Sounding Speech
RGBD video generation model conditioned on camera input
The most powerful and modular diffusion model GUI, api and backend
New family of code large language models (LLMs)
Democratizing Reinforcement Learning for LLMs
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
TTS with kokoro and onnx runtime
SOTA discrete acoustic codec models with 40/75 tokens per second
Best practices on recommendation systems