LTX-Video Support for ComfyUI
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
DeepSeek Coder: Let the Code Write Itself
Access to Anthropic's safety-first language model APIs
A Powerful Native Multimodal Model for Image Generation
Sharp Monocular Metric Depth in Less Than a Second
Foundation Models for Time Series
This repository contains the official implementation of FastVLM
The official PyTorch implementation of Google's Gemma models
New set of lightweight state-of-the-art, open foundation models
Python example app from the OpenAI API quickstart tutorial
DeepSeek LLM: Let there be answers
Official code for Style Aligned Image Generation via Shared Attention
This repository contains the official implementation of research
A library for Multilingual Unsupervised or Supervised word Embeddings
Russian ASR model fine-tuned on Common Voice and CSS10 datasets
Portuguese ASR model fine-tuned on XLSR-53 for 16kHz audio input
Multimodal 7B model for image, video, and text understanding tasks