Native and Compact Structured Latents for 3D Generation
Official inference repo for FLUX.2 models
Animated sprite editor & pixel art tool
Towards Human-Level Text-to-Speech through Style Diffusion
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K
Single-click installer of TRELLIS.2 (3d-model generator) for 8GB gpus
AI-driven neuro-symbolic solver for high-school geometry problems
The modern PHP app server
Ready-to-use OCR with 80+ supported languages
Models and examples built with TensorFlow
State-of-the-art TTS model under 25MB
PS2 Covers Collection
Operating LLMs in production
Library for OCR-related tasks powered by Deep Learning
Video understanding codebase from FAIR for reproducing video models
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Rust native ready-to-use NLP pipelines and transformer-based models
A speech-text foundation model for real time dialogue
Deep learning library
A modern tool for managing database schemas
State of the Art Natural Language Processing
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
OpenAI swift async text to image for SwiftUI app using OpenAI