Native and Compact Structured Latents for 3D Generation
Official inference repo for FLUX.2 models
Animated sprite editor & pixel art tool
Towards Human-Level Text-to-Speech through Style Diffusion
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K
Single-click installer of TRELLIS.2 (3d-model generator) for 8GB gpus
AI-driven neuro-symbolic solver for high-school geometry problems
The modern PHP app server
Ready-to-use OCR with 80+ supported languages
Models and examples built with TensorFlow
State-of-the-art TTS model under 25MB
PS2 Covers Collection
Operating LLMs in production
Library for OCR-related tasks powered by Deep Learning
Video understanding codebase from FAIR for reproducing video models
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Deep learning library
Rust native ready-to-use NLP pipelines and transformer-based models
A speech-text foundation model for real time dialogue
A modern tool for managing database schemas
OpenAI swift async text to image for SwiftUI app using OpenAI
State of the Art Natural Language Processing
Usable Implementation of "Bootstrap Your Own Latent" self-supervised