Awesome multilingual OCR toolkits based on PaddlePaddle
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
An easy 1-click way to create beautiful artwork on your PC using AI
Inference script for Oasis 500M
Provides convenient access to the Anthropic REST API from any Python 3
Easy Docker setup for Stable Diffusion with user-friendly UI
Extension index for stable-diffusion-webui
A Systematic Framework for Interactive World Modeling
A Unified Framework for Text-to-3D and Image-to-3D Generation
Foundational Models for State-of-the-Art Speech and Text Translation
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Safety reasoning models built-upon gpt-oss
RGBD video generation model conditioned on camera input
ChatGPT interface with better UI
The ChatGPT Retrieval Plugin lets you easily find personal documents
AI Suite for upscaling, interpolating & restoring images/videos
StudioOllamaUI is a local, portable interface for Ollama
Open source large language model by Alibaba
Detect faces in an image
Praca z modelami AI w AvoTensor
Open Multilingual Multimodal Chat LMs
Example Discord bot written in Python that uses the completions API
Let us control diffusion models
Hermes 4 FP8: hybrid reasoning Llama-3.1-405B model by Nous Research
Dia-1.6B generates lifelike English dialogue and vocal expressions