Awesome multilingual OCR toolkits based on PaddlePaddle
The most powerful local music generation model
State-of-the-art TTS model under 25MB
Phi-3.5 for Mac: Locally-run Vision and Language Models
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Block Diffusion for Ultra-Fast Speculative Decoding
OCR expert VLM powered by Hunyuan's native multimodal architecture
26m function call model that runs on incredibly small devices
GLM-4 series: Open Multilingual Multimodal Chat LMs
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Personalize Any Characters with a Scalable Diffusion Transformer
ICLR2024 Spotlight: curation/training code, metadata, distribution
Memory-efficient and performant finetuning of Mistral's models
Uncommon Objects in 3D dataset
This repository contains the official implementation of research
PyTorch implementation of MAE
Reproduces results of "Fixing the train-test resolution discrepancy"
A library for Multilingual Unsupervised or Supervised word Embeddings
OpenAI’s compact 20B open model for fast, agentic, and local use