CLIP model fine-tuned for zero-shot fashion product classification
Multimodal Transformer for document image understanding and layout
Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video
Multimodal 7B model for image, video, and text understanding tasks
4-bit Command A+ model for enterprise agents and multilingual tasks
An advanced bilingual image editing with semantic control
VaultGemma: 1B DP-trained Gemma variant for private NLP tasks
Custom BLEURT model for evaluating text similarity using PyTorch
Robust BERT-based model for English with improved MLM training
Versatile 8B-base multimodal LLM, flexible foundation for custom AI
Fast uncensored Gemma model optimized for local chat and coding
Text-to-image model optimized for artistic quality and safe generation
Dense multimodal Qwen model for coding, agents, and long context
Open multimodal model for coding, agents, and long-context tasks
BGE-Large v1.5: High-accuracy English embedding model for retrieval
CTC-based forced aligner for audio-text in 158 languages
Dia-1.6B generates lifelike English dialogue and vocal expressions
Omnimodal AI model for agents, coding, and long-context tasks
FP8 Qwen model for efficient multimodal coding and agent tasks
Instruction-tuned 1.2B LLM for multilingual text generation by Meta
SmartMap is an easy desktop random world creator.
Open, non-commercial SDXL model for quality image generation