CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B
boosting algo to reveal network generation mechansisms
Are you a composer looking for ideas? Or do you just enjoy listening..
A category-based approach to exploring film data.
Make your AI agent reach flow state safely
Multimodal Transformer for document image understanding and layout
Compact English sentence embedding model for semantic search tasks
Lightweight on-device model for private AI text redaction
CTC-based forced aligner for audio-text in 158 languages
Efficient English embedding model for semantic search and retrieval
GUI based toolkit for running common Machine Learning algorithms.
Small 3B-base multimodal model ideal for custom AI on edge hardware
Versatile 8B-base multimodal LLM, flexible foundation for custom AI