Robust BERT-based model for English with improved MLM training
Flexible text-to-text transformer model for multilingual NLP tasks
T5-Small: Lightweight text-to-text transformer for NLP tasks
CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B
A category-based approach to exploring film data.
Multimodal Transformer for document image understanding and layout
Compact English sentence embedding model for semantic search tasks
CTC-based forced aligner for audio-text in 158 languages
Efficient English embedding model for semantic search and retrieval
Small 3B-base multimodal model ideal for custom AI on edge hardware
Versatile 8B-base multimodal LLM, flexible foundation for custom AI