Page 5 | text classification free download

fashion-clip

CLIP model fine-tuned for zero-shot fashion product classification

FashionCLIP is a domain-adapted CLIP model fine-tuned specifically for the fashion industry, enabling zero-shot classification and retrieval of fashion products. Developed by Patrick John Chia and collaborators, it builds on the CLIP ViT-B/32 architecture and was trained on over 800K image-text pairs from the Farfetch dataset. The model learns to align product images and descriptive text using contrastive learning, enabling it to perform well across various fashion-related tasks without additional supervision. ...

Downloads: 0 This Week

Last Update: 2025-07-02

See Project

CLIP-ViT-bigG-14-laion2B-39B-b160k

CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B

CLIP-ViT-bigG-14-laion2B-39B-b160k is a powerful vision-language model trained on the English subset of the LAION-5B dataset using the OpenCLIP framework. Developed by LAION and trained by Mitchell Wortsman on Stability AI’s compute infrastructure, it pairs a ViT-bigG/14 vision transformer with a text encoder to perform contrastive learning on image-text pairs. This model excels at zero-shot image classification, image-to-text and text-to-image retrieval, and can be adapted for tasks such as image captioning or generation guidance. It achieves an impressive 80.1% top-1 accuracy on ImageNet-1k without any fine-tuning, showcasing its robustness in open-domain settings. ...

Downloads: 0 This Week

Last Update: 2025-07-02

See Project

roberta-base

Robust BERT-based model for English with improved MLM training

...RoBERTa is designed to be fine-tuned for a wide range of NLP tasks such as classification, QA, and sequence labeling, achieving strong performance on the GLUE benchmark and other downstream applications.

Downloads: 0 This Week

Last Update: 2025-07-01

See Project

layoutlm-base-uncased

Multimodal Transformer for document image understanding and layout

layoutlm-base-uncased is a multimodal transformer model developed by Microsoft for document image understanding tasks. It incorporates both text and layout (position) features to effectively process structured documents like forms, invoices, and receipts. This base version has 113 million parameters and is pre-trained on 11 million documents from the IIT-CDIP dataset. LayoutLM enables better performance in tasks where the spatial arrangement of text plays a crucial role. The model uses a standard BERT-like architecture but enriches input with 2D positional embeddings. ...

Downloads: 0 This Week

Last Update: 2025-07-02

See Project

Rampart

Lightweight on-device model for private AI text redaction

Rampart is a lightweight, on-device privacy protection model developed by the National Design Studio to detect and redact personally identifiable information (PII) before text leaves a user's device. Rather than relying on server-side filtering, Rampart performs token-level PII detection locally, enabling privacy-preserving AI interactions with minimal latency and without exposing sensitive information to external services. The released model is a 14.7 MB ONNX artifact based on a fine-tuned MiniLM-L6-H384 encoder with approximately 18.5 million parameters and a 35-label BIO classification head covering 17 entity types. ...

Downloads: 0 This Week

Last Update: 15 hours ago

See Project

mms-300m-1130-forced-aligner

CTC-based forced aligner for audio-text in 158 languages

mms-300m-1130-forced-aligner is a multilingual forced alignment model based on Meta’s MMS-300M wav2vec2 checkpoint, adapted for Hugging Face’s Transformers library. It supports forced alignment between audio and corresponding text across 158 languages, offering broad multilingual coverage. The model enables accurate word- or phoneme-level timestamping using Connectionist Temporal Classification (CTC) emissions. Unlike other tools, it provides significant memory efficiency compared to the TorchAudio forced alignment API. Users can integrate it easily through the Python package ctc-forced-aligner, and it supports GPU acceleration via PyTorch. ...

Downloads: 0 This Week

Last Update: 2025-07-02

See Project

Ministral 3 8B Base 2512

Versatile 8B-base multimodal LLM, flexible foundation for custom AI

...Because it comes from the edge-optimized Ministral 3 family, it remains deployable on reasonably powerful hardware while offering a good balance between capability and resource use. Its multilingual and multimodal pretraining enables broad applicability across languages and tasks — from generation to classification to vision-language tasks.

Downloads: 0 This Week

Last Update: 2025-12-03

See Project

Ministral 3 3B Base 2512

Small 3B-base multimodal model ideal for custom AI on edge hardware

Ministral 3 3B Base 2512 is the smallest model in the Ministral 3 family, offering a compact yet capable multimodal architecture suited for lightweight AI applications. It combines a 3.4B-parameter language model with a 0.4B vision encoder, enabling both text and image understanding in a tiny footprint. As the base pretrained model, it is not fine-tuned for instructions or reasoning, making it the ideal foundation for custom post-training, domain adaptation, or specialized downstream tasks....

Downloads: 0 This Week

Last Update: 2025-12-03

See Project

Search Results for "text classification" - Page 5

Showing 108 open source projects for "text classification"

fashion-clip

CLIP-ViT-bigG-14-laion2B-39B-b160k

roberta-base

layoutlm-base-uncased

Rampart

mms-300m-1130-forced-aligner

Ministral 3 8B Base 2512

Ministral 3 3B Base 2512

Search Results for "text classification" - Page 5

Showing 108 open source projects for "text classification"

fashion-clip

CLIP-ViT-bigG-14-laion2B-39B-b160k

roberta-base

layoutlm-base-uncased

Rampart

mms-300m-1130-forced-aligner

Ministral 3 8B Base 2512

Ministral 3 3B Base 2512

Related Categories