CLIP download | SourceForge.net

CLIP (Contrastive Language-Image Pretraining) is a neural model that links images and text in a shared embedding space, allowing zero-shot image classification, similarity search, and multimodal alignment. It was trained on large sets of (image, caption) pairs using a contrastive objective: images and their matching text are pulled together in embedding space, while mismatches are pushed apart. Once trained, you can give it any text labels and ask it to pick which label best matches a given image—even without explicit training for that classification task. The repository provides code for model architecture, preprocessing transforms, evaluation pipelines, and example inference scripts. Because it generalizes to arbitrary labels via text prompts, CLIP is a powerful tool for tasks that involve interpreting images in terms of descriptive language.

Features

Shared embedding space for images and text enabling zero-shot classification
Model code for architecture, preprocessing, training, and inference
Support for custom prompt templates and label embeddings
Image/text similarity scoring and retrieval pipelines
Example usage scripts and evaluation benchmarks
Adaptation to new data or labels without retraining via prompt methods

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow CLIP

CLIP Web Site

Other Useful Business Software

Gen AI apps are built with MongoDB Atlas

Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.

Start Free

Rate This Project

User Reviews

Be the first to post a review of CLIP!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Models

Registered

6 hours ago

Report inappropriate content

CLIP

CLIP, Predict the most relevant text snippet given an image

Get an email when there's a new version of CLIP

Features

Project Samples

Project Activity

Categories

License

Follow CLIP

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered