Aya Vision

Aya Vision

Cohere
PaliGemma 2

PaliGemma 2

Google
+
+

Related Products

  • Vertex AI
    961 Ratings
    Visit Website
  • Google AI Studio
    11 Ratings
    Visit Website
  • LTX
    181 Ratings
    Visit Website
  • LM-Kit.NET
    25 Ratings
    Visit Website
  • Imorgon
    5 Ratings
    Visit Website
  • PackageX OCR Scanning
    46 Ratings
    Visit Website
  • RaimaDB
    12 Ratings
    Visit Website
  • Windocks
    7 Ratings
    Visit Website
  • CompUp
    66 Ratings
    Visit Website
  • TeleRay
    6 Ratings
    Visit Website

About

Aya Vision is a research model advancing in multilingual multimodal AI through innovative synthetic data generation, cross-modal model merging, and a comprehensive benchmark suite. It achieves state-of-the-art performance across 23 languages, surpassing larger models while efficiently addressing data scarcity and catastrophic forgetting by reducing computational overhead up to 40% via optimized training techniques.

About

PaliGemma 2, the next evolution in tunable vision-language models, builds upon the performant Gemma 2 models, adding the power of vision and making it easier than ever to fine-tune for exceptional performance. With PaliGemma 2, these models can see, understand, and interact with visual input, opening up a world of new possibilities. It offers scalable performance with multiple model sizes (3B, 10B, 28B parameters) and resolutions (224px, 448px, 896px). PaliGemma 2 generates detailed, contextually relevant captions for images, going beyond simple object identification to describe actions, emotions, and the overall narrative of the scene. Our research demonstrates leading performance in chemical formula recognition, music score recognition, spatial reasoning, and chest X-ray report generation, as detailed in the technical report. Upgrading to PaliGemma 2 is a breeze for existing PaliGemma users.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Researchers and developers building multilingual AI applications that require understanding and generating content from both text and images

Audience

Medical researchers seeking a tool to automate the generation of detailed reports from chest X-rays

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Cohere
Founded: 2019
Canada
cohere.com/research/aya

Company Information

Google
Founded: 1994
United States
developers.googleblog.com/en/introducing-paligemma-2-powerful-vision-language-models-simple-fine-tuning/

Alternatives

Pixtral Large

Pixtral Large

Mistral AI

Alternatives

MedGemma

MedGemma

Google DeepMind
Gemma

Gemma

Google
Falcon 2

Falcon 2

Technology Innovation Institute (TII)
Gemma 3

Gemma 3

Google
GLM-OCR

GLM-OCR

Z.ai
Falcon 2

Falcon 2

Technology Innovation Institute (TII)
Qwen3.5

Qwen3.5

Alibaba
Gemma

Gemma

Ceros

Categories

Categories

Integrations

Gemma
Hugging Face
Kaggle
Keras
LLaMA-Factory
PyTorch

Integrations

Gemma
Hugging Face
Kaggle
Keras
LLaMA-Factory
PyTorch
Claim Aya Vision and update features and information
Claim Aya Vision and update features and information
Claim PaliGemma 2 and update features and information
Claim PaliGemma 2 and update features and information