Olmo 2

Olmo 2

Ai2
+
+

Related Products

  • Gemini Enterprise Agent Platform
    961 Ratings
    Visit Website
  • Google AI Studio
    12 Ratings
    Visit Website
  • LM-Kit.NET
    28 Ratings
    Visit Website
  • Checksum.ai
    1 Rating
    Visit Website
  • Iris Identity Protection
    3 Ratings
    Visit Website
  • DataHub
    10 Ratings
    Visit Website
  • SBS Asset Finance
    3 Ratings
    Visit Website
  • Enterprise Bot
    23 Ratings
    Visit Website
  • QuickApps
    Visit Website
  • Iru
    1,282 Ratings
    Visit Website

About

This repository contains the research preview of LongLLaMA, a large language model capable of handling long contexts of 256k tokens or even more. LongLLaMA is built upon the foundation of OpenLLaMA and fine-tuned using the Focused Transformer (FoT) method. LongLLaMA code is built upon the foundation of Code Llama. We release a smaller 3B base variant (not instruction tuned) of the LongLLaMA model on a permissive license (Apache 2.0) and inference code supporting longer contexts on hugging face. Our model weights can serve as the drop-in replacement of LLaMA in existing implementations (for short context up to 2048 tokens). Additionally, we provide evaluation results and comparisons against the original OpenLLaMA models.

About

Olmo 2 is a family of fully open language models developed by the Allen Institute for AI (AI2), designed to provide researchers and developers with transparent access to training data, open-source code, reproducible training recipes, and comprehensive evaluations. These models are trained on up to 5 trillion tokens and are competitive with leading open-weight models like Llama 3.1 on English academic benchmarks. Olmo 2 emphasizes training stability, implementing techniques to prevent loss spikes during long training runs, and utilizes staged training interventions during late pretraining to address capability deficiencies. The models incorporate state-of-the-art post-training methodologies from AI2's Tülu 3, resulting in the creation of Olmo 2-Instruct models. An actionable evaluation framework, the Open Language Modeling Evaluation System (OLMES), was established to guide improvements through development stages, consisting of 20 evaluation benchmarks assessing core capabilities.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Users interested in a powerful Large Language Model solution

Audience

Developers and researchers searching for a tool to streamline their AI research and operations

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

LongLLaMA
github.com/CStanKonrad/long_llama

Company Information

Ai2
Founded: 2014
United States
allenai.org/blog/olmo2

Alternatives

Llama 2

Llama 2

Meta

Alternatives

Molmo

Molmo

Ai2
Olmo 3

Olmo 3

Ai2
Kimi K2

Kimi K2

Moonshot AI
Llama 2

Llama 2

Meta
Olmo 3

Olmo 3

Ai2
GLM-5

GLM-5

Zhipu AI

Categories

Categories

Integrations

Molmo 2

Integrations

Molmo 2
Claim LongLLaMA and update features and information
Claim LongLLaMA and update features and information
Claim Olmo 2 and update features and information
Claim Olmo 2 and update features and information