Qwen2.5-1M

Qwen2.5-1M

Alibaba
+
+

Related Products

  • Vertex AI
    783 Ratings
    Visit Website
  • LM-Kit.NET
    23 Ratings
    Visit Website
  • Google AI Studio
    11 Ratings
    Visit Website
  • AI Video Cut
    1 Rating
    Visit Website
  • Juspay
    15 Ratings
    Visit Website
  • imgproxy
    15 Ratings
    Visit Website
  • Source Defense
    7 Ratings
    Visit Website
  • Teradata VantageCloud
    992 Ratings
    Visit Website
  • Odoo
    1,629 Ratings
    Visit Website
  • Proton Pass
    31,996 Ratings
    Visit Website

About

Qwen2.5-1M is an open-source language model developed by the Qwen team, designed to handle context lengths of up to one million tokens. This release includes two model variants, Qwen2.5-7B-Instruct-1M and Qwen2.5-14B-Instruct-1M, marking the first time Qwen models have been upgraded to support such extensive context lengths. To facilitate efficient deployment, the team has also open-sourced an inference framework based on vLLM, integrated with sparse attention methods, enabling processing of 1M-token inputs with a 3x to 7x speed improvement. Comprehensive technical details, including design insights and ablation experiments, are available in the accompanying technical report.

About

Foundation models such as GPT-4 have driven rapid improvement in AI. However, the most powerful models are closed commercial models or only partially open. RedPajama is a project to create a set of leading, fully open-source models. Today, we are excited to announce the completion of the first step of this project: the reproduction of the LLaMA training dataset of over 1.2 trillion tokens. The most capable foundation models today are closed behind commercial APIs, which limits research, customization, and their use with sensitive data. Fully open-source models hold the promise of removing these limitations, if the open community can close the quality gap between open and closed models. Recently, there has been much progress along this front. In many ways, AI is having its Linux moment. Stable Diffusion showed that open-source can not only rival the quality of commercial offerings like DALL-E but can also lead to incredible creativity from broad participation by communities.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

AI researchers, developers, and organizations seeking an open-source large language model with extended context capabilities for advanced natural language processing tasks

Audience

AI and LLM developers

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Alibaba
Founded: 1999
China
qwenlm.github.io/blog/qwen2.5-1m/

Company Information

RedPajama
Founded: 2023
www.together.xyz/blog/redpajama

Alternatives

Qwen2.5-Max

Qwen2.5-Max

Alibaba

Alternatives

Alpaca

Alpaca

Stanford Center for Research on Foundation Models (CRFM)
Dolly

Dolly

Databricks
CodeQwen

CodeQwen

Alibaba
Falcon-40B

Falcon-40B

Technology Innovation Institute (TII)
Qwen3-Max

Qwen3-Max

Alibaba
Qwen2

Qwen2

Alibaba
Falcon-7B

Falcon-7B

Technology Innovation Institute (TII)

Categories

Categories

Integrations

Alibaba Cloud
C
C#
C++
Clojure
Elixir
F#
Hugging Face
Java
Julia
LM-Kit.NET
ModelScope
PHP
Python
R
Ruby
Rust
SQL
Scala
WebLLM

Integrations

Alibaba Cloud
C
C#
C++
Clojure
Elixir
F#
Hugging Face
Java
Julia
LM-Kit.NET
ModelScope
PHP
Python
R
Ruby
Rust
SQL
Scala
WebLLM
Claim Qwen2.5-1M and update features and information
Claim Qwen2.5-1M and update features and information
Claim RedPajama and update features and information
Claim RedPajama and update features and information