OPT

OPT

Meta
Qwen-7B

Qwen-7B

Alibaba
+
+

Related Products

  • Vertex AI
    783 Ratings
    Visit Website
  • Google AI Studio
    11 Ratings
    Visit Website
  • LM-Kit.NET
    23 Ratings
    Visit Website
  • Gemini
    1,037,445 Ratings
    Visit Website
  • Claude
    38,813 Ratings
    Visit Website
  • RaimaDB
    9 Ratings
    Visit Website
  • ClickLearn
    65 Ratings
    Visit Website
  • Dragonfly
    16 Ratings
    Visit Website
  • Aizon
    1 Rating
    Visit Website
  • B2i
    2 Ratings
    Visit Website

About

Large language models, which are often trained for hundreds of thousands of compute days, have shown remarkable capabilities for zero- and few-shot learning. Given their computational cost, these models are difficult to replicate without significant capital. For the few that are available through APIs, no access is granted to the full model weights, making them difficult to study. We present Open Pre-trained Transformers (OPT), a suite of decoder-only pre-trained transformers ranging from 125M to 175B parameters, which we aim to fully and responsibly share with interested researchers. We show that OPT-175B is comparable to GPT-3, while requiring only 1/7th the carbon footprint to develop. We are also releasing our logbook detailing the infrastructure challenges we faced, along with code for experimenting with all of the released models.

About

Qwen-7B is the 7B-parameter version of the large language model series, Qwen (abbr. Tongyi Qianwen), proposed by Alibaba Cloud. Qwen-7B is a Transformer-based large language model, which is pretrained on a large volume of data, including web texts, books, codes, etc. Additionally, based on the pretrained Qwen-7B, we release Qwen-7B-Chat, a large-model-based AI assistant, which is trained with alignment techniques. The features of the Qwen-7B series include: Trained with high-quality pretraining data. We have pretrained Qwen-7B on a self-constructed large-scale high-quality dataset of over 2.2 trillion tokens. The dataset includes plain texts and codes, and it covers a wide range of domains, including general domain data and professional domain data. Strong performance. In comparison with the models of the similar model size, we outperform the competitors on a series of benchmark datasets, which evaluates natural language understanding, mathematics, coding, etc. And more.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

AI developers interested in a large language model

Audience

Large language model developers

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

No images available

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Meta
Founded: 2004
United States
www.meta.com

Company Information

Alibaba
Founded: 1999
China
github.com/QwenLM/Qwen-7B

Alternatives

Alternatives

Athene-V2

Athene-V2

Nexusflow
T5

T5

Google
ChatGLM

ChatGLM

Zhipu AI
CodeQwen

CodeQwen

Alibaba
Mistral 7B

Mistral 7B

Mistral AI
PanGu-α

PanGu-α

Huawei
CodeQwen

CodeQwen

Alibaba
Llama 2

Llama 2

Meta
Qwen2

Qwen2

Alibaba

Categories

Categories

Integrations

AiAssistWorks
Alibaba Cloud
C
C#
C++
Elixir
GaiaNet
HTML
Horay.ai
Hugging Face
Java
JavaScript
Julia
LM-Kit.NET
R
Rust
SQL
Scala
Sesterce
Visual Basic

Integrations

AiAssistWorks
Alibaba Cloud
C
C#
C++
Elixir
GaiaNet
HTML
Horay.ai
Hugging Face
Java
JavaScript
Julia
LM-Kit.NET
R
Rust
SQL
Scala
Sesterce
Visual Basic
Claim OPT and update features and information
Claim OPT and update features and information
Claim Qwen-7B and update features and information
Claim Qwen-7B and update features and information