phi-2

Phi-2 is a 2.7 billion parameter Transformer model developed by Microsoft, designed for natural language processing and code generation tasks. It was trained on a filtered dataset of high-quality web content and synthetic NLP texts created by GPT-3.5, totaling 1.4 trillion tokens. Phi-2 excels in benchmarks for common sense, language understanding, and logical reasoning, outperforming most models under 13B parameters despite not being instruction-tuned or aligned via RLHF. It performs best on QA-style prompts, code generation, and chat dialogues using structured input formats. The model has a context length of 2048 tokens and was trained over 14 days on 96 A100 GPUs using DeepSpeed and FlashAttention. Though compact, it still exhibits verbosity, potential bias, and may generate inaccurate or verbose code without supervision. Phi-2 is released under the MIT license to support open research on safe, controllable language modeling.

Features

2.7B parameter Transformer optimized for QA, chat, and code
Trained on 1.4T tokens from high-quality web and synthetic data
Excels at logical reasoning and language comprehension tasks
Supports next-token generation with 2048 token context window
Performs well without RLHF or instruction fine-tuning
Built with DeepSpeed, FlashAttention, and PyTorch
MIT-licensed and openly available for research and development
Known issues include verbosity and limited instruction adherence

Project Samples

Project Activity

See All Activity >

Follow phi-2

phi-2 Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of phi-2!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Models

Registered

2025-06-27

Similar Business Software

Phi-2

We are now releasing Phi-2, a 2.7 billion-parameter language model that demonstrates outstanding reasoning and language understanding capabilities, showcasing state-of-the-art performance among base language models with less than 13 billion parameters. On complex benchmarks Phi-2 matches or...

See Software
Qwen-7B

Qwen-7B is the 7B-parameter version of the large language model series, Qwen (abbr. Tongyi Qianwen), proposed by Alibaba Cloud. Qwen-7B is a Transformer-based large language model, which is pretrained on a large volume of data, including web texts, books, codes, etc. Additionally, based on the...

See Software
Yi-Large

Yi-Large is a proprietary large language model developed by 01.AI, offering a 32k context length with both input and output costs at $2 per million tokens. It stands out with its advanced capabilities in natural language processing, common-sense reasoning, and multilingual support, performing on...

See Software