CodeT5

CodeT5

Salesforce
StarCoder

StarCoder

BigCode
+
+

Related Products

  • Windsurf Editor
    137 Ratings
    Visit Website
  • Cody
    87 Ratings
    Visit Website
  • Blackbird API Development
    1 Rating
    Visit Website
  • Google AI Studio
    4 Ratings
    Visit Website
  • Twilio
    1,298 Ratings
    Visit Website
  • Docmosis
    46 Ratings
    Visit Website
  • Google Cloud Run
    259 Ratings
    Visit Website
  • Adobe PDF Library SDK
    35 Ratings
    Visit Website
  • UserWay
    1,541 Ratings
    Visit Website
  • Vertex AI
    713 Ratings
    Visit Website

About

Code for CodeT5, a new code-aware pre-trained encoder-decoder model. Identifier-aware unified pre-trained encoder-decoder models for code understanding and generation. This is the official PyTorch implementation for the EMNLP 2021 paper from Salesforce Research. CodeT5-large-ntp-py is specially optimized for Python code generation tasks and employed as the foundation model for our CodeRL, yielding new SOTA results on the APPS Python competition-level program synthesis benchmark. This repo provides the code for reproducing the experiments in CodeT5. CodeT5 is a new pre-trained encoder-decoder model for programming languages, which is pre-trained on 8.35M functions in 8 programming languages (Python, Java, JavaScript, PHP, Ruby, Go, C, and C#). In total, it achieves state-of-the-art results on 14 sub-tasks in a code intelligence benchmark - CodeXGLUE. Generate code based on the natural language description.

About

StarCoder and StarCoderBase are Large Language Models for Code (Code LLMs) trained on permissively licensed data from GitHub, including from 80+ programming languages, Git commits, GitHub issues, and Jupyter notebooks. Similar to LLaMA, we trained a ~15B parameter model for 1 trillion tokens. We fine-tuned StarCoderBase model for 35B Python tokens, resulting in a new model that we call StarCoder. We found that StarCoderBase outperforms existing open Code LLMs on popular programming benchmarks and matches or surpasses closed models such as code-cushman-001 from OpenAI (the original Codex model that powered early versions of GitHub Copilot). With a context length of over 8,000 tokens, the StarCoder models can process more input than any other open LLM, enabling a wide range of interesting applications. For example, by prompting the StarCoder models with a series of dialogues, we enabled them to act as a technical assistant.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Developers and users interested in a solution to generate, summarize, and autocomplete code

Audience

Developers interested in an LLM for code generation

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Salesforce
github.com/salesforce/CodeT5

Company Information

BigCode
Founded: 2023
huggingface.co/blog/starcoder

Alternatives

Codestral

Codestral

Mistral AI

Alternatives

CodeGemma

CodeGemma

Google
StableCode

StableCode

Stability AI
CodeQwen

CodeQwen

Alibaba
StarCoder

StarCoder

BigCode
DeepSeek Coder

DeepSeek Coder

DeepSeek
CodeGeeX

CodeGeeX

AMiner
CodeGen

CodeGen

Salesforce

Categories

Categories

Integrations

Python
C
C#
ChatGPT
CodeQwen
Git
GitHub
Go
Java
JavaScript
LM Studio
OpenAI
PHP
Ruby
Tabby
Taylor AI
Visual Studio Code

Integrations

Python
C
C#
ChatGPT
CodeQwen
Git
GitHub
Go
Java
JavaScript
LM Studio
OpenAI
PHP
Ruby
Tabby
Taylor AI
Visual Studio Code
Claim CodeT5 and update features and information
Claim CodeT5 and update features and information
Claim StarCoder and update features and information
Claim StarCoder and update features and information