StarCoder

StarCoder

BigCode
Xgen-small

Xgen-small

Salesforce
+
+

Related Products

  • Windsurf Editor
    147 Ratings
    Visit Website
  • Google AI Studio
    9 Ratings
    Visit Website
  • Claude Code
    20 Ratings
    Visit Website
  • JetBrains Junie
    2 Ratings
    Visit Website
  • Vertex AI
    727 Ratings
    Visit Website
  • LM-Kit.NET
    22 Ratings
    Visit Website
  • Assembled
    178 Ratings
    Visit Website
  • Atera IT Autopilot
    1,792 Ratings
    Visit Website
  • Google Cloud BigQuery
    1,851 Ratings
    Visit Website
  • Adobe PDF Library SDK
    35 Ratings
    Visit Website

About

StarCoder and StarCoderBase are Large Language Models for Code (Code LLMs) trained on permissively licensed data from GitHub, including from 80+ programming languages, Git commits, GitHub issues, and Jupyter notebooks. Similar to LLaMA, we trained a ~15B parameter model for 1 trillion tokens. We fine-tuned StarCoderBase model for 35B Python tokens, resulting in a new model that we call StarCoder. We found that StarCoderBase outperforms existing open Code LLMs on popular programming benchmarks and matches or surpasses closed models such as code-cushman-001 from OpenAI (the original Codex model that powered early versions of GitHub Copilot). With a context length of over 8,000 tokens, the StarCoder models can process more input than any other open LLM, enabling a wide range of interesting applications. For example, by prompting the StarCoder models with a series of dialogues, we enabled them to act as a technical assistant.

About

Xgen-small is an enterprise-ready compact language model developed by Salesforce AI Research, designed to deliver long-context performance at a predictable, low cost. It combines domain-focused data curation, scalable pre-training, length extension, instruction fine-tuning, and reinforcement learning to meet the complex, high-volume inference demands of modern enterprises. Unlike traditional large models, Xgen-small offers efficient processing of extensive contexts, enabling the synthesis of information from internal documentation, code repositories, research reports, and real-time data streams. With sizes optimized at 4B and 9B parameters, it provides a strategic advantage by balancing cost efficiency, privacy safeguards, and long-context understanding, making it a sustainable and predictable solution for deploying Enterprise AI at scale.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Developers interested in an LLM for code generation

Audience

IT leaders and AI practitioners seeking a compact, efficient language model capable of processing long-context information while ensuring cost-effectiveness and data privacy

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

BigCode
Founded: 2023
huggingface.co/blog/starcoder

Company Information

Salesforce
Founded: 1999
United States
www.salesforce.com/blog/xgen-small-enterprise-ready-small-language-models/

Alternatives

CodeGemma

CodeGemma

Google

Alternatives

CodeQwen

CodeQwen

Alibaba
DeepSeek Coder

DeepSeek Coder

DeepSeek
CodeGen

CodeGen

Salesforce
Mistral NeMo

Mistral NeMo

Mistral AI
Kimi K2

Kimi K2

Moonshot AI

Categories

Categories

Integrations

Agentforce Vibes
ChatGPT
CodeQwen
Git
GitHub
LM Studio
OpenAI
Python
Tabby
Taylor AI
Visual Studio Code

Integrations

Agentforce Vibes
ChatGPT
CodeQwen
Git
GitHub
LM Studio
OpenAI
Python
Tabby
Taylor AI
Visual Studio Code
Claim StarCoder and update features and information
Claim StarCoder and update features and information
Claim Xgen-small and update features and information
Claim Xgen-small and update features and information