Xgen-small

Xgen-small

Salesforce
+
+

Related Products

  • Vertex AI
    727 Ratings
    Visit Website
  • LM-Kit.NET
    22 Ratings
    Visit Website
  • Google AI Studio
    9 Ratings
    Visit Website
  • Amazon Bedrock
    77 Ratings
    Visit Website
  • Ango Hub
    15 Ratings
    Visit Website
  • DataHub
    8 Ratings
    Visit Website
  • Iru
    1,380 Ratings
    Visit Website
  • Enterprise Bot
    23 Ratings
    Visit Website
  • Nexo
    16,346 Ratings
    Visit Website
  • QuickApps
    Visit Website

About

This repository contains the research preview of LongLLaMA, a large language model capable of handling long contexts of 256k tokens or even more. LongLLaMA is built upon the foundation of OpenLLaMA and fine-tuned using the Focused Transformer (FoT) method. LongLLaMA code is built upon the foundation of Code Llama. We release a smaller 3B base variant (not instruction tuned) of the LongLLaMA model on a permissive license (Apache 2.0) and inference code supporting longer contexts on hugging face. Our model weights can serve as the drop-in replacement of LLaMA in existing implementations (for short context up to 2048 tokens). Additionally, we provide evaluation results and comparisons against the original OpenLLaMA models.

About

Xgen-small is an enterprise-ready compact language model developed by Salesforce AI Research, designed to deliver long-context performance at a predictable, low cost. It combines domain-focused data curation, scalable pre-training, length extension, instruction fine-tuning, and reinforcement learning to meet the complex, high-volume inference demands of modern enterprises. Unlike traditional large models, Xgen-small offers efficient processing of extensive contexts, enabling the synthesis of information from internal documentation, code repositories, research reports, and real-time data streams. With sizes optimized at 4B and 9B parameters, it provides a strategic advantage by balancing cost efficiency, privacy safeguards, and long-context understanding, making it a sustainable and predictable solution for deploying Enterprise AI at scale.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Users interested in a powerful Large Language Model solution

Audience

IT leaders and AI practitioners seeking a compact, efficient language model capable of processing long-context information while ensuring cost-effectiveness and data privacy

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

LongLLaMA
github.com/CStanKonrad/long_llama

Company Information

Salesforce
Founded: 1999
United States
www.salesforce.com/blog/xgen-small-enterprise-ready-small-language-models/

Alternatives

Llama 2

Llama 2

Meta

Alternatives

Mistral NeMo

Mistral NeMo

Mistral AI
Mistral NeMo

Mistral NeMo

Mistral AI
Kimi K2

Kimi K2

Moonshot AI

Categories

Categories

Integrations

Agentforce Vibes

Integrations

Agentforce Vibes
Claim LongLLaMA and update features and information
Claim LongLLaMA and update features and information
Claim Xgen-small and update features and information
Claim Xgen-small and update features and information