+
+

Related Products

  • RunPod
    206 Ratings
    Visit Website
  • Snowflake
    1,417 Ratings
    Visit Website
  • Gemini Enterprise Agent Platform
    961 Ratings
    Visit Website
  • Enterprise Bot
    23 Ratings
    Visit Website
  • RealEstateAPI (REAPI)
    47 Ratings
    Visit Website
  • Careerminds
    46 Ratings
    Visit Website
  • Google Compute Engine
    1,168 Ratings
    Visit Website
  • RaimaDB
    12 Ratings
    Visit Website
  • ActCAD Software
    401 Ratings
    Visit Website
  • Innoslate
    91 Ratings
    Visit Website

About

Featherless is an AI model provider that offers our subscribers access to a continually expanding library of Hugging Face models. With hundreds of new models daily, you need dedicated tools to keep up with the hype. No matter your use case, find and use the state-of-the-art AI model with Featherless. At present, we support LLaMA-3-based models, including LLaMA-3 and QWEN-2. Note that QWEN-2 models are only supported up to 16,000 context length. We plan to add more architectures to our supported list soon. We continuously onboard new models as they become available on Hugging Face. As we grow, we aim to automate this process to encompass all publicly available Hugging Face models with compatible architecture. To ensure fair individual account use, concurrent requests are limited according to the plan you've selected. Output is delivered at a speed of 10-40 tokens per second, depending on the model and prompt size.

About

This repository contains the research preview of LongLLaMA, a large language model capable of handling long contexts of 256k tokens or even more. LongLLaMA is built upon the foundation of OpenLLaMA and fine-tuned using the Focused Transformer (FoT) method. LongLLaMA code is built upon the foundation of Code Llama. We release a smaller 3B base variant (not instruction tuned) of the LongLLaMA model on a permissive license (Apache 2.0) and inference code supporting longer contexts on hugging face. Our model weights can serve as the drop-in replacement of LLaMA in existing implementations (for short context up to 2048 tokens). Additionally, we provide evaluation results and comparisons against the original OpenLLaMA models.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Anyone interested in a solution to run any model from Hugging Face

Audience

Users interested in a powerful Large Language Model solution

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

$10 per month
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Featherless
featherless.ai/

Company Information

LongLLaMA
github.com/CStanKonrad/long_llama

Alternatives

Alternatives

Llama 2

Llama 2

Meta
Qwen3

Qwen3

Alibaba
Qwen2.5-VL

Qwen2.5-VL

Alibaba
Llama 2

Llama 2

Meta
Kimi K2

Kimi K2

Moonshot AI
Qwen2.5-1M

Qwen2.5-1M

Alibaba
Olmo 3

Olmo 3

Ai2

Categories

Categories

Integrations

ChatGPT
Hugging Face
Llama
Llama 2
Llama 3
Llama 3.1
Llama 3.2
Llama 3.3
OpenAI
Qwen

Integrations

ChatGPT
Hugging Face
Llama
Llama 2
Llama 3
Llama 3.1
Llama 3.2
Llama 3.3
OpenAI
Qwen
Claim Featherless and update features and information
Claim Featherless and update features and information
Claim LongLLaMA and update features and information
Claim LongLLaMA and update features and information