+
+

Related Products

  • RunPod
    206 Ratings
    Visit Website
  • Snowflake
    1,417 Ratings
    Visit Website
  • Gemini Enterprise Agent Platform
    961 Ratings
    Visit Website
  • Enterprise Bot
    23 Ratings
    Visit Website
  • RealEstateAPI (REAPI)
    47 Ratings
    Visit Website
  • Careerminds
    46 Ratings
    Visit Website
  • Google Compute Engine
    1,168 Ratings
    Visit Website
  • RaimaDB
    12 Ratings
    Visit Website
  • ActCAD Software
    401 Ratings
    Visit Website
  • Innoslate
    91 Ratings
    Visit Website

About

Featherless is an AI model provider that offers our subscribers access to a continually expanding library of Hugging Face models. With hundreds of new models daily, you need dedicated tools to keep up with the hype. No matter your use case, find and use the state-of-the-art AI model with Featherless. At present, we support LLaMA-3-based models, including LLaMA-3 and QWEN-2. Note that QWEN-2 models are only supported up to 16,000 context length. We plan to add more architectures to our supported list soon. We continuously onboard new models as they become available on Hugging Face. As we grow, we aim to automate this process to encompass all publicly available Hugging Face models with compatible architecture. To ensure fair individual account use, concurrent requests are limited according to the plan you've selected. Output is delivered at a speed of 10-40 tokens per second, depending on the model and prompt size.

About

Nebius Token Factory is a scalable AI inference platform designed to run open-source and custom AI models in production without manual infrastructure management. It offers enterprise-ready inference endpoints with predictable performance, autoscaling throughput, and sub-second latency — even at very high request volumes. It delivers 99.9% uptime availability and supports unlimited or tailored traffic profiles based on workload needs, simplifying the transition from experimentation to global deployment. Nebius Token Factory supports a broad set of open source models such as Llama, Qwen, DeepSeek, GPT-OSS, Flux, and many others, and lets teams host and fine-tune models through an API or dashboard. Users can upload LoRA adapters or full fine-tuned variants directly, with the same enterprise performance guarantees applied to custom models.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Anyone interested in a solution to run any model from Hugging Face

Audience

Engineering and data science teams that need a production-grade inference system to deploy, scale, and manage open-source or custom AI models reliably in enterprise environments

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

$10 per month
Free Version
Free Trial

Pricing

$0.02
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Featherless
featherless.ai/

Company Information

Nebius
Founded: 2022
Netherlands
nebius.com/services/token-factory/enterprise-grade-inference

Alternatives

Alternatives

Qwen3

Qwen3

Alibaba
Qwen2.5-VL

Qwen2.5-VL

Alibaba
FPT AI Factory

FPT AI Factory

FPT Cloud
Llama 2

Llama 2

Meta
Qwen2.5-1M

Qwen2.5-1M

Alibaba

Categories

Categories

Integrations

Llama
Llama 3.1
Llama 3.3
Qwen
DeepSeek R1
FLUX.1
GLM-4.5
GLM-4.5-Air
Gemma 3
Hermes 4
Kimi K2 Thinking
Kimi K2.6
Llama 3.2
Llama Guard
NVIDIA Llama Nemotron
Nebius
QwQ-32B
Qwen2.5
Qwen3-Coder
gpt-oss-120b

Integrations

Llama
Llama 3.1
Llama 3.3
Qwen
DeepSeek R1
FLUX.1
GLM-4.5
GLM-4.5-Air
Gemma 3
Hermes 4
Kimi K2 Thinking
Kimi K2.6
Llama 3.2
Llama Guard
NVIDIA Llama Nemotron
Nebius
QwQ-32B
Qwen2.5
Qwen3-Coder
gpt-oss-120b
Claim Featherless and update features and information
Claim Featherless and update features and information
Claim Nebius Token Factory and update features and information
Claim Nebius Token Factory and update features and information