Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
textgen-portable-3.14-windows-cuda11.7.zip | 2025-10-10 | 751.0 MB | |
textgen-portable-3.14-windows-cuda12.4.zip | 2025-10-10 | 861.3 MB | |
textgen-portable-3.14-windows-cpu.zip | 2025-10-10 | 196.8 MB | |
textgen-portable-3.14-windows-vulkan.zip | 2025-10-10 | 210.3 MB | |
textgen-portable-3.14-linux-cuda12.4.zip | 2025-10-10 | 870.5 MB | |
textgen-portable-3.14-linux-cuda11.7.zip | 2025-10-10 | 799.9 MB | |
textgen-portable-3.14-macos-x86_64.zip | 2025-10-10 | 197.6 MB | |
textgen-portable-3.14-linux-vulkan.zip | 2025-10-10 | 252.7 MB | |
textgen-portable-3.14-macos-arm64.zip | 2025-10-10 | 181.8 MB | |
textgen-portable-3.14-linux-cpu.zip | 2025-10-10 | 238.9 MB | |
README.md | 2025-10-10 | 1.5 kB | |
v3.14 source code.tar.gz | 2025-10-10 | 24.9 MB | |
v3.14 source code.zip | 2025-10-10 | 25.0 MB | |
Totals: 13 Items | 4.6 GB | 2 |
Changes
- Better handle multi-GPU setups when using Transformers with bitsandbytes (
load-in-8bit
andload-in-4bit
). - Implement the
/v1/internal/logits
endpoint for theexllamav3
andexllamav3_hf
loaders. - Make profile picture uploading safer.
- Add
fla
to the requirements for Exllamav3 to supportqwen3-next
models.
Bug fixes
- Fix an issue with loading certain chat histories in Instruct mode. Thanks, @Remowylliams.
- Fix portable builds for macOS x86 missing llama.cpp binaries (#7238). Thanks, @IonoclastBrigham.
Backend updates
- Update llama.cpp to https://github.com/ggml-org/llama.cpp/tree/d00cbea63c671cd85a57adaa50abf60b3b87d86f.
- Update transformers to 4.57.
- Update exllamav3 0.0.7.
- Update bitsandbytes to 0.48.
Portable builds
Below you can find self-contained packages that work with GGUF models (llama.cpp) and require no installation! Just download the right version for your system, unzip, and run.
Which version to download:
- Windows/Linux:
- NVIDIA GPU: Use
cuda12.4
for newer GPUs orcuda11.7
for older GPUs and systems with older drivers. - AMD/Intel GPU: Use
vulkan
builds. -
CPU only: Use
cpu
builds. -
Mac:
- Apple Silicon: Use
macos-arm64
. - Intel CPU: Use
macos-x86_64
.
Updating a portable install:
- Download and unzip the latest version.
- Replace the
user_data
folder with the one in your existing install. All your settings and models will be moved.