Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
textgen-portable-3.8-windows-cuda12.4.zip | 2025-07-19 | 820.3 MB | |
textgen-portable-3.8-windows-cuda11.7.zip | 2025-07-19 | 709.7 MB | |
textgen-portable-3.8-windows-cpu.zip | 2025-07-19 | 191.3 MB | |
textgen-portable-3.8-windows-vulkan.zip | 2025-07-19 | 200.0 MB | |
textgen-portable-3.8-linux-cuda12.4.zip | 2025-07-19 | 825.0 MB | |
textgen-portable-3.8-linux-cuda11.7.zip | 2025-07-19 | 754.0 MB | |
textgen-portable-3.8-macos-x86_64.zip | 2025-07-19 | 162.8 MB | |
textgen-portable-3.8-linux-cpu.zip | 2025-07-19 | 229.2 MB | |
textgen-portable-3.8-linux-vulkan.zip | 2025-07-19 | 238.0 MB | |
textgen-portable-3.8-macos-arm64.zip | 2025-07-19 | 174.5 MB | |
README.md | 2025-07-19 | 1.3 kB | |
v3.8 source code.tar.gz | 2025-07-19 | 24.9 MB | |
v3.8 source code.zip | 2025-07-19 | 25.0 MB | |
Totals: 13 Items | 4.4 GB | 9 |
Changes
- Replace
use_flash_attention_2
/use_eager_attention
with a unifiedattn_implementation
in the Transformers loader - Ignore
add_bos_token
in instruct prompts, let the jinja2 template decide - Add a "None" option for the speculative decoding model
Backend updates
- Update llama.cpp to https://github.com/ggml-org/llama.cpp/tree/90083283ec254fa8d33897746dea229aee401b37
- Update Transformers to 4.53
- Also update bitsandbytes/Accelerate/PEFT to the latest versions
- Update ExLlamaV3 to 0.0.5
- Update ExLlamaV2 to 0.3.2
Portable builds
Below you can find portable builds: self-contained packages that work with GGUF models (llama.cpp) and require no installation! Just download the right version for your system, unzip, and run.
Which version to download:
- Windows/Linux:
- NVIDIA GPU: Use
cuda12.4
for newer GPUs orcuda11.7
for older GPUs and systems with older drivers. - AMD/Intel GPU: Use
vulkan
builds. -
CPU only: Use
cpu
builds. -
Mac:
- Apple Silicon: Use
macos-arm64
. - Intel CPU: Use
macos-x86_64
.
Updating a portable install:
- Download and unzip the latest version.
- Replace the
user_data
folder with the one in your existing install. All your settings and models will be moved.