Download Latest Version textgen-portable-3.8-macos-x86_64.zip (162.8 MB)
Email in envelope

Get an email when there's a new version of Text Generation Web UI

Home / v3.8
Name Modified Size InfoDownloads / Week
Parent folder
textgen-portable-3.8-windows-cuda12.4.zip 2025-07-19 820.3 MB
textgen-portable-3.8-windows-cuda11.7.zip 2025-07-19 709.7 MB
textgen-portable-3.8-windows-cpu.zip 2025-07-19 191.3 MB
textgen-portable-3.8-windows-vulkan.zip 2025-07-19 200.0 MB
textgen-portable-3.8-linux-cuda12.4.zip 2025-07-19 825.0 MB
textgen-portable-3.8-linux-cuda11.7.zip 2025-07-19 754.0 MB
textgen-portable-3.8-macos-x86_64.zip 2025-07-19 162.8 MB
textgen-portable-3.8-linux-cpu.zip 2025-07-19 229.2 MB
textgen-portable-3.8-linux-vulkan.zip 2025-07-19 238.0 MB
textgen-portable-3.8-macos-arm64.zip 2025-07-19 174.5 MB
README.md 2025-07-19 1.3 kB
v3.8 source code.tar.gz 2025-07-19 24.9 MB
v3.8 source code.zip 2025-07-19 25.0 MB
Totals: 13 Items   4.4 GB 9

Changes

  • Replace use_flash_attention_2/use_eager_attention with a unified attn_implementation in the Transformers loader
  • Ignore add_bos_token in instruct prompts, let the jinja2 template decide
  • Add a "None" option for the speculative decoding model

Backend updates


Portable builds

Below you can find portable builds: self-contained packages that work with GGUF models (llama.cpp) and require no installation! Just download the right version for your system, unzip, and run.

Which version to download:

  • Windows/Linux:
  • NVIDIA GPU: Use cuda12.4 for newer GPUs or cuda11.7 for older GPUs and systems with older drivers.
  • AMD/Intel GPU: Use vulkan builds.
  • CPU only: Use cpu builds.

  • Mac:

  • Apple Silicon: Use macos-arm64.
  • Intel CPU: Use macos-x86_64.

Updating a portable install:

  1. Download and unzip the latest version.
  2. Replace the user_data folder with the one in your existing install. All your settings and models will be moved.
Source: README.md, updated 2025-07-19