Text Generation Web UI - Browse /v3.8 at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
textgen-portable-3.8-windows-cuda12.4.zip	2025-07-19	820.3 MB	1
textgen-portable-3.8-windows-cuda11.7.zip	2025-07-19	709.7 MB	0
textgen-portable-3.8-windows-cpu.zip	2025-07-19	191.3 MB	1
textgen-portable-3.8-windows-vulkan.zip	2025-07-19	200.0 MB	0
textgen-portable-3.8-linux-cuda12.4.zip	2025-07-19	825.0 MB	0
textgen-portable-3.8-linux-cuda11.7.zip	2025-07-19	754.0 MB	0
textgen-portable-3.8-macos-x86_64.zip	2025-07-19	162.8 MB	3
textgen-portable-3.8-linux-cpu.zip	2025-07-19	229.2 MB	4
textgen-portable-3.8-linux-vulkan.zip	2025-07-19	238.0 MB	0
textgen-portable-3.8-macos-arm64.zip	2025-07-19	174.5 MB	0
README.md	2025-07-19	1.3 kB	0
v3.8 source code.tar.gz	2025-07-19	24.9 MB	0
v3.8 source code.zip	2025-07-19	25.0 MB	0
Totals: 13 Items		4.4 GB	9

Changes

Replace use_flash_attention_2/use_eager_attention with a unified attn_implementation in the Transformers loader
Ignore add_bos_token in instruct prompts, let the jinja2 template decide
Add a "None" option for the speculative decoding model

Backend updates

Update llama.cpp to https://github.com/ggml-org/llama.cpp/tree/90083283ec254fa8d33897746dea229aee401b37
Update Transformers to 4.53
Also update bitsandbytes/Accelerate/PEFT to the latest versions
Update ExLlamaV3 to 0.0.5
Update ExLlamaV2 to 0.3.2

Portable builds

Below you can find portable builds: self-contained packages that work with GGUF models (llama.cpp) and require no installation! Just download the right version for your system, unzip, and run.

Which version to download:

Windows/Linux:
NVIDIA GPU: Use cuda12.4 for newer GPUs or cuda11.7 for older GPUs and systems with older drivers.
AMD/Intel GPU: Use vulkan builds.
CPU only: Use cpu builds.
Mac:
Apple Silicon: Use macos-arm64.
Intel CPU: Use macos-x86_64.

Updating a portable install:

Download and unzip the latest version.
Replace the user_data folder with the one in your existing install. All your settings and models will be moved.

Source: README.md, updated 2025-07-19

Text Generation Web UI Files

A gradio web UI for running Large Language Models like LLaMA

Changes

Backend updates

Portable builds

Which version to download:

Updating a portable install:

Text Generation Web UI Files

A gradio web UI for running Large Language Models like LLaMA

Get an email when there's a new version of Text Generation Web UI

Changes

Backend updates

Portable builds

Which version to download:

Updating a portable install: