Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
textgen-portable-3.11-windows-cuda11.7.zip | < 21 hours ago | 730.3 MB | |
textgen-portable-3.11-windows-cuda12.4.zip | < 21 hours ago | 842.5 MB | |
textgen-portable-3.11-windows-vulkan.zip | < 21 hours ago | 203.4 MB | |
textgen-portable-3.11-windows-cpu.zip | < 21 hours ago | 193.9 MB | |
textgen-portable-3.11-linux-cuda11.7.zip | < 21 hours ago | 775.2 MB | |
textgen-portable-3.11-linux-cuda12.4.zip | < 21 hours ago | 847.8 MB | |
textgen-portable-3.11-macos-x86_64.zip | < 21 hours ago | 164.9 MB | |
textgen-portable-3.11-linux-cpu.zip | < 21 hours ago | 232.2 MB | |
textgen-portable-3.11-linux-vulkan.zip | < 21 hours ago | 241.8 MB | |
textgen-portable-3.11-macos-arm64.zip | < 21 hours ago | 177.1 MB | |
README.md | < 21 hours ago | 1.5 kB | |
v3.11 source code.tar.gz | < 21 hours ago | 24.9 MB | |
v3.11 source code.zip | < 21 hours ago | 25.0 MB | |
Totals: 13 Items | 4.5 GB | 0 |
Changes
- Add the Tensor Parallelism option to the ExLlamav3/ExLlamav3_HF loaders through the
--enable-tp
and--tp-backend
options. - Set multimodal status during Model Loading instead of checking every generation (#7199). Thanks, @altoiddealer.
- Improve the multimodal API examples slightly.
Bug fixes
- Make web search functional again
- mtmd: Fix a bug when "include past attachments" is unchecked
- Fix code blocks having an extra empty line in the UI
Backend updates
- Update llama.cpp to https://github.com/ggml-org/llama.cpp/commit/6d7f1117e3e3285d0c5c11b5ebb0439e27920082
- Update ExLlamaV3 to 0.0.6
- Update flash-attention to 2.8.3
Portable builds
Below you can find self-contained packages that work with GGUF models (llama.cpp) and require no installation! Just download the right version for your system, unzip, and run.
Which version to download:
- Windows/Linux:
- NVIDIA GPU: Use
cuda12.4
for newer GPUs orcuda11.7
for older GPUs and systems with older drivers. - AMD/Intel GPU: Use
vulkan
builds. -
CPU only: Use
cpu
builds. -
Mac:
- Apple Silicon: Use
macos-arm64
. - Intel CPU: Use
macos-x86_64
.
Updating a portable install:
- Download and unzip the latest version.
- Replace the
user_data
folder with the one in your existing install. All your settings and models will be moved.