Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
textgen-portable-3.6.1-windows-cuda12.4.zip | 2025-06-19 | 816.0 MB | |
textgen-portable-3.6.1-windows-cuda11.7.zip | 2025-06-19 | 706.7 MB | |
textgen-portable-3.6.1-windows-vulkan.zip | 2025-06-19 | 201.7 MB | |
textgen-portable-3.6.1-windows-cpu.zip | 2025-06-19 | 193.8 MB | |
textgen-portable-3.6.1-linux-cuda12.4.zip | 2025-06-19 | 820.3 MB | |
textgen-portable-3.6.1-linux-cuda11.7.zip | 2025-06-19 | 750.6 MB | |
textgen-portable-3.6.1-macos-x86_64.zip | 2025-06-19 | 166.0 MB | |
textgen-portable-3.6.1-linux-cpu.zip | 2025-06-19 | 231.1 MB | |
textgen-portable-3.6.1-linux-vulkan.zip | 2025-06-19 | 239.2 MB | |
textgen-portable-3.6.1-macos-arm64.zip | 2025-06-19 | 176.1 MB | |
README.md | 2025-06-19 | 3.6 kB | |
v3.6.1 source code.tar.gz | 2025-06-19 | 24.9 MB | |
v3.6.1 source code.zip | 2025-06-19 | 25.0 MB | |
Totals: 13 Items | 4.4 GB | 79 |
Changes
- Merge the Default and Notebook tabs into a single Notebook tab (#7078), with an option in the Session tab to switch between one and two columns.
- Autosave text in the Notebook tab (both generated and manually typed), and add "New" and "Rename" buttons for management.
- Saved prompts have been moved from
user_data/prompts
touser_data/logs/notebook
, move any existing ones there. - Add a new Character tab for character settings.
- Remember the last selected chat for each chat mode and character.
- Truncate web search results to at most 8192 tokens to handle edge cases like pages with infinite scrolling.
- Remove images and links from web search results to reduce noise and focus on the relevant text content.
- Add an option to exclude attachments from previous messages in the chat prompt. It can be found in the Session tab.
- Improve the wpp chat style.
- Increase the size of the enlarged character profile picture that appears when clicking the profile picture.
- Move 'Custom system message' to the Parameters > Generation tab.
- Hide the navigation bar on Ctrl+S / Show controls click.
- Always close/open the two sidebars at the same time when clicking their close buttons on desktop.
- Only save active extensions and extensions settings on manual settings save.
- More informative log message when the user input gets truncated.
- Small style improvements to the chat tab.
- Optimize scrolling in the chat tab.
- Optimize syntax highlighting on long conversations.
- Optimize the token count at the end of generation with llama.cpp.
- Disable message action icons during streaming for better performance.
- Expose real model list via /v1/models endpoint (#7088). Thanks @NoxWorld2660
- Improved API examples in the documentation.
- Show file sizes in the Model tab on "Get file list" (new in 3.6.1)
- Force dark theme on the Gradio login page (new in 3.6.1)
Bug fixes
- Ensure estimated vram is updated when switching between different models (#7071). Thanks @miriameng.
- Fix an edge case where the
gpu-layers
slider maximum is incorrectly limited. - Add error handling for non-llama.cpp models in portable mode.
- Fix the character profile picture sometimes not appearing when switching from instruct to chat modes.
- Fix jittering while typing in the Chat tab on Firefox.
- Fix the /v1/models output format (new in 3.6.1)
- Bump numpy to 2.2 to fix loading certain EXL3 models on Windows (new in 3.6.1)
- Fix obtaining the maximum number of GPU layers for DeepSeek-R1-0528-GGUF (new in 3.6.1)
Backend updates
- Update llama.cpp to https://github.com/ggml-org/llama.cpp/commit/e434e69183fd9e1031f4445002083178c331a28b
- Update exllamav3 to 0.0.4
Portable builds
Below you can find portable builds: self-contained packages that work with GGUF models (llama.cpp) and require no installation! Just download the right version for your system, unzip, and run.
Which version to download:
- Windows/Linux:
- NVIDIA GPU: Use
cuda12.4
for newer GPUs orcuda11.7
for older GPUs and systems with older drivers. - AMD/Intel GPU: Use
vulkan
builds. -
CPU only: Use
cpu
builds. -
Mac:
- Apple Silicon: Use
macos-arm64
. - Intel CPU: Use
macos-x86_64
.
Updating a portable install:
- Download and unzip the latest version.
- Replace the
user_data
folder with the one in your existing install. All your settings and models will be moved.