| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2025-11-29 | 3.5 kB | |
| v1.14.0 source code.tar.gz | 2025-11-29 | 54.9 MB | |
| v1.14.0 source code.zip | 2025-11-29 | 55.9 MB | |
| Totals: 3 Items | 110.7 MB | 0 | |
What's new in 1.14.0 (2025-11-30)
These are the changes in inference v1.14.0.
New features
- FEAT: add vLLM 0.11.1+ compatibility with v1 executor support by @amumu96 in https://github.com/xorbitsai/inference/pull/4252
- FEAT: [virtualenv] New v3 spec and list/delete virtual env APIs by @OliverBryant in https://github.com/xorbitsai/inference/pull/4254
- FEAT: [model] HunyuanOCR support by @OliverBryant in https://github.com/xorbitsai/inference/pull/4290
- FEAT: Add support of rerank model for llamacpp by @harryzwh in https://github.com/xorbitsai/inference/pull/4227
- FEAT: show reason why engines not available by @OliverBryant in https://github.com/xorbitsai/inference/pull/4261
- FEAT: Parallel startup model, add tooltips for startup progress, and p… by @leslie2046 in https://github.com/xorbitsai/inference/pull/4268
Enhancements
- BLD: fix model ui launch error with gradio 6.x by @OliverBryant in https://github.com/xorbitsai/inference/pull/4289
- BLD: add pr auto run gen_docs workflow. by @yiboyasss in https://github.com/xorbitsai/inference/pull/4260
- BLD: gen docs pr modify by @OliverBryant in https://github.com/xorbitsai/inference/pull/4294
- BLD: gen doc modify v2 by @OliverBryant in https://github.com/xorbitsai/inference/pull/4296
- BLD: gen docs pr modify v3 by @OliverBryant in https://github.com/xorbitsai/inference/pull/4297
- BLD: auto-run gen_docs.py from doc/source by @yiboyasss in https://github.com/xorbitsai/inference/pull/4300
- BLD: remove [skip ci] from auto docs commit by @yiboyasss in https://github.com/xorbitsai/inference/pull/4301
Bug fixes
- BUG: Compat with xllamacpp 0.2.5+ by @codingl2k1 in https://github.com/xorbitsai/inference/pull/4270
- BUG: add download_hubs for cluster by @OliverBryant in https://github.com/xorbitsai/inference/pull/4273
- BUG: sometimes cannot select gpu in CPU and GPU hybrid cluster by @leslie2046 in https://github.com/xorbitsai/inference/pull/4280
Documentation
- DOC: added v1.13.0 release notes by @qinxuye in https://github.com/xorbitsai/inference/pull/4250
- DOC: update gen_docs by @qinxuye in https://github.com/xorbitsai/inference/pull/4302
Others
- CHORE: expand stale and close time by @qinxuye in https://github.com/xorbitsai/inference/pull/4253
- chore: sync models JSON [audio, embedding, image, llm, rerank, video] by @XprobeBot in https://github.com/xorbitsai/inference/pull/4258
- chore: sync models JSON [llm] by @XprobeBot in https://github.com/xorbitsai/inference/pull/4272
- chore: sync model "Qwen3-Reranker-0.6B" JSON by @OliverBryant in https://github.com/xorbitsai/inference/pull/4277
- chore: sync model "bge-reranker-v2-m3" JSON by @OliverBryant in https://github.com/xorbitsai/inference/pull/4276
- chore: sync model "Qwen3-Reranker-4B" JSON by @OliverBryant in https://github.com/xorbitsai/inference/pull/4278
- chore: sync model "Qwen3-Reranker-8B" JSON by @OliverBryant in https://github.com/xorbitsai/inference/pull/4279
- chore: sync model "qwen3" JSON by @XprobeBot in https://github.com/xorbitsai/inference/pull/4287
- chore: sync models JSON [rerank] by @XprobeBot in https://github.com/xorbitsai/inference/pull/4284
- chore: sync model "FLUX.1-dev" JSON by @OliverBryant in https://github.com/xorbitsai/inference/pull/4293
- chore: sync model "FLUX.2-dev" JSON by @OliverBryant in https://github.com/xorbitsai/inference/pull/4292
- chore: sync models JSON [image] by @XprobeBot in https://github.com/xorbitsai/inference/pull/4303
Full Changelog: https://github.com/xorbitsai/inference/compare/v1.13.0...v1.14.0