Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
load_test.html | 2025-06-22 | 1.7 MB | |
load_test_stats.csv | 2025-06-22 | 534 Bytes | |
README.md | 2025-06-21 | 2.2 kB | |
v1.73.0-nightly source code.tar.gz | 2025-06-21 | 172.2 MB | |
v1.73.0-nightly source code.zip | 2025-06-21 | 174.0 MB | |
Totals: 5 Items | 347.9 MB | 0 |
What's Changed
- Update Azure o3 pricing to match OpenAI pricing ($2/$8 per 1M tokens) by @ervwalter in https://github.com/BerriAI/litellm/pull/11937
- [BugFix] Ollama response_format not working by @ThakeeNathees in https://github.com/BerriAI/litellm/pull/11880
- fix aws bedrock claude tool call index by @jnhyperion in https://github.com/BerriAI/litellm/pull/11842
- fix(acompletion): allow dict for tool_choice argument by @Jannchie in https://github.com/BerriAI/litellm/pull/11860
- [Chore] Check team counts on license when creating new team by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/11943
- [Docs] [Pre-Release] v1.73.0-stable by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/11950
- Show user all models they can call (Across teams) on UI by @krrishdholakia in https://github.com/BerriAI/litellm/pull/11948
New Contributors
- @ervwalter made their first contribution in https://github.com/BerriAI/litellm/pull/11937
- @ThakeeNathees made their first contribution in https://github.com/BerriAI/litellm/pull/11880
- @jnhyperion made their first contribution in https://github.com/BerriAI/litellm/pull/11842
- @Jannchie made their first contribution in https://github.com/BerriAI/litellm/pull/11860
Full Changelog: https://github.com/BerriAI/litellm/compare/v1.72.9-nightly...v1.73.0-nightly
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.73.0-nightly
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 260.0 | 272.8001474868273 | 6.0932611338548 | 0.0 | 1822 | 0 | 217.21155499994893 | 1591.5092660000028 |
Aggregated | Passed ✅ | 260.0 | 272.8001474868273 | 6.0932611338548 | 0.0 | 1822 | 0 | 217.21155499994893 | 1591.5092660000028 |