Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README.md | 2025-05-28 | 5.9 kB | |
v0.21.0 source code.tar.gz | 2025-05-28 | 4.6 MB | |
v0.21.0 source code.zip | 2025-05-28 | 5.1 MB | |
Totals: 3 Items | 9.6 MB | 3 |
Summary
- New Models.
- Xception: Added Xception architecture for image classification tasks.
- Qwen: Added Qwen2.5 large language models and presets of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters.
- Qwen MoE: Added transformer-based Mixture of Experts (MoE) decoder-only language model with a base variant having 2.7B activated parameters during runtime.
- Mixtral: Added Mixtral LLM, a pretrained generative Sparse Mixture of Experts with pre-trained and instruction tuned models having 7 billion activated parameters.
- Moonshine: Added Moonshine, a speech recognition task model.
- CSPNet: Added Cross Stage Partial Network (CSPNet) classification task model.
-
Llama3: Added support for Llama 3.1 and 3.2.
-
Added sharded weight support to KerasPresetSaver and KerasPresetLoader, defaulting to a 10GB maximum shard size.
What's Changed
- Fix Roformer export symbol by @abheesht17 in https://github.com/keras-team/keras-hub/pull/2199
- Bump up master version to 0.21 by @abheesht17 in https://github.com/keras-team/keras-hub/pull/2204
- reenable test by @mattdangerw in https://github.com/keras-team/keras-hub/pull/2188
- Add xception model by @mattdangerw in https://github.com/keras-team/keras-hub/pull/2179
- Make image converter built by @mattdangerw in https://github.com/keras-team/keras-hub/pull/2206
- Qwen - Fix Preset Loader + Add Causal LM Test by @kanpuriyanawab in https://github.com/keras-team/keras-hub/pull/2193
- Update Qwen conversion script by @laxmareddyp in https://github.com/keras-team/keras-hub/pull/2207
- Revert "Do not export Qwen for release" by @sachinprasadhs in https://github.com/keras-team/keras-hub/pull/2208
- Fixes compute_output_shape for PaliGemmaVitEncoder and Gemma3VisionEncoderBlock by @JyotinderSingh in https://github.com/keras-team/keras-hub/pull/2210
- Python 3.12 fix by @mattdangerw in https://github.com/keras-team/keras-hub/pull/2211
- Small Gemma3 doc-string edits by @abheesht17 in https://github.com/keras-team/keras-hub/pull/2214
- Llama3.1 by @pctablet505 in https://github.com/keras-team/keras-hub/pull/2132
- Update gemma3_causal_lm_preprocessor.py by @pctablet505 in https://github.com/keras-team/keras-hub/pull/2217
- fix: apply
weights_only = True
by @b8zhong in https://github.com/keras-team/keras-hub/pull/2215 - Fix the keras_hub package for typecheckers and IDEs by @mattdangerw in https://github.com/keras-team/keras-hub/pull/2222
- Add utility to map COCO IDs to class names by @mattdangerw in https://github.com/keras-team/keras-hub/pull/2219
- Set GPU timeouts to 2 hours by @mattdangerw in https://github.com/keras-team/keras-hub/pull/2226
- Fix nightly by @mattdangerw in https://github.com/keras-team/keras-hub/pull/2227
- Another fix for nightly builds by @mattdangerw in https://github.com/keras-team/keras-hub/pull/2229
- Cast a few more input to tensors in SD3 by @mattdangerw in https://github.com/keras-team/keras-hub/pull/2234
- Fix up package build scripts again by @mattdangerw in https://github.com/keras-team/keras-hub/pull/2230
- Add qwen presets by @laxmareddyp in https://github.com/keras-team/keras-hub/pull/2241
- script for converting retinanet weights from trochvision by @sineeli in https://github.com/keras-team/keras-hub/pull/2233
- Sharded weights support by @james77777778 in https://github.com/keras-team/keras-hub/pull/2218
- Add Qwen Moe by @kanpuriyanawab in https://github.com/keras-team/keras-hub/pull/2163
- Add Mixtral by @kanpuriyanawab in https://github.com/keras-team/keras-hub/pull/2196
- Made label data optional for inference and adopted other required changes by @laxmareddyp in https://github.com/keras-team/keras-hub/pull/2183
- Fix the layer names by @kanpuriyanawab in https://github.com/keras-team/keras-hub/pull/2247
- Add new CSPNet preset and add manual padding. by @sachinprasadhs in https://github.com/keras-team/keras-hub/pull/2212
- Update the int8 quant logic in
ReversibleEmbedding
by @james77777778 in https://github.com/keras-team/keras-hub/pull/2250 - Add Moonshine to KerasHub by @harshaljanjani in https://github.com/keras-team/keras-hub/pull/2093
- Add Kaggle handle for moonshine presets by @laxmareddyp in https://github.com/keras-team/keras-hub/pull/2253
- Update requirements-jax-cuda.txt by @pctablet505 in https://github.com/keras-team/keras-hub/pull/2252
- Add Mixtral,Qwen-MoE presets and Update conversion script. by @laxmareddyp in https://github.com/keras-team/keras-hub/pull/2248
- fix flash attention test by @divyashreepathihalli in https://github.com/keras-team/keras-hub/pull/2263
- Fix JAX bugs for qwen moe & mixtral by @kanpuriyanawab in https://github.com/keras-team/keras-hub/pull/2258
- Create pull_request_template.md by @sachinprasadhs in https://github.com/keras-team/keras-hub/pull/2262
- Update preset versions for sharded models by @laxmareddyp in https://github.com/keras-team/keras-hub/pull/2264
- Add AudioToText and AudioToTextPreprocessor class stubs to enable auto class functionality by @harshaljanjani in https://github.com/keras-team/keras-hub/pull/2265
- register moonshine presets by @sachinprasadhs in https://github.com/keras-team/keras-hub/pull/2267
- Version bump 0.21.0.dev1 by @laxmareddyp in https://github.com/keras-team/keras-hub/pull/2273
- Version bump to 0.21.0 by @laxmareddyp in https://github.com/keras-team/keras-hub/pull/2275
New Contributors
- @JyotinderSingh made their first contribution in https://github.com/keras-team/keras-hub/pull/2210
- @pctablet505 made their first contribution in https://github.com/keras-team/keras-hub/pull/2132
- @b8zhong made their first contribution in https://github.com/keras-team/keras-hub/pull/2215
Full Changelog: https://github.com/keras-team/keras-hub/compare/v0.20.0...v0.21.0