Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README.md | 2025-03-25 | 743 Bytes | |
v0.1.4 source code.tar.gz | 2025-03-25 | 570.8 kB | |
v0.1.4 source code.zip | 2025-03-25 | 684.6 kB | |
Totals: 3 Items | 1.3 MB | 1 |
What's Changed
- Refactor Gemma ctor and improve pool NUMA support by @copybara-service in https://github.com/google/gemma.cpp/pull/520
- Fix the prompt wrapping of gemma3-1b by @ufownl in https://github.com/google/gemma.cpp/pull/523
- Add note on attention length and SFP by @copybara-service in https://github.com/google/gemma.cpp/pull/521
- Add support for a secondary EOS token by @copybara-service in https://github.com/google/gemma.cpp/pull/525
- Update app argument documentation by @copybara-service in https://github.com/google/gemma.cpp/pull/526
- Set the secondary EOS for Gemma2 by @ufownl in https://github.com/google/gemma.cpp/pull/527
Full Changelog: https://github.com/google/gemma.cpp/compare/v0.1.3...v0.1.4