Infinity Files

Low-latency REST API for serving text-embeddings

This is an exact mirror of the Infinity project, hosted at https://github.com/michaelfeil/infinity. SourceForge is not affiliated with Infinity.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
0.0.76 source code.tar.gz	2025-03-16	2.8 MB	0
0.0.76 source code.zip	2025-03-16	2.9 MB	0
README.md	2025-03-16	944 Bytes	0
Totals: 3 Items		5.7 MB	0

torch=2.6.0 update - 5-10% faster attention on hopper -> previously 2.4.1 -> does no longer work with torch.compile + bettertransformers. We recommend disabling torch.compile for this model class.
flash-attn included in docker image for nvidia.

What's Changed

bump client version by @wirthual in https://github.com/michaelfeil/infinity/pull/522
add new st version by @michaelfeil in https://github.com/michaelfeil/infinity/pull/523
Version check step by @wirthual in https://github.com/michaelfeil/infinity/pull/524
README: add example for using local model wtth docker container by @wirthual in https://github.com/michaelfeil/infinity/pull/528
add vision client template by @wirthual in https://github.com/michaelfeil/infinity/pull/526
bump to 2.6 torch by @michaelfeil in https://github.com/michaelfeil/infinity/pull/556

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.75...0.0.76

Source: README.md, updated 2025-03-16

Other Useful Business Software

Try Google Cloud Risk-Free With $300 in Credit Icon

Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free

AI-generated apps that pass security review Icon

AI-generated apps that pass security review

Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free

MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free

Recommended Projects

Infinity Virtualization Platform
A virtual infrastructure without the dedicated physical infrastructure
Shaiya Infinity
Shaiya Infinity Game Client
Infinity
music visualization plugin
Infinity Roller
Automatic initial attribute roller for infinity games
Near Infinity
The source code for this project is now served on GitHub. The latest version history can be found here: https://github.com/FredSRichardson/NearInfinity Near Infinity - An Infinity Engine Browser and Editor by Jon Olav Hauglid (http://www.idi.ntnu.no/~joh/ni/index.html)