Showing 31 open source projects for "wise memory optimizer"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    NVIDIA Model Optimizer

    NVIDIA Model Optimizer

    A unified library of SOTA model optimization techniques

    Model Optimizer is a unified library that provides state-of-the-art techniques for compressing and optimizing deep learning models to improve inference efficiency and deployment performance. It brings together multiple optimization strategies such as quantization, pruning, distillation, and speculative decoding into a single cohesive framework. The library is designed to reduce model size and computational requirements while maintaining accuracy, making it particularly valuable for deploying...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    FPS Booster

    FPS Booster

    Optimizer FPS boosts performance for Roblox, CS reducing ping ,memory

    Optimizer FPS,Fps Booster is a lightweight tool 🎮 that boosts your system’s performance for popular games like Roblox, CS, and Valorant. It’s particularly effective for Roblox, where it helps to reduce ping 🌐 and improve overall gameplay 🚀. Even if you’re using a low-end PC, Optimizer FPS can enhance your gaming experience by optimizing settings and providing smoother, more responsive gameplay. Here’s a detailed look at what it can do:
    Downloads: 54 This Week
    Last Update:
    See Project
  • 3
    AirLLM

    AirLLM

    AirLLM 70B inference with single 4GB GPU

    AirLLM is an open source Python library that enables extremely large language models to run on consumer hardware with very limited GPU memory. The project addresses one of the main barriers to local LLM experimentation by introducing a memory-efficient inference technique that loads model layers sequentially rather than storing the entire model in GPU memory. This layer-wise inference approach allows models with tens of billions of parameters to run on devices with only a few gigabytes of VRAM. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Roblox Ping Optimizer

    Roblox Ping Optimizer

    Roblox Ping Optimizer ping, safe & clean, no exploits. Enhance!

    🚀 Roblox Ping Optimizer- Created by Akash Siddique! 🎮 🔥 Fresh Update: Sleek New Modern UI! ⚡ Boost Your Gameplay Experience! Elevate your Roblox sessions with ultra-fast responses and optimized performance. Conquer the virtual world smoother and faster than ever. Join the revolution and take your gaming to the next level! 🌟 Why Choose Roblox Ping Booster? Unleash your gaming potential with just one click! 🚀
    Leader badge
    Downloads: 137 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Model Explorer

    Model Explorer

    A modern model graph visualizer and debugger

    Model Explorer is a visual tool for exploring, debugging, and optimizing ML models deployed on edge devices. Developed by Google AI Edge, it offers a browser-based interface to inspect layer-wise performance, memory usage, and inference timing of TensorFlow Lite and other supported models. It’s a powerful utility for developers optimizing models for constrained environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    how-to-optim-algorithm-in-cuda

    how-to-optim-algorithm-in-cuda

    How to optimize some algorithm in cuda

    ...The project combines technical notes, code examples, and practical experiments that demonstrate how common computational kernels can be optimized to improve speed and memory efficiency. Instead of presenting only theoretical explanations, the repository includes hand-written CUDA implementations of fundamental operations such as reductions, element-wise computations, softmax, and attention mechanisms. These examples show how different optimization techniques influence performance on modern GPU hardware and allow readers to experiment with real implementations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    FSRS4Anki

    FSRS4Anki

    A modern Anki custom scheduling based on Free Spaced Repetition

    A modern spaced-repetition scheduler for Anki based on the Free Spaced Repetition Scheduler algorithm.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    InMemoryDatasets.jl

    InMemoryDatasets.jl

    Multithreaded package for working with tabular data in Julia

    InMemoryDatasets.jl is a multithreaded package for data manipulation and is designed for Julia 1.6+ (64-bit OS). The core computation engine of the package is a set of customized algorithms developed specifically for columnar tables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    LightGBM

    Gradient boosting framework based on decision tree algorithms

    LightGBM or Light Gradient Boosting Machine is a high-performance, open source gradient boosting framework based on decision tree algorithms. Compared to other boosting frameworks, LightGBM offers several advantages in terms of speed, efficiency and accuracy. Parallel experiments have shown that LightGBM can attain linear speed-up through multiple machines for training in specific settings, all while consuming less memory. LightGBM supports parallel and GPU learning, and can handle...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    rtabmap

    rtabmap

    RTAB-Map library and standalone application

    ...The loop closure detector uses a bag-of-words approach to determine how likely a new image comes from a previous location or a new location. When a loop closure hypothesis is accepted, a new constraint is added to the map’s graph, then a graph optimizer minimizes the errors in the map. A memory management approach is used to limit the number of locations used for loop closure detection and graph optimization so that real-time constraints on large-scale environments are always respected. RTAB-Map can be used alone with a handheld Kinect, a stereo camera or a 3D lidar for 6DoF mapping, or on a robot equipped with a laser rangefinder for 3DoF mapping.
    Downloads: 32 This Week
    Last Update:
    See Project
  • 11
    Polars

    Polars

    Dataframes powered by a multithreaded, vectorized query engine

    Polars is a high-performance, multi-language DataFrame library built in Rust using Apache Arrow. It delivers blazing-fast, vectorized, and parallel data manipulation with both eager and lazy execution, making it an excellent tool for data processing in Python, Rust, Node.js, R, and SQL contexts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    verl-agent

    verl-agent

    Designed for training LLM/VLM agents via RL

    ...The framework supports multi-turn interactions between agents and their environments, allowing the system to receive feedback after each step and adjust its strategy accordingly. This step-wise interaction model makes it possible to train agents to operate in long-horizon scenarios where decisions depend on cumulative context and previous outcomes. Developers can configure memory modules that determine how historical information is stored and incorporated into each step of the reasoning process.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    This script is a system tuning script for Linux (especially aimed at EasyOS / Puppy Linux) that tries to make the machine feel faster, more responsive, and more SSD-friendly by changing a number of kernel and runtime settings. Here is the explanation in English: What this script does The script creates a small log file in: /tmp/pp4mnk-blackfire.log Then it waits 4 seconds and starts applying a series of performance tweaks in these areas: • Memory • CPU • SSD / disk...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PhoneNumberKit

    PhoneNumberKit

    A Swift framework for parsing and formatting phone numbers

    ...All of your interactions with PhoneNumberKit happen through a PhoneNumberKit object. The first step you should take is to allocate one. A PhoneNumberKit instance is relatively expensive to allocate (it parses the metadata and keeps it in memory for the object's lifecycle), you should try and make sure PhoneNumberKit is allocated once and deallocated when no longer needed. To parse a string, use the parse function. The region code is automatically computed but can be overridden if needed. PhoneNumberKit automatically does a hard type validation to ensure that the object created is valid, this can be quite costly performance-wise and can be turned off if needed.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    TensorRT

    TensorRT

    C++ library for high performance inference on NVIDIA GPUs

    NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference. It includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for deep learning inference applications. TensorRT-based applications perform up to 40X faster than CPU-only platforms during inference. With TensorRT, you can optimize neural network models trained in all major frameworks, calibrate for lower precision with high accuracy, and deploy to hyperscale data centers,...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 16

    rw

    rw calculates rank-width and rank-decompositions.

    ...It is based on ideas from "Computing rank-width exactly" by Sang-il Oum, "Sopra una formula numerica" by Ernesto Pascal, "Generation of a Vector from the Lexicographical Index" by B.P. Buckles and M. Lybanon and "Fast additions on masked integers" by Michael D. Adams and David S. Wise. On 2009's computers it works quite well up to graph sizes of about 28 nodes. Runtime and memory usage are exponential in the graph size.
    Downloads: 73 This Week
    Last Update:
    See Project
  • 17
    rathole

    rathole

    A lightweight and high-performance reverse proxy for NAT traversal

    ...High Performance Much higher throughput can be achieved than frp, and more stable when handling a large volume of connections. Low Resource Consumption Consumes much fewer memory than similar tools. See Benchmark. The binary can be as small as ~500KiB to fit the constraints of devices, like embedded devices as routers. Security Tokens of services are mandatory and service-wise. The server and clients are responsible for their own configs. With the optional Noise Protocol, encryption can be configured at ease. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    FairScale

    FairScale

    PyTorch extensions for high performance and large scale training

    FairScale is a collection of PyTorch performance and scaling primitives that pioneered many of the ideas now used for large-model training. It introduced Fully Sharded Data Parallel (FSDP) style techniques that shard model parameters, gradients, and optimizer states across ranks to fit bigger models into the same memory budget. The library also provides pipeline parallelism, activation checkpointing, mixed precision, optimizer state sharding (OSS), and auto-wrapping policies that reduce boilerplate in complex distributed setups. Its components are modular, so teams can adopt just the sharding optimizer or the pipeline engine without rewriting their training loop. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Fairseq

    Fairseq

    Facebook AI Research Sequence-to-Sequence Toolkit written in Python

    Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. These ideas are encapsulated in the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Img Optimizer Gradle Plugin

    Img Optimizer Gradle Plugin

    Gradle plugin for optimizing PNGs

    ...Because it is integrated into the build, it can process only changed images and skip redundant work, improving performance. Its configuration is minimal, making it easy to adopt in existing Android projects. For apps sensitive to download size or memory usage, this plugin offers a practical way to squeeze out extra gains in deployment efficiency.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    SimSiam

    SimSiam

    PyTorch implementation of SimSiam

    SimSiam is a PyTorch implementation of “Exploring Simple Siamese Representation Learning” by Xinlei Chen and Kaiming He. The project introduces a minimalist approach to self-supervised learning that avoids negative pairs, momentum encoders, or large memory banks—key complexities of prior contrastive methods. SimSiam learns image representations by maximizing similarity between two augmented views of the same image through a Siamese neural network with a stop-gradient operation, preventing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    ZiteCalculator

    A simple Windows Calculator with Radix conversion and Memory Storage

    A simple Windows Calculator with Radix conversion, bit-wise operations(with 32-bit numbers only), other operations with 64-bit numbers. All radices from 2 to 10 & 16, binary, Decimal, Hex, Octa etc supported. The current operation will shown in a separate part to keep track. Multiple answers can be stored in Memory variables for further use. All Memory items stored, can be viewed in a separate window and its operation details also viewable, to help users select correct part and substitute during big evaluations done part by part. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Rdbtools

    Rdbtools

    Parse Redis dump.rdb files, Analyze Memory, and Export Data to JSON

    Rdbtools is a parser for Redis' dump.rdb files. The parser generates events similar to an XML sax parser and is very efficient memory-wise. Rdbtools is written in Python, though there are similar projects in other languages. Every run of RDB Tool requires to specify a command to indicate what should be done with the parsed RDB data. Valid commands are JSON, diff, justkeys, justkeyvals and protocol. The JSON command output is UTF-8 encoded JSON. By default, the callback try to parse RDB data using UTF-8 and escape non 'ASCII printable' characters with the \U notation, or non-UTF-8 parsable bytes with \x. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 24
    Stacer

    Stacer

    Linux System Optimizer and Monitoring

    Stacer is an open source system optimizer and application monitor that helps users to manage the entire system with different aspects, it's an all-in-one system utility. In the Startup Apps tab, you can view the applications the system launches at boot time and set up new startup apps. This is especially handy if you work with different distributions: You do not always need to think about where you need to set up applications that run at boot time on the respective systems, and you can also...
    Downloads: 43 This Week
    Last Update:
    See Project
  • 25

    JILRuntime/JewelScript

    An object-oriented script language to embed in any application

    A general purpose, object-oriented script language that compiles into code for a register based virtual machine. The language is quite similar to object-oriented high-level languages like Java and C#. The library is entirely self-sufficient and ANSI C compliant. It's main purpose is to be embedded in any application to allow automation of that application through scripting. An integrated C++ binding code generator allows you to create bindings for your application's classes in seconds....
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB