network performance free download

xLSTM

Neural Network architecture based on ideas of the original LSTM

xLSTM is an open-source machine learning architecture that reimagines the classic Long Short-Term Memory (LSTM) network for modern large-scale language modeling and sequence processing tasks. The project introduces a new recurrent neural network design that incorporates exponential gating mechanisms and enhanced memory structures to overcome limitations of traditional LSTM models. By introducing innovations such as matrix-based memory and improved normalization techniques, xLSTM improves the ability of recurrent networks to capture long-range dependencies in sequential data. ...

Downloads: 1 This Week

Last Update: 2026-03-06

See Project

Agents 2.0

An Open-source Framework for Data-centric Language Agents

Agents is an open-source framework designed to build and train autonomous language agents through a data-centric and learning-oriented architecture. The project introduces a concept known as agent symbolic learning, which treats an agent pipeline similarly to a neural network computational graph. In this framework, each node in the pipeline represents a step in the reasoning or action process, while prompts and tools act as adjustable parameters analogous to neural network weights. During...

Downloads: 0 This Week

Last Update: 2026-03-04

See Project

MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models

...Introduced in the ICML 2024 paper “MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases”, it focuses on delivering strong reasoning and generalization capabilities in models under one billion parameters. The framework integrates several architectural innovations—SwiGLU activation, deep and thin network design, embedding sharing, and grouped-query attention (GQA)—to achieve a superior trade-off between model size, inference speed, and accuracy. MobileLLM demonstrates remarkable performance, with the 125M and 350M variants outperforming previous state-of-the-art models of the same scale by up to 4.3% on zero-shot commonsense reasoning tasks.

Downloads: 2 This Week

Last Update: 2 days ago

See Project

MatMul-Free LM

Implementation for MatMul-free LM

MatMul-Free LM is an experimental implementation of a large language model architecture designed to eliminate traditional matrix multiplication operations used in transformer networks. Since matrix multiplication is one of the most computationally expensive components of modern language models, the project explores alternative computational strategies that reduce hardware requirements while maintaining comparable performance. The architecture relies on quantization-aware training and...

Downloads: 0 This Week

Last Update: 2026-03-05

See Project

Alpa

Training and serving large-scale neural networks

Alpa is a system for training and serving large-scale neural networks. Scaling neural networks to hundreds of billions of parameters has enabled dramatic breakthroughs such as GPT-3, but training and serving these large-scale neural networks require complicated distributed system techniques. Alpa aims to automate large-scale distributed training and serving with just a few lines of code.

Downloads: 0 This Week

Last Update: 2023-03-23

See Project

Search Results for "network performance"

Showing 5 open source projects for "network performance"

xLSTM

Agents 2.0

MobileLLM

MatMul-Free LM

Alpa

Search Results for "network performance"

Showing 5 open source projects for "network performance"

xLSTM

Agents 2.0

MobileLLM

MatMul-Free LM

Alpa

Related Categories