fast performance free download

GLM-4.5

GLM-4.5: Open-source LLM for intelligent agents by Z.ai

GLM-4.5 is a cutting-edge open-source large language model designed by Z.ai for intelligent agent applications. The flagship GLM-4.5 model has 355 billion total parameters with 32 billion active parameters, while the compact GLM-4.5-Air version offers 106 billion total parameters and 12 billion active parameters. Both models unify reasoning, coding, and intelligent agent capabilities, providing two modes: a thinking mode for complex reasoning and tool usage, and a non-thinking mode for...

1 Review

Downloads: 134 This Week

Last Update: 2026-01-20

See Project

RWKV Runner

A RWKV management and startup tool, full automation, only 8MB

RWKV (pronounced as RwaKuv) is an RNN with GPT-level LLM performance, which can also be directly trained like a GPT transformer (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, fast training, saves VRAM, "infinite" ctxlen, and free text embedding. Moreover it's 100% attention-free. Default configs has enabled custom CUDA kernel acceleration, which is much faster and consumes much less VRAM.

Downloads: 2 This Week

Last Update: 3 days ago

See Project

llama.cpp

Port of Facebook's LLaMA model in C/C++

The llama.cpp project enables the inference of Meta's LLaMA model (and other models) in pure C/C++ without requiring a Python runtime. It is designed for efficient and fast model execution, offering easy integration for applications needing LLM-based capabilities. The repository focuses on providing a highly optimized and portable implementation for running large language models directly within C/C++ environments.

1 Review

Downloads: 162 This Week

Last Update: 9 hours ago

See Project

Engram

A New Axis of Sparsity for Large Language Models

Engram is a high-performance embedding and similarity search library focused on making retrieval-augmented workflows efficient, scalable, and easy to adopt by developers building search, recommendation, or semantic matching systems. It provides utilities to generate embeddings from text or other structured data, index them using efficient approximate nearest neighbor algorithms, and perform real-time similarity queries even on large corpora. Engineered with speed and memory efficiency in...

Downloads: 0 This Week

Last Update: 1 day ago

See Project

Learn AI Engineering

Learn AI and LLMs from scratch using free resources

Learn AI Engineering is a learning path for AI engineering that consolidates high-quality, free resources across the full stack: math, Python foundations, machine learning, deep learning, LLMs, agents, tooling, and deployment. Rather than a loose bookmark list, it organizes topics into a progression so learners can start from fundamentals and move toward practical, production-oriented skills. It mixes courses, articles, code labs, and videos, emphasizing materials that teach both concepts...

Downloads: 0 This Week

Last Update: 2025-11-12

See Project

FastEdit

Editing large language models within 10 seconds

...For applied teams, FastEdit offers a toolbox to keep models current and compliant while minimizing collateral damage to overall performance.

Downloads: 0 This Week

Last Update: 2025-11-10

See Project

llm

An ecosystem of Rust libraries for working with large language models

llm is an ecosystem of Rust libraries for working with large language models - it's built on top of the fast, efficient GGML library for machine learning. The primary entry point for developers is the llm crate, which wraps the llm-base and the supported model crates. Documentation for the released version is available on Docs.rs. For end-users, there is a CLI application, llm-cli, which provides a convenient interface for interacting with supported models. Text generation can be done as a...

Downloads: 0 This Week

Last Update: 2023-08-21

See Project

Search Results for "fast performance"

Showing 7 open source projects for "fast performance"

GLM-4.5

RWKV Runner

llama.cpp

Engram

Learn AI Engineering

FastEdit

llm

Search Results for "fast performance"

Showing 7 open source projects for "fast performance"

GLM-4.5

RWKV Runner

llama.cpp

Engram

Learn AI Engineering

FastEdit

llm

Related Searches

Related Categories