llama-cpp-static free download

DeepSeek R1

Open-source, high-performance AI model with advanced reasoning

...This approach has resulted in performance comparable to leading models like OpenAI's o1, while maintaining cost-efficiency. To further support the research community, DeepSeek has released distilled versions of the model based on architectures such as LLaMA and Qwen.

1 Review

Downloads: 150 This Week

Last Update: 2025-07-09

See Project

Cosmos-RL

Cosmos-RL is a flexible and scalable Reinforcement Learning framework

...The framework supports multiple parallelism strategies, including tensor, pipeline, and data parallelism, allowing it to leverage large GPU clusters effectively. It is built with compatibility in mind, supporting popular model families such as LLaMA, Qwen, and diffusion-based world models, as well as integration with Hugging Face ecosystems. cosmos-rl also includes support for advanced RL algorithms, low-precision training, and fault-tolerant execution, making it suitable for large-scale production workloads.

Downloads: 2 This Week

Last Update: 3 days ago

See Project

Atropos

Language Model Reinforcement Learning Environments frameworks

...Designed as a scalable ecosystem of environment microservices, Atropos allows researchers and developers to collect, evaluate, and manage trajectories (sequences of actions and outcomes) generated by LLMs across a variety of tasks—from static dataset benchmarks to dynamic interactive games and real-world scenario environments. It provides foundational tooling for asynchronous RL loops where environment services communicate with trainers and inference engines, enabling complex workflow orchestration in distributed and parallel setups. This framework facilitates experimentation with RLHF (Reinforcement Learning from Human Feedback), RLAIF, or multi-turn training approaches by abstracting environment logic, scoring, and logging into reusable components.

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

RecNN

Reinforced Recommendation toolkit built around pytorch 1.7

This is my school project. It focuses on Reinforcement Learning for personalized news recommendation. The main distinction is that it tries to solve online off-policy learning with dynamically generated item embeddings. I want to create a library with SOTA algorithms for reinforcement learning recommendation, providing the level of abstraction you like.

Downloads: 0 This Week

Last Update: 2024-06-04

See Project

TorchCraft

Connecting Torch to StarCraft

We present TorchCraft, a library that enables deep learning research on Real-Time Strategy (RTS) games such as StarCraft: Brood War, by making it easier to control these games from a machine learning framework, here Torch. This white paper argues for using RTS games as a benchmark for AI research, and describes the design and components of TorchCraft. TorchCraft is a BWAPI module that sends StarCraft data out over a ZMQ connection. This lets you parse StarCraft data and interact with BWAPI...

Downloads: 0 This Week

Last Update: 2022-08-16

See Project

Search Results for "llama-cpp-static"

Showing 5 open source projects for "llama-cpp-static"

DeepSeek R1

Cosmos-RL

Atropos

RecNN

TorchCraft

Search Results for "llama-cpp-static"

Showing 5 open source projects for "llama-cpp-static"

DeepSeek R1

Cosmos-RL

Atropos

RecNN

TorchCraft

Related Searches

Related Categories