Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "python q learning" - Page 6

x

Sort By:

Relevance

Clear All Filters

OS

ChromeOS 449
Linux 448
Mac 448
More...
Windows 448
BSD 447
Mobile Operating Systems 5
Desktop Operating Systems 1

Category

Artificial Intelligence 449
Scientific/Engineering 20
Software Development 20
Multimedia 5
Business 4
Internet 2
Productivity 2
Social sciences 2
System 2
Communications 1
Database 1
Education 1
Games 1
Security 1

License

OSI-Approved Open Source 363
Creative Commons Attribution License 5
Other License 1

Translations

English 14
Spanish 2
Brazilian Portuguese 1
Chinese (Simplified) 1
More...
Chinese (Traditional) 1

Programming Language

Python 395
Java 9
JavaScript 9
C++ 6
More...
C 4
Go 4
TypeScript 4
Unix Shell 4
MATLAB 3
Rust 3
Julia 2
PHP 2
C# 1
Lua 1
Prolog 1
S/R 1
Swift 1
Yacc 1

Status

Beta 14
Alpha 8
Production/Stable 8
Pre-Alpha 5
More...
Planning 2

449 projects for "python q learning" with 2 filters applied:

Artificial Intelligence ChromeOS Clear Filters & Widen Search

Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
1

Ploomber

The fastest way to build data pipelines

Ploomber is an open-source framework designed to simplify the development and deployment of data science and machine learning pipelines. It allows developers to transform exploratory data analysis workflows into production-ready pipelines without rewriting large portions of code. The system integrates with common development environments such as Jupyter Notebook, VS Code, and PyCharm, enabling data scientists to continue working with familiar tools while building scalable workflows. Ploomber...

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
2

ComfyUI-3D-Pack

An extensive node suite that enables ComfyUI to process 3D inputs

ComfyUI-3D-Pack is an extension package for the ComfyUI visual AI workflow environment that enables users to generate and manipulate 3D assets using advanced machine learning techniques. ComfyUI itself is a node-based interface for designing and executing generative AI pipelines, and this extension expands its capabilities by introducing nodes specifically designed for working with three-dimensional data. The package allows the platform to process inputs such as meshes and UV textures and...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
3

Kaggle Solutions

Collection of Kaggle Solutions and Ideas

Kaggle Solutions is an open-source repository that compiles winning solutions, insights, and educational resources from hundreds of Kaggle data science competitions. The repository acts as a knowledge base for competitive machine learning by collecting solution write-ups, discussion threads, code notebooks, and tutorial resources shared by top Kaggle participants. Each competition entry typically includes information about the dataset, evaluation metrics, modeling strategies, and techniques...

Downloads: 0 This Week

Last Update: 2026-05-06
See Project
4

verl-agent

Designed for training LLM/VLM agents via RL

verl-agent is an open-source reinforcement learning framework designed to train large language model agents and vision-language model agents for complex interactive environments. Built as an extension of the veRL reinforcement learning infrastructure, the project focuses on enabling scalable training for agents that perform multi-step reasoning and decision-making tasks. The framework supports multi-turn interactions between agents and their environments, allowing the system to receive...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

AgentEvolver

Towards Efficient Self-Evolving Agent System

AgentEvolver is an open-source research framework for building self-evolving AI agents powered by large language models. The system focuses on improving the efficiency and scalability of training autonomous agents by allowing them to generate tasks, explore environments, and refine strategies without heavy reliance on manually curated datasets. Its architecture combines reinforcement learning with LLM-driven reasoning mechanisms to guide exploration and learning. The framework introduces...

Downloads: 0 This Week

Last Update: 2026-03-28
See Project
6

RLHF-Reward-Modeling

Recipes to train reward model for RLHF

RLHF-Reward-Modeling is an open-source research framework focused on training reward models used in reinforcement learning from human feedback for large language models. In RLHF pipelines, reward models are responsible for evaluating generated responses and assigning scores that guide the model toward outputs that better match human preferences. The repository provides training recipes and implementations for building reward and preference models using modern machine learning frameworks. It...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
7

PRIME

Scalable RL solution for advanced reasoning of language models

PRIME is an open-source reinforcement learning framework designed to improve the reasoning capabilities of large language models through process-level rewards rather than relying only on final outputs. The system introduces the concept of process reinforcement through implicit rewards, allowing models to receive feedback on intermediate reasoning steps instead of evaluating only the final answer. This approach helps models learn better reasoning strategies and encourages them to generate...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
8

Dash Data Agent

Self-learning data agent that grounds its answers in layers of content

Dash is a self-learning data agent built by the Agno AI community that generates grounded answers to English queries over structured data by synthesizing SQL and reasoning based on six layers of context, improving automatically with each run. It sidesteps common limitations of simple text-to-SQL agents by incorporating multiple context layers — including schema structure, human annotations, known query patterns, institutional knowledge from docs, machine-discovered error patterns, and live...

Downloads: 0 This Week

Last Update: 2026-04-08
See Project
9

DLRM

An implementation of a deep learning recommendation model (DLRM)

DLRM (Deep Learning Recommendation Model) is Meta’s open-source reference implementation for large-scale recommendation systems built to handle extremely high-dimensional sparse features and embedding tables. The architecture combines dense (MLP) and sparse (embedding) branches, then interacts features via dot product or feature interactions before passing through further dense layers to predict click-through, ranking scores, or conversion probabilities. The implementation is optimized for...

Downloads: 0 This Week

Last Update: 2026-01-12
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
10

PyTorch3D

PyTorch3D is FAIR's library of reusable components for deep learning

PyTorch3D is a comprehensive library for 3D deep learning that brings differentiable rendering, geometric operations, and 3D data structures into the PyTorch ecosystem. It’s designed to make it easy to build and train neural networks that work directly with 3D data such as meshes, point clouds, and implicit surfaces. The library provides fast GPU-accelerated implementations of rendering pipelines, transformations, rasterization, and lighting—making it possible to compute gradients through...

Downloads: 0 This Week

Last Update: 2025-11-27
See Project
11

AutoViz

Automatically Visualize any dataset, any size

...The system also includes built-in tools for evaluating data quality and identifying potential issues such as missing values or unusual distributions. By automating the visualization process, AutoViz allows users to rapidly explore datasets before applying machine learning models or statistical analysis.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
12

mosaicml composer

Supercharge Your Model Training

composer is a deep learning training framework built on PyTorch and designed to make large-scale model training more efficient, scalable, and customizable. At the center of the project is a highly optimized Trainer abstraction that simplifies the management of training loops, parallelization, metrics, logging, and data loading. The framework is intended for modern workloads that may span anything from a single GPU to very large distributed training environments, which makes it suitable for...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
13

how-to-optim-algorithm-in-cuda

How to optimize some algorithm in cuda

how-to-optim-algorithm-in-cuda is an open educational repository focused on teaching developers how to optimize algorithms for high-performance execution on GPUs using CUDA. The project combines technical notes, code examples, and practical experiments that demonstrate how common computational kernels can be optimized to improve speed and memory efficiency. Instead of presenting only theoretical explanations, the repository includes hand-written CUDA implementations of fundamental operations...

Downloads: 6 This Week

Last Update: 2026-05-22
See Project
14

LiteMultiAgent

The Library for LLM-based multi-agent applications

LiteMultiAgent is a lightweight and extensible multi-agent reinforcement learning (MARL) platform designed for rapid experimentation. It allows researchers to design and test coordination, competition, and collaboration scenarios in simulated environments.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
15

mlforecast

Scalable machine learning for time series forecasting

mlforecast is a time-series forecasting framework built around machine-learning models, designed to make forecasting both efficient and scalable. It lets you apply any regressor that follows the typical scikit-learn API, for example, gradient-boosted trees or linear models, to time-series data by automating much of the messy feature engineering and data preparation. Instead of writing custom code to build lagged features, rolling statistics, and date-based predictors, mlforecast generates...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
16

TurboQuant+

Implementation of TurboQuant (ICLR 2026)

TurboQuant Plus is an extended and enhanced version of quantization tooling aimed at improving neural network efficiency through advanced compression and optimization strategies. It builds upon the concept of reducing model precision to accelerate inference while attempting to maintain or recover accuracy through refined techniques. The project explores additional enhancements such as improved calibration, adaptive quantization, and potentially hybrid precision approaches that combine...

Downloads: 2 This Week

Last Update: 2026-05-04
See Project
17

Koila

Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code

Koila is a lightweight Python library designed to help developers avoid memory errors when training deep learning models with PyTorch. The library introduces a lazy evaluation mechanism that delays computation until it is actually required, allowing the framework to better estimate the memory requirements of a model before execution. By building a computational graph first and executing operations only when necessary, koila reduces the risk of running out of GPU memory during the forward pass of neural network training. ...

Downloads: 0 This Week

Last Update: 4 days ago
See Project
18

Academic Research Skills for Claude Code

Academic Research Skills for Claude Code

Academic Research Skills is a structured learning repository aimed at improving users’ ability to conduct rigorous academic research, particularly in technical and scientific domains. It compiles methodologies, frameworks, and best practices for literature review, critical analysis, and research writing. The project is designed as a self-guided resource, helping learners understand how to evaluate sources, synthesize information, and develop strong arguments. It likely integrates examples,...

Downloads: 4 This Week

Last Update: 2026-05-18
See Project
19

Finance

150+ quantitative finance Python programs

Finance is a repository that compiles structured notes and educational material related to financial analysis, markets, and quantitative finance concepts. The project focuses on explaining key principles used in finance and investment analysis, including topics such as financial statements, valuation models, portfolio theory, and financial markets. The repository is designed as a study reference for students and professionals who want to understand financial systems and the analytical...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
20

Reflexion

Reflexion: Language Agents with Verbal Reinforcement Learning

Reflexion is a research-oriented AI framework that focuses on improving the reasoning and problem-solving capabilities of language model agents through iterative self-reflection and feedback loops. Instead of relying solely on a single-pass response, Reflexion enables agents to evaluate their own outputs, identify errors, and refine their reasoning over multiple iterations, leading to more accurate and reliable results. The framework introduces a mechanism where agents maintain a memory of...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
21

Google Research: Language

Shared repository for open-sourced projects from the Google AI Lang

Google Research: Language is a shared repository maintained by Google Research that contains open-source projects developed by the Google AI Language team. The repository hosts multiple subprojects related to natural language processing, machine learning, and large-scale language understanding systems. Many of the projects included in the repository correspond to research papers released by Google researchers and provide implementations of new NLP algorithms or experimental frameworks. These...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
22

Quantitative Trading System

A comprehensive quantitative trading system with AI-powered analysis

Quantitative Trading System is a comprehensive quantitative trading platform that integrates artificial intelligence, financial data analysis, and automated strategy execution within a unified software system. The project is designed to provide an end-to-end infrastructure for building and operating algorithmic trading strategies in financial markets. It includes tools for collecting and processing market data from multiple sources, performing statistical and machine learning analysis, and...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
23

LLMs-Zero-to-Hero

From nobody to big model (LLM) hero

LLMs-Zero-to-Hero is an open-source educational project designed to guide learners through the complete process of understanding and building large language models from the ground up. The repository presents a structured learning pathway that begins with fundamental concepts in machine learning and progresses toward advanced topics such as model pre-training, fine-tuning, and deployment. Rather than relying entirely on existing frameworks, the project encourages readers to implement...

Downloads: 0 This Week

Last Update: 2026-05-04
See Project
24

AI Engineering Academy

Mastering Applied AI, One Concept at a Time

AI-Engineering.academy is a community-driven educational repository that organizes practical knowledge and learning paths for applied AI engineering. The project aims to make complex AI concepts accessible by structuring them into progressive learning modules covering topics such as prompt engineering, retrieval-augmented generation, LLM deployment, and AI agents. Rather than focusing purely on theoretical explanations, the repository emphasizes hands-on understanding of how modern AI...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
25

SwanLab

An open-source, modern-design AI training tracking and visualization

SwanLab is an open-source experiment tracking and visualization platform designed to help machine learning engineers monitor, compare, and analyze the training of artificial intelligence models. The tool records training metrics, hyperparameters, model outputs, and experiment configurations so that developers can easily understand how different experiments perform over time. It provides a modern user interface for visualizing results, enabling teams to compare runs, track model performance...

Downloads: 0 This Week

Last Update: 2026-05-18
See Project

Previous
2
3
4
5
You're on page 6
7
8
9
10
Next

Related Searches

computer

Related Categories

Artificial Intelligence

Scientific/Engineering

Software Development

Multimedia

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise