Showing 428 open source projects for "task"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    OpenClaw-RL

    OpenClaw-RL

    Train any agents simply by 'talking'

    ...One of its key ideas is allowing users to train an AI agent simply by interacting with it conversationally, using natural language feedback to guide the learning process. The system incorporates reinforcement learning techniques to refine the agent’s policies for tool use, decision making, and task completion over time. It also explores approaches such as online policy distillation and hindsight feedback signals to strengthen training signals from real interactions. The framework operates asynchronously and does not require external API keys, making it easier to experiment with local agent training workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    AgentEvolver

    AgentEvolver

    Towards Efficient Self-Evolving Agent System

    AgentEvolver is an open-source research framework for building self-evolving AI agents powered by large language models. The system focuses on improving the efficiency and scalability of training autonomous agents by allowing them to generate tasks, explore environments, and refine strategies without heavy reliance on manually curated datasets. Its architecture combines reinforcement learning with LLM-driven reasoning mechanisms to guide exploration and learning. The framework introduces...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    LISA

    LISA

    LISA: Reasoning Segmentation via Large Language Model

    ...The model integrates multimodal capabilities by combining language understanding with visual perception so that text instructions guide the segmentation process. Researchers created a specialized task called reasoning segmentation, where the model must generate a mask for regions described in natural language instructions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Agents 2.0

    Agents 2.0

    An Open-source Framework for Data-centric Language Agents

    ...In this framework, each node in the pipeline represents a step in the reasoning or action process, while prompts and tools act as adjustable parameters analogous to neural network weights. During training, the system performs a forward execution where the agent completes a task and records the trajectory of prompts, outputs, and tool usage. A prompt-based loss function is then applied to evaluate the quality of the outcome, generating language-based gradients that guide improvements to the agent pipeline.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    Ling-V2

    Ling-V2

    Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI

    ...Trained on more than 20 trillion tokens of high-quality data and enhanced through multi-stage supervised fine-tuning and reinforcement learning, Ling-V2’s models demonstrate strong general reasoning, mathematical problem-solving, coding understanding, and knowledge-intensive task performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    ralph-loop-agent

    ralph-loop-agent

    Continuous Autonomy for the AI SDK

    ralph-loop-agent is an experimental autonomous agent framework from Vercel Labs that brings continuous autonomy to the AI SDK, enabling AI solutions to perform long-running, iterative tasks without manual stop/start intervention. Rather than simply answering a single request and stopping, Ralph Loop implements a loop control architecture that allows an agent to repeatedly evaluate its progress, adjust its approach, and continue working toward a defined completion criteria until tasks are...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ticket

    ticket

    Fast, powerful, git-native ticket tracking in a single bash script

    ...It stores each ticket as a Markdown file with YAML frontmatter, making them human-readable and easy to version control alongside your code, while also allowing IDEs to jump straight to ticket definitions. The CLI provides common subcommands to create, list, edit, close, and manage dependencies between tickets, enabling clear hierarchical task structures and visual dependency trees. Its design is rooted in the Unix philosophy of simplicity, composability, and transparency, meaning it integrates well with other standard tools like grep, jq, and ripgrep when installed. Teams can use ticket to track bugs, features, chores, and epics with priority levels and tags, all by staying within the terminal and Git ecosystem.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Lingvo

    Lingvo

    Framework for building neural networks

    ...The framework provides a structured way to define models, input pipelines, and training configurations using a common interface for layers, which encourages reuse across different tasks. It has been used to implement state of the art architectures such as recurrent neural networks, Transformer models, variational autoencoder hybrids, and multi task systems. Lingvo includes reference models and configurations for domains like machine translation, automatic speech recognition, language modeling, image understanding, and 3D object detection. Centralized hyperparameter configuration files allow researchers to share exact experiment setups so others can retrain and compare results reliably.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    NVIDIA Isaac GR00T

    NVIDIA Isaac GR00T

    NVIDIA Isaac GR00T N1.5 is the world's first open foundation model

    NVIDIA Isaac‑GR00T N1.5 is an open-source foundation model engineered for generalized humanoid robot reasoning and manipulation skills. It accepts multimodal inputs—such as language and images—and uses a diffusion transformer architecture built upon vision-language encoders, enabling adaptive robot behaviors across diverse environments. It is designed to be customizable via post-training with real or synthetic data. The vision-language model remains frozen during both pretraining and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 10
    Monoio

    Monoio

    Rust async runtime based on io-uring

    ...Its design philosophy centers on a “thread-per-core” model where each core runs its own event loop, minimizing cross-thread synchronization needs, avoiding the overhead and complexity of task scheduling, and letting developers write efficient, low-overhead asynchronous networking or I/O code. Because tasks do not need to be Send or Sync and can make use of thread-local data safely, Monoio simplifies certain concurrency paradigms while delivering performance benefits for workloads like high-throughput network servers, proxies, or real-time services. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Orpheus TTS

    Orpheus TTS

    Towards Human-Sounding Speech

    Orpheus TTS is a state-of-the-art open-source text-to-speech system built on a Llama-3B backbone, treating speech synthesis as a large language model problem instead of a traditional TTS pipeline. It is designed to produce human-like speech with natural intonation, emotion, and rhythm, targeting quality comparable to or better than many closed-source systems. The project ships both pretrained and finetuned English models, as well as a family of multilingual models released as a research...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    ERNIE

    ERNIE

    The official repository for ERNIE 4.5 and ERNIEKit

    ERNIE is an open-source large-model toolkit and model family from the PaddlePaddle ecosystem that focuses on training, fine-tuning, compression, and practical application of ERNIE large language models. The repository positions ERNIEKit as an industrial-grade development toolkit, emphasizing end-to-end workflows that span high-performance pre-training, supervised fine-tuning, and alignment. It supports both full-parameter training and parameter-efficient approaches so teams can choose...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Zypher Agent

    Zypher Agent

    A minimal yet powerful framework for creating AI agents

    Zypher Agent is an open-source framework for building full-featured AI agents that can be embedded directly into applications, enabling reactive decision loops where the agent dynamically chooses its next actions. Unlike workflow-style orchestrators, it uses a reactive agent loop that interprets the task, reasons about next steps via LLMs, and integrates directly with extensible tools and external services. Zypher prioritizes native support for multiple model providers such as OpenAI and Anthropic Claude, while also offering a rich set of tools for file system operations, search, and terminal execution. It includes mechanisms like checkpointing to version agent decision states, concurrency protections, error handling, and operational interceptors to customize behavior after each reasoning step. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MAI-UI

    MAI-UI

    Real-World Centric Foundation GUI Agents

    ...Unlike traditional UI frameworks, MAI-UI emphasizes realistic deployment by supporting agent–user interaction (clarifying ambiguous instructions), integration with external tool APIs using MCP calls, and a device–cloud collaboration mechanism that dynamically routes computation to on-device or cloud models based on task state and privacy constraints.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    GenAI Agents

    GenAI Agents

    Implementations for various Generative AI Agent techniques

    ...The repo is structured by categories like beginner agents, framework tutorials, educational agents, business agents, creative agents, analysis agents, news bots, shopping assistants, task management agents, QA bots, and advanced systems such as controllable RAG agents. For each agent, you typically get an overview, implementation notes, and external resources (blog posts, videos, documentation) to deepen understanding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Map-Anything

    Map-Anything

    MapAnything: Universal Feed-Forward Metric 3D Reconstruction

    Map-Anything is a universal, feed-forward transformer for metric 3D reconstruction that predicts a scene’s geometry and camera parameters directly from visual inputs. Instead of stitching together many task-specific models, it uses a single architecture that supports a wide range of 3D tasks—multi-image structure-from-motion, multi-view stereo, monocular metric depth, registration, depth completion, and more. The model flexibly accepts different input combinations (images, intrinsics, poses, sparse or dense depth) and produces a rich set of outputs including per-pixel 3D points, camera intrinsics, camera poses, ray directions, confidence maps, and validity masks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Pytorch-toolbelt

    Pytorch-toolbelt

    PyTorch extensions for fast R&D prototyping and Kaggle farming

    ...By design, both encoder and decoder produces a list of tensors, from fine (high-resolution, indexed 0) to coarse (low-resolution) feature maps. Access to all intermediate feature maps is beneficial if you want to apply deep supervision losses on them or encoder-decoder of object detection task.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    PaSa

    PaSa

    An advanced paper search agent powered by large language models

    ...Instead of simply translating a query into keywords and returning a flat list of matching papers, PaSa uses a dual-agent architecture (Crawler + Selector) that can iteratively search, read, analyze, and filter academic publications — simulating how a researcher might dig through citation networks, expand references, and evaluate relevance based on both metadata and content. Given a complex scholarly question (for example, “Which works focus on non-stationary reinforcement learning with UCB-based value methods?”), PaSa decomposes the task: the Crawler generates search queries, retrieves candidate papers (via search tools and citation expansion), then adds them to a “paper queue.” The Selector then reads abstracts or full text (depending on what’s available) and decides which papers are relevant.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    BotSharp

    BotSharp

    AI Multi-Agent Framework in .NET

    Conversation as a platform (CaaP) is the future, so it's perfect that we're already offering the whole toolkits to our .NET developers using the BotSharp AI BOT Platform Builder to build a CaaP. It opens up as much learning power as possible for your own robots and precisely control every step of the AI processing pipeline. BotSharp is an open source machine learning framework for AI Bot platform builder. This project involves natural language understanding, computer vision and audio...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    gpt-engineer

    gpt-engineer

    Full stack AI software engineer

    gpt-engineer is an open-source platform designed to help developers automate the software development process using natural language. The platform allows users to specify software requirements in plain language, and the AI generates and executes the corresponding code. It can also handle improvements and iterative development, giving users more control over the software they’re building. Built with a terminal-based interface, gpt-engineer is customizable, enabling developers to experiment...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    CogVLM

    CogVLM

    A state-of-the-art open visual language model

    ...The repo provides multiple ways to run models (CLI, web demo, and OpenAI-Vision–style APIs), along with quantization options that reduce VRAM needs (e.g., 4-bit). It includes checkpoints for chat, base, and grounding variants, plus recipes for model-parallel inference and LoRA fine-tuning. The documentation covers task prompts for general dialogue, visual grounding (box→caption, caption→box, caption+boxes), and GUI agent workflows that produce structured actions with bounding boxes.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 22
    OpenPipe

    OpenPipe

    Turn expensive prompts into cheap fine-tuned models

    OpenPipe is an open-source platform focused on improving the efficiency and performance of AI systems by transforming expensive prompt-based workflows into optimized, fine-tuned models and reinforcement-trained agents. It provides tools for training language models and agents using real-world feedback, enabling systems to learn from interactions and improve over time rather than relying solely on static prompts. One of its core components, the Agent Reinforcement Trainer, allows developers...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    OWL

    OWL

    Optimized Workforce Learning for General Multi-Agent Assistance

    ...It is built on the CAMEL-AI Framework and stands as a leader in open-source solutions for task automation.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Mentat

    Mentat

    Mentat - The AI Coding Assistant

    Mentat is the AI tool that assists you with any coding task, right from your command line. Unlike Copilot, Mentat coordinates edits across multiple locations and files. And unlike ChatGPT, Mentat already has the context of your project, no copy and pasting is required. Run Mentat from within your project directory. Mentat uses Git, so if your project doesn't already have Git set up, run git init.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Qwen-Audio

    Qwen-Audio

    Chat & pretrained large audio language model proposed by Alibaba Cloud

    ...There is also an instruction-tuned version called Qwen-Audio-Chat which supports conversational interaction (multi-round), audio + text input, creative tasks and reasoning over audio. It uses multi-task training over many different audio tasks (30+), and achieves strong multi-benchmarks performance without task-specific fine‐tuning. It includes features such as flexible multi-run chat, audio understanding/reasoning, music appreciation, and also tool usage (e.g. voice editing).
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo