Search Results for "artificial intelligence" - Page 31

Sort By:

Showing 2882 open source projects for "artificial intelligence"

View related business solutions

Python Clear Filters & Widen Search

Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
1

LuxTTS

A high-quality rapid TTS voice cloning model

LuxTTS is an open-source text-to-speech (TTS) system focused on delivering high-quality, rapid voice synthesis and voice cloning that runs extremely fast and efficiently on consumer hardware. It implements a lightweight architecture based on ZipVoice and optimized sampling techniques so that it can generate speech at speeds up to roughly 150 times real-time on a single GPU and faster than real-time on CPU, all while producing audio at high fidelity with 48 kHz quality. The project supports...

Downloads: 10 This Week

Last Update: 2026-06-05
See Project
2

Inspect Petri

An alignment auditing agent capable of exploring alignment hypothesis

Inspect Petri is an open-source alignment auditing agent that lets researchers rapidly test concrete safety hypotheses against target models using realistic, multi-turn scenarios. Instead of building bespoke evals, Inspect Petri automatically generates audit environments from seed “special instructions,” orchestrates an auditor model to probe a target model, and simulates tool use and rollbacks to surface risky behaviors. Each interaction transcript is then scored by a judge model using a...

Downloads: 2 This Week

Last Update: 2026-04-25
See Project
3

HunyuanImage-3.0

A Powerful Native Multimodal Model for Image Generation

HunyuanImage-3.0 is a powerful, native multimodal text-to-image generation model released by Tencent’s Hunyuan team. It unifies multimodal understanding and generation in a single autoregressive framework, combining text and image modalities seamlessly rather than relying on separate image-only diffusion components. It uses a Mixture-of-Experts (MoE) architecture with many expert subnetworks to scale efficiently, deploying only a subset of experts per token, which allows large parameter...

1 Review

Downloads: 2 This Week

Last Update: 2026-02-03
See Project
4

Qwen3 Embedding

Designed for text embedding and ranking tasks

Qwen3-Embedding is a model series from the Qwen family designed specifically for text embedding and ranking tasks. It builds upon the Qwen3 base/dense models and offers several sizes (0.6B, 4B, 8B parameters), for both embedding and reranking, with high multilingual capability, long‐context understanding, and reasoning. It achieves state-of-the-art performance on benchmarks like MTEB (Multilingual Text Embedding Benchmark) and supports instruction-aware embedding (i.e. embedding task...

Downloads: 2 This Week

Last Update: 2025-09-30
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
5

img2dataset

Easily turn large sets of image urls to an image dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine. Also supports saving captions for url+caption datasets. Opt-out directives: Websites can pass the http headers X-Robots-Tag: noai, X-Robots-Tag: noindex , X-Robots-Tag: noimageai and X-Robots-Tag: noimageindex By default img2dataset will ignore images with such headers.

Downloads: 4 This Week

Last Update: 2025-08-09
See Project
6

CLI-Anything

Making ALL Software Agent-Native

CLI-Anything is a framework designed to transform traditional software applications into agent-native command-line interfaces that can be directly controlled by AI systems. It is built on the idea that the command-line interface is the most universal, structured, and composable interface for both humans and AI agents, enabling deterministic and predictable execution of workflows. The system provides a methodology and tooling for generating CLI wrappers around existing applications, allowing...

Downloads: 3 This Week

Last Update: 2026-04-24
See Project
7

OpenPlanter

Language-model investigation agent with a terminal UI

OpenPlanter is an open-source Python project focused on building an intelligent automated planting or gardening system powered by software control and data processing. The repository is designed to help developers and hobbyists create programmable plant management workflows that can monitor, schedule, and optimize growing conditions. It emphasizes automation and extensibility, allowing integration with sensors, environmental data, and control logic for smart cultivation setups. The system is...

Downloads: 3 This Week

Last Update: 2026-03-06
See Project
8

Hamilton DAGWorks

Helps scientists define testable, modular, self-documenting dataflow

Hamilton is a lightweight Python library for directed acyclic graphs (DAGs) of data transformations. Your DAG is portable; it runs anywhere Python runs, whether it's a script, notebook, Airflow pipeline, FastAPI server, etc. Your DAG is expressive; Hamilton has extensive features to define and modify the execution of a DAG (e.g., data validation, experiment tracking, remote execution). To create a DAG, write regular Python functions that specify their dependencies with their parameters. As...

Downloads: 3 This Week

Last Update: 2026-04-04
See Project
9

bbox-visualizer

Make drawing and labeling bounding boxes easy as cake

Make drawing and labeling bounding boxes easy as cake. This package helps users draw bounding boxes around objects, without doing the clumsy math that you'd need to do for positioning the labels. It also has a few different types of visualizations you can use for labeling objects after identifying them. There are optional functions that can draw multiple bounding boxes and/or write multiple labels on the same image, but it is advisable to use the above functions in a loop in order to have...

Downloads: 3 This Week

Last Update: 2026-01-29
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

x-transformers

A simple but complete full-attention transformer

A simple but complete full-attention transformer with a set of promising experimental features from various papers. Proposes adding learned memory key/values prior to attending. They were able to remove feedforwards altogether and attain a similar performance to the original transformers. I have found that keeping the feedforwards and adding the memory key/values leads to even better performance. Proposes adding learned tokens, akin to CLS tokens, named memory tokens, that is passed through...

Downloads: 3 This Week

Last Update: 2026-02-12
See Project
11

Sinas

Open-source platform for building AI agents and serverless automation

Sinas is an open-source platform for building AI agents and serverless automation with fine-grained access control. It provides a self-hosted backend where developers can configure agents, connect LLM providers, write Python functions, and trigger workflows through webhooks or schedules. The platform supports isolated container execution for functions, which helps separate automation logic from the rest of the system. It also includes reusable skills, state stores, document collections,...

Downloads: 2 This Week

Last Update: 4 days ago
See Project
12

Pika Skills

A collection of open-source skills for AI coding agents

Pika Skills is an open-source framework designed to extend the capabilities of AI coding agents by introducing modular, reusable “skills” that can be dynamically invoked during development workflows. Each skill acts as a self-contained unit composed of structured instructions, executable scripts, and dependency definitions, enabling agents to autonomously perform complex tasks without requiring manual configuration or orchestration. The system is tightly integrated with the Pika Developer...

Downloads: 0 This Week

Last Update: 2026-04-24
See Project
13

TabPFN

Foundation Model for Tabular Data

TabPFN is an open-source machine learning system that introduces a foundation model designed specifically for tabular data analysis. The model is based on transformer architectures and implements a prior-data fitted network that can perform supervised learning tasks such as classification and regression with minimal configuration. Unlike many traditional machine learning workflows that require extensive hyperparameter tuning and training cycles, TabPFN is pre-trained to perform inference...

Downloads: 2 This Week

Last Update: 3 days ago
See Project
14

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs

GLM-4 is a family of open models from ZhipuAI that spans base, chat, and reasoning variants at both 32B and 9B scales, with long-context support and practical local-deployment options. The GLM-4-32B-0414 models are trained on ~15T high-quality data (including substantial synthetic reasoning data), then post-trained with preference alignment, rejection sampling, and reinforcement learning to improve instruction following, coding, function calling, and agent-style behaviors. The...

Downloads: 8 This Week

Last Update: 7 days ago
See Project
15

fastdup

An unsupervised and free tool for image and video dataset analysis

fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.

Downloads: 2 This Week

Last Update: 2024-08-16
See Project
16

Rogue

AI Agent Evaluator & Red Team Platform

Rogue is an open-source evaluation and red-team framework designed to test the reliability, safety, and policy compliance of AI agents. The platform automatically interacts with an AI agent by generating dynamic scenarios and multi-turn conversations that simulate real-world interactions. Instead of relying solely on static test scripts, Rogue uses an agent-as-a-judge architecture where one agent probes another agent to detect failures or unexpected behaviors. The system allows developers to...

Downloads: 9 This Week

Last Update: 2026-04-29
See Project
17

Qwen-2.5-VL

Qwen2.5-VL is the multimodal large language model series

Qwen2.5 is a series of large language models developed by the Qwen team at Alibaba Cloud, designed to enhance natural language understanding and generation across multiple languages. The models are available in various sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B parameters, catering to diverse computational requirements. Trained on a comprehensive dataset of up to 18 trillion tokens, Qwen2.5 models exhibit significant improvements in instruction following, long-text generation...

Downloads: 11 This Week

Last Update: 2026-01-30
See Project
18

InvokeAI

InvokeAI is a leading creative engine for Stable Diffusion models

InvokeAI is an implementation of Stable Diffusion, the open source text-to-image and image-to-image generator. It provides a streamlined process with various new features and options to aid the image generation process. It runs on Windows, Mac and Linux machines, and runs on GPU cards with as little as 4 GB or RAM. InvokeAI is a leading creative engine built to empower professionals and enthusiasts alike. Generate and create stunning visual media using the latest AI-driven technologies....

1 Review

Downloads: 11 This Week

Last Update: 2026-05-27
See Project
19

GenMedia Creative Studio

AI generative media user experience highlighting use of APIs

GenMedia Creative Studio is a Google Cloud reference application for experimenting with generative media workflows on Vertex AI. It provides a user experience for working with models and APIs such as Gemini, Veo, Imagen, Gemini Image, Gemini TTS, Chirp 3, and Lyria. The project is built to showcase multimodal creation across text, image, video, speech, and music from one deployable interface. It is useful for creators, marketers, developers, and technical teams that want to prototype...

Downloads: 5 This Week

Last Update: 2026-06-02
See Project
20

macai

All-in-one native macOS AI chat application

macai is a native macOS AI chat application that consolidates access to multiple AI providers into a single, polished desktop interface. It is built specifically for macOS using Swift and SwiftUI, delivering a lightweight and responsive experience that integrates seamlessly with the operating system. The app supports a wide range of providers, including OpenAI, Anthropic, Google Gemini, xAI, Perplexity, and Ollama, allowing users to switch between local and cloud-based models without...

Downloads: 5 This Week

Last Update: 2026-04-20
See Project
21

Ollama RAG Chatbot

Chat with multiple PDFs locally

Ollama RAG Chatbot is a local-first retrieval chatbot project built to let users chat with the contents of multiple PDF documents through a simple interface. The project is framed as an experiment, but its setup and packaging make it approachable for practical local use as well. It supports running on a local machine or in Kaggle, which lowers the barrier for users who want to test RAG workflows without building everything from scratch. Model support is flexible, with compatibility for both...

Downloads: 5 This Week

Last Update: 2026-04-20
See Project
22

Cosmos-RL

Cosmos-RL is a flexible and scalable Reinforcement Learning framework

Cosmos-RL is a scalable reinforcement learning framework designed specifically for physical AI systems such as robotics, autonomous agents, and multimodal models. It provides a distributed training architecture that separates policy learning and environment rollout processes, enabling efficient and asynchronous reinforcement learning at scale. The framework supports multiple parallelism strategies, including tensor, pipeline, and data parallelism, allowing it to leverage large GPU clusters...

Downloads: 5 This Week

Last Update: 2026-04-14
See Project
23

Parallax

Parallax is a distributed model serving framework

Parallax is a decentralized inference framework designed to run large language models across distributed computing resources. Instead of relying on centralized GPU clusters in data centers, the system allows multiple heterogeneous machines to collaborate in serving AI inference workloads. Parallax divides model layers across different nodes and dynamically coordinates them to form a complete inference pipeline. A two-stage scheduling architecture determines how model layers are allocated to...

Downloads: 5 This Week

Last Update: 2026-03-09
See Project
24

oterm

the terminal client for Ollama

Oterm is an open-source terminal client designed to provide a lightweight command-line interface for interacting with large language models through the Ollama ecosystem. The tool allows users to chat with local AI models directly from the terminal without needing a graphical interface or web application. Its interface is designed to be simple and intuitive, enabling developers to launch conversations quickly using a single command. Oterm supports persistent chat sessions that store...

Downloads: 5 This Week

Last Update: 6 days ago
See Project
25

Claude Cognitive

Persistent context and multi-instance coordination

Claude Cognitive is an advanced memory and context-management extension designed to address the stateless limitations of Claude Code by giving the model a form of persistent “working memory” and multi-instance coordination. It introduces an attention-based context router that prioritizes files and content relevant to the current development discussion — tagging them as HOT, WARM, or COLD based on recency and keyword activation — so Claude Code doesn’t waste token budget rereading irrelevant...

Downloads: 5 This Week

Last Update: 2026-01-28
See Project