artificial intelligence python free download

llama.cpp Python Bindings

Python bindings for llama.cpp

llama-cpp-python provides Python bindings for llama.cpp, enabling the integration of LLaMA (Large Language Model Meta AI) language models into Python applications. This facilitates the use of LLaMA's capabilities in natural language processing tasks within Python environments.

Downloads: 5 This Week

Last Update: 2026-07-12

See Project

Anthropic SDK Python

Provides convenient access to the Anthropic REST API from any Python 3

The anthropic-sdk-python repository is the official Python client library for interacting with the Anthropic (Claude) REST API. It is designed to provide a user-friendly, type-safe, and asynchronous/synchronous capable interface for making chat/completion requests to models like Claude. The library includes definitions for all request and response parameters using Python typed objects, automatically handles serialization and deserialization, and wraps HTTP logic (timeouts, retries, error...

Downloads: 5 This Week

Last Update: 3 days ago

See Project

Claude Code SDK Python

Python SDK for Claude Agent

claude-code-sdk-python is the Python SDK for Claude Code, Anthropic’s agentic coding system. It provides abstractions to easily query Claude Code (with streaming support) and conduct interactive sessions. The SDK includes core client classes, asynchronous query functions, and support for custom tools and hooks within Claude sessions. It is designed to integrate with local Python workflows and allow developers to embed Claude Code capabilities directly in their applications or scripts. The...

Downloads: 1 This Week

Last Update: 7 days ago

See Project

LTX-2.3

Official Python inference and LoRA trainer package

LTX-2.3 is an open-source multimodal artificial intelligence foundation model developed by Lightricks for generating synchronized video and audio from prompts or other inputs. Unlike most earlier video generation systems that only produced silent clips, LTX-2 combines video and audio generation in a unified architecture capable of producing coherent audiovisual scenes. The model uses a diffusion-transformer-based architecture designed to generate high-fidelity visual frames while simultaneously producing corresponding audio elements such as speech, music, ambient sound, or effects. ...

Downloads: 50 This Week

Last Update: 2026-07-08

See Project

FLUX.1

Official inference repo for FLUX.1 models

FLUX.1 repository contains inference code and tooling for the FLUX.1 text-to-image diffusion models, enabling developers and researchers to generate and edit images from natural-language prompts using open-weight versions of the model on their own hardware or within custom applications. The project is part of a larger family of FLUX models developed by Black Forest Labs, designed to produce high-quality, detailed visuals from text descriptions with competitive prompt adherence and artistic...

Downloads: 90 This Week

Last Update: 2026-01-19

See Project

Kitten TTS

State-of-the-art TTS model under 25MB

KittenTTS is an open-source, ultra-lightweight, and high-quality text-to-speech model featuring just 15 million parameters and a binary size under 25 MB. It is designed for real-time CPU-based deployment across diverse platforms. Ultra-lightweight, model size less than 25MB. CPU-optimized, runs without GPU on any device. High-quality voices, several premium voice options available. Fast inference, optimized for real-time speech synthesis.

Downloads: 21 This Week

Last Update: 2026-02-24

See Project

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle

PaddleOCR offers exceptional, multilingual, and practical Optical Character Recognition (OCR) tools that can help users train better models and apply them into practice. Inspired by PaddlePaddle, PaddleOCR is an ultra lightweight OCR system, with multilingual recognition, digit recognition, vertical text recognition, as well as long text recognition. It features a PPOCR series of high-quality pre-trained models, which includes: ultra lightweight ppocr_mobile series models, general...

Downloads: 98 This Week

Last Update: 2026-06-11

See Project

DB-GPT

Revolutionizing Database Interactions with Private LLM Technology

DB-GPT is an experimental open-source project that uses localized GPT large models to interact with your data and environment. With this solution, you can be assured that there is no risk of data leakage, and your data is 100% private and secure.

Downloads: 0 This Week

Last Update: 2026-06-18

See Project

ACE-Step 1.5

The most powerful local music generation model

ACE-Step 1.5 is an advanced open-source foundation model for AI-driven music generation that pushes beyond traditional limitations in speed, musical coherence, and controllability by innovating in architecture and training design. It integrates cutting-edge generative techniques—such as diffusion-based synthesis combined with compressed autoencoders and lightweight transformer elements—to produce high-quality full-length music tracks with rapid inference times, capable of generating a...

Downloads: 64 This Week

Last Update: 2026-05-18

See Project

Phi-3-MLX

Phi-3.5 for Mac: Locally-run Vision and Language Models

Phi-3-Vision-MLX is an Apple MLX (machine learning on Apple silicon) implementation of Phi-3 Vision, a lightweight multi-modal model designed for vision and language tasks. It focuses on running vision-language AI efficiently on Apple hardware like M1 and M2 chips.

Downloads: 0 This Week

Last Update: 8 hours ago

See Project

FastSD CPU

Fast stable diffusion on CPU and AI PC

FastSD CPU is an optimized fork of Stable Diffusion designed to run efficiently on CPUs and devices without dedicated GPUs by leveraging Latent Consistency Models and Adversarial Diffusion Distillation techniques that accelerate inference. It focuses on bringing fast text-to-image generation to mainstream hardware like desktop CPUs, lower-end laptops, or edge devices without requiring high-end graphics processors. The repository contains multiple interfaces including a desktop GUI for simple...

Downloads: 21 This Week

Last Update: 2026-07-05

See Project

Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models

Qwen3-TTS is an open-source text-to-speech (TTS) project built around the Qwen3 large language model family, focused on generating high-quality, natural-sounding speech from plain text input. It provides researchers and developers with tools to transform text into expressive, intelligible audio, supporting multiple languages and voice characteristics tuned for clarity and fluidity. The project includes pre-trained models and inference scripts that let users synthesize speech locally or...

Downloads: 20 This Week

Last Update: 2026-03-17

See Project

Bonsai 27B

Run Bonsai (1-bit) and Ternary-Bonsai language models locally

Bonsai 27B is a repository for downloading, configuring, and running PrismML’s highly compressed Bonsai language models on local hardware. It supports the 1-bit Bonsai and higher-quality Ternary-Bonsai families in 1.7B, 4B, 8B, and 27B sizes. The models can run on macOS, Linux, and Windows through CPU, Metal, CUDA, Vulkan, ROCm, llama.cpp, or MLX backends. Its 27B models process text, images, screenshots, and PDFs while supporting reasoning and long-context conversations. They also provide...

Downloads: 130 This Week

Last Update: 2 days ago

See Project

DeepSeek-OCR 2

Visual Causal Flow

DeepSeek-OCR-2 is the second-generation optical character recognition system developed to improve document understanding by introducing a “visual causal flow” mechanism, enabling the encoder to reorder visual tokens in a way that better reflects semantic structure rather than strict raster scan order. It is designed to handle complex layouts and noisy documents by giving the model causal reasoning capabilities that mimic human visual scanning behavior, enhancing OCR performance on documents...

Downloads: 8 This Week

Last Update: 2026-02-03

See Project

LTX-Video

Official repository for LTX-Video

LTX-Video is a sophisticated multimedia processing framework from Lightricks designed to handle high-quality video editing, compositing, and transformation tasks with performance and scalability. It provides runtime components that efficiently decode, encode, and manipulate video streams, frame buffers, and audio tracks while exposing a rich API for building customized editing features like transitions, effects, color grading, and keyframe automation. The toolkit is built with both real-time...

Downloads: 15 This Week

Last Update: 2026-01-11

See Project

ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

ComfyUI-LTXVideo is a bridge between ComfyUI’s node-based generative workflow environment and the LTX-Video multimedia processing framework, enabling creators to orchestrate complex video tasks within a visual graph paradigm. Instead of writing code to apply effects, transitions, edits, and data flows, users can assemble nodes that represent video inputs, transformations, and outputs, letting them prototype and automate video production pipelines visually. This integration empowers...

Downloads: 13 This Week

Last Update: 4 days ago

See Project

Gorden Super PPT Skills

AI PPT Track Terminator, the strongest PPT Skill ever

Gorden Super PPT Skills is an AI presentation skill package for generating high-density visual presentations and converting them into editable PowerPoint files. The workflow is split into three skills that can be used separately or together. One skill generates image-based presentation pages from a topic or content brief. Another skill reconstructs image slides into editable PPTX files by separating backgrounds, layout structures, icons, decorations, and text. The orchestration skill chains...

Downloads: 8 This Week

Last Update: 2026-06-17

See Project

Hunyuan3D-2.1

From Images to High-Fidelity 3D Assets

Hunyuan3D-2.1 is Tencent Hunyuan’s advanced 3D asset generation system that produces high-fidelity 3D models with Physically Based Rendering (PBR) textures. It is fully open-source with released model weights, training, and inference code. It improves on prior versions by using a PBR texture pipeline (enabling realistic material effects like reflections and subsurface scattering) and allowing community fine-tuning and extension. It supports both shape generation (mesh geometry) and texture...

Downloads: 24 This Week

Last Update: 2025-10-17

See Project

LTX-2

Python inference and LoRA trainer package for the LTX-2 audio–video

LTX-2 is a powerful, open-source toolkit developed by Lightricks that provides a modular, high-performance base for building real-time graphics and visual effects applications. It is architected to give developers low-level control over rendering pipelines, GPU resource management, shader orchestration, and cross-platform abstractions so they can craft visually compelling experiences without starting from scratch. Beyond basic rendering scaffolding, LTX-2 includes optimized math libraries,...

Downloads: 14 This Week

Last Update: 2026-07-08

See Project

InstantCharacter

Personalize Any Characters with a Scalable Diffusion Transformer

InstantCharacter is a tuning-free diffusion transformer framework created by Tencent Hunyuan / InstantX team, which enables generating images of a specific character (subject) from a single reference image, preserving identity and character features. Uses adapters, so full fine-tuning of the base model is not required. Demo scripts and pipeline API (via infer_demo.py, pipeline.py) included. It works by adapting a base image generation model with a lightweight adapter so that you can produce...

Downloads: 2 This Week

Last Update: 2025-09-23

See Project

DeepSeek-V3

Powerful AI language model (MoE) optimized for efficiency/performance

DeepSeek-V3 is a robust Mixture-of-Experts (MoE) language model developed by DeepSeek, featuring a total of 671 billion parameters, with 37 billion activated per token. It employs Multi-head Latent Attention (MLA) and the DeepSeekMoE architecture to enhance computational efficiency. The model introduces an auxiliary-loss-free load balancing strategy and a multi-token prediction training objective to boost performance. Trained on 14.8 trillion diverse, high-quality tokens, DeepSeek-V3...

1 Review

Downloads: 48 This Week

Last Update: 2025-07-09

See Project

Wan2.1

Wan2.1: Open and Advanced Large-Scale Video Generative Model

Wan2.1 is a foundational open-source large-scale video generative model developed by the Wan team, providing high-quality video generation from text and images. It employs advanced diffusion-based architectures to produce coherent, temporally consistent videos with realistic motion and visual fidelity. Wan2.1 focuses on efficient video synthesis while maintaining rich semantic and aesthetic detail, enabling applications in content creation, entertainment, and research. The model supports...

1 Review

Downloads: 64 This Week

Last Update: 2026-03-05

See Project

IndexTTS2

Industrial-level controllable zero-shot text-to-speech system

IndexTTS is a modern, zero-shot text-to-speech (TTS) system engineered to deliver high-quality, natural-sounding speech synthesis with few requirements and strong voice-cloning capabilities. It builds on state-of-the-art models such as XTTS and other modern neural TTS backbones, improving them with a conformer-based speech conditional encoder and upgrading the decoder to a high-quality vocoder (BigVGAN2), leading to clearer and more natural audio output. The system supports zero-shot voice...

Downloads: 7 This Week

Last Update: 2025-11-27

See Project

DeepSeek-OCR

Contexts Optical Compression

DeepSeek-OCR is an open-source optical character recognition solution built as part of the broader DeepSeek AI vision-language ecosystem. It is designed to extract text from images, PDFs, and scanned documents, and integrates with multimodal capabilities that understand layout, context, and visual elements beyond raw character recognition. The system treats OCR not simply as “read the text” but as “understand what the text is doing in the image”—for example distinguishing captions from body...

Downloads: 3 This Week

Last Update: 2026-01-27

See Project

Wan2.2

Wan2.2: Open and Advanced Large-Scale Video Generative Model

Wan2.2 is a major upgrade to the Wan series of open and advanced large-scale video generative models, incorporating cutting-edge innovations to boost video generation quality and efficiency. It introduces a Mixture-of-Experts (MoE) architecture that splits the denoising process across specialized expert models, increasing total model capacity without raising computational costs. Wan2.2 integrates meticulously curated cinematic aesthetic data, enabling precise control over lighting,...

1 Review

Downloads: 123 This Week

Last Update: 2026-03-17

See Project

Search Results for "artificial intelligence python"

Showing 279 open source projects for "artificial intelligence python"

llama.cpp Python Bindings

Anthropic SDK Python

Claude Code SDK Python

LTX-2.3

FLUX.1

Kitten TTS

PaddleOCR

DB-GPT

ACE-Step 1.5

Phi-3-MLX

FastSD CPU

Qwen3-TTS

Bonsai 27B

DeepSeek-OCR 2

LTX-Video

ComfyUI-LTXVideo

Gorden Super PPT Skills

Hunyuan3D-2.1

LTX-2

InstantCharacter

DeepSeek-V3

Wan2.1

IndexTTS2

DeepSeek-OCR

Wan2.2

Search Results for "artificial intelligence python"

Showing 279 open source projects for "artificial intelligence python"

llama.cpp Python Bindings

Anthropic SDK Python

Claude Code SDK Python

LTX-2.3

FLUX.1

Kitten TTS

PaddleOCR

DB-GPT

ACE-Step 1.5

Phi-3-MLX

FastSD CPU

Qwen3-TTS

Bonsai 27B

DeepSeek-OCR 2

LTX-Video

ComfyUI-LTXVideo

Gorden Super PPT Skills

Hunyuan3D-2.1

LTX-2

InstantCharacter

DeepSeek-V3

Wan2.1

IndexTTS2

DeepSeek-OCR

Wan2.2

Related Searches

Related Categories