composition free download

Showing 28 open source projects for "composition"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
1

Adapters

A Unified Library for Parameter-Efficient Learning

...Adapters provide a unified interface for efficient fine-tuning and modular transfer learning, supporting a myriad of features like full-precision or quantized training (e.g. Q-LoRA, Q-Bottleneck Adapters, or Q-PrefixTuning), adapter merging via task arithmetics or the composition of multiple adapters via composition blocks, allowing advanced research in parameter-efficient transfer learning for NLP tasks.

Downloads: 0 This Week

Last Update: 2026-04-26
See Project
2

Ideogram 4

Open image model at the forefront of design

Ideogram 4 is an open-weight text-to-image model focused on high-quality visual generation, design control, and accurate text rendering inside images. It is built for users who need more than generic image generation, especially when layout, typography, composition, color, and language understanding matter. The project introduces a structured JSON prompting workflow that gives creators more explicit control over scene details and visual constraints. It can also accept plain-text prompts, making it accessible to users who prefer a simpler generation style. Ideogram 4 is especially useful for design-heavy outputs such as posters, ads, mockups, branded graphics, and images that include readable text. ...

Downloads: 1 This Week

Last Update: 2026-06-30
See Project
3

Swarms

Enterprise multi-agent orchestration framework for scalable AI apps

...It supports integration with multiple model providers and existing ecosystems, allowing developers to combine different AI tools and frameworks within a unified system. Swarms also includes mechanisms for agent lifecycle management, memory handling, and dynamic composition, making it adaptable to evolving workloads. Additionally, it focuses on developer productivity through APIs, CLI tools, and templates that simplify building and deploying agent-based applications.

Downloads: 0 This Week

Last Update: 2026-03-17
See Project
4

Sygil WebUI

Stable Diffusion web UI

Sygil WebUI is a browser-based interface for running Stable Diffusion image generation locally or on a server, wrapping common text-to-image and image-to-image workflows into a practical UI. It provides multiple UI modes (including a legacy Gradio interface) and focuses on making iterative prompting, parameter tuning, and post-processing accessible without writing code. The UI exposes core generation controls like resolution, CFG guidance, sampling steps, samplers, seeds, and batch...

Downloads: 0 This Week

Last Update: 2026-07-17
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
5

ImageBind

ImageBind One Embedding Space to Bind Them All

...The model is trained using large-scale contrastive learning, leveraging diverse datasets from natural images, videos, audio clips, and sensor data. Once trained, it can perform cross-modal retrieval, zero-shot classification, and multimodal composition without additional fine-tuning.

Downloads: 0 This Week

Last Update: 2025-11-21
See Project
6

Wan2.2

Wan2.2: Open and Advanced Large-Scale Video Generative Model

...It introduces a Mixture-of-Experts (MoE) architecture that splits the denoising process across specialized expert models, increasing total model capacity without raising computational costs. Wan2.2 integrates meticulously curated cinematic aesthetic data, enabling precise control over lighting, composition, color tone, and more, for high-quality, customizable video styles. The model is trained on significantly larger datasets than its predecessor, greatly enhancing motion complexity, semantic understanding, and aesthetic diversity. Wan2.2 also open-sources a 5-billion parameter high-compression VAE-based hybrid text-image-to-video (TI2V) model that supports 720P video generation at 24fps on consumer-grade GPUs like the RTX 4090. ...

1 Review

Downloads: 116 This Week

Last Update: 2026-03-17
See Project
7

DeepSpeed

Deep learning optimization library: makes distributed training easy

DeepSpeed is an easy-to-use deep learning optimization software suite that enables unprecedented scale and speed for Deep Learning Training and Inference. With DeepSpeed you can: 1. Train/Inference dense or sparse models with billions or trillions of parameters 2. Achieve excellent system throughput and efficiently scale to thousands of GPUs 3. Train/Inference on resource constrained GPU systems 4. Achieve unprecedented low latency and high throughput for inference 5. Achieve extreme...

Downloads: 5 This Week

Last Update: 2026-07-22
See Project
8

HY-World 2.0

A Multi-Modal World Model for Reconstructing, Generating, Simulation

...For text and single-image inputs, it generates high-fidelity 3D Gaussian Splatting scenes through a multi-stage pipeline that includes panorama generation, trajectory planning, world expansion, and world composition. The system also improves reconstruction from multi-view images and video by upgrading its feed-forward 3D prediction components and its memory-aware view generation process. Another major part of the project is WorldLens, a rendering platform designed for interactive exploration with an engine-agnostic architecture, automatic image-based lighting, collision detection, and support for character interaction.

Downloads: 0 This Week

Last Update: 2026-05-27
See Project
9

UNO

A Universal Customization Method for Single and Multi Conditioning

...It suggests a framework for image (or more general generative) modeling where the model can be conditioned either on a single subject or multiple subjects — which may correspond to generating or customizing images featuring specific people, styles, or objects, possibly with fine-grained control over subject identity or composition. Because the project is new (see activity logs for 2025), it seems to aim at bridging between single-subject customization and multi-subject generation in generative modeling — potentially useful for personalized content creation, flexible composition, or controlled generation tasks. UNO likely offers tools to fine-tune or condition generation models so that they can incorporate novel subjects, enabling users to produce custom outputs beyond standard training distribution.

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

AgentScope

Build and run agents you can see, understand and trust

AgentScope is a production-ready agent framework designed to help developers build, deploy, and scale intelligent agentic applications. It provides essential abstractions that evolve with advancing LLM capabilities, emphasizing reasoning, tool use, and flexible orchestration rather than rigid prompt constraints. With built-in support for ReAct agents, memory, planning, human-in-the-loop control, and real-time voice interaction, developers can create powerful agents in minutes. AgentScope...

Downloads: 5 This Week

Last Update: 2026-07-23
See Project
11

ToolUniverse

Democratizing AI scientists with ToolUniverse

ToolUniverse is a comprehensive open-source ecosystem designed to transform any large language model into an autonomous “AI scientist” capable of performing real scientific research tasks through structured tool interaction. It standardizes how AI systems discover, select, and execute tools by introducing a unified AI-Tool Interaction Protocol that allows models to seamlessly connect with hundreds of scientific resources, including machine learning models, datasets, APIs, and analytical...

Downloads: 2 This Week

Last Update: 2026-07-24
See Project
12

UltraRAG

Less Code, Lower Barrier, Faster Deployment

...The toolkit comes with built-in support for popular RAG datasets, large corpora, and canonical baselines, plus documentation that walks from “quick start” to debugging and case analysis. It encourages pipeline composition via configuration, enabling researchers to swap retrievers, rerankers, and generators without heavy refactoring. Community posts highlight its focus on reducing engineering overhead so more effort goes to experimental design. Backed by the OpenBMB org, it is actively maintained with tutorials and updates.

Downloads: 0 This Week

Last Update: 2026-04-09
See Project
13

YuE

Open source AI model for generating full songs from lyrics prompts

YuE is an open source project that provides a foundation model designed for full-song music generation using artificial intelligence. It focuses on transforming text inputs such as lyrics and genre prompts into complete musical compositions that include both vocal and instrumental tracks. Unlike many shorter audio generators, the model is capable of producing songs that last several minutes while maintaining coherent musical structure and alignment with the provided lyrics. YuE introduces a...

Downloads: 1 This Week

Last Update: 7 hours ago
See Project
14

Agentex

Open source codebase for Scale Agentex

AgentEX is an open framework from Scale for building, running, and evaluating agentic workflows, with an emphasis on reproducibility and measurable outcomes rather than ad-hoc demos. It treats an “agent” as a composition of a policy (the LLM), tools, memory, and an execution runtime so you can test the whole loop, not just prompting. The repo focuses on structured experiments: standardized tasks, canonical tool interfaces, and logs that make it possible to compare models, prompts, and tool sets fairly. It also includes evaluation harnesses that capture success criteria and partial credit, plus traces you can inspect to understand where reasoning or tool use failed. ...

Downloads: 1 This Week

Last Update: 2 days ago
See Project
15

OmAgent

Build multimodal language agents for fast prototype and production

...Instead of forcing developers to implement complex orchestration logic manually, the system manages task scheduling, worker coordination, and node optimization behind the scenes. Its architecture uses a graph-based workflow engine where tasks are represented as nodes in a directed workflow, enabling modular composition of complex reasoning pipelines. The framework also includes support for various reasoning strategies commonly used in language agents, such as chain-of-thought prompting, self-consistency reasoning, and ReAct-style decision loops.

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
16

InternLM-XComposer-2.5

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System

...The framework also supports tasks such as image captioning, multimodal reasoning, and layout generation for structured visual documents. By combining language generation with visual composition capabilities, the system enables new forms of content creation that integrate written explanations with automatically generated visual components.

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
17

InfiAgent

Build your own Cowork, AI Scientist and other SoTA Agents

infiAgent is an open-source AI agent framework for building powerful, long-running autonomous agents capable of tackling complex tasks without collapsing under growing context or tool invocation histories. Designed as a “Multi-Level Agent” (MLA) system, it externalizes persistent state to the file system so that agents can operate over unlimited runtime without the need for token-intensive context compression, enabling workflows such as research paper drafting, experiments, coding, and...

Downloads: 0 This Week

Last Update: 2026-03-30
See Project
18

Thinc

A refreshing functional take on deep learning

Thinc is a lightweight deep learning library that offers an elegant, type-checked, functional-programming API for composing models, with support for layers defined in other frameworks such as PyTorch, TensorFlow and MXNet. You can use Thinc as an interface layer, a standalone toolkit or a flexible way to develop new models. Previous versions of Thinc have been running quietly in production in thousands of companies, via both spaCy and Prodigy. We wrote the new version to let users compose,...

Downloads: 0 This Week

Last Update: 2026-03-23
See Project
19

DreamO

A Unified Framework for Image Customization

...It also uses a “placeholder strategy” to precisely align conditional inputs (e.g. where to place clothing or objects) in generated images, giving users fine-grained control over composition.

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
20

MuseGAN

An AI for Music Generation

MuseGAN is a deep learning research project designed to generate symbolic music using generative adversarial networks. The system focuses specifically on generating multi-track polyphonic music, meaning that it can simultaneously produce multiple instrument parts such as drums, bass, piano, guitar, and strings. Instead of generating raw audio, the model operates on piano-roll representations of music, which encode notes as time-pitch matrices for each instrument track. This representation...

Downloads: 1 This Week

Last Update: 2026-03-12
See Project
21

Evals

Evals is a framework for evaluating LLMs and LLM systems

...It’s designed to let you define “evals” (evaluation tasks) in a structured way and run them against different models or agents, with the ability to score, compare, and analyze results. The framework supports templated YAML eval definitions, solver-based evaluations, custom metrics, and composition of multi-step evaluations. It includes utilities and APIs to plug in completion functions, manage prompts, wrap retries or error handling, and register new evaluation types. It also maintains a growing registry of standard benchmarks or “evals” that users can reuse (for example, tasks measuring reasoning, factual accuracy, or chain-of-thought capabilities). ...

Downloads: 0 This Week

Last Update: 2025-10-05
See Project
22

Style Aligned

Official code for Style Aligned Image Generation via Shared Attention

...Instead of fully re-generating an image—and risking changes to lighting, texture, or rendering choices—the method aligns internal features across denoising steps so the target edit inherits the source style. This alignment acts like a constraint on the model’s evolution, steering composition, palette, and brushwork even as objects or attributes change. The result is more consistent edits across a set, which is crucial for workflows like product variations, character sheets, or brand-coherent art. The repository provides reproducible scripts, reference prompts, and guidance for tuning strengths so users can dial in subtle retouches or bolder substitutions. ...

Downloads: 0 This Week

Last Update: 2025-10-10
See Project
23

Lightning-Hydra-Template

PyTorch Lightning + Hydra. A very user-friendly template

...Think of it as a framework for organizing your PyTorch code. Hydra, a framework for elegantly configuring complex applications. The key feature is the ability to dynamically create a hierarchical configuration by composition and override it through config files and the command line.

Downloads: 0 This Week

Last Update: 2023-09-01
See Project
24

Nougat

Implementation of Nougat Neural Optical Understanding

Nougat is a multi-modal generative modeling framework that bridges vision and text modalities with structured generation control (e.g. layout, scene composition) rather than treating images as flat contexts. It combines object-centric modules with transformer-based reasoning to propose, refine, and render scenes in a generative pipeline. The architecture allows you to specify or prompt a layout (which objects should be where) and then the model fills in appearance, context, lighting, and relations coherently. ...

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
25

Piano transcription

Task of transcribing piano recordings into MIDI files

Piano transcription is an open-source high-resolution piano transcription system by ByteDance that converts raw audio recordings of piano performance into symbolic MIDI files — detecting note onsets, offsets, pitch, velocity, and even pedal usage. The system is implemented in Python (PyTorch) and is capable of accurate transcription of polyphonic piano recordings, even with complex passages and pedal techniques, making it suitable for classical piano music. By using this transcription tool,...

Downloads: 8 This Week

Last Update: 2025-12-02
See Project