Page 2 | manipulation free download

Poetiq

Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1

...Instead of relying on a single prompt or fixed strategy, their solver dynamically adapts the reasoning path, selecting what to ask or analyze next depending on intermediate results — effectively compositing reasoning, perception, and program synthesis (or symbolic manipulation) in a loop. The repository allows others to reproduce their results, experiment with different LLM backends (e.g. the user may supply keys for supported models), and observe how their adaptive meta-system handles the logic and abstraction challenges.

Downloads: 0 This Week

Last Update: 2025-12-07

See Project

DreamO

A Unified Framework for Image Customization

DreamO is a unified, open-source framework from ByteDance for advanced image customization and generation that consolidates multiple “image manipulation” tasks into a single system, rather than requiring separate specialized models. Built on a diffusion-transformer (DiT) backbone, it supports a diverse set of tasks — including identity preservation, virtual “try-on” (e.g. clothing, accessories), style transfer, IP adaptation (objects/characters), and layout/condition-aware customizations — all handled within the same unified architecture. ...

Downloads: 0 This Week

Last Update: 2025-12-02

See Project

Vidi2

Large Multimodal Models for Video Understanding and Editing

Vidi is a family of large multimodal models developed for deep video understanding and editing tasks, integrating vision, audio, and language to allow sophisticated querying and manipulation of video content. It’s designed to process long-form, real-world videos and answer complex queries such as “when in this clip does X happen?” or “where in the frame is object Y during that moment?” — offering temporal retrieval, spatio-temporal grounding (i.e. locating objects over time + space), and even video question answering. ...

Downloads: 0 This Week

Last Update: 2026-03-04

See Project

Step-Audio-EditX

LLM-based Reinforcement Learning audio edit model

Step-Audio-EditX is an open-source, 3 billion-parameter audio model from StepFun AI designed to make expressive and precise editing of speech and audio as easy as text editing. Rather than treating audio editing as low-level waveform manipulation, this model converts speech into a sequence of discrete “audio tokens” (via a dual-codebook tokenizer) — combining a linguistic token stream and a semantic (prosody/emotion/style) token stream — thereby abstracting audio editing into high-level token operations. This allows users to modify not only what is said (the text) but also how it's said: emotion, tone, speaking style, prosody, accent, even paralinguistic cues. ...

Downloads: 0 This Week

Last Update: 2026-03-16

See Project

MuJoCo MPC

Real-time behaviour synthesis with MuJoCo, using Predictive Control

...The system supports multi-shooting optimization, enabling precise motion planning across diverse domains like quadruped locomotion, humanoid tracking, and dexterous manipulation. In addition to its C++ core, MJPC includes an experimental Python API, enabling integration with custom models and MuJoCo tasks for flexible scripting and experimentation.

Downloads: 0 This Week

Last Update: 2025-10-09

See Project

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

...DragGAN has gained attention for making complex image edits, such as pose changes or shape adjustments, accessible through an intuitive interface. The repository provides code and GUI tooling that allow researchers and advanced users to experiment with this next-generation controllable image manipulation technique.

Downloads: 0 This Week

Last Update: 2026-02-24

See Project

OpenAI Glow

Copy code in "Glow: Generative Flow with Invertible 1x1 Convolutions"

...Unlike models that rely on approximate inference, Glow uses invertible transformations to directly learn the data distribution, allowing for exact likelihood computation and efficient sampling. The model is capable of producing high-quality synthetic images while maintaining interpretable latent spaces that enable meaningful manipulation of generated outputs. Glow’s architecture is based on reversible layers and efficient flow operations, which allow large-scale training while keeping memory usage manageable. The repository provides training code, pretrained models, and scripts for generating samples or reproducing key results from the original research. Glow is primarily intended for researchers and practitioners exploring generative modeling, likelihood-based training, and interpretable deep learning systems.

Downloads: 3 This Week

Last Update: 7 days ago

See Project

ALAE

Adversarial Latent Autoencoders

ALAE (Adversarial Latent Autoencoders) is a deep learning research implementation that combines autoencoders with generative adversarial networks to produce high-quality image synthesis models. The project implements the architecture introduced in the CVPR research paper on Adversarial Latent Autoencoders, which focuses on improving generative modeling by learning latent representations aligned with adversarial training objectives. Unlike traditional GANs that directly generate images from...

Downloads: 0 This Week

Last Update: 2026-03-11

See Project

GIMP ML

AI for GNU Image Manipulation Program

This repository introduces GIMP3-ML, a set of Python plugins for the widely popular GNU Image Manipulation Program (GIMP). It enables the use of recent advances in computer vision to the conventional image editing pipeline. Applications from deep learning such as monocular depth estimation, semantic segmentation, mask generative adversarial networks, image super-resolution, de-noising and coloring have been incorporated with GIMP through Python-based plugins.

Downloads: 13 This Week

Last Update: 2022-08-19

See Project

StarGAN

Official PyTorch Implementation

...Unlike earlier GAN approaches that required separate models for each domain pair, StarGAN enables flexible attribute transfer across multiple domains within one network, significantly improving efficiency and scalability. The repository includes full training and inference pipelines for tasks such as facial attribute manipulation and style transfer. It demonstrates adversarial training strategies, domain classification losses, and generator-discriminator coordination required for stable multi-domain translation. Researchers and practitioners often use the project as a reference when studying conditional GANs and advanced image synthesis techniques. Overall, the repository provides a clean and practical baseline for experimenting with multi-domain generative modeling in PyTorch.

Downloads: 0 This Week

Last Update: 2026-02-19

See Project

TensorFlow Machine Learning Cookbook

Code for Tensorflow Machine Learning Cookbook

...The repository contains numerous Python scripts and Jupyter notebooks that demonstrate how to implement machine learning algorithms and neural networks using the TensorFlow framework. Each section focuses on a different aspect of machine learning development, including tensor manipulation, model training, optimization strategies, and data processing techniques. The examples illustrate how TensorFlow operations and tensors can be used to build machine learning pipelines and perform tasks such as regression, classification, and clustering. By combining theoretical explanations with executable code, the project helps developers understand how TensorFlow algorithms operate internally while also providing working examples that can be adapted for real projects.

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

Neural Photo Editor

A simple interface for editing natural photos

Neural Photo Editor is an experimental machine learning application that demonstrates how generative neural networks can be used as an interactive photo editing tool. The project implements the system described in the research paper Neural Photo Editing with Introspective Adversarial Networks, which introduces a generative model capable of modifying images in semantically meaningful ways. Instead of editing images by directly manipulating pixels, the software allows users to influence...

Downloads: 0 This Week

Last Update: 2026-03-12

See Project

Tesseract-gui

Tessract-GUI is not a front-end for tesseract-ocr. It is just a graphical way to use it with simple image manipulation thru ImageMagick.

2 Reviews

Downloads: 12 This Week

Last Update: 2014-06-29

See Project

CMU Personal Robotics ROS Packages

The CMU personal robotics package offers many robotics algorithms/controllers/drivers that enable robots to perform basic tasks like manipulation and vision. The main infrastructure used is OpenRAVE and Robot Operating System (ROS).

Downloads: 0 This Week

Last Update: 2014-07-19

See Project

Search Results for "manipulation" - Page 2

Showing 39 open source projects for "manipulation"

Poetiq

DreamO

Vidi2

Step-Audio-EditX

MuJoCo MPC

DragGAN

OpenAI Glow

ALAE

GIMP ML

StarGAN

TensorFlow Machine Learning Cookbook

Neural Photo Editor

Tesseract-gui

CMU Personal Robotics ROS Packages

Search Results for "manipulation" - Page 2

Showing 39 open source projects for "manipulation"

Poetiq

DreamO

Vidi2

Step-Audio-EditX

MuJoCo MPC

DragGAN

OpenAI Glow

ALAE

GIMP ML

StarGAN

TensorFlow Machine Learning Cookbook

Neural Photo Editor

Tesseract-gui

CMU Personal Robotics ROS Packages

Related Searches

Related Categories