make free download - SourceForge

Showing 18 open source projects for "make"

View related business solutions

AI Models Python Clear Filters & Widen Search

Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

FramePack

Lets make video diffusion practical

FramePack explores compact representations for sequences of image frames, targeting tasks where many near-duplicate frames carry redundant information. The idea is to “pack” frames by detecting shared structure and storing differences efficiently, which can accelerate training or inference on video-like data. By reducing I/O and memory bandwidth, datasets become lighter to load while models still see the essential temporal variation. The repository demonstrates both packing and unpacking...

Downloads: 31 This Week

Last Update: 2025-10-21
See Project
2

DreamCraft3D

Official implementation of DreamCraft3D

...“make this object metallic,” “add textures”) via prompt conditioning or guides.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
3

MiniMind-O

A 0.1B Omni model trained from scratch

...It extends the MiniMind family by exploring a model that can handle text, audio, and image inputs while producing text and streaming speech outputs. The project is designed to make multimodal AI training more accessible by keeping the model size small enough for ordinary personal hardware. It includes both mini and full training data paths, allowing learners to run a complete workflow quickly or reproduce the released model setup more closely. The implementation emphasizes native PyTorch code instead of relying on high-level third-party abstractions. minimind-o is most useful for developers and researchers who want to understand how multimodal and speech-capable AI systems are built from the ground up.

Downloads: 4 This Week

Last Update: 2026-06-28
See Project
4

HRM-Text

1B text generation model based on the HRM architecture

HRM-Text is a one-billion-parameter text generation model and pretraining framework based on the Hierarchical Reasoning Model architecture. It is designed to make foundation model pretraining more accessible by reducing compute and data requirements compared with traditional scaling-heavy approaches. The system combines hierarchical recurrent design, task-completion strengthening, and latent-space reasoning. Its training stack includes PrefixLM sequence packing, FlashAttention 3 kernels, PyTorch FSDP2, evaluation scripts, and checkpoint conversion tools. ...

Downloads: 1 This Week

Last Update: 2026-06-17
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
5

Step-Audio-EditX

LLM-based Reinforcement Learning audio edit model

...Because the model is trained with a “large-margin learning” objective over many synthesized and natural speech samples, it gains robust control over expressive attributes, and can perform iterative editing: e.g. you could record a line, then ask the model to “make it sadder,” “speak slower,” or “change accent to X.”

Downloads: 0 This Week

Last Update: 2026-04-09
See Project
6

MiMo Audio

Audio Language Models are Few-Shot Learners

...It supports audio understanding, speech intelligence, spoken dialogue, instruction-following audio generation, and text-to-speech-style tasks. The architecture combines audio tokenization, patch encoding, a language model, and patch decoding to make high-rate audio sequences more efficient to model. Overall, it is useful for researchers and developers experimenting with advanced audio LLMs, speech generation, audio reasoning, and instruction-tuned multimodal systems.

Downloads: 0 This Week

Last Update: 2026-06-29
See Project
7

TimesFM

Pretrained time-series foundation model developed by Google Research

...It provides a decoder-only model approach to forecasting, aiming for strong performance even in zero-shot or low-data settings where traditional models often struggle. The project includes code and an inference API intended to make it practical to run forecasts programmatically, with options to use different backends such as Torch or Flax depending on your environment and performance needs. Newer releases emphasize expanded context handling and more flexible forecasting outputs, including quantile forecasting so users can get uncertainty estimates rather than only point predictions. ...

Downloads: 0 This Week

Last Update: 2026-07-02
See Project
8

FastVLM

This repository contains the official implementation of FastVLM

...Apple’s research brief frames FastVLM as targeting real-time or latency-sensitive scenarios, where lowering visual token pressure is critical to interactive UX. In short, it’s a practical recipe to make VLMs fast without exotic token-selection heuristics.

Downloads: 0 This Week

Last Update: 2025-10-08
See Project
9

Step1X-Edit

A SOTA open-source image editing model

Step1X-Edit is a state-of-the-art open-source image editing model/framework that uses a multimodal large language model (LLM) together with a diffusion-based image decoder to let users edit images simply via natural-language instructions plus a reference image. You supply an existing image and a textual command — e.g. “add a ruby pendant on the girl’s neck” or “make the background a sunset over mountains” — and the model interprets the instruction, computes a latent embedding combining the image content and user intent, then decodes a new image implementing the edit. The model targets general-purpose editing: from object addition/removal, style changes, recoloring, retouching, background replacement, to complex transformations like changing lighting, mood, or art style. ...

Downloads: 0 This Week

Last Update: 2026-04-29
See Project
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
10

Qwen2.5-Omni

Capable of understanding text, audio, vision, video

...It supports “Thinker-Talker” architecture, and introduces innovations for aligning modalities over time (for example synchronizing video/audio), robust speech generation, and low-VRAM/quantized versions to make usage more accessible. It holds state-of-the-art performance in many multimodal benchmarks, particularly spoken language understanding, audio reasoning, image/video understanding, etc. Very strong benchmark performance across modalities (audio understanding, speech recognition, image/video reasoning) and often outperforming or matching single-modality models at a similar scale. ...

Downloads: 0 This Week

Last Update: 2025-09-23
See Project
11

Free AI Watermark Remover - FreeRepair

AI-powered tool to quickly remove watermarks from images flawlessly

AI Watermark Remover (Free And Open-Source) & Make Blurry Images Clearer Or Larger Tool - FreeRepair, Simulation IOPaint Based On The Django Of Python With No Sign-Up. As a free, open-source, AI-powered tool, FreeRepair makes it easy to remove watermarks, logos, text or clutter from images, and blurry images can be made clearer or larger. No installation, no internet connection, it works out of the box, safe and secure, unlimited.

1 Review

Downloads: 35 This Week

Last Update: 2026-03-30
See Project
12

ChatGPT Retrieval Plugin

The ChatGPT Retrieval Plugin lets you easily find personal documents

...It also contains plugin manifest files (OpenAPI spec, plugin JSON) so that the retrieval backend can be registered in a plugin ecosystem. Because retrieval is often needed to make LLMs “know what’s in your docs” without leaking everything, this plugin aims to be a secure, flexible building block for retrieval-augmented generation (RAG) systems.

Downloads: 0 This Week

Last Update: 2025-10-02
See Project
13

FastViT

This repository contains the official implementation of research

...Training and inference recipes highlight straightforward integration into common vision tasks such as classification, detection, and segmentation. The codebase provides reference implementations and checkpoints that make it easy to evaluate or fine-tune on downstream datasets. In practice, FastViT offers drop-in backbones that reduce compute and memory pressure without exotic training tricks.

Downloads: 0 This Week

Last Update: 2025-10-08
See Project
14

Stable-Dreamfusion

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion

A pytorch implementation of the text-to-3D model Dreamfusion, powered by the Stable Diffusion text-to-2D model. This project is a work-in-progress and contains lots of differences from the paper. The current generation quality cannot match the results from the original paper, and many prompts still fail badly! Since the Imagen model is not publicly available, we use Stable Diffusion to replace it (implementation from diffusers). Different from Imagen, Stable-Diffusion is a latent diffusion...

Downloads: 0 This Week

Last Update: 2023-05-15
See Project
15

GPT-NeoX

Implementation of model parallel autoregressive transformers on GPUs

...Our current framework is based on NVIDIA's Megatron Language Model and has been augmented with techniques from DeepSpeed as well as some novel optimizations. We aim to make this repo a centralized and accessible place to gather techniques for training large-scale autoregressive language models, and accelerate research into large-scale training. For those looking for a TPU-centric codebase, we recommend Mesh Transformer JAX. If you are not looking to train models with billions of parameters from scratch, this is likely the wrong library to use. ...

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
16

Mask2Former

Code release for "Masked-attention Mask Transformer

...The project provides extensive configurations and pretrained models across popular benchmarks like COCO, ADE20K, and Cityscapes. Built on top of Detectron2, it includes training scripts, inference tools, and visualization utilities that make experimentation straightforward.

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
17

MaskFormer

Per-Pixel Classification is Not All You Need for Semantic Segmentation

...Its successor, Mask2Former, extends the same meta-architecture to achieve state-of-the-art results across all major segmentation benchmarks. MaskFormer’s modular design, dataset integration, and compatibility with existing Detectron2 models make it an essential research tool.

Downloads: 1 This Week

Last Update: 4 days ago
See Project
18

TimeSformer

The official pytorch implementation of our paper

...Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch provides configurations, pretrained models, and training scripts that make it straightforward to evaluate or fine-tune on video datasets. TimeSformer was influential in showing that pure transformer architectures—without convolutional backbones—can perform strongly on video classification tasks. Its flexible attention design allows experimenting with different factoring (spatial-then-temporal, joint, etc.) to trade off compute, memory, and accuracy.

Downloads: 0 This Week

Last Update: 2025-10-07
See Project