Page 4 | foundation free download

Showing 107 open source projects for "foundation"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.

Try for Free
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
1

Meta-World

Collections of robotics environments

Meta-World is an open-source benchmark suite of robotic manipulation environments focused on multi-task and meta reinforcement learning. It provides a large collection of continuous-control tasks, such as reaching, pushing, opening doors, and manipulating objects with a simulated robot arm. The library defines standardized benchmarks like MT1, MT10, and MT50 for multi-task learning, where a single policy is trained across different numbers of tasks. It also offers meta-learning benchmarks...

Downloads: 0 This Week

Last Update: 2025-11-25
See Project
2

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project

...With its modular design, Kaldi allows users to adapt the system to a wide range of languages and domains. As one of the most influential projects in speech recognition, it has become a foundation for much of the modern work in ASR.

Downloads: 0 This Week

Last Update: 4 days ago
See Project
3

MONAI

AI Toolkit for Healthcare Imaging

The MONAI framework is the open-source foundation being created by Project MONAI. MONAI is a freely available, community-supported, PyTorch-based framework for deep learning in healthcare imaging. It provides domain-optimized foundational capabilities for developing healthcare imaging training workflows in a native PyTorch paradigm. Project MONAI also includes MONAI Label, an intelligent open source image labeling and learning tool that helps researchers and clinicians collaborate, create annotated datasets, and build AI models in a standardized MONAI paradigm. ...

Downloads: 0 This Week

Last Update: 2026-06-08
See Project
4

Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM

Qwen3-Omni is a natively end-to-end multilingual omni-modal foundation model that processes text, images, audio, and video and delivers real-time streaming responses in text and natural speech. It uses a Thinker-Talker architecture with a Mixture-of-Experts (MoE) design, early text-first pretraining, and mixed multimodal training to support strong performance across all modalities without sacrificing text or image quality.

Downloads: 1 This Week

Last Update: 2026-04-23
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

DocArray

The data structure for multimodal data

...It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. Data science powerhouse: greatly accelerate data scientists’ work on embedding, k-NN matching, querying, visualizing, evaluating via Torch/TensorFlow/ONNX/PaddlePaddle on CPU/GPU. Data in transit: optimized for network communication, ready-to-wire at anytime with fast and compressed serialization in Protobuf, bytes, base64, JSON, CSV, DataFrame. ...

Downloads: 0 This Week

Last Update: 2025-03-21
See Project
6

GLM-4.6V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

GLM-4.6V represents the latest generation of the GLM-V family and marks a major step forward in multimodal AI by combining advanced vision-language understanding with native “tool-call” capabilities, long-context reasoning, and strong generalization across domains. Unlike many vision-language models that treat images and text separately or require intermediate conversions, GLM-4.6V allows inputs such as images, screenshots or document pages directly as part of its reasoning pipeline — and...

Downloads: 0 This Week

Last Update: 2026-05-16
See Project
7

MiniMax-M1

Open-weight, large-scale hybrid-attention reasoning model

MiniMax-M1 is presented as the world’s first open-weight, large-scale hybrid-attention reasoning model, designed to push the frontier of long-context, tool-using, and deeply “thinking” language models. It is built on the MiniMax-Text-01 foundation and keeps the same massive parameter budget, but reworks the attention and training setup for better reasoning and test-time compute scaling. Architecturally, it combines Mixture-of-Experts layers with lightning attention, enabling the model to support a native context length of 1 million tokens while using far fewer FLOPs than comparable reasoning models for very long generations. ...

Downloads: 0 This Week

Last Update: 2025-12-01
See Project
8

GeneralAI

Large-scale Self-supervised Pre-training Across Tasks, Languages, etc.

Fundamental research to develop new architectures for foundation models and AI, focusing on modeling generality and capability, as well as training stability and efficiency.

Downloads: 0 This Week

Last Update: 2024-05-09
See Project
9

InternVL

A Pioneering Open-Source Alternative to GPT-4o

InternVL is a large-scale multimodal foundation model designed to integrate computer vision and language understanding within a unified architecture. The project focuses on scaling vision models and aligning them with large language models so that they can perform tasks involving both visual and textual information. InternVL is trained on massive collections of image-text data, enabling it to learn representations that capture both visual patterns and semantic meaning.

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
10

DeepSeek VL

Towards Real-World Vision-Language Understanding

DeepSeek-VL is DeepSeek’s initial vision-language model that anchors their multimodal stack. It enables understanding and generation across visual and textual modalities—meaning it can process an image + a prompt, answer questions about images, caption, classify, or reason about visuals in context. The model is likely used internally as the visual encoder backbone for agent use cases, to ground perception in downstream tasks (e.g. answering questions about a screenshot). The repository...

Downloads: 1 This Week

Last Update: 2025-10-03
See Project
11

Improved Diffusion

Release for Improved Denoising Diffusion Probabilistic Models

...The implementation is intended for researchers and practitioners who want to explore the theoretical and practical aspects of diffusion models in deep learning. By making this code available, OpenAI provides a foundation for further experimentation and development in generative modeling research.

Downloads: 0 This Week

Last Update: 4 days ago
See Project
12

Autodistill

Images to inference with no labeling

Autodistill uses big, slower foundation models to train small, faster supervised models. Using autodistill, you can go from unlabeled images to inference on a custom model running at the edge with no human intervention in between. You can use Autodistill on your own hardware, or use the Roboflow hosted version of Autodistill to label images in the cloud.

Downloads: 0 This Week

Last Update: 2024-08-14
See Project
13

Plugins Quickstart

Get a ChatGPT plugin up and running in under 5 minutes

plugins-quickstart is a starter project created by OpenAI to help developers build and deploy ChatGPT plugins quickly. It provides a minimal but complete example of how to structure a plugin, implement an API, and define the necessary configuration files. The repository demonstrates how a plugin can be served, authenticated, and integrated with ChatGPT for real-world use. By including both the backend code and plugin manifest, it guides developers through the end-to-end development workflow....

Downloads: 2 This Week

Last Update: 1 day ago
See Project
14

InternLM

Official release of InternLM series

InternLM is an open-source family of multilingual foundation and chat models, accompanied by an ecosystem that supports training, inference, and application development. The repository highlights multiple model sizes intended to serve different needs, from efficient research and prototyping to more capable deployments for complex scenarios. Beyond model weights, the project emphasizes an ecosystem view, pointing developers to compatible tools and projects across training and inference so teams can build end-to-end workflows. ...

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
15

RWKV

RNN with great LLM performance

...This gives RWKV important advantages for long-context use, including lower memory pressure and no traditional key-value cache requirement. The repository includes training code, model notes, research material, and references to current RWKV weights. Its main value is providing the foundation for experimenting with efficient large language models that combine transformer-like scalability with RNN-like runtime behavior.

Downloads: 0 This Week

Last Update: 2026-06-10
See Project
16

Maia

MAIA (MyApp Intelligence Artificial) is designed to provide a foundation for building your own voice-controlled assistant with Python. It uses various libraries and modules for speech recognition, text-to-speech synthesis, and custom functionality.

Downloads: 0 This Week

Last Update: 2024-04-21
See Project
17

MetaTransformer

Meta-Transformer for Unified Multimodal Learning

We're thrilled to present OneLLM, an ensembling Meta-Transformer framework with Multimodal Large Language Models, which performs multimodal joint training, supports more modalities including fMRI, Depth, and Normal Maps, and demonstrates very impressive performances on 25 benchmarks.

Downloads: 0 This Week

Last Update: 2024-08-19
See Project
18

TaskMatrix

Enable sending and receiving images during chatting

...The architecture focuses on modularity, allowing new APIs and foundation models to be integrated as interchangeable task-solving components. The project also explores low-code human-AI interaction workflows that improve controllability and transparency during complex task execution.

Downloads: 0 This Week

Last Update: 2026-05-07
See Project
19

RasaGPT

Headless Rasa chatbot platform with LLM integration and APIs

...It also enables Telegram bot integration and remote access via ngrok. Docker support allows easier setup and deployment, particularly on macOS environments. While designed as a working prototype, it provides a practical foundation for developers building LLM-powered chatbot applications with extensible architecture and preconfigured components.

Downloads: 0 This Week

Last Update: 5 days ago
See Project
20

pyllama

LLaMA: Open and Efficient Foundation Language Models

📢 pyllama is a hacked version of LLaMA based on original Facebook's implementation but more convenient to run in a Single consumer grade GPU.

Downloads: 0 This Week

Last Update: 2023-08-24
See Project
21

MaskFormer

Per-Pixel Classification is Not All You Need for Semantic Segmentation

MaskFormer is a unified framework for image segmentation developed by Facebook Research, designed to bridge the gap between semantic, instance, and panoptic segmentation within a single architecture. Unlike traditional segmentation pipelines that treat these tasks separately, MaskFormer reformulates segmentation as a mask classification problem, enabling a consistent and efficient approach across multiple segmentation domains. Built on top of Detectron2, it supports a wide range of datasets...

Downloads: 0 This Week

Last Update: 4 days ago
See Project
22

PixelCNN

Code for the paper "PixelCNN++: A PixelCNN Implementation..."

PixelCNN is the official implementation from OpenAI of the autoregressive generative model described in the paper Conditional Image Generation with PixelCNN Decoders. It provides code for training and evaluating PixelCNN models on image datasets, focusing on conditional image modeling where pixels are generated sequentially based on the values of previously generated pixels. The repository demonstrates how to apply masked convolutions to enforce autoregressive dependencies and achieve...

Downloads: 0 This Week

Last Update: 4 days ago
See Project
23

Deep Learning Drizzle

Drench yourself in Deep Learning, Reinforcement Learning

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures! Optimization courses which form the foundation for ML, DL, RL. Computer Vision courses which are DL & ML heavy. Speech recognition courses which are DL heavy. Structured Courses on Geometric, Graph Neural Networks. Section on Autonomous Vehicles. Section on Computer Graphics with ML/DL focus.

Downloads: 0 This Week

Last Update: 2022-07-29
See Project
24

maskrcnn-benchmark

Fast, modular reference implementation of Instance Segmentation

...The framework integrates critical components—region proposal networks (RPNs), RoIAlign layers, mask heads, and backbone architectures such as ResNet and FPN—optimized for both accuracy and speed. It supports multi-GPU distributed training, mixed precision, and custom data loaders for new datasets. Built as a reference implementation, it became a foundation for the next-generation Detectron2, yet remains widely used for research needing a stable, reproducible environment. Visualization tools, model zoo checkpoints, and benchmark scripts make it easy to replicate state-of-the-art results or fine-tune models for custom tasks.

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
25

Virtual Laboratory Environment

A multi-modeling and simulation environment to study complex systems

...VLE is based on the discrete event specification DEVS. and it implements the DSDE formalism (A merge of Dynamic Structure DEVS, DSDEVS, with Parallel DEVS, PDEVS). VLE provides a complete set of C++ libraries, called VFL (VLE Foundation Libraries), to develop DEVS models, to gets results of simulations, to launch simulation on cluster. The models can be developed with the DEVS formalism or with the classical mathematical formalism: Ordinary Differential Equation with Euler, Range-Kutta or QSS integrator, Finite state automaton (FDDEVS, UML State chart, Hybrid Petri net). ...

2 Reviews

Downloads: 0 This Week

Last Update: 2019-02-04
See Project