Showing 419 open source projects for "foundation"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    LLMs-from-scratch

    LLMs-from-scratch

    Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

    LLMs-from-scratch is an educational codebase that walks through implementing modern large-language-model components step by step. It emphasizes building blocks—tokenization, embeddings, attention, feed-forward layers, normalization, and training loops—so learners understand not just how to use a model but how it works internally. The repository favors clear Python and NumPy or PyTorch implementations that can be run and modified without heavyweight frameworks obscuring the logic. Chapters...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    SAM 3D Objects

    SAM 3D Objects

    Models for object and human mesh reconstruction

    SAM 3D Objects is a foundation model that reconstructs full 3D geometry, texture, and spatial layout of objects and scenes from a single image. Given one RGB image and object masks (for example, from the Segment Anything family), it can generate a textured 3D mesh for each object, including pose and approximate scene layout. The model is specifically designed to be robust in real-world images with clutter, occlusions, small objects, and unusual viewpoints, where many earlier 3D-from-image systems struggle. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    UI-TARS

    UI-TARS

    UI-TARS-desktop version that can operate on your local personal device

    ...This allows it to perform complex, multi-step tasks such as filling forms, downloading files, navigating applications, and even controlling in-game actions — all by understanding the UI as a human would. The project is open-source, supports deployment locally or remotely, and offers a foundation for building GUI automation agents that are more robust, and adaptable.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 4
    nuwa-skill

    nuwa-skill

    Mental models, decision heuristics, expressing DNA

    ...The architecture is typically designed for flexibility, making it suitable for applications in conversational AI, automation, or intelligent assistants. Overall, it serves as a foundation for building scalable and extensible AI-driven systems.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    Cybersecurity AI

    Cybersecurity AI

    Cybersecurity AI (CAI), the framework for AI Security

    CAI (Cybersecurity AI) is a lightweight open-source framework intended to help security practitioners build and deploy AI-assisted automation for defensive and offensive security workflows. The project frames itself as a practical foundation for “AI security,” focusing on turning security tasks into agentic workflows that can be composed, executed, and iterated on by practitioners. Rather than being a single-purpose tool, CAI is positioned as a framework that supports building multiple security automations and integrating them into existing processes. It is designed for real-world usability, aiming to reduce friction for teams experimenting with AI agents in security operations, assessment, and response contexts. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    SenseVoice

    SenseVoice

    Multilingual speech recognition and audio understanding model

    SenseVoice is a speech foundation model designed to perform multiple voice understanding tasks from audio input. It provides capabilities such as automatic speech recognition, spoken language identification, speech emotion recognition, and audio event detection within a single system. SenseVoice is trained on more than 400,000 hours of speech data and supports over 50 languages for multilingual recognition tasks.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    GLM-OCR

    GLM-OCR

    Accurate × Fast × Comprehensive

    GLM-OCR is an open-source multimodal optical character recognition (OCR) model built on a GLM-V encoder–decoder foundation that brings robust, accurate document understanding to complex real-world layouts and modalities. Designed to handle text recognition, table parsing, formula extraction, and general information retrieval from documents containing mixed content, GLM-OCR excels across major benchmarks while remaining highly efficient with a relatively compact parameter size (~0.9B), enabling deployment in high-concurrency services and edge environments. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Depth Pro

    Depth Pro

    Sharp Monocular Metric Depth in Less Than a Second

    Depth Pro is a foundation model for zero-shot metric monocular depth estimation, producing sharp, high-frequency depth maps with absolute scale from a single image. Unlike many prior approaches, it does not require camera intrinsics or extra metadata, yet still outputs metric depth suitable for downstream 3D tasks. Apple highlights both accuracy and speed: the model can synthesize a ~2.25-megapixel depth map in around 0.3 seconds on a standard GPU, enabling near real-time applications. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    GenericAgent

    GenericAgent

    Self-evolving autonomous agent framework

    ...The project also focuses on extensibility, allowing developers to plug in custom tools or APIs and tailor agent behavior to specific use cases. By abstracting common agent patterns, it reduces the overhead of building agent systems from scratch. Overall, GenericAgent provides a foundation for scalable and reusable AI agent development.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Book5_Essentials-Probability-Statistics

    Book5_Essentials-Probability-Statistics

    The book 5 of statistics in simplicity

    ...The material connects probability theory directly to real analytical workflows, helping learners understand how statistics supports predictive modeling. Like the other books in the series, it blends mathematical explanation with Python-based experimentation. Overall, the project provides a practical statistical foundation for students advancing into AI and data science.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Gymnasium

    Gymnasium

    An API standard for single-agent reinforcement learning environments

    Gymnasium is a fork of OpenAI Gym, maintained by the Farama Foundation, that provides a standardized API for reinforcement learning environments. It improves upon Gym with better support, maintenance, and additional features while maintaining backward compatibility.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Nixtla TimeGPT

    Nixtla TimeGPT

    TimeGPT-1: production ready pre-trained Time Series Foundation Model

    TimeGPT is a production ready, generative pretrained transformer for time series. It's capable of accurately predicting various domains such as retail, electricity, finance, and IoT with just a few lines of code. Whether you're a bank forecasting market trends or a startup predicting product demand, TimeGPT democratizes access to cutting-edge predictive insights, eliminating the need for a dedicated team of machine learning engineers. A generative model for time series. TimeGPT is capable of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Cal.diy

    Cal.diy

    Scheduling infrastructure for absolutely everyone

    ...The platform enables users to manage bookings, availability, and integrations without relying on external services or subscription models. Built with modern web technologies such as Next.js, React, and PostgreSQL, it offers a robust foundation for building custom scheduling workflows. The system is designed for developers and advanced users who prefer to host and secure their own data rather than depend on third-party providers. It also supports integration with external tools through APIs and authentication systems, allowing flexible customization.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Clawith

    Clawith

    OpenClaw for Teams

    ...It may also include monitoring and control features to ensure that agent behavior remains aligned with user goals. The project reflects a broader trend toward building AI systems that act as autonomous operators rather than passive assistants. Overall, Clawith serves as a foundation for building advanced, action-oriented AI workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    InvokeAI

    InvokeAI

    InvokeAI is a leading creative engine for Stable Diffusion models

    ...Generate and create stunning visual media using the latest AI-driven technologies. InvokeAI offers an industry leading Web Interface, interactive Command Line Interface, and also serves as the foundation for multiple commercial products. This fork is supported across Linux, Windows and Macintosh. Linux users can use either an Nvidia-based card (with CUDA support) or an AMD card (using the ROCm driver). We do not recommend the GTX 1650 or 1660 series video cards. They are unable to run in half-precision mode and do not have sufficient VRAM to render 512x512 images.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 16
    Hugging Face - Speech To Speech

    Hugging Face - Speech To Speech

    Open speech-to-speech models and pipelines by Hugging Face toolkit AI

    ...It is designed to help researchers and developers experiment with multilingual and cross-lingual voice applications. It integrates with the broader Hugging Face ecosystem, making it easier to load pretrained models and run inference. It also serves as a foundation for building real-time or batch audio transformation systems. Overall, it highlights an emerging approach to voice technology that reduces latency and preserves more of the original speech characteristics.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    CogView4

    CogView4

    CogView4, CogView3-Plus and CogView3(ECCV 2024)

    CogView4 is the latest generation in the CogView series of vision-language foundation models, developed as a bilingual (Chinese and English) open-source system for high-quality image understanding and generation. Built on top of the GLM framework, it supports multimodal tasks including text-to-image synthesis, image captioning, and visual reasoning. Compared to previous CogView versions, CogView4 introduces architectural upgrades, improved training pipelines, and larger-scale datasets, enabling stronger alignment between textual prompts and generated visual content. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    memsearch

    memsearch

    A Markdown-first memory system, a standalone library for any AI agent

    ...Memsearch is designed to be agent-friendly, making it easy to plug into existing AI workflows and enhance reasoning capabilities. Its markdown-first approach ensures transparency and portability of stored knowledge. Overall, it provides a robust foundation for building AI systems with persistent and intelligent memory.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    LingBot-Map

    LingBot-Map

    A feed-forward 3D foundation model for reconstructing scenes

    LingBot-Map is a specialized project focused on mapping conversational or linguistic interactions within chatbot or AI-driven systems, providing a structured way to visualize and organize dialogue flows. It is designed to help developers understand how conversations evolve across different states, enabling better debugging and optimization of chatbot behavior. The system emphasizes mapping relationships between intents, responses, and transitions, creating a clear representation of...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    HolmesGPT

    HolmesGPT

    CNCF Sandbox Project

    ...Rather than requiring engineers to manually correlate large volumes of monitoring data, HolmesGPT automatically synthesizes evidence and presents explanations in natural language. The project is developed by Robusta and has been accepted as a Cloud Native Computing Foundation Sandbox project, highlighting its relevance to the cloud-native ecosystem. It is designed to operate as an automated troubleshooting assistant that can analyze incidents continuously and support on-call engineers during outages.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    BCEmbedding

    BCEmbedding

    Netease Youdao's open-source embedding and reranker models

    ...It includes an EmbeddingModel for semantic vector generation and a RerankerModel for refining and ordering search results. The project is optimized for bilingual and cross-lingual retrieval, especially across Chinese and English. It is used as a foundation for RAG systems such as QAnything and other Youdao products. The models are designed to work directly without fine-tuning across common business scenarios such as education, medicine, law, finance, literature, FAQs, textbooks, and general conversation. BCEmbedding also provides integrations for popular RAG frameworks, making it easier to add semantic search and reranking to AI applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Book4_Power-of-Matrix

    Book4_Power-of-Matrix

    Book_4_Matrix Power | The Iris Book: From Addition, Subtraction

    ...The repository is continuously updated and intended to accompany the broader Visualize-ML learning ecosystem. Overall, it serves as a visually driven mathematical foundation for students preparing for data science and machine learning work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Hyperledger Cello

    Hyperledger Cello

    Operating System for Enterprise Blockchain

    Hyperledger Cello is a blockchain operation and provisioning system designed to automate the deployment, management, and scaling of Hyperledger Fabric networks. As part of the Hyperledger project under the Linux Foundation, Cello aims to offer Blockchain-as-a-Service (BaaS) by abstracting the complexity of infrastructure setup for consortiums and enterprises. It provides a dashboard, APIs, and orchestration tools to help users create, monitor, and manage blockchain nodes, ledgers, and applications efficiently across cloud and on-premise environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    AG2

    AG2

    Framework for building and orchestrating multi-agent AI systems

    ...AG2 is intended for developers experimenting with autonomous systems, research prototypes, or production-grade agent pipelines. AG2 emphasizes flexibility, allowing users to integrate different models and customize behaviors depending on their use case. Overall, it serves as a foundation for building scalable and modular AI agent ecosystems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Positron

    Positron

    Positron, a next-generation data science IDE

    ...It aims to unify exploratory data analysis, production code, and data-app authoring in a single environment so that data scientists move from “question → insight → application” without switching tools. Built on the open-source Code-OSS foundation, Positron provides a familiar coding experience along with specialized panes and tooling for variable inspection, data-frame viewing, plotting previews, and interactive consoles designed for analytical work. The IDE supports notebook and script workflows, integration of data-app frameworks (such as Shiny, Streamlit, Dash), database and cloud connections, and built-in AI-assisted capabilities to help write code, explore data, and build models.
    Downloads: 2 This Week
    Last Update:
    See Project
Auth0 Logo