Search Results for "foundation visual art design"

Showing 48 open source projects for "foundation visual art design"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    Oumi

    Oumi

    Everything you need to build state-of-the-art foundation models

    Oumi is an open-source framework that provides everything needed to build state-of-the-art foundation models, end-to-end. It aims to simplify the development of large-scale machine-learning models.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    LongCat-Image

    LongCat-Image

    Foundation model for image generation

    LongCat-Image is an open-source foundation model for image generation and editing created by the LongCat team at Meituan, designed to deliver high-quality visual outputs while remaining efficient and accessible for developers and researchers. Rather than relying on massive parameter counts typical of many cutting-edge models, LongCat-Image achieves strong photorealism, stable structure, and accurate bilingual (Chinese and English) text rendering with a more compact ~6-billion parameter architecture, making it competitive with much larger alternatives despite its relatively lean design.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Salesforce Lightning Design System

    Salesforce Lightning Design System

    Salesforce Lightning design system

    Ready-to-use HTML and CSS UI elements provide the foundation for Salesforce experience development. Visual design values and attributes that ensure branding and UI consistency at scale. Design principles and best practices that guide beautiful, consistent, user-friendly product experiences. Easy-to-use tools help all Trailblazers optimize workflows and bring Salesforce ideas to life. Start building immediately, without worrying about detailed specs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    StarVector

    StarVector

    StarVector is a foundation model for SVG generation

    ...The model converts raster images or text instructions into structured vector representations, enabling high-quality vectorization and design generation. This approach allows StarVector to create scalable graphics that maintain visual quality regardless of resolution, which is especially useful for design tools and illustration workflows. Because the model produces SVG code rather than pixel images, the output can be edited programmatically or integrated directly into web and design environments.
    Downloads: 1 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    WPF UI

    WPF UI

    Make your application written in WPF keep up with modern design trends

    ...Most styles and controls work right away in Visual Studio Designer. WPF UI is a library designed for building modern user interfaces in a Windows Presentation Foundation (WPF) environment. Windows Presentation Foundation (WPF) is a free and open-source graphical subsystem (similar to WinForms) originally developed by Microsoft for rendering user interfaces in Windows-based applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    MetBrewer

    MetBrewer

    Color palette package inspired by Metropolitan Museum of Art in NY

    MetBrewer is an R package that provides color palettes inspired by artworks and collections in the Metropolitan Museum of Art (The Met). The idea is to draw on the rich visual heritage of fine art to generate palettes that are aesthetically pleasing and grounded in real-world artistic color usage. The palettes are curated, named after artworks or styles, and often include notes about colorblind-friendliness and contrast. The package supports both discrete and continuous palette types, with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Reka UI

    Reka UI

    An open-source UI component library for building design systems

    Reka UI is an open-source UI component library designed for building accessible and customizable web interfaces in Vue-based applications. The library provides a collection of unstyled UI primitives that developers can use as the foundation for creating fully customized design systems. Rather than forcing a predefined visual style, Reka UI focuses on accessibility, composability, and flexibility, allowing developers to implement their own visual themes and branding. The project evolved from the Radix Vue initiative and aims to bring the same accessibility-focused component patterns to the Vue ecosystem. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    DeepSeek VL

    DeepSeek VL

    Towards Real-World Vision-Language Understanding

    DeepSeek-VL is DeepSeek’s initial vision-language model that anchors their multimodal stack. It enables understanding and generation across visual and textual modalities—meaning it can process an image + a prompt, answer questions about images, caption, classify, or reason about visuals in context. The model is likely used internally as the visual encoder backbone for agent use cases, to ground perception in downstream tasks (e.g. answering questions about a screenshot). The repository...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 9
    Qwen3-Omni

    Qwen3-Omni

    Qwen3-omni is a natively end-to-end, omni-modal LLM

    Qwen3-Omni is a natively end-to-end multilingual omni-modal foundation model that processes text, images, audio, and video and delivers real-time streaming responses in text and natural speech. It uses a Thinker-Talker architecture with a Mixture-of-Experts (MoE) design, early text-first pretraining, and mixed multimodal training to support strong performance across all modalities without sacrificing text or image quality.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10
    MiniMax-M2.5

    MiniMax-M2.5

    State of the art LLM and coding model

    MiniMax-M2.5 is a state-of-the-art foundation model extensively trained with reinforcement learning across hundreds of thousands of real-world environments. It delivers leading performance in coding, agentic tool use, search, and complex office workflows, achieving top benchmark scores such as 80.2% on SWE-Bench Verified and 76.3% on BrowseComp. Designed to reason efficiently and decompose tasks like an experienced architect, M2.5 plans features, structure, and system design before generating code. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    GLM-4.6V

    GLM-4.6V

    GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

    GLM-4.6V represents the latest generation of the GLM-V family and marks a major step forward in multimodal AI by combining advanced vision-language understanding with native “tool-call” capabilities, long-context reasoning, and strong generalization across domains. Unlike many vision-language models that treat images and text separately or require intermediate conversions, GLM-4.6V allows inputs such as images, screenshots or document pages directly as part of its reasoning pipeline — and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    vengi

    vengi

    Free and open source voxel art tools - editor, thumbnailer and format

    This is a collection of voxel tools. It includes a voxel editor with animation support and loading/saving into a lot of voxel volume formats. There are other tools e.g. a thumbnail for your file manager and a command line tool to convert or modify between several voxel formats. Use a voxel editor or to convert your voxel models into an obj or ply file. If you came here as a developer, make sure to check out the dependencies and set up your development environment. The build scripts are...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Magic Resume

    Magic Resume

    free online AI resume editor

    Magic Resume is a modern, open-source AI-powered resume builder designed to simplify the process of creating professional resumes through an interactive, visually rich web interface. Built with modern frontend technologies such as TanStack Start, TypeScript, and Tailwind CSS, it provides a smooth and responsive user experience enhanced by animation frameworks that make editing intuitive and engaging. The platform offers real-time preview capabilities, allowing users to instantly visualize...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    SAM 3

    SAM 3

    Code for running inference and finetuning with SAM 3 model

    SAM 3 (Segment Anything Model 3) is a unified foundation model for promptable segmentation in both images and videos, capable of detecting, segmenting, and tracking objects. It accepts both text prompts (open-vocabulary concepts like “red car” or “goalkeeper in white”) and visual prompts (points, boxes, masks) and returns high-quality masks, boxes, and scores for the requested concepts. Compared with SAM 2, SAM 3 introduces the ability to exhaustively segment all instances of an...
    Downloads: 42 This Week
    Last Update:
    See Project
  • 15
    Agent S

    Agent S

    Agent S: an open agentic framework that uses computers like a human

    ...Built to operate graphical user interfaces like a human, it allows AI agents to perceive screens, reason about tasks, and execute actions across macOS, Windows, and Linux systems. The latest version, Agent S3, surpasses human-level performance on the OSWorld benchmark, demonstrating state-of-the-art results in complex multi-step computer tasks. Agent S combines powerful foundation models (such as GPT-5) with grounding models like UI-TARS to translate visual inputs into precise executable actions. It supports flexible deployment via CLI, SDK, or cloud, and integrates with multiple model providers including OpenAI, Anthropic, Gemini, Azure, and Hugging Face endpoints. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    _s

    _s

    Inactive project

    Underscores is a no-frills WordPress starter theme engineered to give developers a clean, standards-based foundation for building custom themes. It provides semantic templates, accessibility-minded markup, and a minimal stylesheet that sets sensible defaults without imposing a visual identity. The codebase includes template tags, custom header and menu scaffolding, widget areas, and translation-ready strings to speed up common theme tasks. Its structure is intentionally opinionated about where code lives and how templates compose, which makes large customizations predictable and maintainable. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Primer CSS

    Primer CSS

    The CSS design system that powers GitHub

    Our goal is to create a system that enables us to build consistent user experiences with ease, yet with enough flexibility to support the broad spectrum of GitHub websites. This goal is embedded in our design and code decisions. Our approach to CSS is influenced by Object-Oriented CSS principles, functional CSS, and BEM architecture. Styles can be mixed and matched to achieve many different layouts, independent of their location. These styles fall into three categories. Single-purpose, immutable styles, that do one thing well. Abstracted patterns for frequently used visual styles. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Thunder Engine

    Thunder Engine

    Cross-platform 2D and 3D game engine with modular architecture

    Thunder is a lightweight, modular game engine written in C++ that provides a robust foundation for creating 2D and 3D games across platforms. It emphasizes a clean, component-based architecture and is designed for flexibility, performance, and scalability. Thunder includes built-in support for scripting, resource management, physics, animation, and rendering, with tools that facilitate efficient game development and deployment. Its cross-platform nature and minimal dependencies make it...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    Qwen3-VL-Embedding

    Qwen3-VL-Embedding

    Multimodal embedding and reranking models built on Qwen3-VL

    Qwen3-VL-Embedding (with its companion Qwen3-VL-Reranker) is a state-of-the-art multimodal embedding and reranking model suite built on the open-sourced Qwen3-VL foundation, developed to handle diverse inputs including text, images, screenshots, and videos. The core embedding model maps such inputs into semantically rich vectors in a unified representation space, enabling similarity search, clustering, and cross-modal retrieval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    SwissGL

    SwissGL

    SwissGL is a minimalistic wrapper on top of WebGL2 JS API

    SwissGL is a compact JavaScript library that provides a streamlined abstraction layer over the WebGL2 API, designed to minimize boilerplate when building GPU-accelerated graphics, simulations, and procedural visualizations. Acting as a "Swiss Army knife" for WebGL2, it simplifies shader, texture, and framebuffer management into a single, expressive interface that enables developers to write complex GPU workflows in just a few lines of code. The library centers around one main function that...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    beautiful-mermaid

    beautiful-mermaid

    Render Mermaid diagrams as beautiful SVGs or ASCII art

    beautiful-mermaid is a styling and rendering toolkit built to produce visually enhanced diagrams from Mermaid syntax, aiming to bridge the gap between simple technical diagrams and rich, presentation-ready visualizations, all while preserving the lightweight text-to-diagram workflow that Mermaid offers. Instead of plain, utilitarian shapes and lines, Beautiful Mermaid applies themes, typography enhancements, color palettes, and layout optimizations so diagrams look polished and professional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Kaldi

    Kaldi

    kaldi-asr/kaldi is the official location of the Kaldi project

    ...It includes extensive tools for data preparation, feature extraction, acoustic and language modeling, decoding, and evaluation. With its modular design, Kaldi allows users to adapt the system to a wide range of languages and domains. As one of the most influential projects in speech recognition, it has become a foundation for much of the modern work in ASR.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    Wan2.1

    Wan2.1

    Wan2.1: Open and Advanced Large-Scale Video Generative Model

    Wan2.1 is a foundational open-source large-scale video generative model developed by the Wan team, providing high-quality video generation from text and images. It employs advanced diffusion-based architectures to produce coherent, temporally consistent videos with realistic motion and visual fidelity. Wan2.1 focuses on efficient video synthesis while maintaining rich semantic and aesthetic detail, enabling applications in content creation, entertainment, and research. The model supports...
    Downloads: 55 This Week
    Last Update:
    See Project
  • 24
    GLM-4.5V

    GLM-4.5V

    GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

    GLM-4.5V is the preceding iteration in the GLM-V series that laid much of the groundwork for general multimodal reasoning and vision-language understanding. It embodies the design philosophy of mixing visual and textual modalities into a unified model capable of general-purpose reasoning, content understanding, and generation, while already supporting a wide variety of tasks: from image captioning and visual question answering to content recognition, GUI-based agents, video understanding, and long-document interpretation. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Mudlet

    Mudlet

    Cross-platform, open source, and super fast MUD client with scripting

    Play immersive, multiplayer, pure-text games on Mudlet. Performance defined Mudlet from its birth. The custom text display and the Lua-powered scripting system will handle the biggest raids without issues. Mudlet’s entire user interface is designed to be modded – from the space within the window to the look and feel of the client itself. Make yourself a beautiful portal to your text world. Mudlet is completely free to download, modify, and extend. Build on top of a powerful foundation and...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB