depth-violet free download

Showing 89 open source projects for "depth-violet"

View related business solutions

Artificial Intelligence Clear Filters & Widen Search

Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

Depth Pro

Sharp Monocular Metric Depth in Less Than a Second

Depth Pro is a foundation model for zero-shot metric monocular depth estimation, producing sharp, high-frequency depth maps with absolute scale from a single image. Unlike many prior approaches, it does not require camera intrinsics or extra metadata, yet still outputs metric depth suitable for downstream 3D tasks. Apple highlights both accuracy and speed: the model can synthesize a ~2.25-megapixel depth map in around 0.3 seconds on a standard GPU, enabling near real-time applications. ...

Downloads: 5 This Week

Last Update: 2025-10-08
See Project
2

Depth Anything 3

Recovering the Visual Space from Any Views

Depth Anything 3 is a research-driven project that brings accurate and dense depth estimation to any input image or video, enabling foundational understanding of 3D structure from 2D visual content. Designed to work across diverse scenes, lighting conditions, and image types, it uses advanced neural networks trained on large, heterogeneous datasets, producing depth maps that reveal scene depth relationships and object surfaces with strong fidelity.

Downloads: 6 This Week

Last Update: 2026-03-21
See Project
3

OpenMythos

A theoretical reconstruction of the Claude Mythos architecture

...The architecture incorporates advanced techniques such as mixture-of-experts routing, adaptive computation time, and multiple attention mechanisms to dynamically allocate compute where needed. It is highly configurable through a centralized configuration system, allowing experimentation with different architectural parameters such as loop depth, attention type.

Downloads: 43 This Week

Last Update: 5 days ago
See Project
4

Map-Anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

...Instead of stitching together many task-specific models, it uses a single architecture that supports a wide range of 3D tasks—multi-image structure-from-motion, multi-view stereo, monocular metric depth, registration, depth completion, and more. The model flexibly accepts different input combinations (images, intrinsics, poses, sparse or dense depth) and produces a rich set of outputs including per-pixel 3D points, camera intrinsics, camera poses, ray directions, confidence maps, and validity masks. Its inference path is fully feed-forward with optional mixed-precision and memory-efficient modes, making it practical to scale to long image sequences while keeping latency predictable.

Downloads: 0 This Week

Last Update: 2026-03-23
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

Dream Textures

Stable Diffusion built-in to Blender

Create textures, concept art, background assets, and more with a simple text prompt. Use the 'Seamless' option to create textures that tile perfectly with no visible seam. Texture entire scenes with 'Project Dream Texture' and depth to image. Re-style animations with the Cycles render pass. Run the models on your machine to iterate without slowdowns from a service. Create textures, concept art, and more with text prompts. Learn how to use the various configuration options to get exactly what you're looking for. Texture entire models and scenes with depth to image. ...

Downloads: 9 This Week

Last Update: 2024-08-26
See Project
6

ImageBind

ImageBind One Embedding Space to Bind Them All

ImageBind is a multimodal embedding framework that learns a shared representation space across six modalities—images, text, audio, depth, thermal, and IMU (inertial motion) data—without requiring explicit pairwise training for every modality combination. Instead of aligning each pair independently, ImageBind uses image data as the central binding modality, aligning all other modalities to it so they can interoperate zero-shot. This creates a unified embedding space where representations from any modality can be compared or retrieved against any other (e.g., matching sound to text or depth to image). ...

Downloads: 0 This Week

Last Update: 2025-11-21
See Project
7

VGGT

[CVPR 2025 Best Paper Award] VGGT

VGGT is a transformer-based framework aimed at unifying classic visual geometry tasks—such as depth estimation, camera pose recovery, point tracking, and correspondence—under a single model. Rather than training separate networks per task, it shares an encoder and leverages geometric heads/decoders to infer structure and motion from images or short clips. The design emphasizes consistent geometric reasoning: outputs from one head (e.g., correspondences or tracks) reinforce others (e.g., pose or depth), making the system more robust to challenging viewpoints and textures. ...

Downloads: 0 This Week

Last Update: 2025-10-11
See Project
8

The Hypersim Dataset

Photorealistic Synthetic Dataset for Holistic Indoor Scene

Hypersim is a large-scale, photorealistic synthetic dataset and tooling suite for indoor scene understanding research. It provides richly annotated renderings—RGB, depth, surface normals, instance and semantic segmentations, and material/lighting metadata—produced from high-fidelity virtual environments. The dataset spans diverse furniture layouts, room types, and camera trajectories, enabling robust training for geometry, segmentation, and SLAM-adjacent tasks. Rendering pipelines and utilities allow researchers to reproduce sequences, generate novel views, or extract task-specific supervision. ...

Downloads: 1 This Week

Last Update: 2026-01-09
See Project
9

CS-Ebook

Curated list of classic, high-quality computer science books

CS-Ebook is a curated repository that compiles high-quality and classic computer science books across a wide range of software-related fields. It focuses on depth over volume, selecting only well-regarded titles that support structured learning and long-term skill development. It spans core areas such as computer fundamentals, programming languages, software engineering, mathematics, data science, and artificial intelligence, making it suitable for learners at different stages. Rather than hosting files, the project serves as a discovery guide, helping users identify essential reading materials and build a strong technical foundation. ...

Downloads: 2 This Week

Last Update: 23 hours ago
See Project
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
10

Qwen-Image

Qwen-Image is a powerful image generation foundation model

...Qwen-Image supports sophisticated editing tasks such as style transfer, object insertion and removal, detail enhancement, and even human pose manipulation, making it suitable for both professional and casual users. It also includes advanced image understanding capabilities like object detection, semantic segmentation, depth and edge estimation, and novel view synthesis.

1 Review

Downloads: 16 This Week

Last Update: 2026-02-10
See Project
11

MESHROOM

3D reconstruction software

Photogrammetry is the science of making measurements from photographs. It infers the geometry of a scene from a set of unordered photographies or videos. Photography is the projection of a 3D scene onto a 2D plane, losing depth information. The goal of photogrammetry is to reverse this process. The dense modeling of the scene is the result yielded by chaining two computer vision-based pipelines, “Structure-from-Motion” (SfM) and “Multi View Stereo” (MVS). Fusion of Multi-bracketing LDR images into HDR. Alignment of panorama images. Support for fisheye optics. ...

1 Review

Downloads: 146 This Week

Last Update: 2025-08-19
See Project
12

KaTrain

Improve your Baduk skills by training with KataGo

KaTrain is an advanced training and analysis tool for the board game Go that leverages the powerful KataGo AI engine to provide real-time feedback and in-depth game review capabilities. It is designed to help players of all skill levels improve by identifying mistakes, analyzing move efficiency, and offering alternative strategies based on AI evaluation. The application allows users to play against AI opponents with adjustable difficulty, including intentionally weakened versions of the engine that simulate human-like play styles. ...

Downloads: 53 This Week

Last Update: 2026-03-19
See Project
13

HunyuanWorld-Voyager

RGBD video generation model conditioned on camera input

...By leveraging user-defined camera paths, it enables immersive scene exploration and supports controllable video synthesis with high realism. The system jointly produces aligned RGB and depth video sequences, making it directly applicable to 3D reconstruction tasks. At its core, Voyager integrates a world-consistent video diffusion model with an efficient long-range world exploration engine powered by auto-regressive inference. To support training, the team built a scalable data engine that automatically curates large video datasets with camera pose estimation and metric depth prediction. ...

Downloads: 8 This Week

Last Update: 2026-04-15
See Project
14

Attention Residuals (AttnRes)

Drop-in replacement for standard residual connections in Transformers

...Attention Residuals introduces a learnable softmax attention mechanism that allows each layer to selectively retrieve and weight useful representations from earlier layers, making depth dynamically adaptive rather than uniformly aggregated. This approach improves gradient stability, preserves meaningful signals throughout the network, and enhances performance in reasoning-heavy tasks such as coding, mathematics, and multi-step problem solving.

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
15

HunyuanDiT

Diffusion Transformer with Fine-Grained Chinese Understanding

...It trains a diffusion model in latent space using a transformer backbone and integrates a Multimodal Large Language Model (MLLM) to refine captions and support conversational image generation. It supports adapters like ControlNet, IP-Adapter, LoRA, and can run under constrained VRAM via distillation versions. LoRA, ControlNet (pose, depth, canny), IP-adapter to extend control over generation. Integration with Gradio for web demos and diffusers / command-line compatibility. Supports multi-turn T2I (text-to-image) interactions so users can iteratively refine their images via dialogue.

Downloads: 1 This Week

Last Update: 2025-11-27
See Project
16

WanGP

AI video generator optimized for low VRAM and older GPUs use

...Wan2GP provides a full web-based interface that simplifies interaction with complex generative pipelines, making it easier to configure prompts, models, and rendering settings. It also integrates a wide range of utilities such as prompt enhancement, mask editing, motion design, and extraction tools for pose, depth, and flow data to support advanced video workflows.

Downloads: 55 This Week

Last Update: 24 hours ago
See Project
17

Open Deep Research

An AI-powered research assistant that performs iterative research

...Its core goal is to provide the simplest possible implementation of a deep research workflow so developers can study and extend agent behavior without dealing with large, opaque codebases. The system exposes parameters such as breadth and depth to control how widely and how deeply the agent explores information sources. It is intentionally kept compact, with a codebase under roughly 500 lines, making it highly approachable for experimentation and learning. The architecture demonstrates how modern agent pipelines can continuously gather evidence, extract learnings, and adjust research direction over time.

Downloads: 0 This Week

Last Update: 2026-04-11
See Project
18

GPT Crawler

Crawl a site to generate knowledge files to create your own custom GPT

...Developers can integrate it into automated pipelines to keep knowledge sources fresh and synchronized with live websites. The overall architecture emphasizes extensibility, allowing users to customize crawling depth, parsing rules, and output handling.

Downloads: 0 This Week

Last Update: 2026-03-02
See Project
19

CO3D (Common Objects in 3D)

Tooling for the Common Objects In 3D dataset

...Each of the thousands of sequences in CO3Dv2 captures a common object (from categories like cars, chairs, or plants) from multiple real-world viewpoints. The dataset includes RGB images, depth maps, masks, and camera poses for each frame, along with pre-defined training, validation, and testing splits for both few-view and many-view reconstruction tasks.

Downloads: 1 This Week

Last Update: 7 days ago
See Project
20

AI Engineering Hub

In-depth tutorials on LLMs, RAGs and real-world AI agent applications

...Projects range from OCR applications and local chatbot UIs to multimodal RAG systems and multi-agent automation pipelines, making the hub valuable both as a learning resource and as a practical reference. The repository provides in-depth notebooks, example code, and integration patterns that illustrate how to implement, adapt, and scale AI features in real applications.

Downloads: 0 This Week

Last Update: 2026-03-13
See Project
21

HunyuanWorld-Mirror

Fast and Universal 3D reconstruction model for versatile tasks

HunyuanWorld-Mirror focuses on fast, universal 3D reconstruction that can ingest varied inputs and produce multiple kinds of 3D outputs. The model accepts combinations of images, camera intrinsics and poses, or even depth cues, then reconstructs consistent 3D geometry suitable for downstream rendering or editing. The pipeline emphasizes both speed and flexibility so creators can go from casual captures to assets without elaborate capture rigs. Outputs can include point clouds, estimated camera parameters, and other 3D representations that plug into typical graphics workflows. ...

Downloads: 0 This Week

Last Update: 2026-04-15
See Project
22

Fast3R

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

...It represents a next-generation feedforward 3D reconstruction model capable of producing dense point clouds and camera poses for hundreds to thousands of images or video frames in a single inference pass—eliminating the need for slow, iterative structure-from-motion pipelines. Built on PyTorch Lightning and extending concepts from DUSt3R and Spann3r, Fast3R unifies multi-view geometry, depth estimation, and camera registration within a single transformer-based architecture. It outputs high-quality 3D scene representations from unordered or sequential views, scaling to large datasets and varied camera intrinsics. The repository includes pretrained models, Gradio-based demos, and modular APIs for direct integration into research or production workflows.

Downloads: 0 This Week

Last Update: 7 days ago
See Project
23

fklearn

Functional Machine Learning

fklearn uses functional programming principles to make it easier to solve real problems with Machine Learning.

Downloads: 0 This Week

Last Update: 2026-04-22
See Project
24

VGGSfM

VGGSfM: Visual Geometry Grounded Deep Structure From Motion

VGGSfM is an advanced structure-from-motion (SfM) framework jointly developed by Meta AI Research (GenAI) and the University of Oxford’s Visual Geometry Group (VGG). It reconstructs 3D geometry, dense depth, and camera poses directly from unordered or sequential images and videos. The system combines learned feature matching and geometric optimization to generate high-quality camera calibrations, sparse/dense point clouds, and depth maps in standard COLMAP format. Version 2.0 adds support for dynamic scene handling, dense point cloud export, video-based reconstruction (1000+ frames), and integration with Gaussian Splatting pipelines. ...

Downloads: 0 This Week

Last Update: 7 days ago
See Project
25

Perplexica

Perplexica is an AI-powered answering engine.

...Powered by SearxNG, it aggregates results from multiple search engines while keeping your identity and queries private. Perplexica offers multiple search modes—Speed, Balanced, and Quality—so you can trade off latency and depth depending on the task. It also enhances the experience with widgets and rich search types like images, videos, and domain-limited queries. With local search history, file uploads, and an easy Docker-based setup, it’s built for everyday research without sacrificing control or privacy.

Downloads: 17 This Week

Last Update: 2026-04-09
See Project