Search Results for "atom 3d model" - Page 2

Sort By:

Showing 311 open source projects for "atom 3d model"

View related business solutions

Linux Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Error to trace to log to deploy. One click. No SSH.
Catch the cause before the pager goes off.

AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.

Free 30 days.
1

HunyuanWorld-Voyager

RGBD video generation model conditioned on camera input

HunyuanWorld-Voyager is a next-generation video diffusion framework developed by Tencent-Hunyuan for generating world-consistent 3D scene videos from a single input image. By leveraging user-defined camera paths, it enables immersive scene exploration and supports controllable video synthesis with high realism. The system jointly produces aligned RGB and depth video sequences, making it directly applicable to 3D reconstruction tasks. At its core, Voyager integrates a world-consistent video diffusion model with an efficient long-range world exploration engine powered by auto-regressive inference. ...

Downloads: 1 This Week

Last Update: 2026-04-15
See Project
2

3DCellForge

AI-powered interactive 3D cell generation and exploration studio

...The project also supports optional image-to-3D generation through cloud providers and local backends, while keeping API keys on the server side instead of exposing them in the frontend bundle. It includes cached demo models so the experience can work without generating a new model every time. Overall, it is a creative research and education tool for visualizing cell structures in a more interactive way than static diagrams.

Downloads: 1 This Week

Last Update: 2026-05-14
See Project
3

Hunyuan3D-1

A Unified Framework for Text-to-3D and Image-to-3D Generation

Hunyuan3D-1 is an earlier version in the same 3D generation line (the unified framework for text-to-3D and image-to-3D tasks) by Tencent Hunyuan. It provides a framework combining shape generation and texture synthesis, enabling users to create 3D assets from images or text conditions. While less advanced than version 2.1, it laid the foundations for the later PBR, higher resolution, and open-source enhancements. (Note: less detailed public documentation was found for Hunyuan3D-1 compared to...

Downloads: 0 This Week

Last Update: 2025-11-19
See Project
4

VGGT-Ω

[CVPR 2026 Oral] VGGT Omega

VGGT-Omega is a Facebook Research computer vision project for feed-forward camera and depth reconstruction. It takes images as input and predicts camera parameters, depth maps, confidence values, and related scene tokens. The project is associated with 3D understanding workflows where models infer scene geometry without a traditional multi-stage reconstruction pipeline. It includes pretrained model variants with different resolutions and text-alignment capabilities, though checkpoint access may require approval. The repository also provides a Gradio demo that can visualize predicted cameras and depth-unprojected point clouds as a GLB scene. ...

Downloads: 3 This Week

Last Update: 2026-05-26
See Project
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
5

glslViewer

Console-based GLSL Sandbox for 2D/3D shaders shaders

GlslViewer is a flexible console-based OpenGL Sandbox to display 2D/3D GLSL shaders without the need of a UI. You can definitely make your own UI or wrapper using the Python Module (include) or any other tool that communicates back/forth with the GPS viewer through the standard POSIX console In/Out or OSC. Default vert/frag shaders for 2D shader and 3D material shaders with PBR lighting model. Hot reload of files on changes.

Downloads: 7 This Week

Last Update: 2026-02-07
See Project
6

Depth Pro

Sharp Monocular Metric Depth in Less Than a Second

Depth Pro is a foundation model for zero-shot metric monocular depth estimation, producing sharp, high-frequency depth maps with absolute scale from a single image. Unlike many prior approaches, it does not require camera intrinsics or extra metadata, yet still outputs metric depth suitable for downstream 3D tasks. Apple highlights both accuracy and speed: the model can synthesize a ~2.25-megapixel depth map in around 0.3 seconds on a standard GPU, enabling near real-time applications. ...

Downloads: 0 This Week

Last Update: 2025-10-08
See Project
7

Lyra 2

Project Lyra: Open Generative 3D World Models

The Lyra 2 project is a research-driven framework developed by NVIDIA that focuses on building open generative 3D world models using advanced diffusion-based techniques. It enables the creation of fully explorable 3D environments from minimal inputs such as a single image or video, leveraging self-distillation methods to generate consistent spatial representations. The system evolves across versions, with newer iterations introducing long-horizon generation and improved 3D consistency across...

Downloads: 1 This Week

Last Update: 2026-06-11
See Project
8

Step-Video-T2V

State-of-the-art (SoTA) text-to-video pre-trained model

...The model handles bilingual input (e.g. English and Chinese) thanks to dual encoders, and supports end-to-end text-to-video generation without requiring external assets. Its training and generation pipeline includes techniques like flow-matching, full 3D attention for temporal consistency, and fine-tuning approaches (e.g. video-based DPO) to improve fidelity and reduce artifacts.

Downloads: 2 This Week

Last Update: 2025-12-02
See Project
9

Draper

Decorators/view-models for Rails applications

Draper adds an object-oriented layer of presentation logic to your Rails application. Without Draper, this functionality might have been tangled up in procedural helpers or adding bulk to your models. With Draper decorators, you can wrap your models with presentation-related logic to organize and test this layer of your app much more effectively. Imagine your application has an Article model. With Draper, you'd create a corresponding ArticleDecorator. The decorator wraps the model, and deals...

Downloads: 0 This Week

Last Update: 2025-11-16
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

Fast3R

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Fast3R is Meta AI’s official CVPR 2025 release for “Towards 3D Reconstruction of 1000+ Images in One Forward Pass.” It represents a next-generation feedforward 3D reconstruction model capable of producing dense point clouds and camera poses for hundreds to thousands of images or video frames in a single inference pass—eliminating the need for slow, iterative structure-from-motion pipelines. Built on PyTorch Lightning and extending concepts from DUSt3R and Spann3r, Fast3R unifies multi-view geometry, depth estimation, and camera registration within a single transformer-based architecture. ...

Downloads: 2 This Week

Last Update: 5 days ago
See Project
11

Copulas

A library to model multivariate data using copulas

...Choose from a variety of univariate distributions and copulas – including Archimedian Copulas, Gaussian Copulas and Vine Copulas. Compare real and synthetic data visually after building your model. Visualizations are available as 1D histograms, 2D scatterplots and 3D scatterplots. Access & manipulate learned parameters. With complete access to the internals of the model, set or tune parameters to your choosing.

Downloads: 0 This Week

Last Update: 2026-02-05
See Project
12

Depth Anything 3

Recovering the Visual Space from Any Views

Depth Anything 3 is a research-driven project that brings accurate and dense depth estimation to any input image or video, enabling foundational understanding of 3D structure from 2D visual content. Designed to work across diverse scenes, lighting conditions, and image types, it uses advanced neural networks trained on large, heterogeneous datasets, producing depth maps that reveal scene depth relationships and object surfaces with strong fidelity. The model can be applied to photography, AR/VR content creation, robotics perception, and 3D reconstruction workflows, making it versatile across industries and research domains. ...

Downloads: 6 This Week

Last Update: 2026-03-21
See Project
13

Chili3D

A browser-based 3D CAD application for online model design

Chili3D is an open-source, browser-based 3D CAD application that enables users to design, edit, and visualize complex 3D models directly within a web environment without requiring local installation. It is built using TypeScript and leverages WebAssembly to compile the OpenCascade geometric modeling kernel, allowing it to achieve near-native performance inside the browser. The application integrates with modern rendering libraries to provide real-time visualization, interactive modeling, and...

Downloads: 1 This Week

Last Update: 2026-04-07
See Project
14

Automated Tool for Optimized Modelling

Automated Tool for Optimized Modelling

...How many times have you conducted the same action to pre-process a raw dataset? How many times have you copy-and-pasted code from an old repository to re-use it in a new use case? ATOM is here to help solve these common issues. The package acts as a wrapper of the whole machine learning pipeline, helping the data scientist to rapidly find a good model for his problem.

Downloads: 0 This Week

Last Update: 2024-07-05
See Project
15

WorldGen

Generate Any 3D Scene in Seconds

WorldGen is an AI model and library that can generate full 3D scenes in a matter of seconds from either text prompts or reference images. It is designed to create interactive environments suitable for games, simulations, robotics research, and virtual reality, rather than just static 3D assets. The core idea is that you describe a world in natural language and WorldGen produces a navigable 3D scene that you can freely explore in 360 degrees, with loop closure so that the space remains consistent as you move around. ...

Downloads: 0 This Week

Last Update: 2026-04-12
See Project
16

Map-Anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

...The model flexibly accepts different input combinations (images, intrinsics, poses, sparse or dense depth) and produces a rich set of outputs including per-pixel 3D points, camera intrinsics, camera poses, ray directions, confidence maps, and validity masks. Its inference path is fully feed-forward with optional mixed-precision and memory-efficient modes, making it practical to scale to long image sequences while keeping latency predictable.

Downloads: 0 This Week

Last Update: 2026-05-30
See Project
17

Qwen3-VL

Qwen3-VL, the multimodal large language model series by Alibaba Cloud

Qwen3-VL is the latest multimodal large language model series from Alibaba Cloud’s Qwen team, designed to integrate advanced vision and language understanding. It represents a major upgrade in the Qwen lineup, with stronger text generation, deeper visual reasoning, and expanded multimodal comprehension. The model supports dense and Mixture-of-Experts (MoE) architectures, making it scalable from edge devices to cloud deployments, and is available in both instruction-tuned and...

Downloads: 5 This Week

Last Update: 5 days ago
See Project
18

LingBot-Map

A feed-forward 3D foundation model for reconstructing scenes

LingBot-Map is a specialized project focused on mapping conversational or linguistic interactions within chatbot or AI-driven systems, providing a structured way to visualize and organize dialogue flows. It is designed to help developers understand how conversations evolve across different states, enabling better debugging and optimization of chatbot behavior. The system emphasizes mapping relationships between intents, responses, and transitions, creating a clear representation of...

Downloads: 1 This Week

Last Update: 2026-06-02
See Project
19

Filament

Real-time physically based rendering engine for Android, iOS, and more

...Filament is a physically based rendering (PBR) engine for Android. The goal of Filament is to offer a set of tools and APIs for Android developers that will enable them to create high quality 2D and 3D rendering with ease. For both artists and developers, our system will rely on as few parameters as possible to reduce trial and error and allow users to quickly master the material model. A physically based approach must not preclude non-realistic rendering. User interfaces for instance will need unlit materials. Our primary goal is to design and implement a rendering system able to perform efficiently on mobile platforms.

Downloads: 5 This Week

Last Update: 2 days ago
See Project
20

Tracking Any Point (TAP)

DeepMind model for tracking arbitrary points across videos & robotics

TAPNet is the official Google DeepMind repository for Tracking Any Point (TAP), bundling datasets, models, benchmarks, and demos for precise point tracking in videos. The project includes the TAP-Vid and TAPVid-3D benchmarks, which evaluate long-range tracking of arbitrary points in 2D and 3D across diverse real and synthetic videos. Its flagship models—TAPIR, BootsTAPIR, and the latest TAPNext—use matching plus temporal refinement or next-token style propagation to achieve state-of-the-art...

Downloads: 1 This Week

Last Update: 4 days ago
See Project
21

SCAIL

Towards Studio-Grade Character Animation via In-Context Learning of 3D

SCAIL is a project developed by the ZAI Organization, focusing on AI-driven research initiatives. While specific documentation about SCAIL’s exact goals and implementation is limited from the repository context alone, the project appears to be part of a collection of machine learning and AI research tools that facilitate scalable model development, evaluation, or application workflows. Given its listing alongside other ZAI projects like speech recognition and text-to-speech systems, SCAIL...

Downloads: 0 This Week

Last Update: 2026-05-06
See Project
22

SynaBun

Persistent vector memory for AI assistants

...One of its defining characteristics is its Neural Interface, a browser-based 3D visualization that represents stored memories as nodes in an interactive graph, allowing users to explore relationships, edit entries, and manage knowledge visually.

Downloads: 0 This Week

Last Update: 2026-05-15
See Project
23

video2robot

End-to-end pipeline converting generative videos

video2robot is an end-to-end open-source pipeline that converts generative video or prompt-driven motion content into executable humanoid robot motion sequences, enabling researchers and developers to go from high-level action descriptions or videos to robot-ready motion data. The pipeline supports both prompt-to-video generation using models like Veo/Sora and video upload processing, followed by human pose extraction through a 3D pose model and retargeting of that motion to robot joints using a general motion retargeting system. This workflow allows users to generate robot motion files that specify joint angles, root positions, and orientations that can be deployed on supported robot platforms (e.g., Unitree models). Video2robot includes scripts for each stage of the pipeline (generation, extraction, conversion, visualization) and can run as a CLI or through a basic web UI.

Downloads: 0 This Week

Last Update: 2026-01-30
See Project
24

VoxelMorph

Unsupervised Learning for Image Registration

VoxelMorph is an open-source deep learning framework designed for medical image registration, a process that aligns multiple medical scans into a common spatial coordinate system. Traditional image registration techniques typically rely on optimization procedures that must be executed separately for each pair of images, which can be computationally expensive and slow. VoxelMorph approaches the problem using neural networks that learn to predict deformation fields that transform one image so...

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
25

Lingvo

Framework for building neural networks

...Lingvo includes reference models and configurations for domains like machine translation, automatic speech recognition, language modeling, image understanding, and 3D object detection. Centralized hyperparameter configuration files allow researchers to share exact experiment setups so others can retrain and compare results reliably.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project