3d text free download

Showing 197 open source projects for "3d text"

View related business solutions

AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
1

HunyuanWorld 1.0

Generating Immersive, Explorable, and Interactive 3D Worlds

HunyuanWorld-1.0 is an open-source, simulation-capable 3D world generation model developed by Tencent Hunyuan that creates immersive, explorable, and interactive 3D environments from text or image inputs. It combines the strengths of video-based diversity and 3D-based geometric consistency through a novel framework using panoramic world proxies and semantically layered 3D mesh representations.

Downloads: 17 This Week

Last Update: 2026-04-15
See Project
2

LLaMA-Mesh

Unifying 3D Mesh Generation with Language Models

LLaMA-Mesh is a research framework that extends large language models so they can understand and generate 3D mesh data alongside text. The system introduces a method for representing 3D meshes in a textual format by encoding vertex coordinates and face definitions as sequences that can be processed by a language model. By serializing 3D geometry into text tokens, the approach allows existing transformer architectures to generate and interpret 3D models without requiring specialized visual tokenizers. ...

Downloads: 1 This Week

Last Update: 2026-03-09
See Project
3

Hunyuan3D-1

A Unified Framework for Text-to-3D and Image-to-3D Generation

Hunyuan3D-1 is an earlier version in the same 3D generation line (the unified framework for text-to-3D and image-to-3D tasks) by Tencent Hunyuan. It provides a framework combining shape generation and texture synthesis, enabling users to create 3D assets from images or text conditions. While less advanced than version 2.1, it laid the foundations for the later PBR, higher resolution, and open-source enhancements.

Downloads: 3 This Week

Last Update: 2025-11-19
See Project
4

three.js

JavaScript 3D library

three.js is a lightweight and easy-to-use JavaScript 3D library with its own default WebGL renderer. It works by creating a scene with a camera and a geometric cube, the latter of which is added to the scene. This will become the animated object within the scene. A WebGL renderer is created for the scene and the camera, with an added viewport to the document.body element. Finally, the cube is animated within the scene for the camera. A plethora of examples can be found in the project’s...

Downloads: 72 This Week

Last Update: 2026-04-16
See Project
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
5

HY-World 2.0

A Multi-Modal World Model for Reconstructing, Generating, Simulation

HY-World 2.0 is a multi-modal world model framework for reconstructing, generating, and simulating navigable 3D worlds from diverse inputs. It accepts text prompts, single-view images, multi-view images, and videos, and produces 3D world representations rather than limiting output to flat video generation. For text and single-image inputs, it generates high-fidelity 3D Gaussian Splatting scenes through a multi-stage pipeline that includes panorama generation, trajectory planning, world expansion, and world composition. ...

Downloads: 17 This Week

Last Update: 4 days ago
See Project
6

Hunyuan3D 2.0

High-Resolution 3D Assets Generation with Large Scale Diffusion Models

The Hunyuan3D-2 model, developed by Tencent, is designed for generating high-resolution 3D assets using large-scale diffusion models. This model offers advanced capabilities for creating detailed 3D models, including texture enhancements, multi-view shape generation, and rapid inference for real-time applications. It is particularly useful for industries requiring high-quality 3D content, such as gaming, film, and virtual reality. Hunyuan3D-2 supports various enhancements and is available...

Downloads: 37 This Week

Last Update: 2025-10-28
See Project
7

CadQuery

A python parametric CAD scripting framework based on OCCT

...Provide a non-proprietary, plain text model format that can be edited and executed with only a web browser. The scripts use a standard programming language, Python, and thus can benefit from the associated infrastructure. This includes many standard libraries and IDEs. CadQuery's CAD kernel Open CASCADE Technology (OCCT) is much more powerful than the CGAL used by OpenSCAD.

Downloads: 52 This Week

Last Update: 2026-02-13
See Project
8

HY-Motion 1.0

HY-Motion model for 3D character animation generation

HY-Motion 1.0 is an open-source, large-scale AI model suite developed by Tencent’s Hunyuan team that generates high-quality 3D human motion from simple text prompts, enabling the automatic production of fluid, diverse, and semantically accurate animations without manual keyframing or rigging. Built on advanced deep learning architectures that combine Diffusion Transformer (DiT) and flow matching techniques, HY-Motion scales these approaches to the billion-parameter level, resulting in strong instruction-following capabilities and richer motion outputs compared to existing open-source models. ...

Downloads: 11 This Week

Last Update: 2026-01-29
See Project
9

MLX-Audio

A text-to-speech, speech-to-text and speech-to-speech library

...It includes examples such as audiobook generation to demonstrate long-form synthesis and joined audio segments. On top of that, MLX-Audio offers a modern web interface powered by FastAPI, with real-time waveform and 3D visualizations, file upload, and audio management.

Downloads: 5 This Week

Last Update: 2026-03-30
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

WorldGen

Generate Any 3D Scene in Seconds

WorldGen is an AI model and library that can generate full 3D scenes in a matter of seconds from either text prompts or reference images. It is designed to create interactive environments suitable for games, simulations, robotics research, and virtual reality, rather than just static 3D assets. The core idea is that you describe a world in natural language and WorldGen produces a navigable 3D scene that you can freely explore in 360 degrees, with loop closure so that the space remains consistent as you move around. ...

Downloads: 0 This Week

Last Update: 2026-04-12
See Project
11

Make-A-Video - Pytorch (wip)

Implementation of Make-A-Video, new SOTA text to video generator

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch. They combine pseudo-3d convolutions (axial convolutions) and temporal attention and show much better temporal fusion. The pseudo-3d convolutions isn't a new concept. It has been explored before in other contexts, say for protein contact prediction as "dimensional hybrid residual networks".

Downloads: 1 This Week

Last Update: 2024-05-03
See Project
12

Step-Video-T2V

State-of-the-art (SoTA) text-to-video pre-trained model

...The model handles bilingual input (e.g. English and Chinese) thanks to dual encoders, and supports end-to-end text-to-video generation without requiring external assets. Its training and generation pipeline includes techniques like flow-matching, full 3D attention for temporal consistency, and fine-tuning approaches (e.g. video-based DPO) to improve fidelity and reduce artifacts. As a result, Step-Video-T2V aims to push the frontier of open-source video generation.

Downloads: 1 This Week

Last Update: 2025-12-02
See Project
13

DreamCraft3D

Official implementation of DreamCraft3D

DreamCraft3D is DeepSeek’s generative 3D modeling framework / model family that likely extends their earlier 3D efforts (e.g. Shap-E or Point-E style models) with more capability, control, or expression. The name suggests a “dream crafting” metaphor—users probably supply textual or image prompts and generate 3D assets (point clouds, meshes, scenes). The repository includes model code, inference scripts, sample prompts, and possibly dataset preparation pipelines. It may integrate rendering or...

Downloads: 1 This Week

Last Update: 2025-10-03
See Project
14

Video Diffusion - Pytorch

Implementation of Video Diffusion Models

...Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch. It uses a special space-time factored U-net, extending generation from 2D images to 3D videos. 14k for difficult moving mnist (converging much faster and better than NUWA) - wip. Any new developments for text-to-video synthesis will be centralized at Imagen-pytorch. For conditioning on text, they derived text embeddings by first passing the tokenized text through BERT-large. You can also directly pass in the descriptions of the video as strings, if you plan on using BERT-base for text conditioning. ...

Downloads: 0 This Week

Last Update: 2024-05-03
See Project
15

Radegast

Lightweight client for connecting to Second Life and OpenSim

Radegast is a virtual world client compatible with Second Life and OpenSimulator. Its main purpose is to provide an alternative client to Linden Lab-derived virtual world viewers. There is a strong focus on accessibility and non-3D interaction. Given the current nature of changes in Second Life, I felt it was prudent to take on another abandoned text-focused viewer. Introducing MEGAbolt, a fork of the METAbolt viewer which was abandoned by its author almost eight years ago. Keep in mind, this is the first version and much has changed under the hood. It has been brought up to date and only supports Windows x64 currently and is built against the .NET 5 platform. ...

3 Reviews

Downloads: 32 This Week

Last Update: 2026-03-04
See Project
16

Microsoft Paint 3D

Free app for creating 3D art

...Paint 3D includes a built-in library of 3D objects and supports remixing models created by other users, making it ideal for beginners and hobbyists. The app also features tools for adding 3D text, stickers, and realistic textures to personalize projects. Beyond 3D, Paint 3D allows simple 2D drawing with brushes and effects similar to traditional digital painting apps.

1 Review

Downloads: 111 This Week

Last Update: 2025-07-14
See Project
17

HY-World 1.5

A Systematic Framework for Interactive World Modeling

HY-WorldPlay is a Hunyuan AI project focusing on immersive multimodal content generation and interaction within virtual worlds or simulated environments. It aims to empower AI agents with the capability to both understand and generate multimedia content — including text, audio, image, and potentially 3D or game-world elements — enabling lifelike dialogue, environmental interpretations, and responsive world behavior. The platform targets use cases in digital entertainment, game worlds, training simulators, and interactive storytelling, where AI agents need to adapt to real-time user inputs and changes in environment state. ...

Downloads: 23 This Week

Last Update: 2026-04-15
See Project
18

Axmol Engine

Multi-platform Engine for Desktop, XBOX (UWP) and Mobile games

Axmol is a modern C++ game engine forked from Cocos2d-x, designed to support high-performance 2D and lightweight 3D game development across multiple platforms. It improves upon the original Cocos2d-x with a cleaner architecture, better tooling, and support for modern C++ standards. Axmol supports scripting with Lua and JavaScript, and is suitable for both indie developers and studios targeting mobile, desktop, and web platforms. With an active community and frequent updates, Axmol is a solid...

Downloads: 5 This Week

Last Update: 2026-02-23
See Project
19

FAY

Framework for building AI-powered interactive digital humans and agent

...It acts as a middleware layer that connects digital character technologies with conversational AI systems and business applications. Fay supports various types of digital humans, including 2.5D and 3D avatars, and can be integrated with applications running on mobile devices, PCs, web platforms, and embedded systems. Its architecture allows developers to combine different AI components such as speech recognition, text-to-speech, and large language models to create conversational digital agents. Fay provides multiple interfaces for text, voice, and digital human control, enabling developers to build interactive assistants, virtual presenters, or automated service agents. ...

Downloads: 4 This Week

Last Update: 2 hours ago
See Project
20

OpenUSD

Universal Scene Description

Universal Scene Description (USD) is an open source framework developed by Pixar Animation Studios for the interchange of 3D graphics data across various digital content creation tools. USD provides a rich and powerful toolset for reading, writing, editing, and rapidly previewing 3D scene data. With many of its features geared towards performance and large-scale collaboration among many artists, USD is ideal for the complexities of the modern pipeline. One such feature is Hydra, a high-performance preview renderer capable of interactively displaying large data sets. ...

Downloads: 6 This Week

Last Update: 10 hours ago
See Project
21

CADAM

CADAM is the open source text-to-CAD web application

CADAM is an open-source “text-to-CAD” web application that turns natural-language prompts into parametric 3D models, targeting a workflow where you describe what you want and the system produces editable geometry. It is designed around a browser-first experience, so users can iterate quickly without installing a heavyweight desktop CAD suite for every experiment.

Downloads: 13 This Week

Last Update: 2026-03-03
See Project
22

HyperTools

A Python toolbox for gaining geometric insights

...It is built on top of matplotlib (for plotting), seaborn (for plot styling), and scikit-learn (for data manipulation). Functions for plotting high-dimensional datasets in 2/3D. Static and animated plots. Simple API for customizing plot styles. Set of powerful data manipulation tools including hyperalignment, k-means clustering, normalizing and more. Support for lists of Numpy arrays, Pandas dataframes, text or (mixed) lists. Applying topic models and other text vectorization methods to text data. HyperTools is designed to facilitate dimensionality reduction-based visual explorations of high-dimensional data. ...

Downloads: 0 This Week

Last Update: 2026-01-29
See Project
23

Qwen3-VL

Qwen3-VL, the multimodal large language model series by Alibaba Cloud

Qwen3-VL is the latest multimodal large language model series from Alibaba Cloud’s Qwen team, designed to integrate advanced vision and language understanding. It represents a major upgrade in the Qwen lineup, with stronger text generation, deeper visual reasoning, and expanded multimodal comprehension. The model supports dense and Mixture-of-Experts (MoE) architectures, making it scalable from edge devices to cloud deployments, and is available in both instruction-tuned and...

Downloads: 7 This Week

Last Update: 2026-04-18
See Project
24

DocArray

The data structure for multimodal data

DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. ...

Downloads: 0 This Week

Last Update: 2025-03-21
See Project
25

comfyui-mixlab-nodes

Workflow and speech recognition app

comfyui-mixlab-nodes is a large collection of custom nodes for ComfyUI that turns workflows into interactive apps and adds real-time multimedia, LLM, and TTS capabilities. It introduces a “Workflow-to-APP” concept, where a ComfyUI graph can be transformed into a Web App through an AppInfo node, complete with categories, batch prompts, and editable configurations. The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project