Alternatives to SAM 3D

Compare SAM 3D alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to SAM 3D in 2026. Compare features, ratings, user reviews, pricing, and more from SAM 3D competitors and alternatives in order to make an informed decision for your business.

  • 1
    ReconstructMe

    ReconstructMe

    ReconstructMe

    ReconstructMe’s usage concept is similar to that of an ordinary video camera – simply move around the object to be modelled in 3D. Scanning with ReconstructMe scales from smaller objects such as human faces up to entire rooms and runs on commodity computer hardware. Read more about its features. Integrate ReconstructMe into your application using our powerful SDK. ReconstructMe’s usage concept is similar to that of an ordinary video camera – simply move around the object to be captured. However, instead of a video stream you get a complete 3d model in real-time. Read about our hardware requirements. Modelling with ReconstructMe scales from smaller objects such as human faces up to entire rooms. ReconstructMe is capable of capturing and processing the color information of the object being scanned, as long as the sensor provides the necessary color stream.
    Starting Price: $279 one-time payment
  • 2
    Imverse LiveMaker
    Use LiveMaker™ to make photorealistic 3D scenes for virtual reality experiences, volumetric videos, movie previsualization, video games, immersive training, virtual showrooms, and much more! LiveMaker™ is the first software that enables you to build 3D models from inside of virtual reality. It’s easy to use, and requires no special programming skills. Using proprietary voxel technology, LiveMaker™ lets you import 360° photos and reconstruct their geometry, retexture occlusions, create new objects, and relight the entire scene. It also allows you to import and integrate external media and assets, static or dynamic, low or high quality, so you can design your virtual scene without limitations. You can use LiveMaker™ to create complete environments or for quick visual prototyping, and the 3D models created with LiveMaker™ can be easily exported and used in other tools depending on your needs and workflow.
  • 3
    alwaysAI

    alwaysAI

    alwaysAI

    alwaysAI provides developers with a simple and flexible way to build, train, and deploy computer vision applications to a wide variety of IoT devices. Select from a catalog of deep learning models or upload your own. Use our flexible and customizable APIs to quickly enable core computer vision services. Quickly prototype, test and iterate with a variety of camera-enabled ARM-32, ARM-64 and x86 devices. Identify objects in an image by name or classification. Identify and count objects appearing in a real-time video feed. Follow the same object across a series of frames. Find faces or full bodies in a scene to count or track. Locate and define borders around separate objects. Separate key objects in an image from background visuals. Determine human body poses, fall detection, emotions. Use our model training toolkit to train an object detection model to identify virtually any object. Create a model tailored to your specific use-case.
  • 4
    Parallel Domain Replica Sim
    Parallel Domain Replica Sim enables the creation of high-fidelity, fully annotated, simulation-ready environments from users’ own captured data (photos, videos, scans). With PD Replica, you can generate near-pixel-perfect reconstructions of real-world scenes, transforming them into virtual environments that preserve visual detail and realism. PD Sim provides a Python API through which perception, machine learning, and autonomy teams can configure and run large-scale test scenarios and simulate sensor inputs (camera, lidar, radar, etc.) in either open- or closed-loop mode. These simulated sensor feeds come with full annotations, so developers can test their perception systems under a wide variety of conditions, lighting, weather, object configurations, and edge cases, without needing to collect real-world data for every scenario.
  • 5
    3D House Planner

    3D House Planner

    3D House Planner

    3D House Planner is the professional home design web application that allows you to design houses and apartments. No installation required. It is accessible through your browser. 3D House Planner is absolutely free for all. You can import or export 3d models for personal or commercial purposes. There are countless possibilities. Browse our catalog and select from thousands of objects to furnish and decorate the interior and exterior of your home with furnitures, decorative accessories, electric devices, and household appliances. We have also a texture Library with a wide range of high quality textures. Most textures contain albedo, normal, ambient occlusion, metalness and roughness maps. Apart from home design, you can import your own 3d object, change appearance, position of objects, make videos, take snapshots, etc.
  • 6
    Mudbox

    Mudbox

    Autodesk

    3D digital painting and sculpting software. Create beautiful characters and environments with Mudbox. Sculpt and paint highly detailed 3D geometry and textures. Mudbox® 3D digital sculpting and texture painting software gives you an intuitive, tactile toolset. Create highly detailed 3D characters and environments using an intuitive set of digital tools based on real sculpting techniques. Paint directly on your 3D assets across multiple channels. Add resolution to a mesh only in areas that need it with an artist-friendly, camera-based workflow. Create clean, production-quality meshes from scanned, imported, or sculpted data. Bake normal, displacement, and ambient occlusion maps. Get effective, brush-based workflows for polygons and textures. Bring assets from Maya into Mudbox to add detailed geometry. Send characters from Maya LT to Mudbox for sculpting and texturing. Then transfer your model back to Maya LT. Take your 3D assets and environments from first draft to final frame.
    Starting Price: $7 per month
  • 7
    NVIDIA Picasso
    NVIDIA Picasso is a cloud service for building generative AI–powered visual applications. Enterprises, software creators, and service providers can run inference on their models, train NVIDIA Edify foundation models on proprietary data, or start from pre-trained models to generate image, video, and 3D content from text prompts. Picasso service is fully optimized for GPUs and streamlines training, optimization, and inference on NVIDIA DGX Cloud. Organizations and developers can train NVIDIA’s Edify models on their proprietary data or get started with models pre-trained with our premier partners. Expert denoising network to generate photorealistic 4K images. Temporal layers and novel video denoiser generate high-fidelity videos with temporal consistency. A novel optimization framework for generating 3D objects and meshes with high-quality geometry. Cloud service for building and deploying generative AI-powered image, video, and 3D applications.
  • 8
    BodyPaint 3D
    Maxon's BodyPaint 3D is the ultimate tool for creating high-end textures and unique sculptures. Wave good-bye to UV seams, inaccurate texturing and constant back-and-forth switching to your 2D image editor. Say hello to hassle-free texturing that lets you quickly paint highly detailed textures directly on your 3D objects. BodyPaint 3D also offers a comprehensive set of sculpting tools that let you turn a simple object into a detailed work of art. When you use BodyPaint 3D to paint complete materials onto your 3D models, you’ll immediately see how the texture fits with the contour of the model, how the bump or displacement react to lighting, and how the transparency and reflection interact with the environment. There’s no need to waste time transitioning textures between environments, you’ll always see an accurate depiction of the texture so you can concentrate on making it look great.
    Starting Price: $22 per month
  • 9
    OptiTrack Motive
    Motive + OptiTrack cameras deliver the best-performing real-time human and object tracking available today. Vastly improved skeletal tracking precision. Robust, accurate bone tracking, even during heavy occlusion of markers. “Solver” in human motion tracking terms refers to the programmatic process of estimating the pose (6 DoF) of each bone, deduced by the actually measured markers, at each frame of measurement. A precision solver, like that developed for Motive 3.0, accurately defines the skeleton movement of the tracked subject(s) which yields higher confidence and more nuanced performance capture for character animation. A robust solver will also perform precision marker labeling and skeletal tracking even when many markers are hidden from cameras or lost, providing more reliable tracking data and vastly reduced editing time across all applications. Motive processes OptiTrack camera data to deliver global 3D positions, marker IDs, and rotational data.
    Starting Price: $999 one-time payment
  • 10
    Seed3D

    Seed3D

    ByteDance

    Seed3D 1.0 is a foundation-model pipeline that takes a single input image and generates a simulation-ready 3D asset, including closed manifold geometry, UV-mapped textures, and physically-based rendering material maps, designed for immediate integration into physics engines and embodied-AI simulators. It uses a hybrid architecture combining a 3D variational autoencoder for latent geometry encoding, and a diffusion-transformer stack to generate detailed 3D shapes, followed by multi-view texture synthesis, PBR material estimation, and UV texture completion. The geometry branch produces watertight meshes with fine structural details (e.g., thin protrusions, holes, text), while the texture/material branch yields multi-view consistent albedo, metallic, and roughness maps at high resolution, enabling realistic appearance under varied lighting. Assets generated by Seed3D 1.0 require minimal cleanup or manual tuning.
  • 11
    SeedEdit

    SeedEdit

    ByteDance

    SeedEdit is an advanced AI image-editing model developed by the ByteDance Seed team that enables users to revise an existing image using natural-language text prompts while preserving unedited regions with high fidelity. It accepts an input image plus a text description of the change (such as style conversion, object removal or replacement, background swap, lighting shift, or text change), and produces a seamlessly edited result that maintains structural integrity, resolution, and identity of the original content. The model leverages a diffusion-based architecture trained via a meta-information embedding pipeline and joint loss (combining diffusion and reward losses) to balance image reconstruction and re-generation, resulting in strong editing controllability, detail retention, and prompt adherence. The latest version (SeedEdit 3.0) supports high-resolution edits (up to 4 K), delivers fast inference (under ~10-15 seconds in many cases), and handles multi-round sequential edits.
  • 12
    Symage

    Symage

    Symage

    Symage is a synthetic data platform that generates custom, photorealistic image datasets with automated pixel-perfect labeling to support training and improving AI and computer vision models; using physics-based rendering and simulation rather than generative AI, it produces high-fidelity synthetic images that mirror real-world conditions and handle diverse scenarios, lighting, camera angles, object motion, and edge cases with controlled precision, which helps eliminate data bias, reduce manual labeling, and dramatically cut data preparation time by up to 90%. Designed to give teams the right data for model training rather than relying on limited real datasets, Symage lets users tailor environments and variables to match specific use cases, ensuring datasets are balanced, scalable, and accurately labeled at every pixel. It is built on decades of expertise in robotics, AI, machine learning, and simulation, offering a way to overcome data scarcity and boost model accuracy.
  • 13
    SeedEdit 3.0

    SeedEdit 3.0

    ByteDance

    SeedEdit is a generative AI image editing model from ByteDance’s Seed team that enables text-guided, high-quality image modification by applying natural language instructions to change specific parts of an image while maintaining consistency in the rest of the scene. Built on advanced diffusion and multimodal learning techniques, later versions like SeedEdit 3.0 improve on earlier releases with enhanced fidelity, accurate instruction following, and the ability to edit at high resolution (including up to 4K outputs) while preserving original subjects, backgrounds, and fine visual details. It supports common edit tasks such as portrait retouching, background replacement, object removal, lighting and perspective changes, and stylistic transformations without manual masking or tools, and achieves higher usability and visual quality than previous models by balancing between reconstruction and regeneration of images.
  • 14
    Qwen-Image

    Qwen-Image

    Alibaba

    Qwen-Image is a multimodal diffusion transformer (MMDiT) foundation model offering state-of-the-art image generation, text rendering, editing, and understanding. It excels at complex text integration, seamlessly embedding alphabetic and logographic scripts into visuals with typographic fidelity, and supports diverse artistic styles from photorealism to impressionism, anime, and minimalist design. Beyond creation, it enables advanced image editing operations such as style transfer, object insertion or removal, detail enhancement, in-image text editing, and human pose manipulation through intuitive prompts. Its built-in vision understanding tasks, including object detection, semantic segmentation, depth and edge estimation, novel view synthesis, and super-resolution, extend its capabilities into intelligent visual comprehension. Qwen-Image is accessible via popular libraries like Hugging Face Diffusers and integrates prompt-enhancement tools for multilingual support.
    Starting Price: Free
  • 15
    OmniHuman-1

    OmniHuman-1

    ByteDance

    OmniHuman-1 is a cutting-edge AI framework developed by ByteDance that generates realistic human videos from a single image and motion signals, such as audio or video. The platform utilizes multimodal motion conditioning to create lifelike avatars with accurate gestures, lip-syncing, and expressions that align with speech or music. OmniHuman-1 can work with a range of inputs, including portraits, half-body, and full-body images, and is capable of producing high-quality video content even from weak signals like audio-only input. The model's versatility extends beyond human figures, enabling the animation of cartoons, animals, and even objects, making it suitable for various creative applications like virtual influencers, education, and entertainment. OmniHuman-1 offers a revolutionary way to bring static images to life, with realistic results across different video formats and aspect ratios.
  • 16
    ActiveCube

    ActiveCube

    Virtalis

    Professional interactive 3D visualization system, designed to transform how organizations build and engage. Wrap your teams in a human-scale virtual space where they effortlessly and naturally interact with the scenario and with each other. Thanks to the high-resolution 3D images that surround the user, the ActiveCube achieves a high level of immersion without the isolation of HMDs. Being able to see the real world helps to reduce nausea often experienced with HMDs. Get stronger insight and appreciation of your data reinforced by natural, real-time tracking and human-scale interaction with virtual and real-world objects. See other users, read body language and use other devices as you would normally for a more comfortable working environment. ActiveCubes can be configured to 2 or more walls with images that surround the user. Virtalis has the expertise necessary to design and deliver such complex systems seamlessly, as attested by satisfied Fortune 500 customers.
  • 17
    Movmi

    Movmi

    Movmi

    Providing a high qualified tool for human body motion developers, Movmi provides a revolutionary solution for capturing humanoid motion from 2D media Data (Image, Video). Use media shots from any camera, starting from smartphones to professional cameras, through any lifestyle scene. Browses a collection of full-textured characters which are used in every purpose: cartoon, fantasy, and CG projects. Movmi Store Explores Full-Body Character animation of many poses and actions. You can use the animation on Any of Movmi characters. Movmi Store Contains a collection of 3D characters that are free of charge so the Motion Developers have the freedom to use them in their Development. It Explores a library of Full-Body Character animation of many poses and actions.
    Starting Price: Free
  • 18
    VGSTUDIO

    VGSTUDIO

    Volume Graphics

    VGSTUDIO is the ideal choice for visual quality inspection in industrial applications, e.g., in the electronics industry, but also for the visualization of data in fields of academic research such as archaeology, geology, and life sciences. VGSTUDIO covers the entire workflow, from the precise reconstruction of three-dimensional volume data sets using the images taken by your CT scanner to visualization (in 3D and 2D) and the creation of impressive animations. 3D visualization of even very large CT data sets, with almost no limit on data volume. Real-time ray tracing for a photo-realistic look. Combined visualization of voxel and mesh data, including textured meshes. Arbitrary orientation of 2D slices, 2D slice rotation view around a customizable axis. Gray-value classification of a data set, and a wide variety of 3D clipping options. Unrolling of objects or leveling of freeform surfaces in a 2D view. Combination of consecutive slices into a single 2D view.
  • 19
    Frost 3D Universal
    Frost 3D software allows you to develop scientific models of permafrost thermal regimes under the thermal influence of pipelines, production wells, hydraulic constructions, etc., taking into account the thermal stabilization of the ground. The software package is based on ten years experience in the field of programming, computational geometry, numerical methods, 3D visualization, and parallelization of computational algorithms. Creation of 3D computational domain with surface topography and soil lithology; 3D reconstruction of pipelines, boreholes, basements, and foundations of buildings; Import of 3D objects including Wavefront (OBJ), StereoLitho (STL), 3D Studio Max (3DS) and Frost 3D Objects (F3O); Library of thermophysical properties of the ground, building elements, climatic factors and the parameters of cooling units; Specification of thermal and hydrological properties of 3D objects and heat transfer parameters on the surfaces of objects.
  • 20
    FindFace

    FindFace

    NtechLab

    NtechLab platform processes video and recognizes human faces, bodies and actions, as well as cars and plate numbers. AI-powered technology enables record breaking accuracy and high speed of recognition. The multi-object and analytical capabilities of FindFace Multi unlock new scenarios for responding challenges of public sector and business. FindFace Multi quickly and accurately recognizes faces, human bodies, cars, and license plate numbers in a live video stream or in a video archive. Searching for faces, bodies, and vehicles in a database or in an archive is available both by a photo sample and by specific features, for example, by age, clothes color, or vehicle model. NtechLab developers are constantly improving recognition algorithms, increasing their performance and accuracy. With FindFace Multi it takes less than a second to detect a face in a video stream, recognize it, and search for a match in a database with billions of images.
  • 21
    Photo Eraser

    Photo Eraser

    Toscanapps

    Powered by advanced AI technology, Photo Eraser is a photo eraser that not only removes unwanted objects from your pictures but also seamlessly reconstructs the background to give you that perfect shot you've always desired. No more distractions in your photos. With Photo Eraser's cutting-edge erase elements function, you can effortlessly eliminate any unwanted object, person, or background clutter from your images. The app's AI capabilities ensure that the area previously occupied by the removed item is filled in with an accurate and natural-looking background, making the edit invisible. The Photo Eraser feature comes equipped with a range of intuitive tools designed to speed up the editing process, ensuring you achieve professional-quality results in just a few taps. The AI detection feature automatically identifies objects and people that you may want to remove. This intelligent detection capability saves you time and effort.
    Starting Price: Free
  • 22
    Mocha Pro

    Mocha Pro

    Boris FX

    Mocha Pro is the world renowned software for planar tracking, rotoscoping and object removal. Essential to visual effects and post-production workflows, Mocha has been recognized with prestigious Academy and Emmy Awards for contribution to the film and television industry. Mocha Pro has recently been used on global hits including The Mandalorian, Stranger Things, Avengers: Endgame, and many more. The next evolution of Mocha. PowerMesh enables a powerful new sub-planar tracking engine for VFX, roto and stabilization. Warped surface tracking and roto that sticks. Track complex organic surfaces through occlusions and blur using Mocha’s intuitive layer based interface. Simple to use and faster than most optical flow based techniques. Apply to source files for realistic match moves, convert to AE Nulls to drive motion graphics, render a mesh warped stabilize/reverse stabilize plate for compositing, or export dense tracking data to host applications.
    Starting Price: $27.75 per month
  • 23
    Shap-E

    Shap-E

    OpenAI

    This is the official code and model release for Shap-E. Generate 3D objects conditioned on text or images. Sample a 3D model, conditioned on a text prompt, or conditioned on a synthetic view image. To get the best result, you should remove the background from the input image. Load 3D models or a trimesh, and create a batch of multiview renders and a point cloud encode them into a latent and render it back. For this to work, install Blender version 3.3.1 or higher.
    Starting Price: Free
  • 24
    openMVG

    openMVG

    openMVG

    Extend awareness of the power of 3D reconstruction from images and photogrammetry by developing a C++ framework. Simplify reproducible research with easy-to-read and accurate implementation of state of the art and "classic" algorithms. OpenMVG is designed to be easy to read, learn, modify and use. Thanks to its strict test-driven development and samples, the library allows to build trusted larger systems. OpenMVG provides an end-to-end 3D reconstruction from images framework compounded of libraries, binaries, and pipelines. The libraries provide easy access to features like images manipulation, features description and matching, feature tracking, camera models, multiple-view-geometry, robust-estimation, structure-from-motion algorithms, etc. The binaries solve unit tasks that a pipeline could require scene initialization, feature detection & matching and structure-from-motion reconstruction.
  • 25
    SURE Aerial
    nFrames SURE software delivers an efficient solution of dense image surface reconstruction for mapping, surveying, geo-information and research organizations. The SURE software delivers derivation of precise point clouds, DSMs, True Orthophotos and textured Meshes from small, medium and large frame images. This advanced solution is designed for applications including countrywide mapping, monitoring projects that use manned aircraft and UAVs, cadaster, infrastructure planning, and 3D modeling. SURE Aerial is specifically designed for aerial image datasets captured with large frame nadir cameras, oblique cameras and hybrid systems with additional LiDAR sensors. Without limitation in image resolution, it empowers the production of 3D Meshes, True Orthophotos, Point Clouds and Digital Surface Models on common workstation hardware and in cluster environments. Simple to setup and operate, SURE Aerial is compliant with mapping industry standards and accessible for web streaming technologies.
  • 26
    Ilus AI

    Ilus AI

    Ilus AI

    The quickest way to get started with our illustration generator is to use pre-made models. If you want to depict a style or an object that is not available in the premade models you can train your own fine tune by uploading 5-15 illustrations. there are no limits to fine-tuning you can use it for illustrations icons or any assets you need. Read more about fine-tuning. Illustrations are exportable in PNG and SVG formats. Fine-tuning allows you to train the stable-diffusion AI model, on a particular object or style, and create a new model that generates images of those objects or styles. The fine-tuning will be only as good as the data you provide. Around 5-15 images are recommended for fine-tuning. Images can be of any unique object or style. Images should contain only the subject itself, without background noise or other objects. Images must not include any gradients or shadows if you want to export it as SVG later. PNG export still works fine with gradients and shadows.
    Starting Price: $0.06 per credit
  • 27
    Astria

    Astria

    Astria

    Tailor-made AI image generation, start creating your unique images. Align your crew with the most detailed, custom-made visual references. Previs to the max. Find the most attractive visualization for your product. Instant realization of your vision, with limitless variations. Realize your super-specific concepts with augmented creativity. Experiment, modify, and fine-tune. Upload 10-20 pictures of your subject. Preferably shot or cropped to a 1:1 aspect ratio. We recommend uploading 3 photos of the full body or entire object + 5 medium shot photos from the chest up + 10 close-ups. Change body poses for every picture, use pictures from different days' backgrounds and lighting, and show a variety of expressions and emotions. Make sure you capture the subject's eyes looking in different directions for different images, take one with closed eyes. Every picture of your subject should introduce new info about your subject.
    Starting Price: $0.10 per prompt
  • 28
    Imagen 3

    Imagen 3

    Google

    Imagen 3 is the next evolution of Google's cutting-edge text-to-image AI generation technology. Building on the strengths of its predecessors, Imagen 3 offers significant advancements in image fidelity, resolution, and semantic alignment with user prompts. By employing enhanced diffusion models and more sophisticated natural language understanding, it can produce hyper-realistic, high-resolution images with intricate textures, vivid colors, and precise object interactions. Imagen 3 also introduces better handling of complex prompts, including abstract concepts and multi-object scenes, while reducing artifacts and improving coherence. With its powerful capabilities, Imagen 3 is poised to revolutionize creative industries, from advertising and design to gaming and entertainment, by providing artists, developers, and creators with an intuitive tool for visual storytelling and ideation.
  • 29
    Bifrost

    Bifrost

    Bifrost AI

    Quickly and easily generate diverse and realistic synthetic data and high-fidelity 3D worlds to enhance model performance. Bifrost's platform is the fastest way to generate the high-quality synthetic images that you need to improve ML performance and overcome real-world data limitations. Prototype and test up to 30x faster by circumventing costly and time-consuming real-world data collection and annotation. Generate data to account for rare scenarios underrepresented in real data, resulting in more balanced datasets. Manual annotation and labeling is an error-prone, resource-intensive process. Easily and quickly generate data that is pre-labeled and pixel-perfect. Real-world data can inherit the biases of conditions under which the data was collected, and generate data to solve for these instances.
  • 30
    Imagen3D

    Imagen3D

    Imagen3D

    Imagen3D is an AI-powered online tool that instantly converts photos into high-quality 3D models with industry-standard topology, watertight geometry, and realistic PBR texture maps, eliminating the need for manual modeling cleanup and delivering production-ready assets for rendering, animation, 3D printing, AR or VR, and game workflows in minutes. It uses advanced image-to-3D technology to preserve fine surface details from your source images and offers flexible quality options (Fast, Pro, Ultra) so you can balance speed versus detail, generating models often in under three minutes. It supports uploading single images or multiple views for enhanced reconstruction accuracy and outputs to universal formats such as GLB, OBJ, STL, GLTF, USDZ, and MP4 for seamless use in Blender, Unity, Unreal, Maya, web viewers, and more.
    Starting Price: $10 per month
  • 31
    DEEPMOTION

    DEEPMOTION

    DEEPMOTION

    Say hello to a revolutionary solution for capturing and reconstructing full body motion. Animate 3D lets you turn videos into 3D animations for use in games, augmented/virtual reality, and other applications. Simply upload a video clip, select output formats and job settings, and RUN! It's that simple. Animate 3D lets you create animations from video clips in seconds, drastically reducing development time and costs. And with pioneering features such as Physics Simulation, Foot Locking, Slow Motion handling and now full body motion combined with Face Tracking you have more control and flexibility to create high-fidelity 3D animations. Upload custom FBX or GLB characters, or create new models directly through Animate 3D, and our AI will automatically retarget animations onto your custom characters. Plus with an interactive animation previewer you can verify your 3D animation results immediately before downloading and copying into your solution.
    Starting Price: $12 per month
  • 32
    HunyuanWorld
    HunyuanWorld-1.0 is an open source AI framework and generative model developed by Tencent Hunyuan that creates immersive, explorable, and interactive 3D worlds from text prompts or image inputs by combining the strengths of 2D and 3D generation techniques into a unified pipeline. At its core, the project features a semantically layered 3D mesh representation that uses 360° panoramic world proxies to decompose and reconstruct scenes with geometric consistency and semantic awareness, enabling the creation of diverse, coherent environments that can be navigated and interacted with. Unlike traditional 3D generation methods that struggle with either limited diversity or inefficient data representations, HunyuanWorld-1.0 integrates panoramic proxy generation, hierarchical 3D reconstruction, and semantic layering to balance high visual quality and structural integrity while enabling exportable meshes compatible with common graphics workflows.
    Starting Price: Free
  • 33
    RecFusion

    RecFusion

    RecFusion

    With RecFusion you can create 3D models of people, pets, furniture and many other objects, even your motorcycle! All you need is a depth-sensor like the Microsoft Kinect or the Asus Xtion. Just move the sensor around the object and you can see the model building up on your screen in real-time and in color. Use the built-in post-processing functions to prepare your models for 3D printing and publish your models on the web to show them to your friends. Download RecFusion now and start creating your own models today! For customers from any domain, ImFusion GmbH is offering custom solutions. Use the 3D-reconstruction as a third-party component in your software. Supports custom measurement and scanning applications. Registration of 3D data is available. Branded versions of the application are supported as well. RecFusion provides you with custom image processing and computer vision solutions.
    Starting Price: €145 one-time payment
  • 34
    AI Edit

    AI Edit

    AI Edit

    AI Edit is a complete creative AI Platform for Images, Video, Audio & Design that brings together best models and tools – all in one unified interface. It provides everything you need for visual and audio content creation in a single workspace. - Extensive Model Library with 100+ latest and most powerful AI models. - Image Generation & Editing (editing with natural language prompts, reference images, and angle modifications, background change and removal, upscaling, cropping, expansion to various aspect ratios, photo restoration, 360° Panorama creation, remixing that helps you create 4-9 variations of the uploaded image in one generation and upscale one of them, pose editor that allows to change human poses using an intuitive 3D model interface, inpainting and object removal tools that help enhance specific image areas, YouTube thumbnail generator, Vector generation, virtual try-on and try-off) - Video Generation & Continuation - Audio & Music Creation - Chat mode
  • 35
    ZenCtrl

    ZenCtrl

    Fotographer AI

    ZenCtrl is an open source AI image generation toolkit developed by Fotographer AI, designed to produce high-quality, multi-view, and diverse-scene outputs from a single image without any training. It enables precise regeneration of objects and subjects from any angle and background, offering real-time element regeneration that provides both stability and flexibility in creative workflows. ZenCtrl allows users to regenerate subjects from any angle, swap backgrounds or clothing with just a click, and start generating results immediately without the need for additional training. By leveraging advanced image processing techniques, it ensures high accuracy without the need for extensive training data. The model's architecture is composed of lightweight sub-models, each fine-tuned on task-specific data to excel at a single job, resulting in a lean system that delivers sharper, more controllable results.
    Starting Price: Free
  • 36
    FLUX.2 [max]

    FLUX.2 [max]

    Black Forest Labs

    FLUX.2 [max] is the flagship image-generation and editing model in the FLUX.2 family from Black Forest Labs that delivers top-tier photorealistic output with professional-grade quality and unmatched consistency across styles, objects, characters, and scenes. It supports grounded generation that can incorporate real-time contextual information, enabling visuals that reflect current trends, environments, and detailed prompt intent while maintaining coherence and structure. It excels at producing marketplace-ready product photos, cinematic visuals, logo and brand assets, and high-fidelity creative imagery with precise control over colors, lighting, composition, and textures, and it preserves identity even through complex edits and multi-reference inputs. FLUX.2 [max] handles detailed features such as character proportions, facial expressions, typography, and spatial reasoning with high stability, making it suitable for iterative creative workflows.
  • 37
    SCANN3D

    SCANN3D

    SmartMobileVision

    Scann3D deploys patent pending photogrammetry technology to enable true 3D model capture and reconstruction for smartphones and tablets. Your device becomes a standalone tool to turn images into 3D models - all your images are processed by and on it. The resulting 3D models can be stored, shared, and edited by 3rd party applications, and can be used in augmented or virtual reality applications. Privacy guaranteed: doesn't upload any of your images or your models without prior consent to anywhere! Capture now, reconstruct later: you may prepare many Image Sets for reconstruction in advance, and process them later! Built-in model viewer: review your models on your phone! After uploading to Sketchfab, share easily on Facebook! Rich Roadmap: many more features under development, and even more in mind!
  • 38
    Qlone

    Qlone

    EyeCue Vision Technologies

    Scanning is super-fast and done in real-time on your device, no waiting time! even in 4K! Includes AR View with ARKit/ARCore, so you can beam your 3D models back into the real world! Scanning is super easy with Qlone, just place your object in the middle of the mat and our AR dome will guide you through the scanning process. Re-texture from your selected pose. You can even merge two different poses of the same object to have a better overall result. Use our set of modifiers to clean and modify your 3D model – Texture, Art, Sculpt, Clean and Resize. Export in a variety of formats for use in other 3D tools/projects – OBJ, STL, USDZ, GLB, PLY, X3D. Share your models with friends through Facebook, Twitter, Instagram, WhatsApp, Line, Email, iCloud and iMessage. Export directly to i.materialise - an amazing online 3D printing service. Use our top flattening option to easily scan objects with a flat top. New one-time purchase option for unlimited exports.
  • 39
    3DFY.ai

    3DFY.ai

    3DFY.ai

    3DFY.ai develops a complete framework to facilitate solving the 2D-to-3D problem for various domains by using artificial intelligence, to create high-quality 3D models with just a few existing images. Generating high-quality 3D models from just a few images is a challenging task due to missing information and self-occlusions with a small number of images. At 3DFY.ai, we solve this by training our AI models on category-specific datasets, enabling them to complete missing information in a plausible manner. Our unique technology allows us to offer a white-glove service that can transform 2D images into high-quality 3D models at an unprecedented speed and quality. 3DFY.ai Image drastically reduces the time and cost required for retailers to convert their product catalogs from 2D to 3D, by using 3DFY image as part of their studio production pipeline.
  • 40
    Little Language Lessons
    Little Language Lessons (LLL) is an experimental AI language-learning experience from Google Labs designed to make everyday language practice more personal and contextual. Built with Google’s Gemini models, the project consists of bite-sized interactive tools that help users learn vocabulary, phrases, and real-world expressions in practical situations rather than through traditional textbook exercises. It includes Tiny Lesson, which delivers useful words, phrases, and grammar for specific scenarios; Slang Hang, which generates authentic conversations to teach idioms and regional slang; and Word Cam, which uses the camera to instantly identify objects and provide relevant vocabulary. The goal of LLL is to complement conventional study methods by helping learners build habits and integrate language learning into daily life moments, such as ordering food or describing their surroundings.
  • 41
    Meshroom

    Meshroom

    AliceVision

    Presentation of the workflow to create a textured mesh from still images in Meshroom. Meshroom is a 3D reconstruction software based on the open source Photogrammetric Computer Vision Framework AliceVision. Improve robustness of sift features extraction on challenging images: update default values, add new filtering and add dsp-sift variation. Improve mesh quality with a new post-processing. Cells empty/full status are filtered by solid angle ratio to favor smoothness. On some distributions (e.g Ubuntu), you may have conflicts between native drivers and mesa drivers, resulting in an empty black window. AliceVision does not collect or transmit any personal (or system related) information without the user's consent. This website uses Google Analytics services, which collects session data about every pageview. The data is used only to better understand the public behavior towards this website and AliceVision project.
  • 42
    CoppeliaSim

    CoppeliaSim

    Coppelia Robotics

    CoppeliaSim, developed by Coppelia Robotics, is a versatile and powerful robot simulation platform utilized for rapid algorithm development, factory automation simulations, fast prototyping and verification, robotics education, remote monitoring, safety double-checking, and digital twin creation. It features a distributed control architecture, allowing each object or model to be individually controlled via embedded scripts (Python or Lua), plugins (C/C++), remote API clients (Python, Lua, Java, MATLAB, Octave, C, C++, Rust), or custom solutions. The simulator supports five physics engines, MuJoCo, Bullet Physics, ODE, Newton, and Vortex Dynamics, for fast and customizable dynamics calculations, enabling realistic simulation of real-world physics and object interactions, including collision response, grasping, soft bodies, strings, ropes, and cloths. CoppeliaSim provides forward and inverse kinematics calculations for any type of mechanism.
    Starting Price: $2,380 per year
  • 43
    Roora

    Roora

    Roora

    Roora provides high-quality data annotation services for machine learning, specializing in image, video, and text annotation across various industries such as healthcare, autonomous vehicles, and retail. With expertise in techniques like bounding boxes, semantic segmentation, and object detection, Roora helps businesses enhance AI models for better performance. The platform’s skilled team ensures that data labeling is accurate, scalable, and secure, improving AI systems' ability to recognize and classify visual elements in real-world applications like facial recognition, medical imaging, and autonomous navigation.
  • 44
    Express Animate
    Express Animate helps you create stunning animations using objects, images, illustrations, and videos. Choose from a wide array of effects and animation tools to add your creative flare to your project. Get creative with applying transformations and effects to image objects. Quickly convert a color object to black and white or sepia. Enhance an object by adjusting the color temperature or saturation. Use keyframes with the object properties to motion tween, zoom, rotate, and more. Add life to your characters and animated cartoons. Animate separate body parts or group multiple objects together to optimize the animation process. Use the timeline to move your character and create animations. Use multiple layers and keyframes to add special effects, audio, and more. Express Animate has advanced tools for experienced animators and graphic designers, including vector masks, onion skins, blending modes, and a graph editor for precise animation.
    Starting Price: $24.99 one-time payment
  • 45
    Wild Capture Digital Human Platform
    Stream Wild Capture's volumetric fashion demo straight to your mobile or desktop with a sharable URL. Pinch scroll and zoom your way through our early fashion demo. This early test shows how Wild Capture can take a pre-existing alembic mesh input and a 2D pattern from Marvelous Designer or CLO3D and automatically pair it to the body for an automated result. With a little bit of touch-up, this result will be cleaned up for final delivery. Volumetric + CG Cloth automation. In the new era of digital humans, it has become clear that successful pipelines for digital humans are key to the mass adoption of volumetric video and to creating digital humans in spatial media. Wild Capture’s Digital Human Platform is a suite of toolsets and technologies that work together to create high-quality digital human assets that comport to commonly used 3D platforms. This allows creators to access optimum volumetric assets quickly with minimal quality or function loss.
  • 46
    Veo 3.1

    Veo 3.1

    Google

    Veo 3.1 builds on the capabilities of the previous model to enable longer and more versatile AI-generated videos. With this version, users can create multi-shot clips guided by multiple prompts, generate sequences from three reference images, and use frames in video workflows that transition between a start and end image, both with native, synchronized audio. The scene extension feature allows extension of a final second of a clip by up to a full minute of newly generated visuals and sound. Veo 3.1 supports editing of lighting and shadow parameters to improve realism and scene consistency, and offers advanced object removal that reconstructs backgrounds to remove unwanted items from generated footage. These enhancements make Veo 3.1 sharper in prompt-adherence, more cinematic in presentation, and broader in scale compared to shorter-clip models. Developers can access Veo 3.1 via the Gemini API or through the tool Flow, targeting professional video workflows.
  • 47
    OpenCV

    OpenCV

    OpenCV

    OpenCV (Open Source Computer Vision Library) is an open-source computer vision and machine learning software library. OpenCV was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in commercial products. Being a BSD-licensed product, OpenCV makes it easy for businesses to utilize and modify the code. The library has more than 2500 optimized algorithms, which includes a comprehensive set of both classic and state-of-the-art computer vision and machine learning algorithms. These algorithms can be used to detect and recognize faces, identify objects, classify human actions in videos, track camera movements, track moving objects, extract 3D models of objects, produce 3D point clouds from stereo cameras, and stitch images together to produce a high-resolution image of an entire scene, find similar images from an image database, remove red eyes from images taken using flash, follow eye movements, recognize scenery, etc.
    Starting Price: Free
  • 48
    ReCap Pro

    ReCap Pro

    Autodesk

    Reality capture software connecting the physical world to the digital. Use ReCap™ Pro 3D scanning software to create 3D models from imported photographs and laser scans. Deliver a point cloud or mesh in support of BIM processes. Collaborate across teams with design based on reality. ReCap Photo, a service included with ReCap Pro, processes drone photography to create 3D representations of current site conditions, objects, and more. It also supports the creation of point clouds, meshes, and ortho photos. Use solutions created with the ReCap Pro Software Development Kit (SDK) to quickly get reality data into Autodesk design and construction tools. Compare the scan view (RealView) and overhead map view side-by-side. Use the compass widget to set the XY axis for the user coordinate system in the overhead view. Use high-precision GPS technology to avoid costly prep work in setting ground control points and get survey-grade accuracy from photo reconstruction.
    Starting Price: $26 per month
  • 49
    Cheetah3D

    Cheetah3D

    Cheetah3D

    Cheetah3D is a powerful and easy to learn 3D modeling, rendering and animation software which was developed from the ground up for Mac. So jump right into the world of computer generated imaging, create 3D artwork for your next iPhone game or make your first animated character. With a full set of polygon, subdivision surface and spline modeling tools the artists can focus on creating, safe in the knowledge that Cheetah3D has a breadth of features for the task. Cheetah3D makes modeling in 3D a breeze for new and experienced users alike. Character rigging is part of the seamless animation system of Cheetah3D where just about every property of an object can be animated. Breathe life into a character for your next iPhone game or animate an architectural fly-through by the powerful animation system built into Cheetah3D. Cheetah3D smoothly integrates the industrial strength Bullet physics engine to simulate rigid body and soft body dynamics.
    Starting Price: $99
  • 50
    Krut AI

    Krut AI

    Krut AI

    An AI platform that integrates products to generate high-quality custom brand images without being an expert prompter. Create visually appealing product photoshoots in seconds. Optimize your ads with human model images, saving cost & time. Instantly remove background with AI precision in seconds with automatic image recognition. Instantly scale any image to 4K clarity with a click using AI precision. Eliminate unwanted objects and fix imperfections instantly and accurately. Swap unwanted objects effortlessly with pixels with perfect clarity. Extend existing images to your desired specifications. Try on glasses, outfits, and more in real-time online before buying with Krut AI. Just input the prompt or take a recommendation and get your image in seconds. Krut AI is a free AI image generator tool, which offers a huge collection of AI tools to generate and alter your pictures.
    Starting Price: $18 per month