Showing 330 open source projects for "model-builder"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 1
    Easy Upscale

    Easy Upscale

    A simple image upscaler application using EDSR, ESPCN, FSRCNN, etc.

    ...The main theme is queues, we implement circular queues for pooling/storing a list of images to be upscaled. Gui creation is made manually using the tkinter library. For the upscale process itself, it uses the OpenCV library with a model obtained from open source. Checked using vermin. Minimum required versions: 3.6 Incompatible versions: 2.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    UE Viewer

    UE Viewer

    Viewer and exporter for Unreal Engine 1-4 assets

    Unreal Engine resource viewer (formerly Unreal Model Viewer) is a program for viewing and extracting resources from various games made with Unreal Engine. Sometimes the program is referenced as "umodel", the short of "unreal" and "model viewer". The project was originally named the "Unreal model viewer", however, the name was changed in 2011 to meet the request from Epic Games. Please note that the "official" project's name is "UE Viewer", and a short unofficial name of the project is "model" (it was left from the older name "Unreal MODEL viewer"). ...
    Downloads: 93 This Week
    Last Update:
    See Project
  • 3
    Robust Video Matting (RVM)

    Robust Video Matting (RVM)

    Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX

    We introduce a robust, real-time, high-resolution human video matting method that achieves new state-of-the-art performance. Our method is much lighter than previous approaches and can process 4K at 76 FPS and HD at 104 FPS on an Nvidia GTX 1080Ti GPU. Unlike most existing methods that perform video matting frame-by-frame as independent images, our method uses a recurrent architecture to exploit temporal information in videos and achieves significant improvements in temporal coherence and...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Svgbob

    Svgbob

    Convert your ascii diagram scribbles into happy little SVG

    Svgbob can create a nice graphical representation of your text diagrams. Svgbob provides a cli that takes text as input and creates an svg image as an output. Svgbob is a diagramming model which uses a set of typing characters to approximate the intended shape.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    VAD

    VAD

    Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

    This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Quan is designed to model physical quantities in C++ programs. Advantages include automated dimensional analysis checking, automatic unit conversions, self documentation of code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    TimeSformer

    TimeSformer

    The official pytorch implementation of our paper

    TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch provides configurations, pretrained models, and training scripts that make it straightforward to evaluate or fine-tune on video datasets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    AviSynth AiUpscale v1.2.0

    AviSynth AiUpscale v1.2.0

    AviSynth+ implementation of Super-Resolution Convolutional Neural

    ...The AiUpscale models used for all datasets were the "Photo" models, except for the Manga109 dataset for which the "LineArt" models were used. In the same way, the Waifu2x cunet model was used for the Manga109 dataset, and the upconv_7 model for the rest.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    Paper.js

    Paper.js

    The swiss army knife of vector graphics scripting

    Paper.js is an open source vector graphics scripting framework that runs on top of the HTML5 Canvas. It offers a clean Scene Graph / Document Object Model and a lot of powerful functionality to create and work with vector graphics and bezier curves, all neatly wrapped up in a well designed, consistent and clean programming interface. Paper.js is based on and largely compatible with Scriptographer, a scripting environment for Adobe Illustrator with an active community of scripters and more than 10 years of development. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    XZVoice

    XZVoice

    Free and open source text-to-speech software

    ...Technically, multi-level rhythmic pauses are taken into account to achieve the purpose of natural synthesizing rhythm, and comprehensively use acoustic parameters and linguistic parameters to establish multiple automatic prediction models based on deep learning. Using massive audio data to train the pronunciation model, the synthetic sound is real, full, cadenced, and expressive, and the MOS score has reached the professional level in the industry.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TTS

    TTS

    Deep learning for text to speech

    ...TTS comes with pre-trained models, tools for measuring dataset quality, and is already used in 20+ languages for products and research projects. Released models in PyTorch, Tensorflow and TFLite. Tools to curate Text2Speech datasets underdataset_analysis. Demo server for model testing. Notebooks for extensive model benchmarking. Modular (but not too much) code base enabling easy testing for new ideas. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings efficiently. Vocoder models (MelGAN, Multiband-MelGAN, GAN-TTS, ParallelWaveGAN, WaveGrad, WaveRNN). ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Mayo 3D Cad Viewer

    Mayo 3D Cad Viewer

    Free cad viewer for 3d drawings in different formats

    A very nice 3d Cad viewer program. This software depends on the Opencascade library. To install opencascade V7.5.0 Beta on linux : sudo dpgk -i opencascade.deb To install mayo : sudo dpkg -i mayo-cad-viewer.deb
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Render32

    Render32

    Command-line video compositing and audio mixing tools

    Render is a program for creating composite BMP image sequences. These images are composited as specified in a text configuration file. Mixer is a program for mixing film soundtracks. It accepts input files in WAV format and outputs a mixed soundtrack in WAV format. Each input channel can contain one or more audio files that are edited and mixed using a cue sheet. The maximum number of channels is a compile-time parameter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The COLLADA Document Object Model (DOM) is an application programming interface (API) that provides a C++ object representation of a COLLADA XML instance document. Contact a project admin to become a developer!
    Downloads: 17 This Week
    Last Update:
    See Project
  • 16
    YAPS - Yet Another Photo Sorter

    YAPS - Yet Another Photo Sorter

    Photo sorting tool to auto-generate folders based on EXIF meta data.

    A flexible photo sorting tool that uses the EXIF information of your photos and videos to copy or move them into a structured folder hierarchy. It takes your photos and nicely sorts them into a new folder structure auto generated based on the criteria you chose. For example like this: year/month/date_and_time.jpg Sources are available here: https://github.com/knopkem/yaps Happy Sorting!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    VR Neuroanatomy

    VR Neuroanatomy

    Explore and dissect a complete brain model in VR

    Understanding the anatomy of complex organs such as the human brain can be a challenge for students, so there is a definite advantage given to students who can experience the structures in a spatial way through real or virtual experiences. Since 2016, the Marquette Visualization Lab has been utilizing a dataset of 3 and 7 Tesla MRI scans from the Structural Informatics Group at University of Washington to visualize the structures of the brain in their Cave, utilizing the Unity game engine...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Relativty

    Relativty

    An open source VR headset with SteamVR supports for $200

    ...The room-scaling AI can be used with any camera, it tracks your body based on video input. Precision and freedom of movement are still very far from dedicated sensors, however, we believe that the model can be trained and improved by orders of magnitude. We designed an affordable motherboard that runs the Relativty Firmware, it is based on the Atmel SAM3X8E ARM Cortex-M3 processor and uses an MPU-6050 as it’s IMU. The naked PCB and the components to solder on, cost in total about 25 dollars.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Carnatic Music Guru / JRaaga

    Carnatic Music Guru / JRaaga

    Carnatic Music Guru - JRaaga

    VISIT THIS PAGE AS MORE FEATURES ARE BEING ADDED. If you have downloaded 2.03 or above - Use Help>Check Updates to download latest version. If that does not update: Download CMGUpdater from here: update/CMGUpdater.jar Copy it to the update/ folder of your JRaaga installation path. Try again Help>Check for updates. If it does not work. Delete existing installation. Download the latest version and try. Carnatic Music Guru is a tutor/player/lesson generator. YOU NEED Java Runtime...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Software Renderer

    Software Renderer

    A shader-based software renderer written from scratch in C89

    This is a shader-based software renderer written from scratch in C89 with minimal dependencies, available for Windows, macOS, and Linux. Pre-built binaries for Windows, macOS, and Linux are available for download from the Releases page. To build the software renderer from source, a C89 compiler and development files for your window system are required. Install Visual Studio with C++ support and run build_win32.bat. If the software renderer is launched without arguments, one of the available...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Consistent Depth

    Consistent Depth

    We estimate dense, flicker-free, geometrically consistent depth

    ...The system builds upon traditional structure-from-motion (SfM) techniques to provide geometric constraints while integrating a convolutional neural network trained for single-image depth estimation. During inference, the model fine-tunes itself to align with the geometric constraints of a specific input video, ensuring stable and realistic depth maps even in less-constrained regions. This approach achieves improved geometric consistency and visual stability compared to prior monocular reconstruction methods. The project can process challenging hand-held video footage, including those with moderate dynamic motion, making it practical for real-world usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Talkie

    Talkie

    Talking 3D character for your stream

    Full source code and 3D model for a talking 3D character, tweak as you need. Useful for streaming/videos when you don't want to show your face. Uses your mic input to move the mouth* You can provide your own 3D model with a jaw bone. This is not intended for end users. No executable is provided. You will need to have a 3D head modeled, with armature attached and a labeled jaw bone (unless you just want to use the default green sphere).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Super-résolution via CNN

    Super-résolution via CNN

    Super resolution using a CNN, based on the work of the DGtal team

    ...First of all, an Nvidia graphics card (neither AMD nor Intel integrated) is highly recommended to parallelize the CNN. You will then need to install CUDA. No CUDA = dozens of times slower. This program will generate "model_epoch_ .pth" files corresponding to the model at epoch n, in a folder saved_model_u t_bs bs_tbs tbs_lr lr, where corresponds to the scale factor, bsthe size of the training batch, tbsthe size of the test batch and lrto the learning rate. Low res images should be located in a "dataset/input" folder, and high res targets in a "dataset/target" folder, where each different quality image has the same name in both folders.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DeepFaceLab

    DeepFaceLab

    The leading software for creating deepfakes

    ...DeepFaceLab is an open-source deepfake system that enables users to swap the faces on images and on video. It offers an imperative and easy-to-use pipeline that even those without a comprehensive understanding of the deep learning framework or model implementation can use; and yet also provides a flexible and loose coupling structure for those who want to strengthen their own pipeline with other features without having to write complicated boilerplate code. DeepFaceLab can achieve results with high fidelity that are indiscernible by mainstream forgery detection approaches. ...
    Downloads: 16,316 This Week
    Last Update:
    See Project
  • 25
    Affine Transformation of Virtual Object

    Affine Transformation of Virtual Object

    Transformation virtual 3D object using a finger gesture-based system

    Affine transformation virtual 3D object using a finger gesture-based interactive system in the virtual environment. A convolutional neural network (CNN) based thumb and index fingertip detection system are presented here for seamless interaction with a virtual 3D object in the virtual environment. First, a two-stage CNN is employed to detect the hand and fingertips, and using the information of the fingertip position, the scale, rotation, translation, and in general, the affine...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo