39 projects for "training" with 2 filters applied:

  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    SCAIL

    SCAIL

    Towards Studio-Grade Character Animation via In-Context Learning of 3D

    ...Given its listing alongside other ZAI projects like speech recognition and text-to-speech systems, SCAIL likely emphasizes scalable, composable AI learning frameworks that support researchers and practitioners in experimenting with learning algorithms, datasets, and model components. The repository structure suggests a focus on flexibility and extensibility, with potential integration into other ZAI tooling for training or analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Habitat-Sim

    Habitat-Sim

    A flexible, high-performance 3D simulator for Embodied AI research

    Habitat-Sim is a high-performance 3D simulator for embodied AI research, designed to run photorealistic indoor environments at thousands of frames per second. It offers GPU-accelerated rendering and a flexible sensor suite—RGB, depth, semantic segmentation, and more—so agents can perceive and act in realistic scenes. The engine is written in C++ with Python bindings and integrates physics, navigation meshes, and shortest-path planners to support tasks like point-goal navigation,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    AnimateDiff

    AnimateDiff

    Plug-n-play module turning text-to-image models into animation

    AnimateDiff is an open-source project designed to enhance text-to-image diffusion models by adding animation capabilities. It allows users to turn static images generated by popular text-to-image models into animated sequences without requiring additional model training. This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models. It supports various configurations to create animations with different visual styles, providing flexibility and ease of use for developers and artists interested in exploring dynamic, AI-generated animations.
    Leader badge
    Downloads: 45 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Video Pre-Training

    Video Pre-Training

    Learning to Act by Watching Unlabeled Online Videos

    The Video PreTraining (VPT) repository provides code and model artifacts for a project where agents learn to act by watching human gameplay videos—specifically, gameplay of Minecraft—using behavioral cloning. The idea is to learn general priors of control from large-scale, unlabeled video data, and then optionally fine-tune those priors for more goal-directed behavior via environment interaction. The repository contains demonstration models of different widths, fine-tuned variants (e.g. for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    CWStudio
    CWStudio is a portable almost library-independent and computationally efficient generator of CW signals for telegraphy training purposes. It can create a sound with maximum similarity to the real air, simulating many difficulties.
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    VAD

    VAD

    Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

    This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    TimeSformer

    TimeSformer

    The official pytorch implementation of our paper

    ...Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch provides configurations, pretrained models, and training scripts that make it straightforward to evaluate or fine-tune on video datasets. TimeSformer was influential in showing that pure transformer architectures—without convolutional backbones—can perform strongly on video classification tasks. Its flexible attention design allows experimenting with different factoring (spatial-then-temporal, joint, etc.) to trade off compute, memory, and accuracy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Linux-Intelligent-Ocr-Solution

    Linux-Intelligent-Ocr-Solution

    Easy-OCR solution and Tesseract trainer for GNU/Linux

    ...A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial : https://www.youtube.com/playlist?list=PLn29o8rxtRe1zS1r2-yGm1DNMOZCgdU0i Tesseract Training Tutorial (beta) : https://www.youtube.com/watch?v=qLpCld4cdtk Source Code Github : https://github.com/Nalin-x-Linux/lios-3 Gitlab : https://gitlab.com/Nalin-x-Linux/lios-3 User guide is available in download page
    Downloads: 13 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    tonopol-trainer

    tonopol-trainer

    Musik Trainer

    Analyse von Songs (mp3,wav) Unterteilung der Songelemente Training von Abschnitten langsam - schell Trainig ganzer Sets mittels Playlist
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Video Nonlocal Net

    Video Nonlocal Net

    Non-local Neural Networks for Video Classification

    ...This formulation improves action recognition and spatiotemporal reasoning, especially for classes requiring context beyond short temporal windows. The repo provides training recipes and models for standard datasets, as well as ablations that show how many non-local blocks to insert and at which stages. Efficient implementations keep memory and compute manageable so the blocks can be added without rewriting the entire backbone. The result is a practical, drop-in mechanism for upgrading purely local video models into context-aware networks with strong benchmark performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Training Image Operators from Samples

    Tools to train Image Operators automatically from a set of samples.

    TRIOS - Training Image Operators from Samples is a set of tools to bring Image Processing closer to scientists in general. It is capable of estimating an operator between two images using only pairs of samples that contain an input image and the desired output. The operator is saved to a file and can be applied to any image.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ScoreDate

    ScoreDate

    ScoreDate is a software to learn music reading and ear training

    ScoreDate is your date with the music ! It is an open source software written in Java that helps musicians to learn music reading. It also helps you with ear training. It is suitable for any skill, from beginners to professional users. From slow training to first sight reading.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    InPitch is a simple and easy-to-use program to aid in training your ear to recognize musical notes (i.e. to develop "perfect pitch" or "absolute pitch").
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    GNU Solfege
    GNU Solfege is *free* ear training software written in Python 3.4 using the Gtk+ 3 toolkit. The program is designed to be easily extended with lesson files (data files), so the user can create new exercises.
    Leader badge
    Downloads: 56 This Week
    Last Update:
    See Project
  • 16
    Gamera is a framework for the creation of structured document analysis applications by domain experts. It combines a programming library with GUI tools for the training and interactive development of recognition systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    MORSEPOD a morse training file generator for your portable audio file player.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    This is a MATLAB implementation of inverse compositional Active Appearance Models (AAMs), as described in the "Active Appearance Models Revisited" paper by Iain Matthews and Simon Baker.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Pace & Place is an application for managing and visualizing GPS data and sports training data. It empowers the user to track training events and combine them with geographic data. Map, waypoint, track visualizations and geo-referencing of photos.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    "Ugo Capeto's Ear Training Scale Note" is a java program that trains your ear to recognize a (mystery) note among the notes of a given scale (major or minor). Extract rar archive file and double-click the jar file (needs java).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    GRANULE is a flashcards program based on Leitner cardfile methodology for learning new words. It features long-term memory training capabilities with scheduling, integrated pictures, sound, and full-screen mode.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    "Ugo Capeto's Ear Training Lower Higher" is a java program that trains your ear to recognize if a note is lower or higher than the previous one in a two-note sequence. Extract rar archive file and double-click the jar file (needs java).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Foad (EKG Processing)
    Foad is an open source software which receive an EKG Signal from scanner, WFDB database or heart sensors. Finding patient disease started by taking Fourier transform (FFT) from input signal and extract a single cycle. Based on some heuristic algorithm the most important feature like P , Q , R , S , T captured and feed to trained neural network. and so the final decision made by CNN library. As mentioned before this software also capable do some image processing on scanned paper to lower...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    polycasso
    A java webstart application that attempts to create cubism style artwork through the layering of semi-transparent polygons via a training feedback loop. Produces increasingly realistic work though hill climbing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    GPU library for simple development of OpenGL-based GPGPU applications, offscreen rendering and shading techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB