Showing 48 open source projects for "motion"

View related business solutions
  • Outgrown Windows Task Scheduler? Icon
    Outgrown Windows Task Scheduler?

    Free diagnostic identifies where your workflow is breaking down—with instant analysis of your scheduling environment.

    Windows Task Scheduler wasn't built for complex, cross-platform automation. Get a free diagnostic that shows exactly where things are failing and provides remediation recommendations. Interactive HTML report delivered in minutes.
    Download Free Tool
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    vJEPA-2

    vJEPA-2

    PyTorch code and models for VJEPA2 self-supervised learning from video

    ...Instead of reconstructing pixels, it predicts the missing high-level embeddings of masked space-time regions using a context encoder and a slowly updated target encoder. This objective encourages the model to learn semantics, motion, and long-range structure without the shortcuts that pixel-level losses can invite. The architecture is designed to scale: spatiotemporal ViT backbones, flexible masking schedules, and efficient sampling let it train on long clips while remaining stable. Trained representations transfer well to downstream tasks such as action recognition, temporal localization, and video retrieval, often with simple linear probes or light fine-tuning. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    VideoCrafter2

    VideoCrafter2

    Overcoming Data Limitations for High-Quality Video Diffusion Models

    ...The system is optimized for generating videos from textual descriptions or still images, leveraging advanced diffusion models. VideoCrafter2, an upgraded version, improves on its predecessor by enhancing motion dynamics and concept combinations, especially in low-data scenarios. Users can explore a wide range of creative possibilities, producing cinematic videos that combine artistic styles and real-world scenes.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    Aphantasia

    Aphantasia

    CLIP + FFT/DWT/RGB = text to image/video

    This is a collection of text-to-image tools, evolved from the artwork of the same name. Based on CLIP model and Lucent library, with FFT/DWT/RGB parameterizes (no-GAN generation). Illustrip (text-to-video with motion and depth) is added. DWT (wavelets) parameterization is added. Check also colabs below, with VQGAN and SIREN+FFM generators. Tested on Python 3.7 with PyTorch 1.7.1 or 1.8. Generating massive detailed textures, a la deepdream, fullHD/4K resolutions and above, various CLIP models (including multi-language from SBERT), continuous mode to process phrase lists (e.g. illustrating lyrics), pan/zoom motion with smooth interpolation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    CoTracker

    CoTracker

    CoTracker is a model for tracking any point (pixel) on a video

    ...The repository ships with inference scripts, pretrained weights, and simple interfaces to seed points, run tracking, and export trajectories for downstream tasks. Typical uses include correspondence building, motion analysis, dynamic SLAM priors, video editing masks, and evaluation of geometric consistency in real scenes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 5
    hloc

    hloc

    Visual localization made easy with hloc

    This is hloc, a modular toolbox for state-of-the-art 6-DoF visual localization. It implements Hierarchical Localization, leveraging image retrieval and feature matching, and is fast, accurate, and scalable. This codebase won the indoor/outdoor localization challenges at CVPR 2020 and ECCV 2020, in combination with SuperGlue, our graph neural network for feature matching. We provide step-by-step guides to localize with Aachen, InLoc, and to generate reference poses for your own data using...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    FrankMocap

    FrankMocap

    A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

    ...Outputs include textured meshes, joint locations, and model parameters that can be exported to common DCC tools and game engines. The codebase offers pretrained models, clear inference scripts, and utilities to visualize results, making single-camera motion capture approachable on commodity hardware. Researchers and creators use it for motion studies, AR/VR prototyping, character animation, and human-in-the-loop editing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7

    avio

    Python version of ffplay with built-in AI

    See the Files tab above for installation instructions
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    TimeSformer

    TimeSformer

    The official pytorch implementation of our paper

    TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch provides configurations, pretrained models, and training scripts that make it straightforward to evaluate or fine-tune on video datasets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    DensePose

    DensePose

    A real-time approach for mapping all human pixels of 2D RGB images

    ...It extends human pose estimation from predicting joint keypoints to providing dense correspondences between 2D images and a canonical 3D mesh (such as the SMPL model). This enables detailed understanding of human shape, motion, and surface appearance directly from images or videos. The repository includes the DensePose network architecture, training code, pretrained models, and dataset tools for annotation and visualization. DensePose is widely used in augmented reality, motion capture, virtual try-on, and visual effects applications because it enables real-time 3D human mapping from 2D inputs. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software. Icon
    Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software.

    Banks, lending institutions

    Founded in 2004, axefinance is a global market-leading software provider focused on credit risk automation for lenders looking to provide an efficient, competitive, and seamless omnichannel financing journey for all client segments (FI, Retail, Commercial, and Corporate.)
    Learn More
  • 10
    Consistent Depth

    Consistent Depth

    We estimate dense, flicker-free, geometrically consistent depth

    ...The project can process challenging hand-held video footage, including those with moderate dynamic motion, making it practical for real-world usage.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    VideoPose3D

    VideoPose3D

    Efficient 3D human pose estimation in video using 2D keypoint

    VideoPose3D is a deep learning framework that reconstructs 3D human poses from 2D keypoint sequences extracted from videos. It builds on top of convolutional and temporal networks that map 2D joint coordinates over time to consistent 3D skeletons, enabling robust motion capture without specialized sensors. The model is trained on large motion capture datasets and can generalize well to unseen environments by leveraging temporal context for smoothing and error correction. By using only 2D detections (such as those from OpenPose or Detectron), it enables markerless 3D pose estimation with relatively lightweight computational requirements. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Activity Recognition

    Activity Recognition

    Resources about activity recognition

    ...The repository includes links to code in MATLAB, Python, summaries of algorithms, datasets, and relevant research papers. Feature extraction method summaries (e.g. motion, sensor, vision). Deep learning for activity recognition references.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Simd

    Simd

    High performance image processing library in C++

    ...It provides many useful high performance algorithms for image processing such as: pixel format conversion, image scaling and filtration, extraction of statistic information from images, motion detection, object detection (HAAR and LBP classifier cascades) and classification, neural network. The algorithms are optimized with using of different SIMD CPU extensions. In particular the library supports following CPU extensions: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM. ...
    Leader badge
    Downloads: 25 This Week
    Last Update:
    See Project
  • 14

    IGVC IITK Data

    Data useful for testing autonomous navigation algorithms

    ...For the relevant code, see our GitHub repositories. (https://github.com/igvc-iitk). The recorded data is used for testing various algorithms related to Computer Vision, SLAM, Motion Planning etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
     Object detection is usually a software-based monitoring algorithm that will signal, for example in the surveillance camera to begin capturing the event when it detects motion. In object tracking, the object is located and the moving object is followed. One of the fundamental steps in many computer based vision systems for object tracking and motion detection is real-time segmentation of moving regions in the image sequences. Segmentation is done in order to detect the object accurately. Usually cameras are used as input sensors, for recording...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Hemera is a Virtual Intelligent System aggregating some more advanced Artificial Intelligence Technologies (speech, speech recognition, form recognition, motion recognition ...); with applications in daily tasks, domotics and robotics ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Porcupine2 is a small wearable sensory unit for logging motion data and doing low-level activity recognition. The project includes electronic schematics for the hardware, software for the microcontroller and client-side software for hardware-interaction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    This projects deals with different robotics algorithms such as motion planning (navigation) , vision SLAM, vision algorithms, and mapping.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Diglo is a Music Information Retrieval System based on Computer Vision and Audio Spectrum Analysis, using algorithmic operations to find emergent patterns in musical performance. Also it functions as a low-cost Motion Capture Analysis system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Mito-MAS-m is a simulator of the mitochondrial inner membrane and the enzymatic complexes embedded in it, implementing a coarse-grained (CG) model of the molecules using rigid structures and Dissipative Particules Dynamics (DPD) as motion equation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A C++ library of high level motion analysis/computer vision functions, coupled with GUIs that allow easy configuration and use. Current development focuses on color tracking, multiple camera calibration and triangulation, and 3D tracking algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Catcher is a v4l motion-detector for KDE.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The Data Fusion Peer is a multitier computer vision internet application. The system provides image processing, motion tracking, and visualization information. Application will convert data into 3-Deminsional and other digital environments.
    Downloads: 0 This Week
    Last Update:
    See Project