Deep Learning-based Image Fusion: A Survey
A tool to snap pixels to a perfect grid
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Director, Screenwriter, Producer, and Video Generator All-in-One
Multimodal-Driven Architecture for Customized Video Generation
Fast inference engine for Transformer models
3D reconstruction software
Advanced techniques for RAG systems
Burn is a new comprehensive dynamic Deep Learning Framework
The Compute Library is a set of computer vision and machine learning
14-stage Fusion Pipeline for LLM token compression
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Omnilingual ASR Open-Source Multilingual SpeechRecognition
An implementation of a deep learning recommendation model (DLRM)
Foundational Models for State-of-the-Art Speech and Text Translation
Implementation of Make-A-Video, new SOTA text to video generator
Weld Optimization for Automatic Welding
TDSFT (Two-Dimensional Segmentation Fusion Tool)
Lightweight anchor-free object detection model
Code release for "Masked-attention Mask Transformer
Fast and user-friendly runtime for transformer inference
ColdFusion SDK for the VoiceShot API.
Pattern recognition for ADL events