Search Results for "video-player" - Page 5

Sort By:

Showing 968 open source projects for "video-player"

View related business solutions

Python Clear Filters & Widen Search

AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
1

Depth Anything 3

Recovering the Visual Space from Any Views

Depth Anything 3 is a research-driven project that brings accurate and dense depth estimation to any input image or video, enabling foundational understanding of 3D structure from 2D visual content. Designed to work across diverse scenes, lighting conditions, and image types, it uses advanced neural networks trained on large, heterogeneous datasets, producing depth maps that reveal scene depth relationships and object surfaces with strong fidelity.

Downloads: 2 This Week

Last Update: 2026-02-05
See Project
2

Hello Python

Comprehensive tutorial repository aimed at teaching the Python program

Hello-Python is a comprehensive tutorial repository aimed at teaching the Python programming language from scratch for beginners. It includes over 100 classes and about 44 hours of video instruction, combined with code samples, projects, and a chat community for support. The material covers the fundamentals—variables, data types, loops, functions—as well as intermediate topics like date handling, list comprehensions, file IO, regular expressions, modules, and packages. The course is designed to be accessible: no prior programming experience required, and the resources are freely available. ...

Downloads: 3 This Week

Last Update: 2025-10-28
See Project
3

RimSort

Mod manager for the video game RimWorld

RimSort is a free, open-source, multi-platform mod manager built specifically for RimWorld, designed to help players organize and maintain large mod collections without relying on in-game ordering alone. It focuses on reliability and community-driven maintenance, positioning itself as an alternative to other external RimWorld mod managers while keeping the workflow straightforward for everyday use. The app is intended to be used before launching the game so you can curate your active list,...

Downloads: 37 This Week

Last Update: 1 day ago
See Project
4

Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM

...It achieves state-of-the-art results: across 36 audio and audio-visual benchmarks, it hits open-source SOTA on 32 and overall SOTA on 22, outperforming or matching strong closed-source models such as Gemini-2.5 Pro and GPT-4o. To reduce latency, especially in audio/video streaming, Talker predicts discrete speech codecs via a multi-codebook scheme and replaces heavier diffusion approaches.

Downloads: 1 This Week

Last Update: 2026-01-08
See Project
Catch Bugs Before Your Customers Do
Real-time error alerts, performance insights, and anomaly detection across your full stack. Free 30-day trial.

Move from alert to fix before users notice. AppSignal monitors errors, performance bottlenecks, host health, and uptime—all from one dashboard. Instant notifications on deployments, anomaly triggers for memory spikes or error surges, and seamless log management. Works out of the box with Rails, Django, Express, Phoenix, Next.js, and dozens more. Starts at $23/month with no hidden fees.

Try AppSignal Free
5

Segmentation Models

Segmentation models with pretrained backbones. PyTorch

Segmentation models with pre trained backbones. High-level API (just two lines to create a neural network) 9 models architectures for binary and multi class segmentation (including legendary Unet) 124 available encoders (and 500+ encoders from timm) All encoders have pre-trained weights for faster and better convergence. Popular metrics and losses for training routines. All encoders have pretrained weights. Preparing your data the same way as during weights pre-training may give you better...

Downloads: 0 This Week

Last Update: 2025-04-17
See Project
6

supervision

We write your reusable computer vision tools

We write your reusable computer vision tools. Whether you need to load your dataset from your hard drive, draw detections on an image or video, or count how many detections are in a zone. You can count on us.

Downloads: 0 This Week

Last Update: 2026-02-06
See Project
7

Image-Editor

AI based photo editing website for changing image background

...With cv2, you can easily read, write, filter, and display images, and much more. Image-Editor uses Mediapipe's selfie_segmentation model for background removal in real-time video streams. This advanced model uses deep neural networks to detect and remove the background.

Downloads: 4 This Week

Last Update: 2024-06-06
See Project
8

Auto Synced & Translated Dubs

Automatically translates the text of a video based on a subtitle file

...The tool then time-stretches or compresses each TTS clip to match the original speech duration exactly, which preserves lip-sync and rhythm as closely as possible without manual editing. Finally, it combines all the clips into a single dubbed audio track that can be muxed with the original video, along with new translated subtitle files.

Downloads: 6 This Week

Last Update: 2025-11-28
See Project
9

BettaFish

Public opinion analysis system

...It also integrates multimodal processing, enabling it to parse images and video alongside text.

Downloads: 1 This Week

Last Update: 2026-02-17
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

Phantom Player

Smart video player and playlist manager.

Phantom Player offers a range of practical features, making it easy to play, organize, and manage videos from your hard drive. Whether you're watching individual videos or managing playlists, it provides a seamless experience. + Play Single Videos + Create Playlists from Torrents + Organize Videos on a Hard Drive

Downloads: 2 This Week

Last Update: 2025-10-03
See Project
11

MagicBox Player

Magic Box 🎶: The Open-Source Multimedia Player Magic Box is a versatile, custom-built media player for desktop environments, blending a classic interface with powerful, modern features. Developed in Python with PyQt5, it supports a wide range of audio and video formats. Key Features: Dynamic Visualizer: Features a real-time, custom FFT audio spectrum visualizer that monitors system loopback audio, providing vibrant, data-driven feedback (requires manual loopback setup like Stereo Mix/PulseAudio). ...

Downloads: 2 This Week

Last Update: 2025-10-05
See Project
12

GLM-4.6V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

...Its architecture supports a very large context window (on the order of 128K tokens during training), which lets it handle complex multimodal inputs like long documents, multi-page reports, or video transcripts, while maintaining coherence across extended content. In benchmarks and internal evaluations, GLM-4.6V achieves state-of-the-art (SoTA) performance among models of comparable parameter scale on multimodal reasoning.

Downloads: 5 This Week

Last Update: 2026-01-27
See Project
13

Tartube

Download videos/channels/playlists from YouTube and many other sites

Tartube is a GUI front-end for youtube-dl, yt-dlp and other compatible video downloaders. It is written in Python 3 / Gtk 3 and runs on MS Windows, Linux, BSD and MacOS.

Downloads: 1,533 This Week

Last Update: 2026-01-20
See Project
14

Bayesian Optimization

Python implementation of global optimization with gaussian processes

This is a constrained global optimization package built upon bayesian inference and gaussian process, that attempts to find the maximum value of an unknown function in as few iterations as possible. This technique is particularly suited for optimization of high cost functions, situations where the balance between exploration and exploitation is important. More detailed information, other advanced features, and tips on usage/implementation can be found in the examples folder. Follow the basic...

Downloads: 0 This Week

Last Update: 2025-12-27
See Project
15

Qwen-2.5-VL

Qwen2.5-VL is the multimodal large language model series

Qwen2.5 is a series of large language models developed by the Qwen team at Alibaba Cloud, designed to enhance natural language understanding and generation across multiple languages. The models are available in various sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B parameters, catering to diverse computational requirements. Trained on a comprehensive dataset of up to 18 trillion tokens, Qwen2.5 models exhibit significant improvements in instruction following, long-text generation...

Downloads: 5 This Week

Last Update: 2026-01-30
See Project
16

Paper2GUI

Convert AI papers to GUI

...让每个人都简单方便的使用前沿人工智能技术 Paper2GUI: An AI desktop APP toolbox for ordinary people. It can be used immediately without installation. It already supports 40+ AI models, covering AI painting, speech synthesis, video frame complementing, video super-resolution, object detection, and image stylization. , OCR recognition and other fields. Support Windows, Mac, Linux systems. Paper2GUI: 一款面向普通人的 AI 桌面 APP 工具箱，免安装即开即用，已支持 40+AI 模型，内容涵盖 AI 绘画、语音合成、视频补帧、视频超分、目标检测、图片风格化、OCR 识别等领域。支持 Windows、Mac、Linux 系统。

Downloads: 1 This Week

Last Update: 2024-09-20
See Project
17

new-pac

Scientific Internet access

This repository aggregates tools, guides, and configuration files aimed at enabling network access in restrictive environments across desktop and mobile platforms. It collects client applications, one-click browser bundles, configuration examples, and references for widely used proxy and tunneling technologies. The emphasis is on approachability: instructions, packaged builds, and links are organized so non-experts can find a workable setup for their device. Because endpoint reliability and...

Downloads: 0 This Week

Last Update: 2025-12-16
See Project
18

Recurrent Interface Network (RIN)

Implementation of Recurrent Interface Network (RIN)

Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in Pytorch. The author unawaredly reinvented the induced set-attention block from the set transformers paper. They also combine this with the self-conditioning technique from the Bit Diffusion paper, specifically for the latents. The last ingredient seems to be a new noise function based around the sigmoid, which the author claims is better than cosine scheduler for larger images. ...

Downloads: 1 This Week

Last Update: 2024-02-14
See Project
19

GLM-V

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning

...The repository provides both GLM-4.5V and GLM-4.1V models, designed to advance beyond basic perception toward higher-level reasoning, long-context understanding, and agent-based applications. GLM-4.5V builds on the flagship GLM-4.5-Air foundation (106B parameters, 12B active), achieving state-of-the-art results on 42 benchmarks across image, video, document, GUI, and grounding tasks. It introduces hybrid training for broad-spectrum reasoning and a Thinking Mode switch to balance speed and depth of reasoning. GLM-4.1V-9B-Thinking incorporates reinforcement learning with curriculum sampling (RLCS) and Chain-of-Thought reasoning, outperforming models much larger in scale (e.g., Qwen-2.5-VL-72B) across many benchmarks.

Downloads: 1 This Week

Last Update: 2 days ago
See Project
20

Qwen3-VL-Embedding

Multimodal embedding and reranking models built on Qwen3-VL

...The reranking model then precisely scores relevance between a given query and candidate documents, enhancing retrieval accuracy in complex multimodal tasks. Together, they support advanced information retrieval workflows such as image-text search, visual question answering (VQA), and video-text matching, while providing out-of-the-box support for more than 30 languages.

Downloads: 0 This Week

Last Update: 2026-02-02
See Project
21

CutLER

Code release for Cut and Learn for Unsupervised Object Detection

CutLER is an approach for unsupervised object detection and instance segmentation that trains detectors without human-annotated labels, and the repo also includes VideoCutLER for unsupervised video instance segmentation. The method follows a “Cut-and-LEaRn” recipe: bootstrap object proposals, refine them iteratively, and train detection/segmentation heads to discover objects across diverse datasets. The codebase provides training and inference scripts, model configs, and references to benchmarking results that report large gains over prior unsupervised baselines. ...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
22

Selkies-GStreamer

Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop

selkies-gstreamer is a GStreamer-based media streaming component used in the Selkies project, a cloud-native platform designed for interactive desktop and application streaming. This module acts as a high-performance media pipeline that captures video, encodes it with low latency, and streams it via WebRTC to client browsers. It is optimized for GPU-accelerated encoding and integrates with Kubernetes-based deployments to enable scalable, real-time remote desktop sessions. This component plays a critical role in delivering smooth, responsive experiences for cloud-based workstations, gaming, or visualization tools.

Downloads: 0 This Week

Last Update: 2025-03-27
See Project
23

mediapy

This Python library makes it easy to display images and videos

Read/write/show images and videos in an IPython/Jupyter notebook.

Downloads: 0 This Week

Last Update: 2026-02-03
See Project
24

Transparent Background

This is a background removing tool powered by InSPyReNet

This is a background-removing tool powered by InSPyReNet (ACCV 2022). You can easily remove the background from the image or video or bunch of other stuffs when you can make the background transparent! We basically follow the virtual camera settings from pyvirtualcam. If you do not choose to install virtual camera, it will visualize real-time output with cv2.imshow. Use another checkpoint file. Default is trained with composite dataset and will be automatically downloaded if not available.

Downloads: 3 This Week

Last Update: 2025-05-14
See Project
25

Fantasy PL MCP

Fantasy Premier League MCP Server

Fantasy Premier League MCP Server is a Model Context Protocol (MCP) server that provides access to Fantasy Premier League (FPL) data and tools. It allows interaction with FPL data in MCP-compatible clients, enabling users to manage their fantasy teams effectively.

Downloads: 0 This Week

Last Update: 2025-07-31
See Project