timing free download - SourceForge

Showing 18 open source projects for "timing"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Cut Data Warehouse Costs by 54%
Easily migrate from Snowflake, Redshift, or Databricks with free tools.

BigQuery delivers 54% lower TCO with exabyte scale and flexible pricing. Free migration tools handle the SQL translation automatically.

Try Free
1

WhisperX

Automatic Speech Recognition with Word-level Timestamps

WhisperX is an advanced speech recognition system built on top of OpenAI’s Whisper model, designed to improve transcription accuracy and timing precision for long-form audio. It addresses key limitations of standard Whisper implementations by introducing voice activity detection and forced alignment techniques to produce word-level timestamps. The system enables batched inference, significantly increasing transcription speed while maintaining high accuracy. It is particularly effective for long recordings, where traditional approaches may suffer from drift, repetition, or misalignment. whisperx also supports speaker diarization, allowing identification of different speakers within a conversation. ...

Downloads: 137 This Week

Last Update: 2026-05-25
See Project
2

BruteForceAI

Advanced LLM-powered brute-force tool combining AI intelligence

BruteForceAI is an open-source security testing tool that applies large language models to the analysis of login forms and authentication flows in web applications. At a high level, the project uses AI to inspect HTML content, identify the relevant form elements, and automate selector discovery so that a tester does not need to hand-map every field before evaluation. It combines that analysis layer with automated credential testing workflows, framing itself as a more adaptive alternative to...

Downloads: 180 This Week

Last Update: 2026-03-09
See Project
3

LTX-2.3

Official Python inference and LoRA trainer package

...The model uses a diffusion-transformer-based architecture designed to generate high-fidelity visual frames while simultaneously producing corresponding audio elements such as speech, music, ambient sound, or effects. This unified approach allows creators to generate complete multimedia sequences where motion, timing, and sound are aligned automatically. LTX-2 is designed for both research and production workflows and can generate high-resolution video clips with precise control over structure, motion, and camera behavior.

Downloads: 129 This Week

Last Update: 2026-07-08
See Project
4

RealtimeSTT

A robust, efficient, low-latency speech-to-text library

RealtimeSTT is a Python-based realtime speech-to-text engine emphasizing low latency, wake-word detection, voice activity detection, and automatic speech segmentation. It provides asynchronous callbacks, nanosecond-precision timestamps, and CLI tools, suitable for building voice assistants, meeting transcribers, or live caption systems.

Downloads: 6 This Week

Last Update: 2026-05-31
See Project
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
5

Story Flicks

Generate high-definition story short videos with one click using AI

Story Flicks is another open-source project in the AI-assisted video generation / editing space, focused on creating short, story-style videos from script or prompt inputs. It aims to let users generate high-definition short movies or video stories with minimal manual effort, using AI models under the hood to assemble visuals, timing, and possibly narration or subtitles. For creators who want to produce narrative short-form content — whether for social media, storytelling, or prototyping video ideas — story-flicks offers a lightweight, code-backed alternative to complex video editing suites. Because the project is open and modifiable, developers can customize the generation pipeline: adjust story structure, alter rendering parameters, tweak video quality or resolution, or integrate with other AI models (e.g. for audio, voice-over, or image-to-video). ...

Downloads: 1 This Week

Last Update: 2025-12-14
See Project
6

VideoCaptioner

AI-powered tool for generating, optimizing, and translating subtitles

...After transcription, large language models are used to intelligently restructure subtitles into natural sentences, correct wording, and improve readability for viewers. It can also translate subtitles into other languages while preserving the original timing, making it suitable for multilingual video publishing and accessibility. In addition to generating subtitles, it supports editing, formatting, and embedding subtitles into videos as either hard or soft subtitles.

Downloads: 17 This Week

Last Update: 2026-06-17
See Project
7

WhisperJAV

Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD

WhisperJAV is an open-source speech transcription pipeline designed specifically for generating subtitles for Japanese adult video content. The project addresses challenges that standard speech recognition models face when transcribing this type of audio, which often includes low signal-to-noise ratios and large numbers of non-verbal vocalizations. Traditional automatic speech recognition systems can misinterpret these sounds as words, leading to inaccurate transcripts. WhisperJAV introduces...

Downloads: 16 This Week

Last Update: 2026-05-11
See Project
8

Auto Synced & Translated Dubs

Automatically translates the text of a video based on a subtitle file

Auto-Synced-Translated-Dubs is a toolchain that automatically translates and re-dubs videos using AI voices while keeping the new speech aligned to the original timing via subtitle files. It assumes you have a human-made SRT (or similar) subtitle file; the script then uses translation services such as Google Cloud or DeepL to generate translated subtitle tracks in one or more target languages. Using the timestamps of each subtitle line, it computes the required duration of each spoken segment and synthesizes audio via neural TTS services, producing one audio clip per subtitle entry. ...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
9

JoyAI-VL-Interaction

An Open Real-time Video-Language Interaction System

...It is designed to watch a webcam or livestream continuously and decide whether to speak, stay silent, or delegate a harder task. Unlike turn-based assistants, it focuses on event-driven interaction where timing matters as much as answer quality. The repository releases the model, training recipe, time-aligned interaction data, and deployable system together. Its system includes inference, WebUI, ASR, TTS, and background-agent services running on vLLM-based infrastructure. It is useful for real-time monitoring, live commentary, cooking guidance, game calling, visual alerts, and other scenarios where an AI should respond at the right moment.

Downloads: 1 This Week

Last Update: 2 days ago
See Project
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
10

Dia2

TTS model capable of streaming conversational audio in realtime

...It currently focuses on English generation and supports up to two minutes of generated audio. Its main value is enabling low-latency, dialogue-oriented TTS workflows where timing, turn-taking, and natural conversation matter.

Downloads: 0 This Week

Last Update: 2026-06-08
See Project
11

Violin

Open-source Video Translation Skill

...It transcribes the original speech, translates the text, generates natural-sounding speech in the target language, and remuxes the new audio back into the video. The project is designed to keep the generated speech aligned with the original timing so the final result feels closer to a real dubbed video. It can be used from the command line, through a FastAPI web app, or as a Claude Code skill. Violin supports multilingual workflows and is useful for creators, educators, localization teams, and developers building automated video translation pipelines. It is especially practical for turning lectures, tutorials, interviews, demos, and social videos into accessible content for wider audiences.

Downloads: 0 This Week

Last Update: 2026-05-19
See Project
12

SoniTranslate

Synchronized Translation for Videos

...It provides a web UI built with Gradio, allowing users to upload a video, choose source and target languages, and then run a pipeline that handles transcription, translation and re-synthesis of speech. Under the hood, it uses advanced speech and diarization models to separate speakers, align audio with timecodes and respect subtitle timing, which lets the generated dub track stay in sync with the original video structure. The project supports a wide range of languages for translation, spanning major world languages (English, Spanish, French, German, Chinese, Arabic, etc.) and many regional or less widely spoken languages, making it suitable for broad internationalization. ...

Downloads: 30 This Week

Last Update: 2025-11-28
See Project
13

Melodfy

✨:AI-Powered Piano Audio to MIDI Converter 🎶

Melodfy is an application that utilizes the power of artificial intelligence (developed by ByteDance) to seamlessly convert audio recordings of piano playing into playable MIDI files.

Downloads: 7 This Week

Last Update: 2025-04-04
See Project
14

Weather Cast

A desktop weather app powered by AI

Weather app is a desktop weather app for Windows OS that shows detailed weather information for the searched city. The dashboard shows the current temperature of the city, description of temperature, pressure, wind, humidity, dew point, uv index, local time, air pollution index.

Downloads: 2 This Week

Last Update: 2024-08-28
See Project
15

snntorch

Deep and online learning with spiking neural networks in Python

...This allows researchers to train spiking neural models using familiar deep learning workflows while taking advantage of GPU acceleration and automatic differentiation. snnTorch provides implementations of common spiking neuron models, surrogate gradient training methods, and utilities for handling temporal neural dynamics. Because spiking neural networks operate over time and encode information through spike timing, the library includes tools for simulating temporal behavior.

Downloads: 4 This Week

Last Update: 2026-03-12
See Project
16

SoftVC VITS Singing Voice Conversion

SoftVC VITS Singing Voice Conversion

SoftVC VITS Singing Voice Conversion is a deep learning project focused on singing voice conversion, allowing users to transform one voice into another while preserving melody and timing. Unlike traditional text-to-speech systems, it specializes specifically in singing scenarios and does not provide general TTS functionality. The project leverages neural network architectures derived from VITS and SoftVC research to achieve high-quality voice transformation. It is commonly used in creative audio workflows, especially in communities experimenting with synthetic singing and character voices. ...

Downloads: 2 This Week

Last Update: 2026-03-02
See Project
17

DiffSinger

Singing Voice Synthesis via Shallow Diffusion Mechanism

...The core idea is to view generation of a sung voice (mel-spectrogram) as a diffusion process: starting from noise, the model iteratively “denoises” while being conditioned on a music score (lyrics, pitch, musical timing). This avoids some of the typical problems of prior SVS models — like over-smoothing or unstable GAN training — and produces more realistic, expressive, and natural-sounding singing. The method introduces a “shallow diffusion” mechanism: instead of diffusing over many steps, generation begins at a shallow step determined adaptively, which leverages prior knowledge learned by a simple mel-spectrogram decoder and speeds up inference.

Downloads: 39 This Week

Last Update: 2025-11-28
See Project
18

travianfee

Yet another python powered bot for the browsergame "Travian". We want to make this bot more effective then any human, at construction defensing and farming.

Downloads: 0 This Week

Last Update: 2016-07-14
See Project