Search Results for "audio processing" - Page 13

Showing 308 open source projects for "audio processing"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    maestream is a serie of abstractions, to make more easily the use of PureData for multimedia purposes. maestream is recommended for Pd-extended users
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Sensors to MIDI management system. This is a project that allows for creating MIDI output from sensors input. This output can then be used in MIDI processors, like GarageBand, Resolum and other real-time multimedia software. The input can be taken from d
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    pulsorclip

    pulsorclip

    Download videos from almost any website

    ...The application focuses on a controlled workflow instead of instant downloads. Users first provide a media URL, then select format, quality, and container before processing the file. It includes both a web interface built with Next.js and a Telegram bot that offers the same guided experience through chat. Both share a common backend in a monorepo structure. The system is fully self-hosted and designed to run in a single Docker deployment including both web and bot services. Users can download videos and audio in formats such as MP4, WebM, MKV, MP3, and M4A, with server-side processing and progress tracking.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    mms-300m-1130-forced-aligner

    mms-300m-1130-forced-aligner

    CTC-based forced aligner for audio-text in 158 languages

    ...The alignment pipeline includes audio processing, emission generation, tokenization, and span detection, making it suitable for speech analysis, transcription syncing, and dataset creation. This model is especially useful for researchers and developers working with low-resource languages or building multilingual speech systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5

    KinectStreamer

    Streams data from 3D cameras over a network.

    This is an application that streams data from the Microsoft Kinect or cameras like it over a network. The program is Intended to be used in robotics applications where the controller cannot use such cameras directly due to hardware/software limitations--such as lacking usb ports or appropriate drivers--or in situations where the camera is not in close proximity to the device that needs to access it. Given that the controller can accept data from over the network, another embedded controller...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MiMo-V2.5

    MiMo-V2.5

    Omnimodal AI model for agents, coding, and long-context tasks

    MiMo-V2.5 is a native omnimodal large language model developed by Xiaomi, designed for advanced agentic workflows, multimodal reasoning, and long-context processing. Built on a Mixture-of-Experts architecture with approximately 309B total parameters and around 15B activated per inference, it balances high capability with efficient execution. The model natively processes text, images, video, and audio within a unified system, enabling cross-modal understanding and complex task execution in a single pipeline. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ArCoLIVE provides a set of open source component-based toolkit for multimedia applications such as chat room, Peer-to-Peer (P2P) conversation, virtual white-board, audio/video conferencing, control connection containers, screen capture and file sharing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Forked version of abandoned project MPEG4IP ( http://mpeg4ip.sf.net ) The MPEG4IP project provides an MPEG and IETF standards-based system for encoding, streaming, and playing audio and video.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB