352 projects for "python-bibtex" with 2 filters applied:

  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    ...Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    NovaSR

    NovaSR

    A lightning fast audio upsampler

    NovaSR is an extremely lightweight and high-performance audio upsampling model that transforms low-quality 16 kHz audio into clearer, high-fidelity 48 kHz audio with remarkable speed and efficiency. At only about 50 KB in size, the model is orders of magnitude smaller than typical audio super-resolution networks, yet it achieves high quality and realtime performance thanks to its compact architecture and efficient convolutional design. NovaSR is especially valuable for post-processing tasks...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    LiveAvatar

    LiveAvatar

    Streaming Real-time Audio-Driven Avatar Generation

    LiveAvatar is an open-source research and implementation project that provides a unified framework for real-time, streaming, interactive avatar video generation driven by audio and other control signals. It implements techniques from state-of-the-art diffusion-based avatar modeling to support infinite-length continuous video generation with low latency, enabling interactive AI avatars that maintain continuity and realism over extended sessions. The project co-designs algorithms and system...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    EasyABC

    EasyABC

    EasyABC is an open source ABC editor

    EasyABC allows the user to create, edit, view, play, convert music written in the ABC music notation language. The program was originally written in Python 2.7 and WxPython by Nils Liberg and runs on Windows, OSX, and Linux. Jan Wybren de Jong has converted to run on Python 3.8 or higher. Frédéric Aupépin has been supporting EasyABC on OSX. EasyABC depends upon other external programs like abc2midi, abcm2ps, fluidsynth. If you install the Windows or Mac executables most of these programs are automatically included.
    Leader badge
    Downloads: 225 This Week
    Last Update:
    See Project
  • 7
    QUI for MPlayer is a QT-based MPlayer GUI. It aims to provide a simple, but effective, frontend for the command-line media player.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    The Sound Description Interchange Format (SDIF) is an established standard for the interchange of sound descriptions and analysis data. This project provides libraries, SDIF (in C) and Easdif (in C++), tools, and wrappers to read and write SDIF files.
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Signal Ranger DSP Linux Support Tools
    This project delivers Linux support (USB driver and Tools) for a Digital Signal Processor (DSP) cards of the Signal Ranger series, see http://www.softdb.com/product_dsp.shtml for details) connected to a Linux host via USB.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    LAME (Lame Aint an MP3 Encoder)

    LAME (Lame Aint an MP3 Encoder)

    A high quality MP3 encoder

    LAME is an educational tool to be used for learning about MP3 encoding. The goal of the LAME project is to improve the psycho acoustics, quality and speed of MP3 encoding. Note: we provide source code only!
    Leader badge
    Downloads: 21,447 This Week
    Last Update:
    See Project
  • 11
    CloudTunes

    CloudTunes

    Web-based music player for the cloud

    CloudTunes is a web application that turns cloud and online sources into an iTunes-like music experience you can access in the browser. It focuses on pulling tracks from services such as YouTube or storage backends and organizing them into searchable libraries and playlists. The interface is built to feel familiar to desktop music players, with queue management, dynamic search, and metadata views that make large collections manageable. Because playback runs in the browser, you can use it...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Pyst consists of a set of interfaces and libraries to allow programming of Asterisk from python. The library currently supports AGI, AMI, and the parsing of Asterisk configuration files. The library also includes debugging facilities for AGI. 2014-04-17: Moved the version control to GIT. To check out see the tab "Code". Note that the whole history including ancient CVS, then some time in monotone, then subversion was united into one GIT repository thanks to ESR's reposurgeon. 2013-05-29: Maintainers of github fork "pyst2" contacted to join forces. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    StreamTuner2 ♪♬#

    Internet radio directory browser

    Streamtuner2 is an internet radio station and video browser. It simply lists stations in categories from different directories. Launches your preferred media apps for playback. It's built in Python now, but retains UI similarity with the original StreamTuner 0.99
    Leader badge
    Downloads: 54 This Week
    Last Update:
    See Project
  • 14

    pydatascope

    Software oscilloscope using Python and tkinter

    Software oscilloscope using Python and tkinter. Supports multiple sources: socket, file, audio, USB. Displays data by samples, time or frequency. Scales the input automatically or manually.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    VAD

    VAD

    Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

    ...It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). Acoustic feature extraction (multi-resolution cochleagram, MRCG). Provided real-world dataset with manual annotations.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    OpenDAFF

    OpenDAFF

    Directional Audio File Format

    OpenDAFF is a free, open-source software package for directional audio data - like the directivity of microphones, speakers, as well as head-related transfer functions (HRTFs)
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17

    MediaData

    Provides access to media file metadata

    A suite of python packages to display/record media file metadata.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    GUIDOLib
    The GUIDOLib provides a powerful engine for the graphic rendering of music scores, based on the Guido Music Notation format. It supports Linux, Mac OS X, Windows, Android and iOS operating systems. A Java JNI interface is available as well as a Javascript version of the library. A Web API has also been designed, allowing to deploy the engine as a Web service.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Downloads: 16 This Week
    Last Update:
    See Project
  • 20
    Swami Project
    A SoundFont editor and other software for editing, managing and sharing sample based MIDI instrument files for computer music composition. Support for other formats is planned.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    Advanced Audio Architecture is server based architecture to publish, manage and distribute digital music through streaming and download to multiple, different type of clients high end reproduction and the most flexible and versatile system thinkable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Spatial Media

    Spatial Media

    Specifications and tools for 360º video and spatial audio

    spatial-media provides tools for working with spherical video and spatial audio metadata so players and platforms can correctly render immersive media. The utilities inject, inspect, and extract metadata in common container formats (MP4/WebM) to signal 360° projection type, stereoscopy mode, and spatial audio layout. Creators use it to prepare 360/VR180 assets for upload so services know whether a video is monoscopic, top-bottom stereo, or side-by-side, and whether ambisonic audio is...
    Downloads: 39 This Week
    Last Update:
    See Project
  • 23
    TunesViewer

    TunesViewer

    Easy podcast access in Linux, Android

    A small, easy-to-use program to access itunesU media & podcasts in Linux and Android.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 24
    Peyote
    Peyote is an audio player with MC-like interface It designed specifically for work easy with cue sheets. It supports wv ( wavepack ), wav, flac, ape, ogg, mp4, vma and mp3 formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    DreamStudio by Volocian Studios
    Volocian believes first and foremost that financial status shouldn’t be a barrier to entry in multimedia content creation. Whether you want to design a product or website, record a band, remix a song, or produce a feature length movie with professional post production graphics and effects, Volocian™ wants to help, even if you’re using borrowed hardware with no previous experience. We provide affordable products and services for any budget, including cost-free solutions for education and demo...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB