Showing 811 open source projects for "quality"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    HunyuanVideo-Foley

    HunyuanVideo-Foley

    Multimodal Diffusion with Representation Alignment

    ...It is designed to generate audio that matches both visual content and textual semantic cues, for use in video production, film, advertising, games, etc. The model architecture aligns audio, video, and text representations to produce realistic synchronized soundtracks. Produces high-quality 48 kHz audio output suitable for professional use. Hybrid architecture combining multimodal transformer blocks and unimodal refinement blocks. Temporal alignment via frame-level synchronization modules (e.g. Synchformer).
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    YouTube Music Downloader

    YouTube Music Downloader

    A simple app to get songs from YouTube in mp3 format with artist name

    YouTube Music Downloader is a command-line music downloader written in Python that retrieves audio from YouTube and enriches it with detailed metadata from external sources. It combines tools like yt-dlp and FFmpeg to extract high-quality audio while automatically tagging files with artist name, album, release date, and artwork. The application distinguishes itself by integrating metadata providers such as Spotify and iTunes, ensuring that downloaded tracks resemble properly organized music library entries. It supports downloading single songs, playlists, or batches of tracks using flexible command-line options. ytmdl also allows customization of output formats, directory structures, and metadata handling through configuration files. ...
    Downloads: 66 This Week
    Last Update:
    See Project
  • 3
    Miso TTS

    Miso TTS

    Miso TTS is an 8 billion, highly emotive text-to-speech model

    Miso TTS is an advanced 8-billion-parameter text-to-speech model developed by Miso Labs for generating highly expressive and natural-sounding conversational speech. Built on an RVQ Transformer architecture inspired by Sesame CSM, it combines a powerful Llama-based backbone with an autoregressive audio decoder to produce high-quality audio from text. The model supports both standard speech synthesis and voice-conditioned generation using optional audio prompts for voice cloning. Miso TTS generates Mimi audio codes and can leverage conversation history to create more contextually aware and realistic dialogue. Designed for local deployment, it offers watermarking by default to help promote responsible use of generated audio. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 4
    ACE-Step 1.5

    ACE-Step 1.5

    The most powerful local music generation model

    ...It integrates cutting-edge generative techniques—such as diffusion-based synthesis combined with compressed autoencoders and lightweight transformer elements—to produce high-quality full-length music tracks with rapid inference times, capable of generating a complete song in seconds on modern GPUs while remaining efficient enough to run on consumer-grade hardware with minimal memory requirements. Beyond straightforward text-to-music synthesis, ACE-Step 1.5 enables flexible creative workflows, including tasks like cover generation, editing existing tracks, transforming vocals to background accompaniment, and stylistic personalization using low-rank adaptation from just a few example songs.
    Downloads: 59 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 5
    Toloka-Kit

    Toloka-Kit

    Toloka-Kit is a Python library for working with Toloka API

    ...For example, you can pass data between two related projects: one for data labeling, and another for its validation. AutoQuality feature which automatically finds the best fitting quality control rules for your project.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    CosyVoice

    CosyVoice

    Multi-lingual large voice generation model, providing inference

    ...CosyVoice 2.0 significantly improves on version 1.0 by boosting accuracy, stability, speed, and overall speech quality, making it more suitable for production environments. The repository contains training recipes, inference pipelines, deployment scripts, and integration examples, positioning it as a comprehensive toolkit rather than just a set of model weights.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    FLUX.1

    FLUX.1

    Official inference repo for FLUX.1 models

    FLUX.1 repository contains inference code and tooling for the FLUX.1 text-to-image diffusion models, enabling developers and researchers to generate and edit images from natural-language prompts using open-weight versions of the model on their own hardware or within custom applications. The project is part of a larger family of FLUX models developed by Black Forest Labs, designed to produce high-quality, detailed visuals from text descriptions with competitive prompt adherence and artistic fidelity. This repo focuses on running the open-source model variants efficiently, providing scripts, model loading logic, and examples for local installations, and supports integration with Python toolchains like PyTorch and popular generative pipelines. ...
    Downloads: 54 This Week
    Last Update:
    See Project
  • 8
    Best-of Python Developer Tools

    Best-of Python Developer Tools

    A ranked list of awesome python developer tools and libraries

    ...Updated weekly. This curated list contains 270 awesome open-source projects with a total of 810K stars grouped into 16 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    WeasyPrint

    WeasyPrint

    The awesome document factory

    ...WeasyPrint follows the widely used HTML and CSS specifications from the W3C. You can use your usual web tools, languages and frameworks, but for print. Creating high-quality digital documents requires features that you love to use as readers, tables of contents, links, annotations, optimized images, attachments, WeasyPrint provides many features out of the box, and even gives you the possibility to add your own ways to customize your PDF files. Digital fonts are finely tuned pieces of artwork. To give to your documents the subtle touch they deserve, carefully choose the options you want, kerning, ligatures, old-style numbers, tabular figures, ordinals, etc.
    Downloads: 51 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    reverse-SynthID

    reverse-SynthID

    Reverse engineering Gemini's SynthID detection

    ...Using this approach, it can detect SynthID watermarks with high accuracy and selectively reduce or remove them through frequency-domain manipulation. Unlike traditional image degradation methods, it performs targeted, minimally invasive adjustments that preserve image quality. Overall, Reverse-SynthID serves as a technical exploration of AI watermark robustness, detection, and removal strategies.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Loki Mode

    Loki Mode

    Multi-agent autonomous startup system for Claude Code

    ...By supporting multiple AI providers (like Claude Code, OpenAI Codex CLI, and Google Gemini CLI), loki-mode dynamically selects and spawns only the needed agents for a given project, optimizing computational resources and task throughput. Its Reason-Act-Reflect-Verify (RARV) cycle with self-verification loops emphasizes quality and resilience, automating end-to-end development lifecycles.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    LiveAvatar

    LiveAvatar

    Streaming Real-time Audio-Driven Avatar Generation

    ...The project co-designs algorithms and system optimizations, such as block-wise autoregressive processing and fast sampling strategies, to deliver real-time frame rates (e.g., ~45 FPS on appropriate GPU clusters) while handling non-stop generation without quality degradation. LiveAvatar focuses on delivering not just high-quality visuals but also the responsiveness necessary for immersive conversational experiences, making it suitable for advanced AI agents, virtual assistants, and interactive streaming contexts.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Datumaro

    Datumaro

    Dataset Management Framework, a Python library and a CLI tool to build

    ...It supports importing and exporting annotations and images across a wide variety of standards like COCO, PASCAL VOC, YOLO, ImageNet, Cityscapes, and many more, enabling easy integration with different training pipelines and tools. Datumaro makes it easy to merge datasets, split them into training/validation/test subsets, filter or transform annotations, and validate annotation quality — all while preserving metadata and supporting detailed statistics. It’s especially useful when you’re dealing with heterogeneous data sources or need to prepare complex datasets for machine learning workflows, freeing you from writing custom scripts for every format conversion.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Giskard

    Giskard

    Collaborative & Open-Source Quality Assurance for all AI models

    The testing framework dedicated to ML models, from tabular to LLMs. Giskard is an open-source testing framework dedicated to ML models, from tabular models to LLMs. Testing Machine Learning applications can be tedious. Since ML models depend on data, testing scenarios depend on the domain specificities and are often infinite. At Giskard, we believe that Machine Learning needs its own testing framework. Created by ML engineers for ML engineers, Giskard enables you to scan your model to find...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    plotly.py

    plotly.py

    The interactive graphing library for Python

    plotly.py is a browser-based, open source graphing library for Python that lets you create beautiful, interactive, publication-quality graphs. Built on top of plotly.js, it is a high-level, declarative charting library that ships with more than 30 chart types. Everything from statistical charts and scientific charts, through to maps, 3D graphs and animations, plotly.py lets you create them all. Graphs made with plotly.py can be viewed in Jupyter notebooks, standalone HTML files, or hosted online using Chart Studio Cloud.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 16
    AutoViz

    AutoViz

    Automatically Visualize any dataset, any size

    ...The primary goal of the project is to help data scientists and analysts quickly understand patterns, relationships, and anomalies within datasets without manually writing complex plotting code. With a single command, the library can automatically generate dozens of charts and graphs that reveal insights into the structure and quality of the data. AutoViz supports a wide range of visualization types including scatter plots, histograms, bar charts, and correlation plots, making it suitable for analyzing both structured and large datasets. The system also includes built-in tools for evaluating data quality and identifying potential issues such as missing values or unusual distributions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    HY-Motion 1.0

    HY-Motion 1.0

    HY-Motion model for 3D character animation generation

    ...The training strategy for the HY-Motion series includes extensive pre-training on thousands of hours of varied motion data, fine-tuning on curated high-quality datasets, and reinforcement learning with human feedback, which improves both the plausibility and adaptability of generated motion sequences.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    npm-clang-format-node

    npm-clang-format-node

    Node repackaging(wrapping) of the LLVM Clang's clang-format

    A Node.js package that integrates the clang-format tool for formatting C, C++, and JavaScript code, enhancing code quality and consistency.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19

    Cascadia Code

    A fun, new monospaced font that includes programming ligatures

    ...Cascadia was designed to enhance the look and feel of the Windows Terminal, with three variants: Cascadia Code, the standard version of the font; Cascadia Mono, which is the version without Ligatures; and Cascadia PL, the version embedded with Powerline symbols. For Windows or any other OS that employs TrueType hinting, it is recommended to use the TTF version to achieve optimal rendering quality, though the OTF version is also provided.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 20
    Manim Python

    Manim Python

    Animation engine for explanatory math videos

    Manim is a Python library and animation engine designed for creating precise, programmatic mathematical visuals—famously used by 3Blue1Brown. It enables developers and educators to script animations using code and produce high-quality explanatory math videos.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    NovaSR

    NovaSR

    A lightning fast audio upsampler

    NovaSR is an extremely lightweight and high-performance audio upsampling model that transforms low-quality 16 kHz audio into clearer, high-fidelity 48 kHz audio with remarkable speed and efficiency. At only about 50 KB in size, the model is orders of magnitude smaller than typical audio super-resolution networks, yet it achieves high quality and realtime performance thanks to its compact architecture and efficient convolutional design.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    PPTAgent

    PPTAgent

    PPTAgent: Generating and Evaluating Presentations

    ...It follows a two-stage, edit-based workflow: first it analyzes reference presentations to infer slide roles and structure, then it drafts an outline and iteratively performs editing actions to produce new slides. The project includes both the generation agent and an evaluation framework, PPTEval, to score content quality, design, and coherence. The repository highlights the EMNLP 2025 paper and provides links to resources for replication and study. The approach reflects human presentation practice—plan, draft, then refine with edits—yielding more coherent decks than direct one-shot generation. Community interest and stars suggest strong uptake for research and tooling around presentation automation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    GLM-4.6

    GLM-4.6

    Agentic, Reasoning, and Coding (ARC) foundation models

    ...Its reasoning capabilities have been strengthened, including improved tool usage during inference and more effective integration within agent frameworks. GLM-4.6 also enhances writing quality, producing outputs that better align with human preferences and role-playing scenarios. Benchmark evaluations demonstrate that it not only outperforms GLM-4.5 but also rivals leading global models such as DeepSeek-V3.1-Terminus and Claude Sonnet 4.
    Downloads: 53 This Week
    Last Update:
    See Project
  • 24
    Ideogram 4

    Ideogram 4

    Open image model at the forefront of design

    Ideogram 4 is an open-weight text-to-image model focused on high-quality visual generation, design control, and accurate text rendering inside images. It is built for users who need more than generic image generation, especially when layout, typography, composition, color, and language understanding matter. The project introduces a structured JSON prompting workflow that gives creators more explicit control over scene details and visual constraints.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 25
    ffmpeg-normalize

    ffmpeg-normalize

    Audio Normalization for Python/ffmpeg

    ...It can process multiple files in batch mode, making it suitable for large media libraries or production workflows. ffmpeg-normalize also preserves metadata and supports a wide range of input and output formats. Its design emphasizes accuracy and compliance with broadcasting standards. Overall, it provides a reliable solution for achieving consistent audio quality in multimedia content.
    Downloads: 17 This Week
    Last Update:
    See Project
Auth0 Logo