Showing 63 open source projects for "smooth"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    Fabulously Optimized

    Fabulously Optimized

    A simple Minecraft modpack focusing on performance and graphics

    A simple Minecraft modpack focusing on performance and graphics enhancements. A Minecraft modpack focused on performance, providing a smooth experience with multiple optimization mods.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 2
    AIMr

    AIMr

    The best AI Aimbot for Fortnite, Valorant, CS2, R6, COD, Apex, & more

    AIMr is an advanced AI aimbot designed to enhance gameplay by providing automated aiming assistance for games like Fortnite, Valorant, CS2, R6, COD, Apex, and more. Written in Python, it uses cutting-edge AI technologies to ensure undetected, efficient aimbot functionality with customizable features. The software includes various aiming enhancements, such as recoil control, silent aim, and prediction capabilities, aimed at making gameplay smoother and more competitive. AIMr also provides...
    Downloads: 308 This Week
    Last Update:
    See Project
  • 3
    NetEase-MusicBox

    NetEase-MusicBox

    NetEase cloud music command line version

    The high-quality command line version of NetEase Cloud Music is simple, elegant, silky and smooth, and is written based on Python. 320kbps high-quality music. Song, artist, album search. NetEase 22 song charts. Netease new disc recommendation. NetEase Featured Playlist. NetEase Anchor Radio. Private playlist, recommended daily. DJing, local collection, add at any time. Play progress and play mode display. Now playing and desktop lyrics display.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Step-Video-T2V

    Step-Video-T2V

    State-of-the-art (SoTA) text-to-video pre-trained model

    ...Under the hood it uses a compressed latent representation (a Video-VAE) to reduce spatial and temporal redundancy, and a denoising diffusion (or similar) process over that latent space to generate smooth, plausible motion and visuals. The model handles bilingual input (e.g. English and Chinese) thanks to dual encoders, and supports end-to-end text-to-video generation without requiring external assets. Its training and generation pipeline includes techniques like flow-matching, full 3D attention for temporal consistency, and fine-tuning approaches (e.g. video-based DPO) to improve fidelity and reduce artifacts. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    ComfyUI-LivePortraitKJ

    ComfyUI-LivePortraitKJ

    ComfyUI nodes for LivePortrait

    ...It allows fine control over animation parameters, enabling customization of movement intensity and style. By leveraging diffusion and motion transfer techniques, it produces smooth and coherent animations. Overall, it provides an accessible way to generate portrait animations within a node-based pipeline.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    AlohaMini

    AlohaMini

    Open-Source Dual-Arm Mobile Robot with Motorized Lift

    ...It combines hierarchical notebooks, rich text editing, and tagging, allowing users to structure information the way they think and refine that structure over time. The platform focuses on speed and responsiveness, with a UI that loads instantly and feels smooth even on modest hardware, making it ideal for both desktop and small server deployments. AlohaMini also supports internal linking between notes, enabling a web-like connected knowledge space that grows organically as users create and connect ideas. Collaboration features are integrated so teams can share notebooks, leave comments, and work simultaneously on a shared knowledge base without leaving the platform.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    OpenDAN

    OpenDAN

    OpenDAN is an open source Personal AI OS

    OpenDAN is an open-source Personal AI OS , that consolidates various AI modules in one place for your personal use. The goal of OpenDAN (Open and Do Anything Now with AI) is to create a Personal AI OS , which provides a runtime environment for various Al modules as well as protocols for interoperability between them. With OpenDAN, users can securely collaborate with various AI modules using their private data to create powerful personal AI agents, such as butlers, lawyers, doctors, teachers,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Edit Banana

    Edit Banana

    Edit Banana: A framework for converting statistical figures

    Edit Banana is an innovative web application designed to simplify image editing by merging intuitive user interfaces with powerful generative AI capabilities, enabling users to quickly enhance, manipulate, or transform photos without needing advanced design skills. It provides a smooth, browser-based experience where users can upload images, make precise edits such as background removal or inpainting, and apply stylistic transformations or corrections through AI prompts. The tool focuses on accessibility, giving hobbyists, content creators, and small teams a way to produce polished visuals without downloading heavyweight software or managing local compute resources. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    ...It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and responsive speech generation without noticeable delay. The project is built with extensibility in mind, enabling developers to add custom voices, integrate additional languages, and tailor the backend for different hardware or cloud environments. It also supports saving generated audio as downloadable files so users can reuse the speech outputs in other projects, presentations, or media content.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    LatentSync

    LatentSync

    Taming Stable Diffusion for Lip Sync

    LatentSync is an open-source framework from ByteDance that produces high-quality lip-synchronization for video by using an audio-conditioned latent diffusion model, bypassing traditional intermediate motion representations. In effect, given a source video (with masked or reference frames) and an audio track, LatentSync directly generates frames whose lip motions and expressions align with the audio, producing convincing talking-head or animated lip-sync output. The system leverages a U-Net...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    pwndbg

    pwndbg

    Exploit Development and Reverse Engineering with GDB Made Easy

    Pwndbg is a fast, simple and lightweight tool for modern debugging. It improves debugging experience with the strength of GDB for low-level software developers, hardware hackers, reverse engineers, and exploit developers. It provides features crucial for efficient debugging in the world of low-level programming. Vanilla GDB is terrible to use for reverse engineering and exploit development. Typing x/g30x $esp is not fun, and does not confer much information. The year is 2024 and GDB still...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Neuroglancer

    Neuroglancer

    WebGL-based viewer for volumetric data

    ...Neuroglancer operates entirely client-side, fetching data over HTTP in a variety of supported formats including Neuroglancer precomputed, N5, Zarr, and NIfTI, among others. The viewer is built with a multi-threaded architecture, separating rendering and data processing to ensure smooth performance even with massive datasets. Extensively used in neuroscience research, Neuroglancer supports integration with tools.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Fun Audio Chat

    Fun Audio Chat

    Large Audio Language Model built for natural interactions

    Fun Audio Chat is an interactive voice-first conversational AI platform designed to let users engage in natural spoken dialogue with large language models in real time, turning speech into context-aware responses while maintaining a smooth back-and-forth experience. It combines speech recognition, audio processing, and AI generation so users can speak simply and receive spoken replies, enabling applications such as virtual assistants, voice bots, and hands-free chat interfaces. The system supports dynamic audio input and output, meaning it can handle different voices, tones, and conversational contexts without forcing users into typed interactions. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Selkies-GStreamer

    Selkies-GStreamer

    Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop

    ...It is optimized for GPU-accelerated encoding and integrates with Kubernetes-based deployments to enable scalable, real-time remote desktop sessions. This component plays a critical role in delivering smooth, responsive experiences for cloud-based workstations, gaming, or visualization tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    AI Chatbot Framework

    AI Chatbot Framework

    Python chatbot framework with Natural Language Understanding

    ...AI Chatbot Framework is an AI powered conversational dialog interface built in Python. With this tool, it’s easy to create Natural Language conversational scenarios with no coding efforts whatsoever. The smooth UI makes it effortless to create and train conversations to the bot and it continuously gets smarter as it learns from conversations it has with people. AI Chatbot Framework can live on any channel of your choice (such as Messenger, Slack etc.) by integrating it’s API with that platform. You don’t need to be an expert at artificial intelligence to create an awesome chatbot that has AI capabilities. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    PyQt-SiliconUI

    PyQt-SiliconUI

    A powerful and artistic UI library based on PyQt5

    ...The library includes a wide range of refactored widgets such as buttons, containers, editors, menus, sliders, and progress bars, all structured to work seamlessly with Qt’s layout system. It also provides core modules for animations, event handling, and custom painting, enabling developers to create smooth, interactive desktop experiences beyond standard PyQt capabilities. A key aspect of the project is its ongoing refactoring effort, which aims to modernize components, improve performance, and replace older implementations with more stable and maintainable versions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ChatGLM3

    ChatGLM3

    ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat

    ChatGLM3 is ZhipuAI & Tsinghua KEG’s third-gen conversational model suite centered on the 6B-parameter ChatGLM3-6B. It keeps the series’ smooth dialog and low deployment cost while adding native tool use (function calling), a built-in code interpreter, and agent-style workflows. The family includes base and long-context variants (8K/32K/128K). The repo ships Python APIs, CLI and web demos (Gradio/Streamlit), an OpenAI-format API server, and a compact fine-tuning kit. Quantization (4/8-bit), CPU/MPS support, and accelerator backends (TensorRT-LLM, OpenVINO, chatglm.cpp) enable lightweight local or edge deployment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Textual

    Textual

    Textual is a TUI (Text User Interface) framework for Python

    ...Textual adds interactivity to Rich with a Python API inspired by modern web development. On modern terminal software (installed by default on most systems), Textual apps can use 16.7 million colors with mouse support and smooth flicker-free animation. A powerful layout engine and re-usable components makes it possible to build apps that rival the desktop and web experience. Textual runs on Linux, macOS, and Windows. Textual requires Python 3.7 or above. The addition of [dev] installs Textual development tools. See the docs if you need help getting started. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    HunyuanWorld-Voyager

    HunyuanWorld-Voyager

    RGBD video generation model conditioned on camera input

    HunyuanWorld-Voyager is a next-generation video diffusion framework developed by Tencent-Hunyuan for generating world-consistent 3D scene videos from a single input image. By leveraging user-defined camera paths, it enables immersive scene exploration and supports controllable video synthesis with high realism. The system jointly produces aligned RGB and depth video sequences, making it directly applicable to 3D reconstruction tasks. At its core, Voyager integrates a world-consistent video...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    fvcore

    fvcore

    Collection of common code shared among different research projects

    fvcore is a lightweight utility library that factors out common performance-minded components used across Facebook/Meta computer-vision codebases. It provides numerics and loss layers (e.g., focal loss, smooth-L1, IoU/GIoU) implemented for speed and clarity, along with initialization helpers and normalization layers for building PyTorch models. Its common modules include timers, logging, checkpoints, registry patterns, and configuration helpers that reduce boilerplate in research code. A standout capability is FLOP and activation counting, which analyzes arbitrary PyTorch graphs to report cost by operator and by module for precise profiling. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    GLM-4-Voice

    GLM-4-Voice

    GLM-4-Voice | End-to-End Chinese-English Conversational Model

    GLM-4-Voice is an open-source speech-enabled model from ZhipuAI, extending the GLM-4 family into the audio domain. It integrates advanced voice recognition and generation with the multimodal reasoning capabilities of GLM-4, enabling smooth natural interaction via spoken input and output. The model supports real-time speech-to-text transcription, spoken dialogue understanding, and text-to-speech synthesis, making it suitable for conversational AI, virtual assistants, and accessibility applications. GLM-4-Voice builds upon the bilingual strengths of the GLM architecture, supporting both Chinese and English, and is designed to handle long-form conversations with context retention. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    DGL

    DGL

    Python package built to ease deep learning on graph

    ...We want to make it easy to implement graph neural networks model family. We also want to make the combination of graph based modules and tensor based modules (PyTorch or MXNet) as smooth as possible. DGL provides a powerful graph object that can reside on either CPU or GPU. It bundles structural data as well as features for a better control. We provide a variety of functions for computing with graph objects including efficient and customizable message passing primitives for Graph Neural Networks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Bailing

    Bailing

    Bailing is a voice dialogue robot similar to GPT-4o

    ...The project is modular: each core function — ASR, VAD, LLM, TTS — exists as a separately replaceable component, which allows flexibility in picking your preferred models depending on resources or languages. It aims to be light enough to run without a GPU, making it usable on modest hardware or edge devices, while still maintaining low latency and smooth interaction. Bailing includes a memory system, giving the assistant the ability to remember user preferences and context across sessions, which enables more personalized and context-aware conversations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    FLAML

    FLAML

    A fast library for AutoML and tuning

    ...It supports both classical machine learning models and deep neural networks. It is easy to customize or extend. Users can find their desired customizability from a smooth range: minimal customization (computational resource budget), medium customization (e.g., scikit-style learner, search space, and metric), or full customization (arbitrary training and evaluation code). It supports fast automatic tuning, capable of handling complex constraints/guidance/early stopping. FLAML is powered by a new, cost-effective hyperparameter optimization and learner selection method invented by Microsoft Research.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    VideoChat

    VideoChat

    Real-time voice interactive digital human

    VideoChat is a real-time voice-interactive “digital human” system that combines automatic speech recognition, large language models, text-to-speech, and talking-head generation into a single conversational pipeline. It supports both pure end-to-end voice solutions based on multimodal large language models (GLM-4-Voice feeding directly into talking-head generation) and a more traditional cascaded pipeline using ASR → LLM → TTS → talking head. It is built as a Gradio Python demo, exposing a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Auth0 Logo