Search Results for "input-leap" - Page 7

Sort By:

Showing 574 open source projects for "input-leap"

View related business solutions

Python Clear Filters & Widen Search

Go from Code to Production URL in Seconds
Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
1

Paper2Slides

From Paper to Presentation in One Click

...It is designed to replace the repetitive work of turning dense technical documents into presentation-friendly structure by extracting key points, figures, and data into a coherent visual narrative. The system supports multiple input formats, so you can process PDFs and common office documents rather than being locked to a single file type. It uses an extraction approach intended to capture critical insights comprehensively, including important visuals and data points that often get missed in naive summarization. A major focus is traceability: generated slide content is designed to remain linked back to the source material so you can verify accuracy and reduce information drift. ...

Downloads: 2 This Week

Last Update: 2026-03-15
See Project
2

Mesh R-CNN

code for Mesh R-CNN, ICCV 2019

...Unlike voxel-based or point-based approaches, Mesh R-CNN uses a differentiable mesh representation, allowing it to efficiently refine surface geometry while maintaining high spatial detail. The system combines 2D detection from Mask R-CNN with 3D reasoning modules that output full mesh reconstructions aligned with the input image. It has been evaluated on datasets such as Pix3D, where it demonstrates state-of-the-art performance in reconstructing real-world object geometry.

Downloads: 2 This Week

Last Update: 11 hours ago
See Project
3

OpenAI-Compatible Edge-TTS API

Free, high-quality text-to-speech API endpoint to replace OpenAI

...The project emulates the /v1/audio/speech endpoint used by OpenAI, so any client that can talk to the OpenAI TTS API can be redirected to this service with minimal changes. It exposes parameters for input text, voice selection, audio format, and playback speed, mirroring the OpenAI interface while mapping popular OpenAI voice names to equivalent Edge voices. Because it relies on Edge’s TTS, the audio generation itself is free, and the project essentially acts as a smart proxy that handles formatting and streaming. The server supports Server-Sent Events (SSE) for streaming audio, enabling low-latency playback in chat UIs and other interactive tools. ...

Downloads: 4 This Week

Last Update: 2025-11-28
See Project
4

TorchMetrics

Machine learning metrics for distributed, scalable PyTorch application

...Metric arithmetic. Similar to torch.nn, most metrics have both a module-based and a functional version. The functional versions are simple python functions that as input take torch.tensors and return the corresponding metric as a torch.tensor.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
Stop Storing Third-Party Tokens in Your Database
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.

Try Auth0 for Free
5

Autograd

Efficiently computes derivatives of numpy code

...For more information, check out the tutorial and the examples directory. We can continue to differentiate as many times as we like, and use numpy's vectorization of scalar-valued functions across many different input values.

Downloads: 0 This Week

Last Update: 2025-05-02
See Project
6

yq JSON

Command-line YAML, XML, TOML processor

Before using yq, you also have to install its dependency, jq. See the jq installation instructions for details and directions specific to your platform. On macOS, yq is also available on Homebrew use brew install python-yq.

Downloads: 5 This Week

Last Update: 2024-04-27
See Project
7

AG-UI

The Agent-User Interaction Protocol

...Instead of treating an AI agent as a black-box chat endpoint, AG-UI defines structured events for messages, tool calls, state changes, lifecycle updates, and user interactions. This makes it easier for developers to build agent-powered apps that stream progress, request human input, update UI state, and coordinate complex workflows. The project is especially useful for teams building copilots, workflow assistants, multi-agent products, or custom AI interfaces. Overall, AG-UI provides a shared communication layer between autonomous systems and the interfaces where people actually use them.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
8

OpenVINO Notebooks

Jupyter notebook tutorials for OpenVINO

...The tutorials also illustrate how OpenVINO integrates with models from frameworks like PyTorch, TensorFlow, and ONNX to accelerate inference workloads. Many notebooks include end-to-end examples that show how to prepare input data, load optimized models, run inference, and visualize results. The project is particularly useful for developers who want to learn how to optimize machine learning inference pipelines for production environments.

Downloads: 1 This Week

Last Update: 4 days ago
See Project
9

verl-agent

Designed for training LLM/VLM agents via RL

verl-agent is an open-source reinforcement learning framework designed to train large language model agents and vision-language model agents for complex interactive environments. Built as an extension of the veRL reinforcement learning infrastructure, the project focuses on enabling scalable training for agents that perform multi-step reasoning and decision-making tasks. The framework supports multi-turn interactions between agents and their environments, allowing the system to receive...

Downloads: 1 This Week

Last Update: 2026-03-10
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
10

Ring

Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI

Ring is a reasoning Mixture-of-Experts (MoE) large language model (LLM) developed by inclusionAI. It is built from or derived from Ling. Its design emphasizes reasoning, efficiency, and modular expert activation. In its “flash” variant (Ring-flash-2.0), it optimizes inference by activating only a subset of experts. It applies reinforcement learning/reasoning optimization techniques. Its architectures and training approaches are tuned to enable efficient and capable reasoning performance....

Downloads: 1 This Week

Last Update: 2025-09-30
See Project
11

JC

CLI tool and python library

CLI tool and python library that converts the output of popular command-line tools and file types to JSON or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts. jc JSONifies the output of many CLI tools and file types for easier parsing in scripts. This allows further command-line processing of output with tools like jq or jello by piping commands. The JC parsers can also be used as python modules. In this case, the output will be a python...

Downloads: 1 This Week

Last Update: 2025-10-13
See Project
12

Douyin TikTok Download API

Douyin TikTok Download API

...You can deploy or transform this project yourself to achieve more functions, or you can directly call scraper.py in your project or install an existing pip package as a parsing library to easily crawl data, etc. Support input Douyin|TikTokuser homepage to crawl the author [homepage video data (remove watermark link, liked video list (permission must be public), video comment data, background music video list data, etc...).

Downloads: 2 This Week

Last Update: 2025-03-16
See Project
13

PyLivestream

Pure Python FFmpeg-based live video / audio streaming to YouTube

PyLivestream is a Python-based tool that enables real-time video streaming from various input sources to platforms such as YouTube and Twitch. It acts as a wrapper around FFmpeg, allowing users to stream video from cameras, files, or screen capture devices with minimal configuration. The tool supports cross-platform operation and integrates easily into Python workflows, making it suitable for automation and scripting. It provides options for controlling streaming parameters such as bitrate, resolution, and codecs. ...

Downloads: 0 This Week

Last Update: 2026-04-27
See Project
14

LLM Telegram Bot

A Telegram bot for Large Language Models

LLM Telegram Bot is a self-hosted Telegram chatbot that connects messaging interactions with large language models, typically powered by Ollama or similar backends. The project is designed to provide a customizable AI assistant that can operate within Telegram conversations, supporting dynamic responses based on user input and configurable parameters. It includes features such as conversation memory, allowing the bot to maintain context across multiple messages and provide more coherent responses. The system supports multiple modes or personas, enabling users to switch between different conversational styles or use cases. It also allows fine-tuning of generation parameters such as temperature and token limits, giving users control over response behavior. ...

Downloads: 0 This Week

Last Update: 2026-04-20
See Project
15

ComfyUI-HunyuanVideoWrapper

ComfyUI wrapper nodes for HunyuanVideo

The ComfyUI-HunyuanVideoWrapper project is a ComfyUI extension that integrates Hunyuan-based multimodal video generation models into node-based workflows. It allows users to generate or manipulate video content by combining text prompts with one or more input images, enabling flexible conditioning of outputs. The system introduces specialized nodes such as text-image encoders that allow multiple image inputs to be referenced directly within prompts. This makes it possible to guide generation using both visual and textual context simultaneously. The wrapper is designed to fit seamlessly into ComfyUI pipelines, enabling chaining with other nodes for advanced workflows. ...

Downloads: 0 This Week

Last Update: 2026-04-16
See Project
16

Cr3dOv3r

Know the dangers of credential reuse attacks

Cr3dOv3r is a penetration testing and security auditing tool designed to demonstrate and analyze the risks associated with credential reuse across multiple online services. It allows users to input an email address and automatically checks for data breaches and leaked credentials using public databases. The tool then attempts to verify whether those credentials are reused across various popular platforms, highlighting potential vulnerabilities. It is primarily intended for educational and ethical security testing purposes, helping security professionals understand how attackers might exploit reused passwords. ...

Downloads: 0 This Week

Last Update: 2026-03-25
See Project
17

AI-Researcher

AI-Researcher: Autonomous Scientific Innovation

AI-Researcher is an open-source system designed to automate complex research tasks end-to-end using large language models and structured workflows, aiming to replicate parts of a human research assistant’s capabilities. It lets users input high-level research goals or questions in natural language and then automatically plans, decomposes, and executes tasks such as literature surveying, summarization, synthesis, experiment design, and draft generation. The system integrates retrieval mechanisms to pull in external knowledge sources, contextually analyze documents and papers, and build structured representations of ideas and arguments that can later be turned into coherent reports or drafts. ...

Downloads: 0 This Week

Last Update: 2026-02-03
See Project
18

NitroGen

A Foundation Model for Generalist Gaming Agents

NitroGen is a foundation model for generalist gaming agents developed under the MineDojo initiative, aimed at training a vision-action AI that can play and interact with a wide variety of games by taking pixel inputs and predicting gamepad actions. As an open research model, NitroGen is trained on extensive gameplay data spanning thousands of hours and hundreds of games to instill broad, generalizable gaming competency rather than skill at a single title. This approach enables the model to...

Downloads: 0 This Week

Last Update: 2026-01-29
See Project
19

Moondream

Tiny vision language model

...While the exact nature can vary by commit or branch, Moondream’s work often blends geometry, color theory, and motion to create immersive visuals that can be interactive, animated, or reactive to input. It serves as both a playground for the author’s artistic curiosity and a resource for other creative coders interested in generative art techniques. The repository may include shaders, canvas/WebGL code, visual demos, and utilities that demonstrate how mathematical functions or noise patterns can be harnessed for compelling visuals.

Downloads: 0 This Week

Last Update: 2026-04-20
See Project
20

MAI-UI

Real-World Centric Foundation GUI Agents

...Developed by Tongyi-MAI (Alibaba’s research initiative), the MAI-UI models are multimodal agents trained to understand user instructions and corresponding screenshots, grounding those instructions to on-screen elements and generating sequences of GUI actions such as taps, swipes, text input, and system commands. Unlike traditional UI frameworks, MAI-UI emphasizes realistic deployment by supporting agent–user interaction (clarifying ambiguous instructions), integration with external tool APIs using MCP calls, and a device–cloud collaboration mechanism that dynamically routes computation to on-device or cloud models based on task state and privacy constraints.

Downloads: 1 This Week

Last Update: 2026-04-20
See Project
21

AI YouTube Shorts Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV

AI-YouTube-Shorts-Generator is a Python-based tool that automates the creation of short-form vertical video clips (“shorts”) from longer source videos — ideal for adapting content for platforms like YouTube Shorts, Instagram Reels, or TikTok. It analyzes input video (whether a local file or a YouTube URL), transcribes audio (with optional GPU-accelerated speech-to-text), uses an AI model to identify the most compelling or engaging segments, and then crops/resizes the video and applies subtitle overlays, producing a polished short video without manual editing. The tool streamlines multiple steps of the tedious short-form video workflow: highlight detection, clipping, subtitle generation, cropping to vertical 9:16 format, and final rendering — reducing hours of editing to a mostly automated pipeline. ...

Downloads: 2 This Week

Last Update: 2026-05-03
See Project
22

Kubespider

A global resource download orchestration system

We are a community of individuals who share a passion for life and have come together based on shared interests and needs. In our free time, we collaborated to develop Kubespider. Kubespider is developed to utilize an idle server in a local area network as a NAS, enabling automatic downloads of TV series, triggering downloads from a local laptop, and adapting to various websites such as YouTube and BiliBili, as well as different types of resources such as TV series, movies, music and more....

Downloads: 1 This Week

Last Update: 2024-06-27
See Project
23

website-to-gif

Turn your website into a GIF

...In your GitHub repo, create a workflow file or extend an existing one. You have to also include a step to checkout and commit to the repo. You can use the following example gif.yml. Make sure to modify the url value and add any other input you want to use. WebP rendering will take a lot of time to benefit from lossless quality and file size optimization.

Downloads: 3 This Week

Last Update: 2024-05-04
See Project
24

Kronos

A Foundation Model for the Language of Financial Markets

Kronos is a specialized open-source foundation model designed for analyzing and predicting financial market data using time-series representations of candlestick patterns. It is built as a decoder-only Transformer model trained specifically on K-line data, which captures open, high, low, close, and volume information across multiple global exchanges. The system introduces a novel tokenization approach that converts continuous financial data into discrete tokens, enabling the model to process...

Downloads: 0 This Week

Last Update: 2026-04-13
See Project
25

SimpleHTR

Handwritten Text Recognition (HTR) system implemented with TensorFlow

...The system uses a combination of convolutional neural networks and recurrent neural networks to extract visual features and model sequential character patterns in handwriting. It also employs connectionist temporal classification (CTC) to align predicted character sequences with input images without requiring character-level segmentation. The repository provides code for training models, performing inference on handwritten text images, and evaluating recognition accuracy. SimpleHTR is commonly used as an educational example for understanding how modern handwriting recognition systems operate.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project