Showing 61 open source projects for "upload"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 1
    Animated Drawings

    Animated Drawings

    Code to accompany "A Method for Animating Children's Drawings"

    ...Users can provide rough keyframes or control constraints (pose anchors), and the system fills intermediate frames with fluid animation. The repository includes demonstration apps and notebooks where you can upload or draw shapes and watch animations play. Because the approach is data-driven, it generalizes to new drawings even with varying proportions or stylizations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    HunyuanWorld-Voyager

    HunyuanWorld-Voyager

    RGBD video generation model conditioned on camera input

    HunyuanWorld-Voyager is a next-generation video diffusion framework developed by Tencent-Hunyuan for generating world-consistent 3D scene videos from a single input image. By leveraging user-defined camera paths, it enables immersive scene exploration and supports controllable video synthesis with high realism. The system jointly produces aligned RGB and depth video sequences, making it directly applicable to 3D reconstruction tasks. At its core, Voyager integrates a world-consistent video...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Deep Lake

    Deep Lake

    Data Lake for Deep Learning. Build, manage, and query datasets

    ...It can be deployed locally or in the cloud, and it enables you to store all of your data in one place, ranging from simple annotations to large videos. Deep Lake is used by Google, Waymo, Red Cross, Omdena, Yale, & Oxford. Use one API to upload, download, and stream datasets to/from AWS S3/S3-compatible storage, GCP, Activeloop cloud, or local storage. Store images, audios and videos in their native compression. Deeplake automatically decompresses them to raw data only when needed, e.g., when training a model. Treat your cloud datasets as if they are a collection of NumPy arrays in your system's memory. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    EasyVoice

    EasyVoice

    Open source text-to-speech tool, supports extra-long text

    easyVoice is an open-source text-to-speech platform aimed at turning long-form text and novels into high-quality audio, with a strong focus on usability and scalability. It provides a web interface where users can paste or upload large texts and generate speech and subtitles in a single workflow, even for works exceeding 100,000 characters. The system supports multi-role voice acting, letting users assign different neural voices to different characters or narrative roles and configure parameters such as rate, pitch, and volume per role. It offers streaming playback so audio starts almost immediately, even for very long inputs, and automatically generates subtitle files suitable for video production or translation workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    marqo

    marqo

    Tensor search for humans

    A tensor-based search and analytics engine that seamlessly integrates with your applications, websites, and workflows. Marqo is a versatile and robust search and analytics engine that can be integrated into any website or application. Due to horizontal scalability, Marqo provides lightning-fast query times, even with millions of documents. Marqo helps you configure deep-learning models like CLIP to pull semantic meaning from images. It can seamlessly handle image-to-image, image-to-text and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    h2oGPT

    h2oGPT

    Private chat with local GPT with document, images, video, etc.

    h2oGPT is an open-source platform that allows users to interact with local GPT models in a completely private environment. It supports a variety of document types, including PDFs, Word files, images, video frames, and even audio, enabling users to query and analyze their documents or engage in a private chat with AI. The platform is designed to be secure and offline, ensuring that all data remains private and under the user's control. h2oGPT supports several AI models, including oLLaMa and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    SoniTranslate

    SoniTranslate

    Synchronized Translation for Videos

    SoniTranslate is a video translation and dubbing system that produces synchronized target-language audio tracks for existing video content. It provides a web UI built with Gradio, allowing users to upload a video, choose source and target languages, and then run a pipeline that handles transcription, translation and re-synthesis of speech. Under the hood, it uses advanced speech and diarization models to separate speakers, align audio with timecodes and respect subtitle timing, which lets the generated dub track stay in sync with the original video structure. ...
    Downloads: 27 This Week
    Last Update:
    See Project
  • 8
    tgState

    tgState

    Using Telegram as a stored file chain system

    A file chain system with Telegram as a storage. No limit to file size and format. It can be used as a telegram drawing bed or as a telegram net. Support web upload files and telegram upload directly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ChonOS

    ChonOS

    A specifical-purpose GNU/Linux distribution for Embedded MAS

    ChonOS (Cognitive Hardware on Network - Operational System) is a specifical-purpose GNU/Linux distribution that seeks to facilitate the development of an Embedded MultiAgent System (MAS). It enables, without the need to turn off the device or stop the MAS: the deployment of reasoning to the robot; firmware deployment for microcontrollers; the transfer of the MAS from the development environment to the production environment; and the transfer of new agents to the MAS running using the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 10
    Hathi Download Helper

    Hathi Download Helper

    Download books from the hathitrust website in a fast and easy manner

    2025-05-08 ====================== PLEASE NOTE ======================= Due to changes to the API of the hathirtust homepage, the HDH is no longer functional!! Please check the project Wiki for alternative methods. https://sourceforge.net/p/hathidownloadhelper/alternative/ ---------------------------------------------------------------------------------------------- Hathi Download Helper was a tool for downloading public domain books from hathitrust.org. E-Mail contact:...
    Leader badge
    Downloads: 23 This Week
    Last Update:
    See Project
  • 11
    PoseidonQ  - AI/ML Based QSAR Modeling

    PoseidonQ - AI/ML Based QSAR Modeling

    ML based QSAR Modelling And Translation of Model to Deployable WebApps

    - This Software was made with an intention to make QSAR/QSPR development more efficient and reproducible. - Published in ACS, Journal of Chemical Information and Modeling . Link : https://pubs.acs.org/doi/10.1021/acs.jcim.4c02372 - Simple to use and no compromise on essential features necessary to make reliable QSAR models. - From Generating Reliable ML Based QSAR Models to Developing Your Own QSAR WebApp. For any feedback or queries, contact kabeermuzammil614@gmail.com - Available on...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 12
    QOTD Discord Bot

    QOTD Discord Bot

    Simple Question Of The Day (QOTD) Discord Bot for your server

    What is this bot for? QOTD stands for Question Of The Day and is usually used to keep servers active and members talking. What does it do? This bot allows for daily (or custom timed) QOTD to be sent in a specific channel. It allows users to add their own QOTD to the bot's queue, and QOTD managers to manage the queue. How to set it up? Simply follow a tutorial to create a Discord bot and run the code. Pass in the bot's token and configure the bot through the commands in qotd...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13

    Liveliness and Face Identification

    Leading free and open-source liveliness check &face recognition system

    ...The description guides you to adjust the settings and click the "Start" button to begin face detection. If user's pic is in DB, it will show the matching name or else you can upload your pic with name to do detection. Application has many uses like door lock, attendance system or any similar identification usages. Face Recognition is highly accurate and simplest application
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    RoomGPT

    RoomGPT

    Upload a photo of your room to generate your dream room with AI

    RoomGPT is an open-source app that lets you upload a photo of your room and generate redesigned versions of it using AI. It uses a model such as ControlNet to condition the generation on the original room layout, producing realistic variations while preserving structure like walls, windows, and furniture placement. The app is built on Next.js and exposes a simple web interface where users can upload images, choose styles, and view generated outputs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    GPT-Code UI

    GPT-Code UI

    An open source implementation of OpenAI's ChatGPT Code interpreter

    An open source implementation of OpenAI's ChatGPT Code interpreter. Simply ask the OpenAI model to do something and it will generate & execute the code for you. You can put a .env in the working directory to load the OPENAI_API_KEY environment variable. For Azure OpenAI Services, there are also other configurable variables like deployment name. See .env.azure-example for more information. Note that model selection on the UI is currently not supported for Azure OpenAI Services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    AI Code Translator

    AI Code Translator

    Use AI to translate code from one language to another

    AI Code Translator is a web-based tool that leverages AI to translate source code from one programming language to another, making cross-language porting or code conversion significantly easier — useful when migrating codebases, experimenting across languages, or learning how code patterns map across languages. The UI is built with Next.js + TypeScript, plus modern tooling like Tailwind CSS, giving a clean frontend experience where you paste or upload code, select target language, and get output quickly. Because it uses AI under the hood, the translation can handle not just syntax but also adapt idioms or patterns appropriately for the target language (though with the usual caveats around AI output correctness). The project is open-source under MIT license, so you can self-host it, integrate into internal tools, or customize language-options and prompts to match your team’s style or language preferences.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Dynacover

    Dynacover

    Dynamic Twitter images and banners

    Dynacover is a PHP GD + TwitterOAuth CLI app to dynamically generate Twitter header images and upload them via the API. This enables you to build cool little tricks, like showing your latest followers or GitHub sponsors, your latest content created, a qrcode to something, a progress bar for a goal, and whatever you can think of. You can run Dynacover in three different ways. As a GitHub action: the easiest way to run Dynacover is by setting it up in a public repository with GitHub Actions, using repository secrets for credentials. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    VideoSrt

    VideoSrt

    Windows-GUI

    ...Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. Video recognition does not need to upload the original video, which is convenient, fast and time-saving.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 19
    Userge

    Userge

    Userge, Durable as a Serge

    UserGe is a Powerful, Pluggable Telegram UserBot written in Python using Pyrogram by which you can Automate your Telegram account to work as you want. It comes with salient and descriptive features that help you to manage your task with some easy command.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Next Generation Programming

    Next Generation Programming

    Compose Software Without Writing Any Programing Code

    "Next Generation Programming - Programming Without Coding Software" is a drag-drop wizard for creating simple or complex applications without writing any programming language code The Software is coded/designed with "Java Programming Language" for novice/expert programmers; Programmers can write softwares with visual tools : drag-drop components;visual editors... Programmers can use the software to compose of simple/complex applications : Database programs, circuit design, generate code and upload to chip for designed circuits (ESP8266, ESP32 chips) The Software in question is much simpler to use than PWCT (https://sourceforge.net/projects/doublesvsoop/) software. The Software has more features than PWCT software such as SCADA. Please start by looking at examples from the website first. In this way, you can learn the features of the software and how to use the software in a very short time. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    DownloadBot

    DownloadBot

    A distributed cross-platform Telegram Bot

    A distributed cross-platform Telegram Bot that can control your Aria2 server, control server files and also upload to OneDrive / Google Drive. This project is mainly to use a small hard disk server for offline downloading, for large BitTorrent files to be downloaded in sections according to the size of the hard disk, each time downloading a part, then uploading the network disk, deleting and then downloading the other parts, until all the files are downloaded.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    aiode

    aiode

    Discord bot that plays Spotify tracks and YouTube videos or any URL

    ...Create custom command presets as shortcuts for your most used commands. Adjustable properties for even deeper customization. Sign in to Spotify to play your own playlists or upload aiode playlists. Manage what roles can access which commands. Customize how you want to summon your bot by using a custom prefix or giving your bot a name. Advanced admin commands such as updating and rebooting the bot or cleaning up the database available to bot administrators. Capable scripting sandbox that enables running and storing custom Groovy scripts and modifying command behavior through interceptors.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    XZVoice

    XZVoice

    Free and open source text-to-speech software

    Text-to-speech software developed by Electron + vue + ElementUI + js. The high-fidelity and flexible configuration of speech synthesis products opens up the closed loop of human-computer interaction and enables applications to sound realistically. A variety of timbres are available, and functions such as adjusting speech rate, intonation, and volume are provided. Technically, multi-level rhythmic pauses are taken into account to achieve the purpose of natural synthesizing rhythm, and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Universal Data Tool

    Universal Data Tool

    Collaborate & label any type of data, images, text, or documents etc.

    ...Import from your S3 buckets easily with IAM or Cognito authentication. Working together, we can accomplish more. The Universal Data Tool was built to bring together the best ideas from different machine learning communities. Upload your dataset to Courses to create a training course. Testing and exercises validate that your workforce knows exactly how the data should be labeled. Get started in less than a minute. Courses uses administrator links. No sign up needed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Cloud Annotations

    Cloud Annotations

    A fast, easy and collaborative open source image annotation tool

    Learn computer vision & AI by building real-world applications. Learn to build and train computer vision models—then show off your skills in an interactive web application. Build impressive applications and learn coveted skills. The examples below were created by the Skills Network Team—right here in CV Studio. Create your own project dataset by uploading images and videos. Coming soon, you'll be able to use a pre-compiled dataset so you can hit the ground running. Creating image annotations...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo