Page 3 | file-studio free download

Showing 637 open source projects for "file-studio"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
1

Basic Pitch

A lightweight audio-to-MIDI converter with pitch bend detection

...basic-pitch is efficient and easy to use, and its multi pitch support, its ability to generalize across instruments, and its note accuracy compete with much larger and more resource-hungry AMT systems. Provide a compatible audio file and a basic-pitch will generate a MIDI file, complete with pitch bends. The basic pitch is instrument-agnostic and supports polyphonic instruments, so you can freely enjoy transcription of all your favorite music, no matter what instrument is used. Basic pitch works best on one instrument at a time.

Downloads: 33 This Week

Last Update: 2024-08-16
See Project
2

Hunyuan3D-1

A Unified Framework for Text-to-3D and Image-to-3D Generation

Hunyuan3D-1 is an earlier version in the same 3D generation line (the unified framework for text-to-3D and image-to-3D tasks) by Tencent Hunyuan. It provides a framework combining shape generation and texture synthesis, enabling users to create 3D assets from images or text conditions. While less advanced than version 2.1, it laid the foundations for the later PBR, higher resolution, and open-source enhancements. (Note: less detailed public documentation was found for Hunyuan3D-1 compared to...

Downloads: 3 This Week

Last Update: 2025-11-19
See Project
3

Upscayl

Free and Open Source AI Image Upscaler for Linux, MacOS and Windows

...Upscayl does not work without a GPU, sorry. You'll need a Vulkan-compatible GPU to upscale images. CPU or iGPU won't work. You can also download the flatpak version and double-click the flatpak file to install via Store but wait for the full release, we'll be pushing it to Flathub for easy access. Upscayl uses AI models to enhance your images by guessing what the details could be. It uses Real-ESRGAN (and more in the future) model to achieve this. The CLI tool is called real-esrgan-ncnn-vulkan and it's available on the Real-ESRGAN repository.

1 Review

Downloads: 165 This Week

Last Update: 2025-01-15
See Project
4

MESHROOM

3D reconstruction software

...Fusion of Multi-bracketing LDR images into HDR. Alignment of panorama images. Support for fisheye optics. Automatically estimate fisheye circle or manually edit it. Take advantage of motorized-head file. Easy to integrate in your Renderfarm System. Add specific rules to select the most suitable machines regarding CPU, RAM, GPU requirements of each Node.

1 Review

Downloads: 141 This Week

Last Update: 2025-08-19
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

AutoAgent AI

Autonomous harness engineering

AutoAgent is an experimental AI framework focused on autonomous agent engineering, where a meta-agent iteratively improves another agent’s architecture without direct human intervention. Instead of manually tuning prompts or workflows, developers define high-level goals in a configuration file, and the system continuously modifies its own tools, orchestration, and logic based on benchmark performance. It operates through a loop of testing, analyzing failures, and refining the agent’s configuration to maximize a scoring metric. The framework uses a single-file agent harness combined with structured tasks and evaluation suites to guide optimization. ...

Downloads: 11 This Week

Last Update: 4 days ago
See Project
6

NOFX

Open source AI trading OS for autonomous multi-model trading systems

NOFX is an open source AI-powered trading operating system designed to automate financial trading workflows using autonomous AI agents. It acts as an infrastructure layer that transforms market data into AI-driven trade decisions and execution. Instead of requiring users to manually configure machine learning models, data sources, and API integrations, the system allows AI components to perceive market conditions, select models, and perform trading actions automatically. It supports running...

Downloads: 5 This Week

Last Update: 3 hours ago
See Project
7

AI File Sorter

Local AI file organization with categorization and rename suggestions

AI File Sorter is a cross-platform desktop application that uses AI (local LLMs run on your computer) to organize files and suggest meaningful file names based on real content, not just filenames or extensions. The app can analyze images locally and propose descriptive rename suggestions (for example, IMG_2048.jpg → clouds_over_lake.jpg).

Downloads: 1,016 This Week

Last Update: 2026-04-07
See Project
8

Nerfstudio

A collaboration friendly studio for NeRFs

Nerfstudio provides a simple API that allows for a simplified end-to-end process of creating, training, and testing NeRFs. The library supports a more interpretable implementation of NeRFs by modularizing each component. With more modular NeRFs, we hope to create a more user-friendly experience in exploring the technology. This is a contributor-friendly repo with the goal of building a community where users can more easily build upon each other’s contributions. Nerfstudio initially launched...

Downloads: 3 This Week

Last Update: 2024-11-11
See Project
9

NeuralNote

Audio Plugin for Audio to MIDI transcription using deep learning

NeuralNote is an open-source audio software tool designed to convert recorded audio into MIDI data using modern machine learning techniques. The software functions as an audio plugin that can be used inside digital audio workstations as well as a standalone application for music production and analysis. Its main purpose is to perform audio-to-MIDI transcription, allowing musicians to record a performance and automatically transform it into editable MIDI notes. NeuralNote supports polyphonic...

Downloads: 107 This Week

Last Update: 2026-03-12
See Project
8 Monitoring Tools in One APM. Install in 5 Minutes.
Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.

Start Free
10

Obsidian Skills

Agent skills for Obsidian

...These skills are markdown-driven specifications that teach Claude Code (or similar agents) how to perform context-aware tasks within Obsidian’s unique environment, such as interpreting different file types and workflows, automating workflows tied to notes, or enhancing agent responses with structured knowledge. By providing formal descriptions of patterns, conventions, and workflows common to Obsidian users, the skills empower AI tools to give more relevant suggestions, generate content that adheres to user conventions, or execute complex multi-step operations that respect the knowledge graph and file relationships.

Downloads: 19 This Week

Last Update: 2026-04-02
See Project
11

autoresearch

AI agents autonomously run and improve ML experiments overnight

...Each experiment runs for a fixed five-minute training window, enabling rapid iteration and consistent comparison across architectural or hyperparameter changes. The system centers on a simple workflow where the agent modifies a single training file while human researchers guide the process through a program.md instruction file. Designed to run on a single GPU, it keeps the research loop minimal and self-contained to make autonomous experimentation practical. Over time, the agent logs experiments, evaluates improvements, and gradually evolves the model through automated trial-and-error.

Downloads: 5 This Week

Last Update: 2026-03-26
See Project
12

website-to-gif

Turn your website into a GIF

This Github Action automatically creates an animated GIF or WebP from a given web page to display on your project README (or anywhere else). In your GitHub repo, create a workflow file or extend an existing one. You have to also include a step to checkout and commit to the repo. You can use the following example gif.yml. Make sure to modify the url value and add any other input you want to use. WebP rendering will take a lot of time to benefit from lossless quality and file size optimization.

Downloads: 1 This Week

Last Update: 2024-05-04
See Project
13

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. ...

Downloads: 19 This Week

Last Update: 2026-04-24
See Project
14

Crush

The glamourous AI CLI coding agent for your favourite terminal 💘

Crush is a next-generation, terminal-based AI coding assistant developed by Charm, designed to seamlessly integrate with your tools, workflows, and preferred LLMs. It provides developers with an intuitive, session-based experience where multiple contexts can be managed across projects. With flexible model switching, Crush allows you to change providers mid-session while retaining conversation history. It enhances productivity by combining LSP (Language Server Protocol) support with...

1 Review

Downloads: 4 This Week

Last Update: 1 day ago
See Project
15

Perplexica

Perplexica is an AI-powered answering engine.

...Perplexica offers multiple search modes—Speed, Balanced, and Quality—so you can trade off latency and depth depending on the task. It also enhances the experience with widgets and rich search types like images, videos, and domain-limited queries. With local search history, file uploads, and an easy Docker-based setup, it’s built for everyday research without sacrificing control or privacy.

Downloads: 20 This Week

Last Update: 2026-04-09
See Project
16

Mistral Vibe CLI

Minimal CLI coding agent by Mistral

...Behind the scenes, it leverages Mistral’s coding-optimized LLM stack (including models tuned for code understanding and generation), with project-wide context awareness: it scans your file structure, Git status, and recent history to inform suggestions so that generated code aligns with existing context.

Downloads: 12 This Week

Last Update: 2 days ago
See Project
17

Magika

Fast and accurate AI powered file content types detection

Magika is an AI-powered file-type detector that uses a compact deep-learning model to classify binary and textual files with high accuracy and very low latency. The model is engineered to be only a few megabytes and to run quickly even on CPU-only systems, making it practical for desktop apps, servers, and security pipelines. Magika ships as a command-line tool and a library, providing drop-in detection that improves on traditional “magic number” and heuristic approaches, especially for ambiguous or short files. ...

Downloads: 5 This Week

Last Update: 2026-04-24
See Project
18

ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1107+ languages

ebook2audiobook is a tool to convert legally obtained eBooks (non-DRM) into fully narrated audiobooks, complete with chapters and metadata. It automates the pipeline: it reads the eBook file, splits it into appropriate segments (chapters, paragraphs), uses text-to-speech (TTS) models to synthesize audio, optionally applies voice cloning, and outputs a final audiobook — ideal for people who prefer listening over reading, or for accessibility purposes. The tool supports a wide array of underlying TTS backends (XTTSv2, Bark, VITS, Fairseq, Tacotron2, YourTTS and more), which gives flexibility depending on hardware availability, voice preference, and language. ...

Downloads: 16 This Week

Last Update: 3 days ago
See Project
19

files-to-prompt

Concatenate a directory full of files into a single prompt

...The output format is flexible: you can emit plain text, Markdown with fenced code blocks, or a Claude-XML style format designed for structured multi-file prompts. It can read file paths from stdin (including NUL-separated paths), which makes it easy to combine with find, rg, or other shell tools.

Downloads: 2 This Week

Last Update: 2025-11-27
See Project
20

Auto Claude

Autonomous multi-session AI coding

...The project aims to make “agentic software engineering” feel like running a small virtual dev team by giving you an opinionated process for turning goals into scoped tasks and then driving those tasks to completion. It includes guardrails intended to keep automation safer, such as restricting file operations to the project workspace and controlling which commands can be run based on the detected tech stack.

Downloads: 12 This Week

Last Update: 2026-02-20
See Project
21

OpenCode

The open source coding agent

...It functions as a locally running assistant that can analyze codebases, execute development tasks, and interact with external tools while remaining highly configurable. The system is built around an agent model that can perform actions such as file manipulation, shell execution, and web access with user confirmation. Its provider-agnostic design allows developers to connect different large language model services rather than being locked into a single vendor. OpenCode aims to streamline everyday development workflows by combining automation with human oversight in a developer-first interface. ...

1 Review

Downloads: 56 This Week

Last Update: 5 hours ago
See Project
22

Voice-Pro

Comprehensive Gradio WebUI for audio processing

Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

1 Review

Downloads: 36 This Week

Last Update: 2025-12-05
See Project
23

Desktop Commander MCP

AI-powered MCP server for desktop file and terminal automation

Desktop Commander MCP is an advanced Model Context Protocol server designed to extend AI assistants with direct control over a user’s local machine, including the file system and terminal. It integrates with clients like Claude Desktop to enable AI-driven workflows such as editing files, executing commands, and automating development tasks from a single conversational interface. Desktop Commander MCP builds on top of an MCP filesystem server and enhances it with powerful search, replace, and code editing capabilities tailored for real-world development environments. ...

Downloads: 4 This Week

Last Update: 5 days ago
See Project
24

008

Open-source event-driven AI powered Softphone

...If your provider does not offer this feature, consider using a SIP proxy such as Kamailio, Opensip or Routr. The softphone is internally configured using a JSON definition. The configuration file can be loaded from either a server or a local file. 008 reads the file only once.

Downloads: 4 This Week

Last Update: 2025-01-06
See Project
25

Tiny CUDA Neural Networks

Lightning fast C++/CUDA neural network framework

This is a small, self-contained framework for training and querying neural networks. Most notably, it contains a lightning-fast "fully fused" multi-layer perceptron (technical paper), a versatile multiresolution hash encoding (technical paper), as well as support for various other input encodings, losses, and optimizers. We provide a sample application where an image function (x,y) -> (R,G,B) is learned. The fully fused MLP component of this framework requires a very large amount of shared...

Downloads: 2 This Week

Last Update: 2025-07-08
See Project