Page 2 | modes free download

Showing 118 open source projects for "modes"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
1

Handy STT

A free, open source, and extensible speech-to-text application

Handy is a free, open-source, offline speech-to-text application built for privacy, accessibility, and extensibility. Developed using Tauri (Rust + React/TypeScript), it runs natively across Windows, macOS, and Linux while performing local speech recognition without sending any audio to cloud servers. Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active...

Downloads: 40 This Week

Last Update: 2026-04-27
See Project
2

RealtimeSTT

A robust, efficient, low-latency speech-to-text library

RealtimeSTT is a Python-based realtime speech-to-text engine emphasizing low latency, wake-word detection, voice activity detection, and automatic speech segmentation. It provides asynchronous callbacks, nanosecond-precision timestamps, and CLI tools, suitable for building voice assistants, meeting transcribers, or live caption systems.

Downloads: 3 This Week

Last Update: 3 days ago
See Project
3

MiniCPM4.1

Achieving 3+ generation speedup on reasoning tasks

MiniCPM4.1 is an enhanced iteration of the MiniCPM4 architecture, introducing improvements in reasoning capabilities, inference speed, and hybrid operation modes that allow dynamic switching between deep reasoning and standard generation. It builds upon the same efficiency-focused philosophy but further optimizes decoding performance, achieving substantial speed gains in reasoning-intensive tasks while maintaining high-quality outputs. One of its key innovations is the hybrid reasoning mode, which allows developers to control whether the model engages in deeper reasoning processes or faster responses depending on the use case. ...

Downloads: 0 This Week

Last Update: 2026-04-13
See Project
4

Shadcn UI v4 MCP Server

A mcp server to allow LLMS gain context about shadcn ui component

...It includes smart caching and efficient GitHub API usage to optimize performance and handle rate limits during component retrieval. The system also supports multiple transport modes such as standard input/output and Server-Sent Events, enabling both local and distributed deployments.

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
Go from Code to Production URL in Seconds
Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free
5

LLMChat

Unified interface for AI chat, Agentic workflows and more

...One of its primary goals is to support sophisticated research workflows that combine conversational AI with information retrieval and reasoning tools. The platform includes specialized interaction modes such as deep research analysis and enhanced search capabilities that help users explore complex topics more effectively. It also incorporates agent-style workflows that allow the system to orchestrate multiple steps of reasoning or data retrieval during a conversation.

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
6

slime LLM

slime is an LLM post-training framework for RL Scaling

...It offers a flexible architecture that connects high-throughput training (e.g., via Megatron-LM) with a customizable data generation pipeline, enabling researchers and engineers to iterate on new RL training paradigms effectively. The framework is designed to support a wide range of training modes, allowing both synchronous and asynchronous RL workflows and programmable rollout interfaces that simplify experimentation with custom environments and reward signals. Because it integrates tightly with SGLang and other training engines, slime can improve scalability and efficiency while providing maintainability and adaptability for developing new models and training algorithms.

Downloads: 0 This Week

Last Update: 2026-03-29
See Project
7

Sopro TTS

A lightweight text-to-speech model with zero-shot voice cloning

...The model is designed to work with a small set of dependencies and to be accessible for developers who want offline TTS with customizable voice style, including options for streaming or non-streaming generation modes. Users can install it with standard Python tools, run a demo server locally, and experiment with CLI or Python API usage for producing synthetic speech.

Downloads: 0 This Week

Last Update: 2026-02-06
See Project
8

FiftyOne

The open-source tool for building high-quality datasets

...FiftyOne supercharges your machine learning workflows by enabling you to visualize datasets and interpret models faster and more effectively. Improving data quality and understanding your model’s failure modes are the most impactful ways to boost the performance of your model. FiftyOne provides the building blocks for optimizing your dataset analysis pipeline. Use it to get hands-on with your data, including visualizing complex labels, evaluating your models, exploring scenarios of interest, identifying failure modes, finding annotation mistakes, and much more! ...

Downloads: 0 This Week

Last Update: 2026-05-02
See Project
9

Steel Browser

Open Source Browser API for AI Agents & Apps

Steel Browser is a privacy-focused web browser built with security and performance optimizations, designed to minimize tracking and enhance user control.

Downloads: 2 This Week

Last Update: 2026-04-24
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

LangGraph Studio

Desktop app for prototyping and debugging LangGraph applications

...With visual graphs and the ability to edit state, you can better understand agent workflows and iterate faster. LangGraph Studio integrates with LangSmith so you can collaborate with teammates to debug failure modes. While in Beta, LangGraph Studio is available for free to all LangSmith users on any plan tier. LangGraph Studio requires docker-compose version 2.22.0+ or higher. Please make sure you have Docker installed and running before continuing. When you open LangGraph Studio desktop app for the first time, you need to login via LangSmith. ...

Downloads: 22 This Week

Last Update: 2025-03-06
See Project
11

WhisperJAV

Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD

WhisperJAV is an open-source speech transcription pipeline designed specifically for generating subtitles for Japanese adult video content. The project addresses challenges that standard speech recognition models face when transcribing this type of audio, which often includes low signal-to-noise ratios and large numbers of non-verbal vocalizations. Traditional automatic speech recognition systems can misinterpret these sounds as words, leading to inaccurate transcripts. WhisperJAV introduces...

Downloads: 21 This Week

Last Update: 2026-05-11
See Project
12

AI Coding Dictionary

AI coding jargon, explained in plain English

...It is designed for developers who use coding agents but feel slowed down by unclear vocabulary, hidden assumptions, confusing billing concepts, and inconsistent model behavior. The dictionary organizes terms across models, context windows, tools, environments, failure modes, handoffs, and related AI coding workflows. Its tone is intentionally direct and accessible, avoiding dense academic or vendor-heavy language. The project helps users name what is happening when prompts fail, context degrades, tools behave unexpectedly, or agent sessions become hard to manage. Its main value is turning AI coding jargon into practical vocabulary that developers can understand quickly and use in real work.

Downloads: 6 This Week

Last Update: 5 days ago
See Project
13

Claude Code Haha

Claude Code leaked source - locally runnable version

...Despite its informal tone, it still provides insight into how coding agents can be structured and extended. It is particularly useful for understanding limitations, failure modes, and creative applications of AI-driven development tools.

Downloads: 7 This Week

Last Update: 4 days ago
See Project
14

AlphaFold 3

AlphaFold 3 inference pipeline

AlphaFold 3, developed by Google DeepMind, is an advanced deep learning system for predicting biomolecular structures and interactions with exceptional accuracy. This repository provides the complete inference pipeline for running AlphaFold 3, though access to the model parameters is restricted and must be obtained directly from Google under specific terms of use. The system is designed for scientific research applications in structural biology, biochemistry, and bioinformatics, enabling...

Downloads: 6 This Week

Last Update: 2026-04-28
See Project
15

Qwen2-Audio

Repo of Qwen2-Audio chat & pretrained large audio language model

...It is trained to accept various audio signal inputs (including speech, sounds, etc.) and perform both voice chat and audio analysis, producing textual responses. It supports two major modes: Voice Chat (interactive voice only input) and Audio Analysis (audio + text instructions), with both base and instruction-tuned models. It is evaluated on many benchmarks (speech recognition, translation, sound classification, emotion, etc.), and offers pretrained models (e.g. 7B) released via ModelScope and Hugging Face. Code & examples provided with Hugging Face transformers, and usage via AutoProcessor, model classes etc. ...

Downloads: 0 This Week

Last Update: 2025-09-23
See Project
16

ZeroClaw

Fast, small, and fully autonomous AI assistant infrastructure

ZeroClaw is a Rust-native autonomous AI agent framework engineered for teams and developers who need highly efficient, secure, and modular AI automation infrastructure that can run reliably in both production and self-hosted environments. It is designed around a trait-based architecture so that model providers, communication channels, memory systems, and tooling integrations can be swapped or extended without rewriting core components, giving engineers flexibility and long-term...

Downloads: 12 This Week

Last Update: 2026-05-08
See Project
17

ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1107+ languages

ebook2audiobook is a tool to convert legally obtained eBooks (non-DRM) into fully narrated audiobooks, complete with chapters and metadata. It automates the pipeline: it reads the eBook file, splits it into appropriate segments (chapters, paragraphs), uses text-to-speech (TTS) models to synthesize audio, optionally applies voice cloning, and outputs a final audiobook — ideal for people who prefer listening over reading, or for accessibility purposes. The tool supports a wide array of...

Downloads: 12 This Week

Last Update: 3 days ago
See Project
18

llmfit

157 models, 30 providers, one command to find what runs on hardware

llmfit is a terminal-based utility that helps developers determine which large language models can realistically run on their local hardware by analyzing system resources and model requirements. The tool automatically detects CPU, RAM, GPU, and VRAM specifications, then ranks available models based on performance factors such as speed, quality, and memory fit. It provides both an interactive terminal user interface and a traditional CLI mode, enabling flexible workflows for different user...

Downloads: 11 This Week

Last Update: 2 days ago
See Project
19

SAM 3D Objects

Models for object and human mesh reconstruction

SAM 3D Objects is a foundation model that reconstructs full 3D geometry, texture, and spatial layout of objects and scenes from a single image. Given one RGB image and object masks (for example, from the Segment Anything family), it can generate a textured 3D mesh for each object, including pose and approximate scene layout. The model is specifically designed to be robust in real-world images with clutter, occlusions, small objects, and unusual viewpoints, where many earlier 3D-from-image...

Downloads: 15 This Week

Last Update: 2026-01-07
See Project
20

Velocity server

The modern, next-generation Minecraft server proxy

...Acting as an intermediary between players and backend servers, Velocity manages player connections and routes them to different game servers within a network. This architecture allows large Minecraft communities to run multiple servers for different game modes while presenting them as a unified system to players. The software is designed with a focus on performance, scalability, and modern architecture, allowing it to handle thousands of simultaneous players efficiently. Velocity also includes a plugin API that allows developers to extend the proxy with custom functionality and integrate it with existing server tools. ...

Downloads: 7 This Week

Last Update: 3 days ago
See Project
21

MiMo-V2-Flash

MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation

MiMo-V2-Flash is a large Mixture-of-Experts language model designed to deliver strong reasoning, coding, and agentic-task performance while keeping inference fast and cost-efficient. It uses an MoE setup where a very large total parameter count is available, but only a smaller subset is activated per token, which helps balance capability with runtime efficiency. The project positions the model for workflows that require tool use, multi-step planning, and higher throughput, rather than only...

Downloads: 10 This Week

Last Update: 2026-01-08
See Project
22

SoniTranslate

Synchronized Translation for Videos

...The project supports a wide range of languages for translation, spanning major world languages (English, Spanish, French, German, Chinese, Arabic, etc.) and many regional or less widely spoken languages, making it suitable for broad internationalization. It offers multiple usage modes, including a Colab notebook for cloud-based experimentation, a Hugging Face Space demo for quick trials, and instructions.

Downloads: 41 This Week

Last Update: 2025-11-28
See Project
23

Qwen3

Qwen3 is the large language model series developed by Qwen team

Qwen3 is a cutting-edge large language model (LLM) series developed by the Qwen team at Alibaba Cloud. The latest updated version, Qwen3-235B-A22B-Instruct-2507, features significant improvements in instruction-following, reasoning, knowledge coverage, and long-context understanding up to 256K tokens. It delivers higher quality and more helpful text generation across multiple languages and domains, including mathematics, coding, science, and tool usage. Various quantized versions,...

1 Review

Downloads: 11 This Week

Last Update: 2026-01-09
See Project
24

PyGPT

Open source personal AI Assistant for Linux, Windows and Mac

PyGPT is a desktop application that allows you to talk to OpenAI's LLM models such as GPT4 and GPT3 using your own computer and OpenAI API. It allows you to talk in chat mode and in completion mode, as well as generate images using DALL-E 2. PyGPT also adds access to the Internet for GPT via Google Custom Search API and Wikipedia API and includes voice synthesis using Microsoft Azure Text-to-Speech API. Moreover, the application has implemented context memory support, context storage,...

Downloads: 9 This Week

Last Update: 2026-02-06
See Project
25

Prompt Optimizer

A prompt word optimizer to help write high-quality prompt words

...It focuses on automating and streamlining the iterative refinement of prompts by analyzing examples, comparing original and optimized text, and guiding users through multi-round improvements that surface clarity, structure, and specificity. With support for different deployment modes including web apps, desktop apps, Chrome plugins, and Docker containers, Prompt-Optimizer offers flexibility that suits both individual developers and teams working in diverse environments. It also includes advanced capabilities like multi-model integration, context testing, and real-time comparison of prompt outputs, helping users to see exactly how prompt changes influence results.

Downloads: 4 This Week

Last Update: 2026-05-16
See Project