Search Results for "speed-dreeams" - Page 5

Sort By:

Showing 376 open source projects for "speed-dreeams"

View related business solutions

Python Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

PyTorch Lightning

The lightweight PyTorch wrapper for high-performance AI research

Scale your models, not your boilerplate with PyTorch Lightning! PyTorch Lightning is the ultimate PyTorch research framework that allows you to focus on the research while it takes care of everything else. It's designed to decouple the science from the engineering in your PyTorch code, simplifying complex network coding and giving you maximum flexibility. PyTorch Lightning can be used for just about any type of research, and was built for the fast inference needed in AI research and...

Downloads: 3 This Week

Last Update: 2026-01-30
See Project
2

DINOv2

PyTorch code and models for the DINOv2 self-supervised learning

...The repository includes code for training, evaluating, and feature extraction, with utilities to run k-NN or linear evaluation baselines to assess representation quality. Pretrained checkpoints cover multiple model sizes so practitioners can trade accuracy for speed and memory depending on their deployment constraints.

Downloads: 2 This Week

Last Update: 2026-02-24
See Project
3

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system

clip-retrieval is an open-source toolkit designed to build large-scale semantic search systems for images and text by leveraging CLIP embeddings to enable multimodal retrieval. It allows developers to compute embeddings for both images and text efficiently and then index them for fast similarity search across massive datasets. The system is optimized for performance and scalability, capable of processing tens or even hundreds of millions of embeddings using GPU acceleration. It includes...

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
4

how-to-optim-algorithm-in-cuda

How to optimize some algorithm in cuda

how-to-optim-algorithm-in-cuda is an open educational repository focused on teaching developers how to optimize algorithms for high-performance execution on GPUs using CUDA. The project combines technical notes, code examples, and practical experiments that demonstrate how common computational kernels can be optimized to improve speed and memory efficiency. Instead of presenting only theoretical explanations, the repository includes hand-written CUDA implementations of fundamental operations such as reductions, element-wise computations, softmax, and attention mechanisms. These examples show how different optimization techniques influence performance on modern GPU hardware and allow readers to experiment with real implementations. ...

Downloads: 1 This Week

Last Update: 4 days ago
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

Minigrid

Simple and easily configurable grid world environments

...It provides a suite of simple 2D grid-based tasks (e.g., navigating mazes, unlocking doors, carrying keys) where an agent moves in discrete steps and interacts with objects. The design emphasizes speed (agents can run thousands of steps per second), low dependency overhead, and high customizability — making it easy to define new maps, new tasks, or wrappers. It supports the Gymnasium-style environment API so that RL researchers can plug it into their existing frameworks and algorithms with minimal adaptation. Because of its simplicity, it is often used for rapid prototyping, analytic experiments, curriculum learning, or pedagogical tutorials. ...

Downloads: 1 This Week

Last Update: 2025-11-25
See Project
6

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM

...It upgrades the base model with GLM’s hybrid pretraining objective, 1.4 TB bilingual data, and preference alignment—delivering big gains on MMLU, CEval, GSM8K, and BBH. The context window extends up to 32K (FlashAttention), and Multi-Query Attention improves speed and memory use. The repo includes Python APIs, CLI & web demos, OpenAI-style/FASTAPI servers, and quantized checkpoints for lightweight local deployment on GPUs or CPU/MPS.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
7

Hunyuan3D-1

A Unified Framework for Text-to-3D and Image-to-3D Generation

Hunyuan3D-1 is an earlier version in the same 3D generation line (the unified framework for text-to-3D and image-to-3D tasks) by Tencent Hunyuan. It provides a framework combining shape generation and texture synthesis, enabling users to create 3D assets from images or text conditions. While less advanced than version 2.1, it laid the foundations for the later PBR, higher resolution, and open-source enhancements. (Note: less detailed public documentation was found for Hunyuan3D-1 compared to...

Downloads: 1 This Week

Last Update: 2025-11-19
See Project
8

Mosec

A high-performance ML model serving framework, offers dynamic batching

Mosec is a high-performance and flexible model-serving framework for building ML model-enabled backend and microservices. It bridges the gap between any machine learning models you just trained and the efficient online service API.

Downloads: 1 This Week

Last Update: 5 days ago
See Project
9

PyTorch Ignite

Library to help with training and evaluating neural networks

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently. Less code than pure PyTorch while ensuring maximum control and simplicity. Library approach and no program's control inversion. Use ignite where and when you need. Extensible API for metrics, experiment managers, and other components. The cool thing with handlers is that they offer unparalleled flexibility (compared to, for example, callbacks). Handlers can be any function: e.g....

Downloads: 1 This Week

Last Update: 2026-03-27
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
10

TextDistance

Compute distance between sequences

Python library for comparing the distance between two or more sequences by many algorithms. For main algorithms, text distance try to call known external libraries (fastest first) if available (installed in your system) and possible (this implementation can compare this type of sequences). Install text distance with extras for this feature. Textdistance use benchmark results for algorithm optimization and try to call the fastest external lib first (if possible). TextDistance show benchmarks...

Downloads: 0 This Week

Last Update: 2024-07-16
See Project
11

Step-Audio

Open-source framework for intelligent speech interaction

Step-Audio is a unified, open-source framework aimed at building intelligent speech systems that combine both comprehension and generation: it integrates large language models (LLMs) with speech input/output to handle not only semantic understanding but also rich vocal characteristics like tone, style, dialect, emotion, and prosody. The design moves beyond traditional separate-component pipelines (ASR → text model → TTS), instead offering a multimodal model that ingests speech or audio and...

Downloads: 4 This Week

Last Update: 2026-03-16
See Project
12

Audiblez

Generate audiobooks from e-books

Audiblez is a tool for generating high-quality .m4b audiobooks directly from .epub e-books using the Kokoro-82M neural text-to-speech model. It focuses on making audiobook creation easy and fast: from a single command, the tool splits an e-book into chapters, synthesizes audio for each section, and then merges the results into a structured audiobook with chapter-based WAV files and a final .m4b container. The Kokoro-82M model it uses is compact (82M parameters) yet natural sounding, trained...

Downloads: 3 This Week

Last Update: 2025-11-30
See Project
13

Masonite

The Modern And Developer Centric Python Web Framework

...Masonite is perfect for beginners getting their first web app deployed or advanced developers and businesses that need to reach for the full fleet of features available. Mail support for sending emails quickly. Queue support to speed your application up by sending jobs to run on a queue or asynchronously. Notifications for sending notifications to your users simply and effectively. Task scheduling to run your jobs on a schedule (like everyday at midnight) so you can set and forget your tasks. Events you can listen for to execute listeners that perform your tasks when certain events happen in your app. ...

Downloads: 3 This Week

Last Update: 2025-03-01
See Project
14

NGINX Admin’s Handbook

How to improve NGINX performance, security, and other important things

...The handbook spans fundamentals and advanced topics alike, from HTTP and SSL/TLS basics to reverse proxy patterns, performance tuning, debugging workflows, and hardening strategies. A centerpiece is its prioritized checklist of 79 rules, grouped by criticality, helping readers focus on what most impacts security, reliability, and speed. Instead of copy-paste snippets in isolation, it emphasizes understanding trade-offs, avoiding common pitfalls, and balancing security with usability. Designed for system administrators and web application engineers, it aims to be a living companion that encourages experimentation, measurement, and continuous improvement of NGINX configurations

Downloads: 2 This Week

Last Update: 1 day ago
See Project
15

The SpeechBrain Toolkit

A PyTorch-based Speech Toolkit

SpeechBrain is an open-source and all-in-one conversational AI toolkit. It is designed to be simple, extremely flexible, and user-friendly. Competitive or state-of-the-art performance is obtained in various domains. SpeechBrain supports state-of-the-art methods for end-to-end speech recognition, including models based on CTC, CTC+attention, transducers, transformers, and neural language models relying on recurrent neural networks and transformers. Speaker recognition is already deployed in a...

Downloads: 2 This Week

Last Update: 2026-03-30
See Project
16

Insanely Fast Whisper

An opinionated CLI to transcribe Audio files w/ Whisper on-device

Insanely Fast Whisper is a high-performance command-line tool designed to dramatically accelerate speech-to-text transcription using OpenAI’s Whisper models on local hardware. It leverages modern optimizations such as batch processing, mixed precision, and advanced attention mechanisms like Flash Attention to significantly reduce inference time while maintaining high transcription accuracy. The project is built on top of the Transformers ecosystem and integrates with libraries such as...

Downloads: 1 This Week

Last Update: 2026-03-26
See Project
17

MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models

...The framework integrates several architectural innovations—SwiGLU activation, deep and thin network design, embedding sharing, and grouped-query attention (GQA)—to achieve a superior trade-off between model size, inference speed, and accuracy. MobileLLM demonstrates remarkable performance, with the 125M and 350M variants outperforming previous state-of-the-art models of the same scale by up to 4.3% on zero-shot commonsense reasoning tasks.

Downloads: 1 This Week

Last Update: 2 days ago
See Project
18

Tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models

...It handles encoding and decoding text to token IDs efficiently, with minimal overhead. Because tokenization is a fundamental step in preparing text for models, tiktoken is optimized for speed, memory, and correctness in model contexts (e.g. matching OpenAI’s internal tokenization). The repo supports multiple encodings (e.g. “cl100k_base”) and lets users switch encoding names to match different model contexts. It also offers extension mechanisms so that custom encodings can be registered. Internally, it includes the core tokenizer logic (often implemented in Rust or efficient lower-level code), APIs for encoding, decoding, and counting tokens, and binding layers to Python (and sometimes other languages) for easy use.

Downloads: 1 This Week

Last Update: 2025-10-06
See Project
19

MiniCPM-o

A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming

MiniCPM-o 2.6 is a cutting-edge multimodal large language model (MLLM) designed for high-performance tasks across vision, speech, and video. Capable of running on end-side devices such as smartphones and tablets, it provides powerful features like real-time speech conversation, video understanding, and multimodal live streaming. With 8 billion parameters, MiniCPM-o 2.6 surpasses its predecessors in versatility and efficiency, making it one of the most robust models available. It supports...

Downloads: 1 This Week

Last Update: 2025-05-15
See Project
20

Mem0

The Memory layer for AI Agents

...Key features include enhancing future conversations by building smarter AI that learns from every interaction, reducing LLM costs by up to 80% through intelligent data filtering, delivering more accurate and personalized AI outputs by leveraging historical context, and offering easy integration compatible with platforms like OpenAI and Claude. Mem0 is perfect for projects such as customer support, where chatbots remember past interactions to reduce repetition and speed up resolution times; personal AI companions that recall preferences and past conversations for more meaningful interactions; AI agents that learn from each interaction to become more personalized and effective over time.

Downloads: 1 This Week

Last Update: 5 hours ago
See Project
21

asciinema

Open source terminal session recorder

asciinema is a free and open source terminal session recorder. It lets you easily record and play back terminal sessions in the terminal or in a web browser. Forget old screen recording methods and resulting blurry videos. asciinema lets you record your terminal sessions the right way, which is right where you work, in the terminal. Recording is as easy as running one command, and since it’s purely text-based you can copy and paste any content you want, simply pause the recording! You...

Downloads: 1 This Week

Last Update: 2026-03-01
See Project
22

alive-progress

A new kind of Progress Bar, with real-time throughput, ETA

alive-progress is an advanced Python progress bar library that introduces a highly animated and adaptive approach to tracking long-running tasks. Unlike traditional static progress indicators, it dynamically adjusts spinner speed and visual feedback based on actual throughput, giving users a more intuitive sense of activity. The library is designed with performance efficiency in mind, using multithreaded updates that minimize CPU overhead and terminal noise. It includes sophisticated ETA estimation powered by exponential smoothing algorithms, improving prediction accuracy for variable workloads. ...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
23

Pruna AI

Pruna is a model optimization framework built for developers

...Built with performance and developer ergonomics in mind, Pruna simplifies inference workflows by enabling multi-model orchestration, autoscaling, GPU resource allocation, and compatibility with popular open-source models. It is ideal for companies or teams looking to reduce reliance on external APIs while maintaining speed, cost-efficiency, and full control over their data and AI stack. With a focus on extensibility and observability, Pruna empowers engineers to scale LLM applications from prototype to production securely and reliably.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
24

PyTensor

Python library for defining and optimizing mathematical expressions

PyTensor is a fork of Aesara, a Python library for defining, optimizing, and efficiently evaluating mathematical expressions involving multi-dimensional arrays. PyTensor is based on Theano, which has been powering large-scale computationally intensive scientific investigations since 2007. A hackable, pure-Python codebase. Extensible graph framework is suitable for rapid development of custom operators and symbolic optimizations. Implements an extensible graph transpilation framework that...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
25

Stable Diffusion WebUI Forge

Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion

...It targets heavy users and researchers who push large models, control nets, and high-resolution pipelines where default settings can become bottlenecks. The fork typically introduces toggles for scheduler behavior, attention implementations, caching, and precision modes to reach better speed or quality on given hardware. It also focuses on stability during long sessions, aiming to reduce out-of-memory failures and provide clearer diagnostics when they occur. The UI surfaces advanced options in a way that remains recognizable to WebUI users, so migration costs are low while gaining experimental features. In practice, Forge serves as a proving ground for ideas that may later influence upstream tools, giving power users early access to cutting-edge techniques.

Downloads: 1 This Week

Last Update: 2025-10-21
See Project