Structure-from-Motion and Multi-View Stereo
Audio Plugin for Audio to MIDI transcription using deep learning
⚡ Building applications with LLMs through composability ⚡
TTS with kokoro and onnx runtime
Browser userscript that enhances ChatGPT reliability and usability
Central interface to connect your LLM's with external data
Focus on creating classic Python small examples and cases
Writing AI Conference Papers: A Handbook for Beginners
The common language for platforms, agents and businesses.
This project is a common knowledge point and code implementation
AI Code Security Anti-Patterns distilled from 150+ sources
Agent skills for Obsidian
The official Meta Llama 3 GitHub site
Repository containing notebooks of my posts on Medium
How to optimize some algorithm in cuda
LTX-Video Support for ComfyUI
Claude Code Subagents & Commands Collection + CLI Tool
DeepSeek Coder: Let the Code Write Itself
Access to Anthropic's safety-first language model APIs
Hosting, Registry, Gateway, and Chat Client
Use Microsoft Edge's online text-to-speech service from Python
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
A TTS that fits in your CPU (and pocket)
Practical productivity tools for Claude Code, Codex-CLI
A Powerful Native Multimodal Model for Image Generation