Structure-from-Motion and Multi-View Stereo
Audio Plugin for Audio to MIDI transcription using deep learning
⚡ Building applications with LLMs through composability ⚡
TTS with kokoro and onnx runtime
Browser userscript that enhances ChatGPT reliability and usability
Central interface to connect your LLM's with external data
Focus on creating classic Python small examples and cases
The common language for platforms, agents and businesses.
Writing AI Conference Papers: A Handbook for Beginners
Agent skills for Obsidian
The official Meta Llama 3 GitHub site
This project is a common knowledge point and code implementation
AI Code Security Anti-Patterns distilled from 150+ sources
Repository containing notebooks of my posts on Medium
How to optimize some algorithm in cuda
LTX-Video Support for ComfyUI
Claude Code Subagents & Commands Collection + CLI Tool
DeepSeek Coder: Let the Code Write Itself
Access to Anthropic's safety-first language model APIs
Use Microsoft Edge's online text-to-speech service from Python
Hosting, Registry, Gateway, and Chat Client
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
A TTS that fits in your CPU (and pocket)
Practical productivity tools for Claude Code, Codex-CLI
A Powerful Native Multimodal Model for Image Generation