Collection of useful data science topics along with articles
Cloud-native open source data warehouse for analytics and AI queries
PostHog provides open-source web & product analytics
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
AI-data warehouse to enrich, transform and analyze unstructured data
Machine Learning automation and tracking
Toloka-Kit is a Python library for working with Toloka API
Data integration platform for ELT pipelines from APIs, databases
Training data (data labeling, annotation, workflow) for all data types
Codes/Notebooks for AI Projects
Big Model Application Development Practice 1
The open-source data curation platform for LLMs
Firebase Admin Python SDK
Common solutions and tools developed by Google Cloud
Edit PDF files with Nano Banana
MLOps simplified. From ML Pipeline ⇨ Data Product without the hassle
A collective list of free APIs
Efficient Triton Kernels for LLM Training
MCP server that integrates Confluence and Jira
Redis Python client
Simple crossplatform IDE for NASM, MASM, GAS and FASM languages
Comprehensive tutorial repository aimed at teaching the Python program
Self-hosted platform to unify wearable health data
Unified open dataset enabling cross-embodiment learning for robotics
Bibtex parser for Python 3