Collection of useful data science topics along with articles
Cloud-native open source data warehouse for analytics and AI queries
AI-data warehouse to enrich, transform and analyze unstructured data
Machine Learning automation and tracking
Toloka-Kit is a Python library for working with Toloka API
Training data (data labeling, annotation, workflow) for all data types
Codes/Notebooks for AI Projects
Big Model Application Development Practice 1
The open-source data curation platform for LLMs
MLOps simplified. From ML Pipeline ⇨ Data Product without the hassle
Efficient Triton Kernels for LLM Training
MCP server that integrates Confluence and Jira
E2M converts various file types (doc, docx, epub, html, htm, url
Examples of using E2B
Language-model investigation agent with a terminal UI
Explainability and Interpretability to Develop Reliable ML models
text and image to video generation: CogVideoX (2024) and CogVideo
kaldi-asr/kaldi is the official location of the Kaldi project
Open source multimodal creative AI assistant with infinite canvas tool
The Memory layer for AI Agents
machine learning tutorials (mainly in Python3)
A collection of scientific methods, processes, algorithms
Flowly is 100x faster than OpenClaw
The Python Code Tutorials
Persistent AI memory using local Markdown knowledge graphs