819 projects for "ekho-data" with 2 filters applied:

  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Profile Data

    Profile Data

    Analyze computation-communication overlap in V3/R1

    profile-data is a repository that publishes profiling traces and metrics from DeepSeek’s training and inference infrastructure (especially during DeepSeek-V3 / R1 experiments). The profiling data targets insights into computation-communication overlap, pipeline scheduling (e.g. DualPipe), and how MoE / EP / parallelism strategies interact in real systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Data Version Control

    Data Version Control

    Git-based data version control for machine learning workflows

    DVC (Data Version Control) is an open source tool designed to bring version control principles to machine learning and data science workflows. It enables developers and data scientists to track datasets, machine learning models, and experiment results in a way that integrates with existing Git repositories. Instead of storing large datasets directly in Git, DVC keeps lightweight metadata in the repository while storing the actual data in external storage systems. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    Synthetic Data Generator

    Synthetic Data Generator

    SDG is a specialized framework

    ...It also includes a data processing module capable of handling different data types, preprocessing columns, managing missing values, and converting formats automatically before model training.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    DATA SCIENCE ROADMAP

    DATA SCIENCE ROADMAP

    Data Science Roadmap from A to Z

    DATA SCIENCE ROADMAP is an educational repository designed to guide learners through the process of becoming proficient in data science and machine learning. The project presents a structured roadmap that outlines the knowledge and skills required for different stages of a data science career. Topics typically include programming with Python, statistics, mathematics, machine learning algorithms, data visualization, and big data technologies. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5
    Agentic Data Scientist

    Agentic Data Scientist

    An end-to-end Data Scientist

    Agentic Data Scientist is an experimental AI-driven research framework that orchestrates data science workflows through autonomous agents that can reason, plan, and execute complex analytics tasks. Unlike traditional scripted pipelines, this project lets AI agents break down high-level research goals into sub-tasks such as data acquisition, cleaning, modeling, evaluation, and reporting, with minimal human direction.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Data Science Articles from CodeCut

    Data Science Articles from CodeCut

    Collection of useful data science topics along with articles

    The Data-science repository from CodeCutTech is a curated collection of educational content focused on practical tools and workflows used in modern data science projects. Instead of providing a single software package, the repository aggregates articles, tutorials, and examples covering many topics within the data science ecosystem. The materials address areas such as MLOps, data management, project organization, testing practices, visualization techniques, and productivity tools used by data scientists. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Data-Science-Interview-Questions-Answers

    Data-Science-Interview-Questions-Answers

    Curated list of data science interview questions and answers

    Data-Science-Interview-Questions-Answers is a curated educational repository designed to help data science candidates prepare for technical interviews by organizing a large bank of questions and answers in one place. It began as a daily interview question initiative and was later consolidated into GitHub so learners could review the material more easily and revisit it over time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Data Science Interviews

    Data Science Interviews

    Data science interview questions and answers

    Data Science Interviews is an open-source repository that collects common data science interview questions along with community-provided answers and explanations. The project serves as a preparation resource for students, job seekers, and professionals who want to review the technical knowledge required for data science roles. The repository organizes questions into different categories including theoretical machine learning concepts, technical programming questions, and probability or statistics problems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Dash Data Agent

    Dash Data Agent

    Self-learning data agent that grounds its answers in layers of content

    Dash is a self-learning data agent built by the Agno AI community that generates grounded answers to English queries over structured data by synthesizing SQL and reasoning based on six layers of context, improving automatically with each run. It sidesteps common limitations of simple text-to-SQL agents by incorporating multiple context layers — including schema structure, human annotations, known query patterns, institutional knowledge from docs, machine-discovered error patterns, and live runtime context — to generate SQL queries that are both technically correct and semantically meaningful. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 10
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 253 This Week
    Last Update:
    See Project
  • 11
    Book6_First-Course-in-Data-Science

    Book6_First-Course-in-Data-Science

    From Addition, Subtraction, Multiplication, and Division to ML

    Book6_First-Course-in-Data-Science is an open-source educational project that serves as part of the “Iris Book” series focused on teaching data science and machine learning concepts through a combination of mathematics, programming, and visualization. The repository contains draft chapters, supporting Python code, and visual materials designed to guide readers from basic mathematical operations toward practical machine learning understanding.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    WeChatMsg

    WeChatMsg

    Project aimed at extracting, exporting, and analyzing chat records

    ...Although the repository has seen periods of inactivity and may not receive frequent updates, its widespread use indicates community interest in preserving chat logs and understanding conversation data outside of the WeChat interface.
    Downloads: 2,965 This Week
    Last Update:
    See Project
  • 13
    cracking-the-data-science-interview

    cracking-the-data-science-interview

    A Collection of Cheatsheets, Books, Questions, and Portfolio

    Cracking the Data Science Interview is an open educational repository that collects study materials, resources, and reference links for preparing for data science interviews. The project organizes content across many fundamental areas of data science, including statistics, probability, SQL, machine learning, and deep learning. It includes cheat sheets that summarize important technical concepts commonly discussed during technical interviews.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Machine Learning and Data Science Apps

    Machine Learning and Data Science Apps

    A curated list of applied machine learning and data science notebooks

    ...Most examples are written in Python and frequently use Jupyter notebooks to present practical implementations and experiments. The project encourages contributions from data scientists and domain experts who want to share applied analytics projects and techniques that address real business challenges.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    graphify

    graphify

    AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)

    ...Overall, graphify serves as a bridge between raw data and visual insight.
    Downloads: 33 This Week
    Last Update:
    See Project
  • 16
    Claude Code

    Claude Code

    Claude Code is an agentic coding tool that lives in your terminal

    ...Claude Code integrates seamlessly into your terminal, IDE, or GitHub by tagging @claude to interact with your code context. The tool is designed to simplify development by automating repetitive work and providing instant clarifications on code behavior. User feedback and usage data are collected responsibly, with strict privacy safeguards and limited retention, ensuring no feedback is used to train generative models. Claude Code is open and actively maintained with community-driven bug reporting and feature requests. Its natural language interface makes advanced coding workflows accessible without leaving your coding environment.
    Downloads: 707 This Week
    Last Update:
    See Project
  • 17
    Mooncake

    Mooncake

    Mooncake is the serving platform for Kimi

    Mooncake is an open-source infrastructure platform designed to optimize large language model serving by focusing on efficient management and transfer of model data and KV cache. The platform was originally developed as part of the serving infrastructure for the Kimi large language model system. Its architecture centers on a high-performance transfer engine that provides unified data transfer across different storage and networking technologies. This engine enables efficient movement of tensors and model data across heterogeneous environments such as GPU memory, system memory, and distributed storage systems. ...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 18
    Apache Hamilton

    Apache Hamilton

    Helps data scientists define testable self-documenting dataflows

    ...This approach encourages modular, testable, and maintainable data pipelines because each transformation is isolated and easily unit tested. The framework also automatically tracks lineage and metadata about how data is produced, which improves debugging, reproducibility, and transparency in data workflows.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    Aix-DB

    Aix-DB

    Based on the LangChain/LangGraph framework

    ...The platform supports multiple types of data sources and provides an end-to-end pipeline that includes intent recognition, SQL generation, database execution, and visual presentation of results. Its architecture includes multiple layers such as a web interface, API gateway, AI service layer, and data storage layer that support relational databases, vector stores, graph databases, and file systems.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    Pixeltable

    Pixeltable

    Data Infrastructure providing an approach to multimodal AI workloads

    ...Developers define data transformations and AI operations using computed columns on tables, allowing pipelines to evolve incrementally as new data or models are added. The framework supports multimodal content including images, video, text, and audio, enabling applications such as retrieval-augmented generation systems, semantic search, and multimedia analytics.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    OpenHealth

    OpenHealth

    AI health assistant for private, local data-driven insights mgmt

    ...A strong emphasis is placed on privacy, as the platform can run entirely locally, ensuring that sensitive medical data does not need to leave the user’s environment. OpenHealth also includes a data parsing layer that transforms raw medical inputs into structured datasets, making them usable for analysis and AI-driven insights. OpenHealth separates data ingestion, processing, and AI interaction, enabling flexibility in integrating different models and data sources.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Eidos

    Eidos

    An extensible framework for Personal Data Management

    Eidos is an extensible personal data management platform designed to help users organize and interact with their information using a local-first architecture. The system transforms SQLite into a flexible personal database that can store structured and unstructured information such as notes, documents, datasets, and knowledge resources. Its interface is inspired by tools like Notion, allowing users to create documents, databases, and custom views to organize personal information. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 23
    NeuralNote

    NeuralNote

    Audio Plugin for Audio to MIDI transcription using deep learning

    NeuralNote is an open-source audio software tool designed to convert recorded audio into MIDI data using modern machine learning techniques. The software functions as an audio plugin that can be used inside digital audio workstations as well as a standalone application for music production and analysis. Its main purpose is to perform audio-to-MIDI transcription, allowing musicians to record a performance and automatically transform it into editable MIDI notes.
    Downloads: 96 This Week
    Last Update:
    See Project
  • 24
    Superduper

    Superduper

    Superduper: Integrate AI models and machine learning workflows

    ...This allows developers to completely avoid implementing MLOps, ETL pipelines, model deployment, data migration, and synchronization. Using Superduper is simply "CAPE": Connect to your data, apply arbitrary AI to that data, package and reuse the application on arbitrary data, and execute AI-database queries and predictions on the resulting AI outputs and data.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    AI-Crawler

    AI-Crawler

    Crawl a website starting from a URL, find relevant pages

    ...Users can define their data requirements in plain English, and the system will interpret those instructions to crawl a domain and extract structured data. The tool supports output formats such as JSON and Markdown, and it can generate or accept schemas to ensure that extracted data is structured according to application needs. It is designed as a low-code solution, reducing the complexity of building and maintaining custom scraping pipelines.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB