Data science on data without acquiring a copy
Tool for producing high quality forecasts for time series data
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
airda(Air Data Agent
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Always know what to expect from your data
Benchmarking synthetic data generation methods
Create HTML profiling reports from pandas DataFrame objects
Accurately Locate Smartphones using Social Engineering
The open standard for data logging
CLI tool to filter JSON and JSON Lines data with Python syntax
A Model Context Protocol (MCP) server that enables AI assistants
WebGL-based viewer for volumetric data
Uncover insights, surface problems, monitor, and fine tune your LLM
ExtractThinker is a Document Intelligence library for LLMs
LLM based data scientist, AI native data application
Detecting silent model failure. NannyML estimates performance
LaTeX CV generator from a YAML/JSON input file
A file based wiki that uses markdown
Code for running inference and finetuning with SAM 3 model
Situational Awareness Server compatible with TAK clients
TikZ figures for concepts in physics/chemistry/ML
Open-Source Python3 tool for recognizing layouts, tables, and math
Free open source tool for real-time PC hardware sensor monitoring
Efficient Triton Kernels for LLM Training