A high-quality tool for convert PDF to Markdown and JSON
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
A framework for real-life data science
A real-time visualisation of the CO2 emissions of electricity
Project structure for doing and sharing data science work
Privacy and Security focused Segment-alternative, in Golang
Scalable and Flexible Gradient Boosting
GPU DataFrame Library
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Exploratory analysis of Bayesian models with Julia
Collaborative forensic timeline analysis
Scale your Pandas workflows by changing a single line of code
Uncover insights, surface problems, monitor, and fine tune your LLM
The open-source tool for building high-quality datasets
AutoGluon: AutoML for Image, Text, and Tabular Data
Tokenization for Julia source code
An optimized graphs package for the Julia programming language
Julia DataFrames serialization format
Package to make C++ libraries available in Julia
Python Stream Processing
Build, run, and manage data pipelines for integrating data
A toolkit to run Ray applications on Kubernetes
A more accurate representation of jupyter notebooks
High-Performance Symbolic Regression in Python and Julia
Monitor the stability of a Pandas or Spark dataframe