Showing 27 open source projects for "cleaning"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    NYC Taxi Data

    NYC Taxi Data

    Import public NYC taxi and for-hire vehicle (Uber, Lyft)

    ...It collects and preprocesses large-scale trip datasets (fares, pickup/dropoff, timestamps, locations, passenger counts) to enable data analysis, modeling, and visualization efforts. The project includes scripts and notebooks for cleaning and filtering the raw data, memory-efficient processing for large CSV/Parquet files, and aggregation workflows (e.g. trips per hour, heatmaps of pickups/dropoffs). It also contains example analyses—spatial and temporal visualizations like maps, time-series plots, and hotspot detection—highlighting insights such as patterns of demand, peak times, and geospatial distributions. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Laravel Chunk Upload

    Laravel Chunk Upload

    The basic implementation for chunk upload with multiple providers

    Laravel Chunk Upload simplifies chunked uploads with support for multiple JavaScript libraries atop Laravel's file upload system, designed with a minimal memory footprint. Features include cross-domain request support, automatic cleaning, and intuitive usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    ExpressJS.com

    ExpressJS.com

    The Express.js Website

    This repository contains the source for the Express web site and documentation, the canonical reference for developers using the Express framework. It organizes material into API reference, getting-started guides, best practices, and topic-focused explanations such as routing, middleware, error handling, templating, and performance. Versioned docs help readers match API behavior to the framework version they’re running, reducing confusion during upgrades. The site showcases common patterns...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    LLM Datasets

    LLM Datasets

    Curated list of datasets and tools for post-training

    LLM Datasets curates and standardizes datasets commonly used to train and fine-tune large language models, reducing the overhead of hunting down sources and normalizing formats. The repository aims to make datasets easy to inspect and transform, with scripts for downloading, deduping, cleaning, and converting to formats like JSONL that slot into training pipelines. It highlights instruction-tuning and conversation-style corpora while also pointing to code, math, or domain-specific sets for targeted capabilities. Quality is a recurring theme: examples and utilities help filter low-value samples, enforce length limits, and split train/validation consistently so results are comparable. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    The Data Engineering Handbook

    The Data Engineering Handbook

    Links to everything you'd ever want to learn about data engineering

    ...Rather than being a code project itself, it’s a learning handbook that links to books, articles, tutorials, community groups, boot camps, and real-world project examples that collectively form a roadmap to mastering data engineering skills. It includes beginner and intermediate boot camps, interview guides, data cleaning and transformation resources, and curated lists of newsletters and industry communities, making it useful both for self-study and technical interview preparation. The repository is actively maintained and widely starred, reflecting its role as a go-to reference for newcomers and experienced practitioners alike.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Buffalo

    Buffalo

    Rapid Web Development w/ Go

    A Go web development eco-system, designed to make your life easier. If you’re familiar with Rake tasks from Ruby, you’ll be right at home using Grift. Seeding a database, and running a cleaning job is now at hand! Running background tasks is easy too, using the Background Workers. Writing a web application module always begins with the same tasks. Buffalo provides you a simple toolbox, the buffalo command, to generate many parts of your app and run the usual tasks. Extend the toolbox with plugins, using the language you want! ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Hotkeys JS

    Hotkeys JS

    A robust Javascript library for capturing keyboard input

    ...Because it has no external dependencies and a small footprint, it drops easily into existing codebases. Its focus on developer ergonomics makes defining, managing, and cleaning up shortcuts straightforward.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Branchless workflow for Git

    Branchless workflow for Git

    High-velocity, monorepo-scale workflow for Git

    ...Prototyping and experimenting workflows: strong support for prototyping and experimental work via "divergent" development. git sync: to rebase all local commit stacks and branches without having to check them out first. git move: The ability to move subtrees rather than "sticks" while cleaning up old branches, not touching the working copy, etc. Anonymous branching: reduces the overhead of branching for experimental work. In-memory operations: to modify the commit graph without having to check out the commits in question. git next/prev: to quickly jump between commits and branches in a commit stack.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Java Tablesaw

    Java Tablesaw

    Java dataframe and visualization library

    Tablesaw is a dataframe and visualization library that supports loading, cleaning, transforming, filtering, and summarizing data. If you work with data in Java, it may save you time and effort. Tablesaw also supports descriptive statistics and can be used to prepare data for working with machine learning libraries like Smile, Tribuo, H20.ai, DL4J. Import data from RDBMS, Excel, CSV, TSV, JSON, HTML, or Fixed Width text files, whether they are local or remote (http, S3, etc.)
    Downloads: 0 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 10
    FengNiao

    FengNiao

    A command line tool for cleaning unused resources in Xcode

    The FengNiao project is a lightweight command-line utility designed to help iOS/macOS developers clean up their Xcode projects by detecting and removing unused image resource files. As projects evolve, image assets often accumulate — many of which may no longer be referenced in code, storyboards, or asset catalogs; FengNiao scans the project directory (and subfolders) to identify those orphaned resources and optionally delete them, helping to reduce project bloat and potential app size. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Vacuum Card

    Vacuum Card

    Vacuum cleaner card for Home Assistant Lovelace UI

    Vacuum Card is a custom UI component for Home Assistant (specifically its Lovelace dashboard) that provides a user-friendly control panel for robot vacuum cleaners — something the default Home Assistant UI doesn’t supply by default. With vacuum-card installed, users get a visual card on their dashboard showing the vacuum’s state (e.g. cleaning, docked, battery level) and controls to start, stop, pause, or send the robot back to dock, directly from their Home Assistant interface. The card aims to support as many robot models as possible by relying on standard vacuum service calls (start, pause, stop, return_to_base etc.), allowing broad compatibility across brands. Installation can be done manually or via the Home Assistant Community Store (HACS), making setup accessible even for non-developers. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Git Tips

    Git Tips

    Compact knowledge base of Git command tips and workflows

    git-tips is a compact knowledge base of Git command tips and workflows designed to be quickly searchable and easy to memorize. It favors short, actionable examples that solve common problems like amending commits, cleaning branches, rewriting history, bisecting, stashing, or recovering lost work. Each tip shows the exact command and a brief explanation, reducing time spent digging through manual pages. The collection is useful for daily development, code reviews, and release management where precise Git operations matter. Because entries are concise and independent, you can dip in at any point and learn something useful in a minute or two. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    CSSO

    CSSO

    CSS minifier with structural optimizations

    CSSO (CSS Optimizer) is a CSS minifier. It performs three sorts of transformations: cleaning (removing redundant), compression (replacement for the shorter forms), and restructuring (merge of declarations, rules, and so on). As a result, an output CSS becomes much smaller in size. One of CDN services like unpkg or jsDelivr can be used. By default (for short path) the ESM version is exposed. For IIFE version a full path to a bundle should be specified.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    codemaid

    codemaid

    CodeMaid is an open source Visual Studio extension to cleanup

    CodeMaid is an open source Visual Studio extension to cleanup and simplify our C#, C++, F#, VB, PHP, PowerShell, JSON, XAML, XML, ASP, HTML, CSS, LESS, SCSS, JavaScript and TypeScript coding. Cleanup random white space into a simple standard order. Add unspecified access modifiers. Utilize Visual Studio’s built-in formatting capabilities. Remove and sort using statements. And do it all automatically on save or on demand, from an individual file to the entire solution. Visualize and navigate...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Tidy

    Tidy

    The granddaddy of HTML tools, with support for modern standards

    ...The HTML Tidy library, libtidy, is used and incorporated into many applications and projects. It offers an extensive API to read in and parse HTML from a file or buffer into a DOM-like node tree, has cleaning and diagnostic services, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    MappingGenerator

    MappingGenerator

    Code fix provider that allows to generate mapping code in design time

    MappingGenerator was initially created as a design-time alternative to AutoMapper. Now it is evolving into a coding assistant to whom you can delegate the most mundane coding tasks. Code action that generates an implementation for different mapping methods based on the method signature. Code action that automatically completes method invocation with locally available variables. Code action that automatically completes object initialization block with locally available variables. Code action...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    MLBox

    MLBox

    MLBox is a powerful Automated Machine Learning python library

    MLBox is a powerful Automated Machine Learning python library. Fast reading and distributed data preprocessing/cleaning/formatting. Highly robust feature selection and leak detection. Accurate hyper-parameter optimization in high-dimensional space. State-of-the-art predictive models for classification and regression (Deep Learning, Stacking, LightGBM,...) Prediction with model interpretation. MLBox has been developed and used by many active community members.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Dr0p1t-Framework

    Dr0p1t-Framework

    A framework that create an advanced stealthy dropper

    ...The framework includes features such as antivirus evasion, privilege escalation, and system persistence, enabling it to maintain access on compromised systems. It also incorporates techniques to avoid forensic analysis, such as self-deletion and cleaning traces after execution. The generated executables are optimized to be small and efficient, improving their ability to bypass security controls.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    fooltrader

    fooltrader

    Quant framework for stock

    Build a standard data schema, and then implement various connectors to import systems you are familiar with for analysis. fooltrader is a quantitative analysis trading system designed using big data technology, including data capture, cleaning, structuring, calculation, display, backtesting and trading. Its goal is to provide a unified framework for the whole market (stock, futures, bonds, foreign exchange, digital currency, macroeconomics, etc.) for research, backtesting, forecasting, and trading. Its applicable objects include quantitative traders, teachers, and students majoring in finance, people interested in economic data, programmers, and people who like freedom and the spirit of exploration. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    vinegar.vim

    vinegar.vim

    Combine with netrw to create a delicious salad dressing

    ...It simplifies entering and leaving netrw directories, maps - to open the parent directory, and streamlines buffer management between file views. The plugin also integrates better with splits and registers, allowing you to edit, copy, and preview files directly from netrw. By cleaning up clutter and aligning behaviors with Vim’s philosophy, vinegar effectively turns netrw into a lightweight project explorer without adding bloat. Its power lies in subtle polish—no new UI, just smarter defaults that make everyday navigation feel natural. For users who prefer staying entirely within Vim, vinegar transforms file browsing into a frictionless experience.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    DataScienceR

    DataScienceR

    a curated list of R tutorials for Data Science, NLP

    The DataScienceR repository is a curated collection of tutorials, sample code, and project templates for learning data science using the R programming language. It includes an assortment of exercises, sample datasets, and instructional code that cover the core steps of a data science project: data ingestion, cleaning, exploratory analysis, modeling, evaluation, and visualization. Many of the modules demonstrate best practices in R, such as using the tidyverse, R Markdown, modular scripting, and reproducible workflows. The repository also shows examples of linking R with external resources — APIs, databases, and file formats — and integrating into larger pipelines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Ansistrano

    Ansistrano

    Ansible role to deploy scripting applications like PHP, Python, Ruby

    ...We support deploying by pulling your code from CVS (Git, Mercurial, or Subversion), downloading from the repository (S3, or ZIP file), or pushing (Copying or Rsyncing). Ansistrano offers a deployment flow with hooks. Attach any specific tasks before or after every step to tune your workflow (restarting servers, cleaning caches, etc.). Capistrano is a remote server automation tool and it's currently in Version 3. Version 2.0 was originally thought in order to deploy RoR applications. With additional plugins, you were able to deploy non Rails applications such as PHP and Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    neobundle.vim

    neobundle.vim

    Next generation Vim package manager

    NeoBundle is a legacy plugin manager by Shougo designed to make installing, updating, and organizing Vim plugins straightforward and scriptable. It lets you declare plugins in your vimrc and handles fetching them from Git repositories, local paths, or archives, then loads them into your runtimepath automatically. One of its core ideas is lazy loading: you can defer plugin initialization until a command, mapping, filetype, or event is triggered, which helps keep startup fast even with many...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Removal tool for viruses source code, no problem if you want to combine or developed my code. Thank's
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    XMUltra is a Java and XML based feed-processing framework. It features utilities for many common tasks, such as: *FTPing files *Zipping files *Watching directories *XSL transformation *Cleaning out archives
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB