Showing 60 open source projects for "csv"

View related business solutions
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • Run applications fast and securely in a fully managed environment Icon
    Run applications fast and securely in a fully managed environment

    Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.

    Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
    Try for free
  • 1
    CSV

    CSV

    Utility library for working with CSV and other delimited files

    Welcome to CSV.jl! A pure-Julia package for handling delimited text data, be it comma-delimited (csv), tab-delimited (tsv), or otherwise. A fast, flexible delimited file reader/writer for Julia.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    CSV Lint

    CSV Lint

    CSV Lint plug-in for Notepad++ for syntax highlighting

    CSV Lint plug-in for Notepad++ for syntax highlighting, csv validation, automatic column and datatype detecting fixed width datasets, change datetime format, decimal separator, sort data, count unique values, convert to xml, json, sql etc. A plugin for data cleaning and working with messy data files. Use CSV Lint for metadata discovery, technical data validation, and reformatting on tabular data files.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 3
    DuckDB

    DuckDB

    DuckDB is an in-process SQL OLAP Database Management System

    ...DuckDB supports arbitrary and nested correlated subqueries, window functions, collations, complex types (arrays, structs), and more. For more information on the goals of DuckDB, please refer to the Why DuckDB page on our website. Processing and storing tabular datasets, e.g. from CSV or Parquet files. Interactive data analysis, e.g. Joining & aggregate multiple large tables. Concurrent large changes, to multiple large tables, e.g. appending rows, adding/removing/updating columns. Large result set transfer to client. For development, DuckDB requires CMake, Python3 and a C++11 compliant compiler. Run make in the root directory to compile the sources. ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 4
    PlotJuggler

    PlotJuggler

    The Time Series Visualization Tool that you deserve

    Fast, intuitive, and extensible time series visualization tool. Its Drag & Drop interface is designed to maximize both simplicity and speed. PlotJuggler is perfect for visualizing logs, offline and real-time data, and it can be used in multiple fields. PlotJuggler can be connected to an external application using any inter-process communication and display data in real time. Thanks to its plugin-based architecture, it is easy to add new data sources and functionalities. If needed, you can...
    Downloads: 58 This Week
    Last Update:
    See Project
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
    Learn More
  • 5
    Java Tablesaw

    Java Tablesaw

    Java dataframe and visualization library

    ...Tablesaw also supports descriptive statistics and can be used to prepare data for working with machine learning libraries like Smile, Tribuo, H20.ai, DL4J. Import data from RDBMS, Excel, CSV, TSV, JSON, HTML, or Fixed Width text files, whether they are local or remote (http, S3, etc.) Tablesaw supports data visualization by providing a wrapper for the Plot.ly JavaScript plotting library. Here are a few examples of the new library in action. Descriptive stats: mean, min, max, median, sum, product, standard deviation, variance, percentiles, geometric mean, skewness, kurtosis, etc. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    AWS SDK for pandas

    AWS SDK for pandas

    Easy integration with Athena, Glue, Redshift, Timestream, Neptune

    aws-sdk-pandas (formerly AWS Data Wrangler) bridges pandas with the AWS analytics stack so DataFrames flow seamlessly to and from cloud services. With a few lines of code, you can read from and write to Amazon S3 in Parquet/CSV/JSON/ORC, register tables in the AWS Glue Data Catalog, and query with Amazon Athena directly into pandas. The library abstracts efficient patterns like partitioning, compression, and vectorized I/O so you get performant data lake operations without hand-rolling boilerplate. It also supports Redshift, OpenSearch, and other services, enabling ETL tasks that blend SQL engines and Python transformations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Explorer

    Explorer

    Series (one-dimensional) and dataframes (two-dimensional)

    Explorer brings series (one-dimensional) and data frames (two-dimensional) to Elixir for fast data exploration.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Quadratic

    Quadratic

    Data science spreadsheet with Python & SQL

    ...Quadratic is a Web-based spreadsheet application that runs in the browser and as a native app (via Electron). Our goal is to build a spreadsheet that enables you to pull your data from its source (SaaS, Database, CSV, API, etc) and then work with that data using the most popular data science tools today (Python, Pandas, SQL, JS, Excel Formulas, etc). Quadratic has no environment to configure. The grid runs entirely in the browser with no backend service. This makes our grids completely portable and very easy to share. Quadratic has Python library support built-in. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    sparklyr

    sparklyr

    R interface for Apache Spark

    sparklyr is an R package that provides seamless interfacing with Apache Spark clusters—either local or remote—while letting users write code in familiar R paradigms. It supplies a dplyr-compatible backend, Spark machine learning pipelines, SQL integration, and I/O utilities to manipulate and analyze large datasets distributed across cluster environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Software Defined Storage Icon
    Software Defined Storage

    The layered architecture of QuantaStor provides solution engineers with unprecedented flexibility and application design options.

    QuantaStor is a unified Software-Defined Storage platform designed to scale up and out to make storage management easy while reducing overall enterprise storage costs.
    Learn More
  • 10
    AWS Data Wrangler

    AWS Data Wrangler

    Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.

    ...Easy integration with Athena, Glue, Redshift, Timestream, OpenSearch, Neptune, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON, and EXCEL). Built on top of other open-source projects like Pandas, Apache Arrow and Boto3, it offers abstracted functions to execute usual ETL tasks like load/unload data from Data Lakes, Data Warehouses, and Databases. Convert the column name to be compatible with Amazon Athena and the AWS Glue Catalog. Run a query against AWS CloudWatchLogs Insights and convert the results to Pandas DataFrame. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    SafeUtils

    SafeUtils

    110+ developer tools as native MacOS, Linux & Windows desktop apps.

    Tools: https://safeutils.com/barcode-generator https://safeutils.com/color-picker https://safeutils.com/qr-code-generator https://safeutils.com/qr-code-scanner https://safeutils.com/word-counter https://safeutils.com/base-64-decoder https://safeutils.com/diff-checker https://safeutils.com/hex-to-ascii https://safeutils.com/json-formatter https://safeutils.com/lorem-ipsum-generator https://safeutils.com/random-generator https://safeutils.com/time-converter https://safeutils.com/xml-formatter https://safeutils.com/ascii-to-binary https://safeutils.com/ascii-to-hex https://safeutils.com/base-64-encoder https://safeutils.com/binary-to-ascii https://safeutils.com/case-converter https://safeutils.com/csv-to-json https://safeutils.com/decimal-to-ascii https://safeutils.com/html-formatter https://safeutils.com/html-preview https://safeutils.com/html-to-markdown https://safeutils.com/id-generator https://safeutils.com/json-to-csv https://safeutils.com/json-to-xml
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    GeoTools, the Java GIS toolkit

    GeoTools, the Java GIS toolkit

    Toolkit for working with and mapping geospatial data

    GeoTools is an open source (LGPL) Java code library which provides standards compliant methods for the manipulation of geospatial data. GeoTools is an Open Source Geospatial Foundation project. The GeoTools library data structures are based on Open Geospatial Consortium (OGC) specifications.
    Leader badge
    Downloads: 224 This Week
    Last Update:
    See Project
  • 13

    Graph Plotter

    GP is a tool to analyse numerical data

    GP is aimed to be simple and fast tool to analyse numerical data. It does not claim to replace all other plotters.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15
    PlateEditor

    PlateEditor

    PlateEditor, a free web application to work with multi-well plates

    PlateEditor is a free web application to work with multi-well plates, from creation of layouts to data visualization and aggregation. It is primarily designed as a tool to help scientists working in the field of biology to simplify and fasten the process of data analysis. It is fully client-side: files are never transferred to the server, ensuring complete security and privacy of research data. It doesn't require any installation. - PlateEditor is available at this...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16

    GETL

    ETL engine based on Groovy

    P.S. Dear friends. Repository migration to https://github.com/ascrus/getl . You can download jar file from this site or maven. GETL - based package in Groovy, which automates the work of loading and transforming data. His name is an acronym for «Groovy ETL». GETL is a set of libraries of pre-built classes and objects that can be used to solve problems unpacking, transform and load data into programs written in Groovy, or Java, as well as from any software that supports the work with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    hui

    hui

    hewies user interface - 3D scientific visualisation tool

    Python project with goal to provide FOSS library to extract, analyse and visualise data in a 3D fashion. The instance will connect to a data source, ods sheet, csv, sql DB, pyodbc the instance will analyse and/or transform the data to be presented to the visualisation functionality the instance will visualise the data in a 3D fashion, likely using third party FOSS
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    HiPlot

    HiPlot

    HiPlot makes understanding high dimensional data easy

    ...Its core view is a parallel-coordinates plot that lets you brush, filter, and highlight runs to spot trade-offs, correlations, and Pareto fronts at a glance. You can load results from simple CSV/JSON logs or programmatically push “experiments” with typed fields, metrics, and tags. The UI supports dynamic filtering, color mapping, and tooltip details so you can iteratively narrow to the most promising configurations. Because it renders as self-contained HTML, you can embed the visualization in notebooks, export it, or serve it as a lightweight web app for teammates. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Metl ETL Data Integration

    Metl ETL Data Integration

    Simple message-based, web-based ETL integration

    Metl is a simple, web-based ETL tool that allows for data integrations including database, files, messaging, and web services. Supports RDBMS, SOAP, HTTP, FTP, SFTP, XML, FIXLEN, CSV, JSON, ZIP, and more. Metl implements scheduled integration tasks without the need for custom coding or heavy infrastructure. It can be deployed in the cloud or in an internal data center, and it was built to allow developers to extend it with custom components.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Query.jl

    Query.jl

    Query almost anything in julia

    ...It can filter, project, join and group data from any iterable data source, including all the sources supported in IterableTables.jl. One can for example query any of the following data sources: any array, DataFrames, DataStreams (including CSV, Feather, SQLite, ODBC), DataTables, IndexedTables, TimeSeries, Temporal, TypedTables and DifferentialEquations (any DESolution). The package currently provides working implementations for in-memory data sources, but will eventually be able to translate queries into e.g. SQL. There is a prototype implementation of such a "query provider" for SQLite in the package, but it is experimental at this point and only works for a very small subset of queries. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    JuliaDB.jl

    JuliaDB.jl

    Parallel analytical database in pure Julia

    JuliaDB is a package for working with large persistent data set. JuliaDB provides distributed table and array datastructures with convenient functions to load data from CSV. JuliaDB is Julia all the way down. This means queries can be composed with Julia code that may use a vast ecosystem of packages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Data Science at the Command Line

    Data Science at the Command Line

    Data science at the command line

    Command Line by Jeroen Janssens, published by O’Reilly Media in October 2021. Obtain, scrub, explore, and model data with Unix Power Tools. This repository contains the full text, data, and scripts used in the second edition of the book Data Science at the Command Line by Jeroen Janssens. This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how to combine small yet powerful...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Crystalsim -  XRD hkl simulation

    Crystalsim - XRD hkl simulation

    X-ray diffraction (XRD) analysis for hkl simulation of any crystal.

    ...Crystalline lattice parameters such as ‘a’, ‘b’, ‘c’ as well as interfacial angles such as alpha, beta, gamma can also be entered manually. Processed data can be saved as .csv file format. Designed by M Kanagasabapathy, Assistant Professor, Department of Chemistry, Rajus' College, Affiliated to Madurai Kamaraj University Rajapalayam (TN) India email: rrcmks(at)gmail.com
    Downloads: 6 This Week
    Last Update:
    See Project
  • 25

    OpenData

    Open source coronavirus 2019 sample application using open data

    Open source software using open data with template code to demonstrate getting open government data such as the coronavirus 2019 (COVID-19). It takes data in feed format such as CSV, JSON, etc. to process maps, pie charts, line/bar graphs, HTML table that can be copied and pasted to spreadsheets such as Excel. The purpose is to provide template code to access open data readily available from official government websites useful for regional data analysis, latest information at a glance with data visualisation, etc. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next