Search Results for "data transformation" - Page 2

Showing 53 open source projects for "data transformation"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    Kornia

    Kornia

    Open Source Differentiable Computer Vision Library

    Kornia is a differentiable computer vision library for PyTorch. It consists of a set of routines and differentiable modules to solve generic computer vision problems. At its core, the package uses PyTorch as its main backend both for efficiency and to take advantage of the reverse-mode auto-differentiation to define and compute the gradient of complex functions. Inspired by existing packages, this library is composed by a subset of packages containing operators that can be inserted within...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Streamline Analyst

    Streamline Analyst

    AI agent that streamlines the entire process of data analysis

    Streamline Analyst is a cutting-edge, open-source application powered by Large Language Models (LLMs) designed to revolutionize data analysis. This Data Analysis Agent effortlessly automates all the tasks such as data cleaning, preprocessing, and even complex operations like identifying target objects, partitioning test sets, and selecting the best-fit models based on your data. With Streamline Analyst, results visualization and evaluation become seamless.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Datapipe

    Datapipe

    Real-time, incremental ETL library for ML with record-level depend

    Datapipe is a real-time, incremental ETL library for Python with record-level dependency tracking. Datapipe is designed to streamline the creation of data processing pipelines. It excels in scenarios where data is continuously changing, requiring pipelines to adapt and process only the modified data efficiently. This library tracks dependencies for each record in the pipeline, ensuring minimal and efficient data processing.
    Downloads: 75 This Week
    Last Update:
    See Project
  • 4
    sketch

    sketch

    AI code-writing assistant that understands data content

    Sketch is an open-source AI-powered data analysis assistant designed specifically for pandas users, enabling natural language interaction with tabular datasets to generate code, insights, and transformations. It works by summarizing the structure and statistical properties of a dataset and providing that context to a language model, allowing it to generate highly relevant and accurate responses tailored to the data. The tool integrates directly into pandas dataframes through an extension,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    Ubix Linux

    Ubix Linux

    The Pocket Datalab

    Ubix stands for Universal Business Intelligence Computing System. Ubix Linux is an open-source, Debian-based Linux distribution geared towards data acquisition, transformation, analysis and presentation. Ubix Linux purpose is to offer a tiny but versatile datalab. Ubix Linux is easily accessible, resource-efficient and completely portable on a simple USB key. Ubix Linux is a perfect toolset for learning data analysis and artificial intelligence basics on small to medium datasets. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Mara Pipelines

    Mara Pipelines

    A lightweight opinionated ETL framework, halfway between plain scripts

    This package contains a lightweight data transformation framework with a focus on transparency and complexity reduction. Data integration pipelines as code: pipelines, tasks and commands are created using declarative Python code. PostgreSQL as a data processing engine. Extensive web ui. The web browser as the main tool for inspecting, running and debugging pipelines. GNU make semantics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    towhee

    towhee

    Framework that is dedicated to making neural data processing

    Towhee is an open-source machine-learning pipeline that helps you encode your unstructured data into embeddings. You can use our Python API to build a prototype of your pipeline and use Towhee to automatically optimize it for production-ready environments. From images to text to 3D molecular structures, Towhee supports data transformation for nearly 20 different unstructured data modalities. We provide end-to-end pipeline optimizations, covering everything from data decoding/encoding, to model inference, making your pipeline execution 10x faster. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ChatFred

    ChatFred

    Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting

    Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting, image generation and more. Access ChatGPT, DALL·E 2, and other OpenAI models. Language models often give wrong information. Verify answers if they are important. Talk with ChatGPT via the cf keyword. Answers will show as Large Type. Alternatively, use the Universal Action, Fallback Search, or Hotkey. To generate text with InstructGPT models and see results in-line, use the cft keyword. ⤓ Install on the Alfred Gallery or...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    macara

    macara

    A converter for seamless transformation of files, data, and media ...

    This application consolidates various scripts, including an AI feature (rembg), into a singular platform. The design of this software is evolutionary, allowing for the seamless integration of additional scripts, menus, or windows as needed. Serving as a versatile tool, it facilitates efficient file management, especially when handling a substantial volume of images, whether sorting by name or other attributes. These scripts are crafted to complement generative art AI technologies like Dall-e...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    hui

    hui

    hewies user interface - 3D scientific visualisation tool

    Python project with goal to provide FOSS library to extract, analyse and visualise data in a 3D fashion. The instance will connect to a data source, ods sheet, csv, sql DB, pyodbc the instance will analyse and/or transform the data to be presented to the visualisation functionality the instance will visualise the data in a 3D fashion, likely using third party FOSS
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Feature-engine

    Feature-engine

    Feature engineering package with sklearn like functionality

    Feature-engine is a Python library with multiple transformers to engineer and select features for use in machine learning models. Feature-engine's transformers follow Scikit-learn's functionality with fit() and transform() methods to learn the transforming parameters from the data and then transform it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    automl-gs

    automl-gs

    Provide an input CSV and a target field to predict, generate a model

    Give an input CSV file and a target field you want to predict to automl-gs, and get a trained high-performing machine learning or deep learning model plus native Python code pipelines allowing you to integrate that model into any prediction workflow. No black box: you can see exactly how the data is processed, and how the model is constructed, and you can make tweaks as necessary. automl-gs is an AutoML tool which, unlike Microsoft's NNI, Uber's Ludwig, and TPOT, offers a zero code/model definition interface to getting an optimized model and data transformation pipeline in multiple popular ML/DL frameworks, with minimal Python dependencies (pandas + scikit-learn + your framework of choice). automl-gs is designed for citizen data scientists and engineers without a deep statistical background under the philosophy that you don't need to know any modern data preprocessing and machine learning engineering techniques to create a powerful prediction workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Django REST Pandas

    Django REST Pandas

    Serves up Pandas dataframes via the Django REST Framework

    Django REST Pandas (DRP) provides a simple way to generate and serve pandas DataFrames via the Django REST Framework. The resulting API can serve up CSV (and a number of other formats for consumption by a client-side visualization tool like d3.js. The design philosophy of DRP enforces a strict separation between data and presentation. This keeps the implementation simple, but also has the nice side effect of making it trivial to provide the source data for your visualizations. This...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    base64io

    base64io

    A stream implementation for Python that provides transparent base64

    A stream implementation for Python that provides transparent base64 encoding and decoding of an underlying stream. This project is designed to develop a class, :class:`base64io.Base64IO`, that implements a streaming interface for Base64 encoding. Python has supported native Base64 encoding since version 2.4. However, there is no streaming interface for Base64 encoding, and none is available from the community. The legacy base64.encode and base64.decode interface lets you shuffle data between...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    QAL

    QAL

    Query Abstraction Layer

    Project has moved to: https://github.com/OptimalBPM/qal QAL is a collection of libraries for mining, transforming and writing data from and to a number of places. Sources and destinations include different SQL and NoSQL backends, file formats like .csv, XML and excel. Even untidy HTML web pages. It has a database abstraction layer that supports connectivity to Postgres, MySQL, DB2, Oracle, MS SQL server. JSON and MongoDB is coming. It uses XML/JSON formats(self-generated SQL schemas) for representing queries, transformation and merging, making it scriptable. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    dbis

    dbis

    DBIS Reference Implementation

    Reference implementation of Directory-Based Information Services, described in IETF Internet Drafts, intended to replace NIS and RFC2307.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    2DMED

    2DMED

    Phase Sensitive Detection and Modulated Enhanced Diffraction Software

    2DMED it is the second out of two programs which allows implementation of Phase Sensitive Detection (PSD) and/or Modulated Enhanced Diffraction (MED) techniques on the in situ data. This software takes normalized and averaged in situ data in flat file format (FLT) and performs PSD transformation which has been described in Urakawa, A.et al. Chem. Eng. Science 2008, 63, 4902. User can choose demodulation index k and perform several demodulation calculations. This is extremely important since demodulation calculations at k = 2 on in situ powder diffraction data represent implementation of MED method which been described in Chernyshov, D. et al. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Optimal BPM

    Optimal BPM

    The beat of your business

    [See the files section for installation instructions] Optimal BPM™ is a project that aims to create an easy-to-use Business Process Management system (BPM) and tools. So far, only the tools, and only a data replication tool, Optimal Sync, has been released (available from the "files" page). But more is coming. It incorporates many different open source projects that relates to interoperability, transformation, context and process control. It has no business engine, instead it generates Python code and leaves the engine mechanics to its scripting engine. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    We aim to develop a transformative computational method for automatic discovery of a set of dynamical rules that best captures both state transition and topological transformation in the data of spatio-temporal evolution of a complex network.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    GWCNV is a genome-wide algorithm for detecting CNV associations with diseases. It works directly on a transformation of intensity data. It is powerful and sensitive in detecting small CNV associations, and retains high power for large CNVs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    qenqote

    Clipboard Quote Encoder

    ...In its prototypical application, qenqote accepts URL's that have been copied or cut to the clipboard from a browser bar. qenqote applies a user-selected quoting function, and writes the result back to the clipboard. Besides its URL transformation applications, qenqote can be given a variable name from which it produces text of "for (...) { }" or "while (...) { }" structures; can produce formatted dates or strings according to clipboard and other data; can put HTML tags around clipped text; can apply regular-expression substitutions to text; and can output any simple function one cares to program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The csvdatamix project aims to randomize CSV input data files in order to conceal the original state of the data. Similar to data masking or data transformation. Also has mapping abilities to translate back to the original state of the data.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    Liegkat-Archiv delivers ALK converters for spatial data formats (EDBS, BGRUND, SQD, DFK, HK) and coordinate systems used in German cadastral agencies before ALKIS into KML and JML. It delivers Python packages and XSL files.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 24
    Wets is a very flexible ETL which can handle data coming from text file / XML / MySQL / postgreSQL and Zope server and because of the language associated there is no limitation concerning the transformation process. See the drop project for a GUI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Financial Data Access Library provides an abstraction of market data sources. It verifies data consistency and provides transformation to different time frame e.g. daily to weely. Includes SQL, CSI, ASCII and online data sources such as Yahoo!
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB