Showing 70 open source projects for "sql data generator"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    Korvus

    Korvus

    Korvus is a search SDK that unifies the entire RAG pipeline

    Korvus is an open-source retrieval-augmented generation (RAG) pipeline designed to run entirely inside PostgreSQL, allowing developers to build AI search and knowledge systems directly within a database environment. The project consolidates the typical steps of a RAG pipeline—including embedding generation, document retrieval, reranking, and text generation—into a single query executed within the Postgres ecosystem. By leveraging PostgresML and vector extensions such as pgvector, Korvus...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Lightweight' GAN

    Lightweight' GAN

    Implementation of 'lightweight' GAN, proposed in ICLR 2021

    Implementation of 'lightweight' GAN proposed in ICLR 2021, in Pytorch. The main contribution of the paper is a skip-layer excitation in the generator, paired with autoencoding self-supervised learning in the discriminator. Quoting the one-line summary "converge on single gpu with few hours' training, on 1024 resolution sub-hundred images". Augmentation is essential for Lightweight GAN to work effectively in a low data setting. You can test and see how your images will be augmented before they pass into a neural network (if you use augmentation). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    DB-GPT-Hub

    DB-GPT-Hub

    A repository that contains models, datasets, and fine-tuning

    DB-GPT-Hub is an open-source repository that provides datasets, models, and training tools designed to improve large language models for database interaction tasks, particularly Text-to-SQL. The project serves as a specialized extension of the broader DB-GPT ecosystem, focusing on the preparation and evaluation of models capable of translating natural language questions into structured database queries. It offers a modular framework that supports data preparation, model fine-tuning, benchmarking, and inference for Text-to-SQL systems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PyTorch Implementation of SDE Solvers

    PyTorch Implementation of SDE Solvers

    Differentiable SDE solvers with GPU support and efficient sensitivity

    ...The example trains an SDE as the generator of a GAN, whilst using a neural CDE [4] as the discriminator.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    Byzer-lang

    Byzer-lang

    A low-code open-source programming language for data pipeline

    Byzer (former MLSQL) is a low-code, open-sourced, and distributed programming language for data pipeline, analytics, and AI in a cloud-native way. Design protocol: Everything is a table. Byzer is a SQL-like language, to simplify data pipeline, analytics, and AI, combined with built-in algorithms and extensions. We believe that everything is a table, a simple and powerful SQL-like language can significantly reduce human efforts of data development without switching different tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    EZStacking

    EZStacking is Jupyter notebook generator for machine learning

    EZStacking is Jupyter notebook generator for supervised learning problems using Scikit-Learn pipelines and stacked generalization. EZStacking handles classification and regression problems for structured data. It can also be viewed as a development tool, because a notebook generated with EZStacking contains: -an exploratory data analysis (EDA) used to assess data quality - a modelling producing a reduced-size stacked estimator - a server returning a prediction, a measure of the quality of input data and the execution time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Synthetic Mixed Data Generator
    A Synthetic Data Generator for producing mixed datasets described by relevant, irrelevant, and redundant features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    BlazingSQL

    BlazingSQL

    BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python

    BlazingSQL is a GPU-accelerated SQL engine built on top of the RAPIDS ecosystem. RAPIDS is based on the Apache Arrow columnar memory format, and cuDF is a GPU DataFrame library for loading, joining, aggregating, filtering, and otherwise manipulating data. BlazingSQL is a SQL interface for cuDF, with various features to support large-scale data science workflows and enterprise datasets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    SQLFlow

    SQLFlow

    SQL compiler bridging databases and machine learning workflows

    SQLFlow is an open source project designed to bridge the gap between traditional SQL-based data processing and modern machine learning workflows by extending SQL syntax with AI capabilities. It acts as a compiler that translates SQL programs into executable workflows, enabling users to train, evaluate, and deploy machine learning models directly from SQL statements. It integrates with multiple database engines such as MySQL, Hive, and MaxCompute, while also supporting machine learning frameworks like TensorFlow and XGBoost. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    awesome-TS-anomaly-detection

    awesome-TS-anomaly-detection

    List of tools & datasets for anomaly detection on time-series data

    All lists are in alphabetical order. In the lists, maintained projects are prioritized vs not mantained. A repository is considered "not maintained" if the latest commit is > 1 year old, or explicitly mentioned by the authors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    NASH OS

    NASH OS

    Nash Operating System for Modern Ecommerce

    The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12

    Persica-A new Persian corpus for NLP

    This project presents a new corpus for NEWS text analysis in Persian

    Lack of multi-application text corpus despite of the surging text data is a serious bottleneck in the text mining and natural language processing especially in Persian language. This project presents a new corpus for NEWS articles analysis in Persian called Persica. NEWS analysis includes NEWS classification, topic discovery and classification, category classification and many more procedures. Dealing with NEWS has special requirements and first of all a valid and reliable corpus to perform...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    pyIRDG

    pyIRDG

    IMDb Relational Dataset Generator

    pyIRDG is a program written in Python to generate relational datasets in Prolog format. It uses data from the Internet Movie Database in combination with IMDbPY as backend. A graphical user interface written in pyQt allows the user to link multiple entities together as model for the generation process. The big four entities are Title, Person, Company and Character. Many attributes can be chosen for adding to the output .pl file. Three types of constraints on attributes are available to limit...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ACLAnalyser is a powerful tool to validate, verify and debug a MAS (Multi Agent Systems). ACLAnalyser sniff all messages exchanged between agents in a MAS, store them in a data base, and recover them later to get interesting information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Idea of ​​merging rules from data and expert knowledge of the author Zlatan Mur is made in RULEDIT. Zlatan Mur's idea of ​​creating rules is unique way to connect (PMML) all sources of the rules. Ruledit is protected by copyright of Zlatan Mur Graphical rule editor for JBoss Drools rules. Can be easily extended to parse rules for any other rule engine. Also includes parser for HQL/SQL for rule testing on a database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A platform for security data analysis.It focus on the quantitative analysis for securities,and research these problem:What and How decided to a trading,and can be believe.So,it include technical analysis,financial model,other to help to profit making.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Shadows will be a mmo with a very unique spell and crafting system. The IrrLicht graphics engine (http://irrlicht.sourceforge.net) shall be utilized to serve as a cross platform engine. I'm oping to create the feel of a graphical mud more than an mmo.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The aim of MIEX (Metadata and Information Extractor from small XML documents) is to create a wrapper for the Stanford Parser, to extract and store metadata (syntactic structures, relationships among words...) from simple XML documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SenseRank Sys: - builds the dictionaries (multidim matrices) of words’ values; - for the set utterance in certain language builds a figure in multidimensional space (in the matrix space) of values (visual schema), which is topological view of sense
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This intelligence Service uses data mining and search-engine techniques to get interesting information out of the internet. The information may be about politicians or companies e.g. and covers longer time periods to create a press review for instance.
    Downloads: 0 This Week
    Last Update:
    See Project