Showing 75 open source projects for "sql data generator"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 1
    WhisperSpeech

    WhisperSpeech

    An Open Source text-to-speech system built by inverting Whisper

    WhisperSpeech is an open-source text-to-speech system created by “inverting” OpenAI’s Whisper, reusing its strengths as a semantic audio model to generate speech instead of only transcribing it. The project aims to be for speech what Stable Diffusion is for images: powerful, hackable, and safe for commercial use, with code under Apache-2.0/MIT and models trained only on properly licensed data. Its architecture follows a token-based, multi-stage pipeline inspired by AudioLM and SPEAR-TTS:...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    DB-GPT-Hub

    DB-GPT-Hub

    A repository that contains models, datasets, and fine-tuning

    DB-GPT-Hub is an open-source repository that provides datasets, models, and training tools designed to improve large language models for database interaction tasks, particularly Text-to-SQL. The project serves as a specialized extension of the broader DB-GPT ecosystem, focusing on the preparation and evaluation of models capable of translating natural language questions into structured database queries. It offers a modular framework that supports data preparation, model fine-tuning, benchmarking, and inference for Text-to-SQL systems. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    PyTorch Implementation of SDE Solvers

    PyTorch Implementation of SDE Solvers

    Differentiable SDE solvers with GPU support and efficient sensitivity

    ...The example trains an SDE as the generator of a GAN, whilst using a neural CDE [4] as the discriminator.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Byzer-lang

    Byzer-lang

    A low-code open-source programming language for data pipeline

    Byzer (former MLSQL) is a low-code, open-sourced, and distributed programming language for data pipeline, analytics, and AI in a cloud-native way. Design protocol: Everything is a table. Byzer is a SQL-like language, to simplify data pipeline, analytics, and AI, combined with built-in algorithms and extensions. We believe that everything is a table, a simple and powerful SQL-like language can significantly reduce human efforts of data development without switching different tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    EZStacking

    EZStacking is Jupyter notebook generator for machine learning

    EZStacking is Jupyter notebook generator for supervised learning problems using Scikit-Learn pipelines and stacked generalization. EZStacking handles classification and regression problems for structured data. It can also be viewed as a development tool, because a notebook generated with EZStacking contains: -an exploratory data analysis (EDA) used to assess data quality - a modelling producing a reduced-size stacked estimator - a server returning a prediction, a measure of the quality of input data and the execution time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Synthetic Mixed Data Generator
    A Synthetic Data Generator for producing mixed datasets described by relevant, irrelevant, and redundant features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    BlazingSQL

    BlazingSQL

    BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python

    BlazingSQL is a GPU-accelerated SQL engine built on top of the RAPIDS ecosystem. RAPIDS is based on the Apache Arrow columnar memory format, and cuDF is a GPU DataFrame library for loading, joining, aggregating, filtering, and otherwise manipulating data. BlazingSQL is a SQL interface for cuDF, with various features to support large-scale data science workflows and enterprise datasets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    SQLFlow

    SQLFlow

    SQL compiler bridging databases and machine learning workflows

    SQLFlow is an open source project designed to bridge the gap between traditional SQL-based data processing and modern machine learning workflows by extending SQL syntax with AI capabilities. It acts as a compiler that translates SQL programs into executable workflows, enabling users to train, evaluate, and deploy machine learning models directly from SQL statements. It integrates with multiple database engines such as MySQL, Hive, and MaxCompute, while also supporting machine learning frameworks like TensorFlow and XGBoost. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    awesome-TS-anomaly-detection

    awesome-TS-anomaly-detection

    List of tools & datasets for anomaly detection on time-series data

    All lists are in alphabetical order. In the lists, maintained projects are prioritized vs not mantained. A repository is considered "not maintained" if the latest commit is > 1 year old, or explicitly mentioned by the authors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    NASH OS

    NASH OS

    Nash Operating System for Modern Ecommerce

    The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12

    HRCloud2

    A full-featured home hosted Cloud Drive, Personal Assistant, App Launc

    HRCloud2 - A fully featured home-hosted Cloud drive and personal assistant that allows users to create and manage user accounts for friends and family, access files from anywhere, convert files and archives to other formats, perform simple image resizing and editing, stream media, create playlists, search for files, OCR images and PDF's, share files with friends and more! Building off the observed industry standards for commercial Cloud storage, HRCloud2 protects server permission levels,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    libVMR

    VMR - machine learning library

    libVMR is a class library written in Java which implements code generator for group method of data handling - GMDH. The library is intended for users, with machine learning skills. libVMR provides an effective framework for the research and development of data mining and predictive analytics. libVMR is based on the most popular neural network model with a higher generalization ability from kernel tricks - vector machine by Reshetov (VMR).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14

    Persica-A new Persian corpus for NLP

    This project presents a new corpus for NEWS text analysis in Persian

    Lack of multi-application text corpus despite of the surging text data is a serious bottleneck in the text mining and natural language processing especially in Persian language. This project presents a new corpus for NEWS articles analysis in Persian called Persica. NEWS analysis includes NEWS classification, topic discovery and classification, category classification and many more procedures. Dealing with NEWS has special requirements and first of all a valid and reliable corpus to perform...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Flamingo Project

    Flamingo Project

    Workflow Designer, Hive Editor, Pig Editor, File System Browser

    Flamingo is a open-source Big Data Platform that combine a Ajax Rich Web Interface + Workflow Engine + Workflow Designer + MapReduce + Hive Editor + Pig Editor. 1. Easy Tool for big data 2. Use comfortable in Hadoop EcoSystem projects 3. Based GPL V3 License Supporting Pig IDE, Hive IDE, HDFS Browser, Scheduler, Hadoop Job Monitoring, Workflow Engine, Workflow Designer, MapReduce.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    A Data Generator

    A tool to generate synthetic test data useful to Record matchers

    ...The data generation process is done in many steps like org data creation, data grouping, pair generation, data mutation and matching data patterns. Data generator also mangles field values of generated test data to achieve data errors and co-relate them in real life contexts like Family, Households, Organizations etc
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    pyIRDG

    pyIRDG

    IMDb Relational Dataset Generator

    pyIRDG is a program written in Python to generate relational datasets in Prolog format. It uses data from the Internet Movie Database in combination with IMDbPY as backend. A graphical user interface written in pyQt allows the user to link multiple entities together as model for the generation process. The big four entities are Title, Person, Company and Character. Many attributes can be chosen for adding to the output .pl file. Three types of constraints on attributes are available to limit...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ACLAnalyser is a powerful tool to validate, verify and debug a MAS (Multi Agent Systems). ACLAnalyser sniff all messages exchanged between agents in a MAS, store them in a data base, and recover them later to get interesting information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Projeny (Probablistic Networks Generator in Java) is a graphical (Java SWT) front-end to BNT (Bayes Net Toolbox for Matlab). Projeny requires BNT, JMatLink and a Matlab back-end. There is no installable release package, but source code is available on SVN - please check out from SVN to use Projeny. Projeny was started with BNJ as the base.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Idea of ​​merging rules from data and expert knowledge of the author Zlatan Mur is made in RULEDIT. Zlatan Mur's idea of ​​creating rules is unique way to connect (PMML) all sources of the rules. Ruledit is protected by copyright of Zlatan Mur Graphical rule editor for JBoss Drools rules. Can be easily extended to parse rules for any other rule engine. Also includes parser for HQL/SQL for rule testing on a database.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    A platform for security data analysis.It focus on the quantitative analysis for securities,and research these problem:What and How decided to a trading,and can be believe.So,it include technical analysis,financial model,other to help to profit making.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Shadows will be a mmo with a very unique spell and crafting system. The IrrLicht graphics engine (http://irrlicht.sourceforge.net) shall be utilized to serve as a cross platform engine. I'm oping to create the feel of a graphical mud more than an mmo.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The aim of MIEX (Metadata and Information Extractor from small XML documents) is to create a wrapper for the Stanford Parser, to extract and store metadata (syntactic structures, relationships among words...) from simple XML documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    SenseRank Sys: - builds the dictionaries (multidim matrices) of words’ values; - for the set utterance in certain language builds a figure in multidimensional space (in the matrix space) of values (visual schema), which is topological view of sense
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    This intelligence Service uses data mining and search-engine techniques to get interesting information out of the internet. The information may be about politicians or companies e.g. and covers longer time periods to create a press review for instance.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB