hive free download - SourceForge

Showing 47 open source projects for "hive"

View related business solutions

Linux Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

Aden Hive

Outcome driven agent development framework that evolves

...The framework also includes human-in-the-loop nodes, credential management, cost and budget controls, and real-time observability so teams can monitor execution and intervene as needed. Hive is designed for production environments and supports a wide range of large language models, local models, and business system connectivity.

Downloads: 1 This Week

Last Update: 7 days ago
See Project
2

Apache Iceberg

Apache Iceberg

Iceberg is a high-performance format for huge analytic tables. Iceberg brings the reliability and simplicity of SQL tables to big data while making it possible for engines like Spark, Trino, Flink, Presto, Hive, and Impala to safely work with the same tables, at the same time. The core Java library that tracks table snapshots and metadata is complete, but still evolving. Current work is focused on adding row-level deletes and upserts, and integration work with new engines like Flink and Hive. The Iceberg format specification is being actively updated and is open for comment. ...

Downloads: 2 This Week

Last Update: 2025-12-22
See Project
3

ggraph

Grammar of Graph Graphics

ggraph adapts the Grammar of Graphics from ggplot2 for network and graph visualizations. It integrates with tidygraph/igraph data structures, providing a wide range of geoms, layouts (e.g. hive plots, circle packing), and layering methods tailored to hierarchical or relational data.

Downloads: 0 This Week

Last Update: 2025-08-24
See Project
4

Apache Phoenix

Mirror of Apache Phoenix

...The power of standard SQL and JDBC APIs with full ACID transaction capabilities and the flexibility of late-bound, schema-on-read capabilities from the NoSQL world by leveraging HBase as its backing store. Apache Phoenix is fully integrated with other Hadoop products such as Spark, Hive, Pig, Flume, and Map Reduce. Become the trusted data platform for OLTP and operational analytics for Hadoop through well-defined, industry-standard APIs. Apache Phoenix takes your SQL query, compiles it into a series of HBase scans, and orchestrates the running of those scans to produce regular JDBC result sets. Direct use of the HBase API, along with coprocessors and custom filters, results in performance on the order of milliseconds.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

Claude-Flow

The leading agent orchestration platform for Claude

Claude-Flow v2 Alpha is an advanced AI orchestration and automation framework designed for enterprise-grade, large-scale AI-driven development. It enables developers to coordinate multiple specialized AI agents in real time through a hive-mind architecture, combining swarm intelligence, neural reasoning, and a powerful set of 87 Modular Control Protocol (MCP) tools. The platform supports both quick swarm tasks and persistent multi-agent sessions known as hives, facilitating distributed AI collaboration with persistent contextual memory. At its core, Claude-Flow integrates Dynamic Agent Architecture (DAA) for self-organizing agent management, neural pattern recognition accelerated by WebAssembly SIMD, and a SQLite-based memory system for context retention and knowledge persistence across tasks. ...

Downloads: 2 This Week

Last Update: 2 days ago
See Project
6

Liquibase

Main Liquibase Source

...Liquibase Core works with the following databases: Apache Derby, CockroachDB, Firebird, H2, HSQL, Informix, InterBase, MariaDB, MSSQL, MySQL, Oracle, PostgreSQL, SQLite, Sybase Anywhere, Sybase Enterprise. The databases that require extensions are: Azure Cosmos DB, Cassandra, Cache, DB2i, Hibernate, Impala/Hive, MaxDB, MongoDB, Redshift, SAP HANA, SQLFire, Snowflake, Teradata, Vertica, VoltDB. See Liquibase Database Tutorials. Liquibase can be integrated with Maven, Ant, Gradle, Spring Boot, and other CI/CD tools.

Downloads: 34 This Week

Last Update: 2026-03-04
See Project
7

Apache Impala

Apache Impala

...Utilize the same file and data formats and metadata, security, and resource management frameworks as your Hadoop deployment, with no redundant infrastructure or data conversion/duplication. For Apache Hive users, Impala utilizes the same metadata and ODBC driver. Like Hive, Impala supports SQL, so you don't have to worry about reinventing the implementation wheel. With Impala, more users, whether using SQL queries or BI applications, can interact with more data through a single repository and metadata stored from source through analysis.

Downloads: 0 This Week

Last Update: 2025-03-07
See Project
8

Genie

Distributed Big Data Orchestration Service

Genie is a completely open source distributed job orchestration engine developed by Netflix. Genie provides REST-ful APIs to run a variety of big data jobs like Hadoop, Pig, Hive, Spark, Presto, Sqoop and more. It also provides APIs for managing the metadata of many distributed processing clusters and the commands and applications which run on them.

Downloads: 0 This Week

Last Update: 2025-08-05
See Project
9

SQL Formatter

A whitespace formatter for different query languages

SQL Formatter is a JavaScript library for pretty-printing SQL queries. It started as a port of a PHP Library, but has since considerably diverged. It supports various SQL dialects: GCP BigQuery, IBM DB2, Apache Hive, MariaDB, MySQL, Couchbase N1QL, Oracle PL/SQL, PostgreSQL, Amazon Redshift, SingleStoreDB, Snowflake, Spark, SQL Server Transact-SQL, Trino/Presto. See language option docs for more details. The CLI tool will be installed under sql-formatter and may be invoked via npx sql-formatter. If you don't use a module bundler, clone the repository, run npm install and grab a file from /dist directory to use inside a script tag. ...

Downloads: 0 This Week

Last Update: 2026-02-17
See Project
8 Monitoring Tools in One APM. Install in 5 Minutes.
Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.

Start Free
10

ANTLR

Parser generator to read, process, or translate structured text

...It’s widely used in academia and industry to build all sorts of languages, tools, and frameworks. Twitter search uses ANTLR for query parsing, with over 2 billion queries a day. The languages for Hive and Pig, the data warehouse and analysis systems for Hadoop, both use ANTLR. Lex Machina uses ANTLR for information extraction from legal texts. Oracle uses ANTLR within SQL Developer IDE and their migration tools. NetBeans IDE parses C++ with ANTLR. The HQL language in the Hibernate object-relational mapping framework is built with ANTLR.

Downloads: 7 This Week

Last Update: 2024-08-03
See Project
11

Apache Bigtop

Bigtop is an Apache Foundation project for Infrastructure Engineers

Apache Bigtop is a project focused on building and packaging the Hadoop ecosystem and related big data components. It provides a consistent framework for testing, packaging, and deploying Hadoop distributions, including tools like HDFS, YARN, Spark, Hive, HBase, and more. By maintaining cross-platform builds (RPMs, DEBs, Docker images, and Kubernetes support), Bigtop makes it easier for organizations to deploy big data stacks in different environments. It also includes a set of integration tests and smoke tests to ensure compatibility and stability between ecosystem components. Developers and operators can use Bigtop to assemble customized Hadoop distributions tailored to their infrastructure and workloads. ...

Downloads: 0 This Week

Last Update: 2025-09-03
See Project
12

GraphQL Inspector

Validate schema, get schema change notifications, validate operations

Validate schemas and detect changes. Receive schema change notifications. Keep Operations and Fragments consistent. Start using our GitHub Application, setup everything up within few clicks. Using GitHub workflows? Try out the GitHub Action. GraphQL Inspector can be used in any Continuous Integration service. Use our modularized, CI suited version of CLI. GraphQL Inspector offers a CLI that lets you analyze your GraphQL API but also client-side applications. Our codebase is publicly...

Downloads: 0 This Week

Last Update: 2026-02-27
See Project
13

Dolphin Scheduler

A distributed and extensible workflow scheduler platform

...All process definition operations are visualized, Visualization process defines key information at a glance, One-click deployment. Support multi-tenant. Support many task types e.g., spark,flink,hive, mr, shell, python, sub_process. Support custom task types, Distributed scheduling, and the overall scheduling capability will increase linearly with the scale of the cluster.

Downloads: 0 This Week

Last Update: 2026-03-01
See Project
14

Luigi

Python module that helps you build complex pipelines of batch jobs

...You can build pretty much any task you want, but Luigi also comes with a toolbox of several common task templates that you use. It includes support for running Python mapreduce jobs in Hadoop, as well as Hive, and Pig, jobs. It also comes with file system abstractions for HDFS, and local files that ensures all file system operations are atomic.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
15

Hive

Lightweight and blazing fast key-value database written in pure Dart

A lightweight, fast NoSQL database for Flutter and Dart applications, optimized for offline-first mobile and desktop development.

Downloads: 0 This Week

Last Update: 2025-02-17
See Project
16

Spark JobServer

REST job server for Apache Spark

...Rather than requiring every application to embed Spark or manage Spark contexts manually, this server abstracts a long-lived service where clients can upload JARs, start and stop contexts, submit jobs synchronously or asynchronously, and manage named objects (RDDs / DataFrames) across job executions. It supports multiple modes (transient jobs, persistent contexts for reuse, streaming, SQL/Hive, etc.), and can be integrated with authentication/authorization systems (e.g. via Apache Shiro). The architecture isolates Spark contexts (optionally in separate JVMs), isolates job dependencies, and persists job / jar metadata via pluggable DAOs. It supports deployment across cluster managers (YARN, Mesos, etc.) and aims to simplify Spark-as-a-service scenarios.

Downloads: 0 This Week

Last Update: 2025-09-18
See Project
17

Steem Ruby Tutorial

Downloads: 0 This Week

Last Update: 2021-05-20
See Project
18

spatial-framework-for-hadoop

The Spatial Framework for Hadoop allows developers

The Spatial Framework for Hadoop allows developers and data scientists to use the Hadoop data processing system for spatial data analysis. For tools, samples, and tutorials that use this framework, head over to GIS Tools for Hadoop. At the root level of this repository, you can build a single jar with everything in the framework using Apache Ant. Alternatively, you can build a jar at the root level of each framework component. Custom MapReduce jobs that use the Esri Geometry API require that...

Downloads: 0 This Week

Last Update: 2023-06-12
See Project
19

SZT-bigdata

SZT‑bigdata is an open source project

SZT‑bigdata is an open-source project analyzing real Shenzhen metro (subway) card usage data using big‑data frameworks like Spark, Hadoop, Hive, Kafka, Flink, ClickHouse, HBase, and Elasticsearch. Aimed at exploring transit passenger flow patterns and system optimization using a variety of Scala-based technologies.

Downloads: 0 This Week

Last Update: 2025-08-04
See Project
20

Open Source Data Quality and Profiling

World's first open source data quality & data preparation project

...This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data Creation, Meta Data Discovery, Anomaly Discovery, Data Cleansing, Reporting and Analytic. It also had Hadoop ( Big data ) support to move files to/from Hadoop Grid, Create, Load and Profile Hive Tables. This project is also known as "Aggregate Profiler" Resful API for this project is getting built as (Beta Version) https://sourceforge.net/projects/restful-api-for-osdq/ apache spark based data quality is getting built at https://sourceforge.net/projects/apache-spark-osdq/

8 Reviews

Downloads: 2 This Week

Last Update: 2021-01-20
See Project
21

SQLFlow

SQL compiler bridging databases and machine learning workflows

...It acts as a compiler that translates SQL programs into executable workflows, enabling users to train, evaluate, and deploy machine learning models directly from SQL statements. It integrates with multiple database engines such as MySQL, Hive, and MaxCompute, while also supporting machine learning frameworks like TensorFlow and XGBoost. By embedding machine learning operations into SQL, it removes the need for users to switch between programming languages such as Python or R, simplifying the overall workflow. SQLFlow also supports model training, prediction, and explanation tasks, allowing data practitioners to work entirely within a familiar query interface.

Downloads: 1 This Week

Last Update: 6 days ago
See Project
22

geometry-api-java

The Esri Geometry API for Java enables developers to write apps

The Esri Geometry API for Java can be used to enable spatial data processing in 3rd-party data-processing solutions. Developers of custom MapReduce-based applications for Hadoop can use this API for spatial processing of data in the Hadoop system. The API is also used by the Hive UDF’s and could be used by developers building geometry functions for 3rd-party applications such as Cassandra, HBase, Storm and many other Java-based “big data” applications.

Downloads: 0 This Week

Last Update: 2023-06-12
See Project
23

Tadpole DB Hub

Web based Database collboration tools

Original home : https://github.com/hangum/TadpoleForDBTools It is a tool to manage Altibase, Amazon RDS, Apache Hive,Apache Cassandra, Apache Tajo, MongoDB, CUBRID, MariaDB, MySQL, Oracle, MSSQL, SQLite, Tibero, PostgreSQL in Web Browser.

1 Review

Downloads: 1 This Week

Last Update: 2020-05-27
See Project
24

Custom Apache Big data Distribution

A Custom Apache Distribution including Spark and Hadoop, for Windows.

This Distribution has been customized to work out of the box. So, just download it, and unzip it. Set the Path variables for bin folders, HADOOP_HOME, SPARK_HOME, and JAVA_HOME. That's it..! use Hadoop and Spark natively on Windows.

Downloads: 0 This Week

Last Update: 2020-03-11
See Project
25

ChunJun

A data integration framework

...The SQL script is compatible with Flink SQL syntax. Supports a variety of heterogeneous data sources, and supports synchronization and calculation of more than 20 data sources such as MySQL, Oracle, SQLServer, Hive, Kudu, etc. Easy to expand, highly flexible, newly expanded data source plugins can integrate with existing data source plugins instantly, plugin developers do not need to care about the code logic of other plugins.

Downloads: 0 This Week

Last Update: 2022-11-18
See Project