Showing 144 open source projects for "hadoop"

View related business solutions
  • SKUDONET Open Source Load Balancer Icon
    SKUDONET Open Source Load Balancer

    Take advantage of Open Source Load Balancer to elevate your business security and IT infrastructure with a custom ADC Solution.

    SKUDONET ADC, operates at the application layer, efficiently distributing network load and application load across multiple servers. This not only enhances the performance of your application but also ensures that your web servers can handle more traffic seamlessly.
  • Business Continuity Solutions | ConnectWise BCDR Icon
    Business Continuity Solutions | ConnectWise BCDR

    Build a foundation for data security and disaster recovery to fit your clients’ needs no matter the budget.

    Whether natural disaster, cyberattack, or plain-old human error, data can disappear in the blink of an eye. ConnectWise BCDR (formerly Recover) delivers reliable and secure backup and disaster recovery backed by powerful automation and a 24/7 NOC to get your clients back to work in minutes, not days.
  • 1

    WebCorpus

    Hadoop framework for scalable processing of large web corpora

    WebCorpus is a Hadoop-based framework that enables you to calculate statistics on large web corpora extracted from web crawls.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    hmrjp-maven-plugin

    hmrjp-maven-plugin

    Hadoop mapreduce maven plugin

    hmrjp-maven-plugin is a maven plugin which helps creating, running and verifying hadoop mapreduce jobs remotely just like any other java project which is built using maven.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    CliqueSquare

    CliqueSquare

    Distributed RDF Processing over Hadoop

    CliqueSquare is a system for storing and querying large RDF graphs relying on Hadoop’s distributed file system (HDFS) and Hadoop’s MapReduce open-source implementation. It provides a novel partitioning and storage scheme that permits 1-level joins to be evaluated locally using efficient map-only joins. In addition, CliqueSquare is equipped with a unique optimization algorithm based on graphs and cliques capable of generating highly parallelizable flat query plans relying on n-ary equality joins.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Downloads: 0 This Week
    Last Update:
    See Project
  • HRSoft Compensation - Human Resources Software Icon
    HRSoft Compensation - Human Resources Software

    HRSoft is the only unified, purpose-built SaaS platform designed to transform your complex HR processes into seamless digital ones

    Manage your enterprise’s compensation lifecycle and accurately recognize top performers with a digitized, integrated system. Keep employees invested and your HR team in control while preventing compensation chaos.
  • 5
    drbl-hadoop
    DRBL-hadoop is a plugin for Diskless Remote Boot in Linux (DRBL). It will help you to setup and deploy a Hadoop Cluster in few steps. You can also use this Live CD as an teaching environment of Hadoop. It already have Cloudera CDH2 installed inside.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    glue

    BigData Workflow Engine for Hadoop, Hbase, Netezza, Pig, Hive ...

    Glue is a job execution engine, written in Java and Groovy. workflows are written in Groovy DSL (simple statements) and use pre-developed modules to interact with external resources e.g. DBs, Hadoop, Netezza, FTP etc. In Glue one of the main features and design goals is to always abstract configuration away from functionality. This means that no more hardcoded IPs, UserIDs and Passwords spread over 10s of hundreds of bash/python scripts. Scripts written for one environment can be easily...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Volt

    Volt is pure JAVA NGS mapping soft which run on Hadoop 2.0 env

    The project move to VoltMR http://voltmr.sourceforge.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    CSVTOHIVE

    Generate Hive Scripts Automatically from CSV Files

    Generates Hive Scripts Automatically from a CSV Files. 1. Script copies csv files to Hadoop Files System. 2. Generates CREATE statements to create tables. 3. Generates .hive files in the same folder as that of csv folder and also generates run.sh with all consolidated files. So just switch to the folder where .hive scripts are residing and run run.sh (./run.sh). This tool will also set execute permissions on .hive and run.sh scripts so you can directly execute run.sh.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OpenCrowbar

    OpenCrowbar

    Data Center Bare Metal configuration platform

    The principal motivation for creation of OpenCrowbar is the transition a from bare metal installer into a tool that manages ongoing operations. OpenCrowbar enables upgrade and continuous deployment automation. This capability is important for large scale deployments of evolving complex projects like OpenStack, Hadoop, and Ceph. OpenCrowbar provides the foundation for operations automation. OpenCrowbar is an open reference implementation that can be reliably deployed in large-scale, multi...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Manage your IT department more effectively Icon
    Manage your IT department more effectively

    Streamline your business from end to end with ConnectWise PSA

    ConnectWise PSA (formerly Manage) allows you to stop working in separate systems, and helps you build a more profitable business. No more duplicate data entries, inefficient employees, manual invoices, and the inability to accurately track client service issues. Get a behind the scenes look into the award-winning PSA that automates processes for each area of business: sales, help desk, support, finance, and HR.
  • 10
    Flamingo Project

    Flamingo Project

    Workflow Designer, Hive Editor, Pig Editor, File System Browser

    Flamingo is a open-source Big Data Platform that combine a Ajax Rich Web Interface + Workflow Engine + Workflow Designer + MapReduce + Hive Editor + Pig Editor. 1. Easy Tool for big data 2. Use comfortable in Hadoop EcoSystem projects 3. Based GPL V3 License Supporting Pig IDE, Hive IDE, HDFS Browser, Scheduler, Hadoop Job Monitoring, Workflow Engine, Workflow Designer, MapReduce.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    MPJ Express: Parallel Computing for Java
    MPJ Express is an implementation of an MPI-like API—standardized by the Java Grande forum—used to write parallel Java applications, which can execute on a variety of parallel platforms ranging from multicore processors to compute clusters/clouds.
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • 12
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Aspose for Hadoop

    Aspose for Hadoop

    This project holds source code for Aspose for Hadoop project.

    Aspose for Hadoop project enables Apache Hadoop / MapReduce developers to work with various binary file formats. The developers can create and convert binary sequence files into text sequence files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Failure Aware Hadoop Scheduler

    Failure aware Hadoop scheduler

    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    UI To the Hadoop HBase Project
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16

    HDFSFileTransfer

    File transfer from local FS to HDFS

    The HDFSFileTransfer project was created and developed to ease Hadoop users quickly copying varied files such as: flat, structured, unstructured, big and small from linux to Hadoop File System (HDFS). It allows users to transfer files: - within the same physical machine - from local file system (linux) into HDFS - between two physical machines - copy files from local file system (linux) with HDFS cluster installed to another HDFS cluster. Sample - one can have two single clustered Hadoop...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    BIRT Report Designer

    BIRT Report Designer

    Open Source Reporting & Data Visualization Platform

    .... With a flexible Open Data Access framework, developers can write custom data drivers to access data from any source, including Big Data sources like Apache Hadoop, Cassandra, and MongoDB, along with all traditional relational databases, Flat Files, XML data streams, and data stored in proprietary systems. Built for embedding, BIRT includes APIs for data access, chart generation, output formats, content execution, and integration within larger applications.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 18
    This project aims to reduce the data read redundancy in the Apache Hadoop.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Crowbar

    Crowbar

    A complete operations platform to deploy, maintain and scale clusters.

    The Crowbar Project is an effort to build a complete, easy to use operational platform for everyone. It allows for any number of physical nodes to be moved from bare-metal to production cluster within hours. Specific applications include (but are not limited to) Hadoop and OpenStack.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    Pydoop is a Python MapReduce and HDFS API for Hadoop.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A Hadoop based genome assembler for assembling large genomes in the clouds
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Seal

    Seal

    A toolkit for distributed processing of HT sequencing data.

    Seal is a Hadoop-based distributed short read alignment and analysis toolkit. Currently Seal includes tools for: read demultiplexing, read alignment, duplicate read removal, sorting read mappings, and calculating statistics for empirical base quality recalibration. Seal scales, easily handling TB of data.
    Downloads: 0 This Week
    Last Update:
    See Project