Showing 144 open source projects for "hadoop"

View related business solutions
  • SKUDONET Open Source Load Balancer Icon
    SKUDONET Open Source Load Balancer

    Take advantage of Open Source Load Balancer to elevate your business security and IT infrastructure with a custom ADC Solution.

    SKUDONET ADC, operates at the application layer, efficiently distributing network load and application load across multiple servers. This not only enhances the performance of your application but also ensures that your web servers can handle more traffic seamlessly.
  • SysAid multi-layered ITSM solution Icon
    SysAid multi-layered ITSM solution

    For organizations spanning all industries and sizes from SMBs to Fortune 500 corporations

    SysAid is an ITSM, Service Desk and Help Desk software solution that integrates all of the essential IT tools into one product. Its rich set of features include a powerful Help Desk, IT Asset Management, and other easy-to-use tools for analyzing and optimizing IT performance.
  • 1

    Distributed advertising system

    hadoop and storm-based distributed advertising system

    hadoop and storm-based distributed advertising system,Provides real-time bidding and pay per click advertising function
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    PureHadoop

    pHd - Pure Apache Hadoop Distribution

    A pure build of Apache Hadoop 2.2 from the source. This represents the purest form of Hadoop available. Canned CentOS 6.5 Single node VM available for fast start and quick sandbox. This is NOT a vendor distro from Cloudera, Hortonworks, or MapR. No junk! Just pure Apache. VM is 64 bit native build.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    This is a toolkit to help developer to install hadoop cluster on hiCloud VMs. Note: hiCloud is the Amazon EC2 like service provided by CHT, Taiwan.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    A next gen sequencing analysis pipeline designed to run on hadoop/hdfs written in java and PIG. For more info, contact Zack Ramjan at USC
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gain insights and build data-powered applications Icon
    Gain insights and build data-powered applications

    Your unified business intelligence platform. Self-service. Governed. Embedded.

    Chat with your business data with Looker. More than just a modern business intelligence platform, you can turn to Looker for self-service or governed BI, build your own custom applications with trusted metrics, or even bring Looker modeling to your existing BI environment.
  • 5

    Uragan

    Custom Search Engine based on Apache Hadoop platform

    Uragan is the Custom Search Engine build on Apache Hadoop architecture. It allows to fetch 1 Tb of data in a daily basis. 100% relevance achived by custom configurable list of sources (urls or sites). It also applies custom extraction templates to point which information blocks needed for extraction. Its architecure designed on inspiration of Apache Nutch search engine. But more less work in a different two cycles process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Jxtadoop

    Jxtadoop

    This project aims to provide P2P capabilities with Hadoop DFS.

    Hadoop is designed to work in large datacenters with thousands of servers connected to each others in the Hadoop cloud. This project focuses on the Distributed File System part of Hadoop (HDFS). The goal of this project is to provide an alternative to direct IP connectivity required for Hadoop. Instead, the DFS layer has been modified to use a Peer-2-Peer framework which allows direct connectivity in datacenters as well as indirect connectivity to bypass firewall constraints. The typical use...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    testHadoop

    Project to work with binary files on hadoop

    Aspose for Hadoop will enable hadoop developers to work with binary file formats on Hadoop by converting binary sequence files into text sequence files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    DynamicMR

    A Dynamic Slot Allocation and Scheduling System for MapReduce Clusters

    DynamicMR is a dynamic slot allocation and scheduling framework aiming to improve the performance of Hadoop under Hadoop Fair Scheduler (HDFS) by maximizing the slots utilization while guaranteeing the fairness across pools. It consists of three levels of scheduling components, namely, Dynamic Hadoop Fair Scheduler (DHFS), Dynamic Speculative Task Scheduler (DSTS), and Data Locality Maximization Scheduler (DLMS).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Downloads: 0 This Week
    Last Update:
    See Project
  • Vivantio IT Service Management Icon
    Vivantio IT Service Management

    Your service operation isn’t one-size-fits all, so your IT service management solution shouldn’t be either

    The Vivantio Platform allows you to focus on the IT service management tools that make sense for your organization’s unique service model: from incident, problem and change requests, to service requests, client knowledge and asset management
  • 10

    Hadoop

    Integration of Virtualization with Hadoop tools.

    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    DHFS

    A dynamic slot allocation technique to improve performance for HFS.

    Dynamic Hadoop Fair Scheduler (DHFS) is an optimized Hadoop Fair Scheduler that improves the performance of Hadoop by maximizing the slots utilization while guarantees the fairness across pools. It is based on the observation that at different period of time there may be idle map (or reduce) slots, as the job proceeds from map phase to reduce phase. We can use the unused map slots for those overloaded reduce tasks to improve the performance of the MapReduce workload, and vice versa, by breaking...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    JUMMP

    JUMMP

    JUMMP: Job Uninterrupted Maneuverable MapReduce Platform

    JUMMP is an automated scheduling platform that provides a customized Hadoop environment within a batch-scheduled cluster environment. JUMMP enables an interactive pseudo-persistent MapReduce platform within the existing administrative structure of an academic high performance computing center by “jumping” between nodes with minimal administrative effort. Jumping is implemented by the synchronization of stopping and starting daemon processes on different nodes in the cluster. Use...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    R Hadoop for Big Data

    R Hadoop for Big Data

    Download Free Associated R open source script files for big data analy

    Download Free Associated R open source script files for big data analysis with Hadoop and R These are R script source file from Ram Venkat from a past Meetup we did at http://www.meetup.com/R-Matlab-Users/events/85160532/ Also, there is a long video and Powerpoint presentation slide PDF with R files at: http://quantlabs.net/blog/2012/11/how-to-use-hadoop-and-r-for-big-data-parallel-processing-free-download-pdf/ Download source files from http://quantlabs.net/blog/2012/11/download-free...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    eneningSearch

    Based on hadoop search engine

    Based hadoop vertical search system
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    hadoop4win
    Hadoop for Windows using Cygwin
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    distmap

    A toolkit for distributed short read mapping

    DistMap is a user-friendly pipeline designed to map short reads in a MapReduce framework on a local Hadoop cluster. It is designed to be easily implemented by researchers who do not have expert knowledge of bioinformatics. As it does not have any dependencies, DistMap provides full flexibility and control to the user. The user can use any version of a compatible mapper and any reference genome assembly. There is no need to maintain the mapper, reference or DistMap source code on each...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    HadStat

    HadStat is service on cloud,for data analysis using Hadoop MapReduce.

    HadStat is service on the cloud, allow you to analysis the data on the cloud and return the result in nice graph,this service is free, you can redistribute it and/or modify it under the terms of the GNU General Public License. this service using many technologies , like Hadoop mapreduce, HTML, PHP, Web Service applications, linux server, java, eclipse IDE, with many indicators:Simple moving average (SMA),Exponential moving average (EMA),Smoothed simple moving average (SMMA),Linear weighted...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Standalone HDFS
    Hadoop is a great project for deep analytics based on the MapReduce features. It also includes a powerful distributed file system designed to ensure that the analytics workloads can locally access the data to be processed to minimize the network bandwidth impact. I found this filesystem very useful to leverage storage from all my PCs and even from some of my online storage such as S3. However i did not want to deploy the full hadoop stack. Hence my decidion to create a standalone distribution...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    HadoopFileManager

    Console File Manager for Hadoop, written on java.

    Console File Manager for Hadoop, written on java. For Linux only. Left panel contains local files, right - files from HDFS. For run execute: hadoop jar HadoopFileManager-0.1.0-DEMO.jar Lanterna library as UI. For avoid additional classpath, included into main jar. Current version is just demo, for check display possibility.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    OozieWorkflowViewer

    Eclipse plugin for view Appache Oozie workflow structure

    Eclipse plugin for view Appache Hadoop Oozie workflow structure. Plugin contains one view in Oozie category. Put plugin in eclipse/plugins directory. Run Eclipse, open workflow in editor. Open view from menu Window>Show View>Other>Oozie. Workflow structure will be displayed in view as tree.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Hadoop Network Topology

    Network Topology Discovery and Hardware Failure Detection in Hadoop

    The project involves detection of hardware failures and discovery of the network topology within the Hadoop cluster.The application developed in this project, col- lects various network and hardware components information from all the Datanodes and analyses them to detect the network failures, CPU and Harddisk failures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    oozie-workflow-checker

    Validation of complex Apache Oozie Hadoop workflow

    Library validated complex Oozie workflows (http://oozie.apache.org/). Two usage scenarios: 1) Execute workflow with specified parameters, and as result get list of passed nodes. Sample in WorkflowDirProcessorIntegrationTest Note: from all workflow functions only "wf:conf" is supported now. 2) Check called actions exists or build full call tree in xml format Sample in OozieWorkflowCheckerTest: You can override properties from "config-default.xml" and "job.properties" by file with name...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A web-based interface for the Hadoop MapReduce framework that simplifies the process of writing and running MapReduce jobs. Aimed at introducing parallelism concepts in introductory computer science courses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    MR-plus

    MR+ advocates a departure from a fixed two-stage process of MapReduce.

    The implementation of MR+ is derived from Hadoop MapReduce. However, unlike Hadoop, MR+ enables both map and reduce to interleave, as well as the same key to be reduced by different reduce workers in parallel – permitting multi- level reduces. In the MR+ implementation, although the role of the JobTracker and Task- Tracker is identical to that in Hadoop, the details of task scheduling are completely different. To achieve interleaving of maps and reduces, MR+ tasks are allocated according...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    SOAPstreaming

    Hadoop-based Framework for Analyzing Large Scale NGS Data

    SOAPstreaming, a flexible hadoop-based framework for systematically analyzing large scale NGS data. SOAPstreaming uses an enhanced Hadoop streaming strategy of MapReduce model to parallelize tools, which offers great scalability and efficiently reduces running time of large NGS datasets. Moreover, it has nice expansion capability to integrate new tools for different analysis needs, which more and more tools will be added in this framework.
    Downloads: 0 This Week
    Last Update:
    See Project