Search Results for "kettle data integration"

Showing 17 open source projects for "kettle data integration"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.
    Start Free
  • 1
    Cassandra Spark Connector

    Cassandra Spark Connector

    Apache Spark to Apache Cassandra connector

    The Apache Cassandra Spark Connector allows Spark jobs (RDDs or DataFrames/Datasets) to read from and write to Cassandra tables. Compatible with Apache Cassandra (v2.1+), Spark 1.0–3.5, and Scala 2.11–2.13, it supports mapping Cassandra rows to Scala case classes, saving results back to Cassandra, and executing arbitrary CQL within Spark applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Scio

    Scio

    A Scala API for Apache Beam and Google Cloud Dataflow

    Scio is a Scala API developed by Spotify that builds on Apache Beam to enable expressive batch and streaming data pipelines, optimized for running on Google Cloud Dataflow. Inspired by Spark and Scalding, it provides scalable, type‑safe, and production-grade data processing, with built-in support for BigQuery, Pub/Sub, Cassandra, Elasticsearch, Redis, TensorFlow IO, and more.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Airframe

    Airframe

    Essential Building Blocks for Scala

    Airframe is an essential building block for developing applications in Scala, including logging, object serialization using JSON or MessagePack, dependency injection, HTTP server/client with RPC support, functional testing with AirSpec, etc. Airframe RPC supports seamless integration of servers and clients using Scala as RPC interfaces. AirSpec is a simple unit testing framework for Scala and Scala.js. You can use public methods in your classes as test cases. There is no need to remember...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    ZIO JSON

    ZIO JSON

    Fast, secure JSON library with tight ZIO integration

    ZIO Json is a fast and secure JSON library with tight ZIO integration. The goal of this project is to create the best all-round JSON library for Scala. Extreme performance is achieved by decoding JSON directly from the input source into business objects (docs/inspired by plokhotnyuk). Although not a requirement, the latest advances in Java Loom can be used to support arbitrarily large payloads with near-zero overhead. Best-in-class security is achieved with an aggressive early exit strategy...
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    Chimney

    Chimney

    Scala library for boilerplate-free, type-safe data transformations

    Chimney is a Scala library that facilitates boilerplate-free, type-safe data transformations between different data types. It enables developers to define mappings between source and target types, ensuring that transformations are checked at compile time, thereby reducing runtime errors and enhancing code reliability.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Monocle

    Monocle

    Optics library for Scala

    Monocle is a pure functional, optics library for Scala providing immutable data access and transformation tools — including Lens, Prism, Iso, Optional, and Traversal. It enables composable, declarative modifications of deeply nested immutable structures in a concise and type-safe fashion.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    GitBucket

    GitBucket

    A Git platform powered by Scala

    A Git platform powered by Scala with easy installation, high extensibility & GitHub API compatibility. GitBucket is a Git web platform powered by Scala offering, easy installation, intuitive UI, high extensibility by plugins, API compatibility with GitHub. You can also deploy gitbucket.war to a servlet container which supports Servlet 3.0 (like Jetty, Tomcat, JBoss, etc). To upgrade GitBucket, replace gitbucket.war with the new version, after stopping GitBucket. All GitBucket data is stored...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Synapse Machine Learning

    Synapse Machine Learning

    Simple and distributed Machine Learning

    ...SynapseML builds on Apache Spark and SparkML to enable new kinds of machine learning, analytics, and model deployment workflows. SynapseML adds many deep learning and data science tools to the Spark ecosystem, including seamless integration of Spark Machine Learning pipelines with the Open Neural Network Exchange (ONNX), LightGBM, The Cognitive Services, Vowpal Wabbit, and OpenCV. These tools enable powerful and highly-scalable predictive and analytical models for a variety of data sources. SynapseML also brings new networking capabilities to the Spark Ecosystem. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ScalaCheck

    ScalaCheck

    Property-based testing for Scala

    ScalaCheck is a library for property-based testing in Scala (and Java), inspired by Haskell’s QuickCheck. It automatically generates test inputs based on specifications, validating that properties hold across randomized scenarios, thereby enabling robust, declarative testing of edge cases and invariants.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Powerful App Monitoring Without Surprise Bills Icon
    Powerful App Monitoring Without Surprise Bills

    AppSignal starts at $23/month with all features included. No overages, no hidden fees. 30-day free trial.

    Tired of monitoring tools that punish you for scaling? AppSignal offers transparent, predictable pricing with every feature unlocked on every plan. Track errors, monitor performance, detect anomalies, and manage logs across Ruby, Python, Node.js, and more. Trusted by developers since 2012 with free dev-to-dev support. No credit card required to start your 30-day trial.
    Try AppSignal Free
  • 10
    Scalaz

    Scalaz

    Principled Functional Programming in Scala

    Scalaz is a foundational functional-programming library for Scala that provides type classes, data types, and syntax to write pure, composable code. It implements classic abstractions such as Functor, Applicative, Monad, Monoid, Foldable, and Traverse, along with powerful transformers (ReaderT, StateT, WriterT, OptionT, and more) to structure effects. The library offers rich data structures—\/ (disjunction), Validation, NonEmptyList, IList, and Free—that help model errors, invariants, and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Elastiknn

    Elastiknn

    Elasticsearch plugin for nearest neighbor search

    Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity searches using exact and approximate algorithms. Methods like word2vec and convolutional neural nets can convert many data modalities (text, images, users, items, etc.) into numerical vectors, such that pairwise distance computations on the vectors correspond to semantic similarity of the original data. Elasticsearch is a ubiquitous search solution, but its support for vectors is limited. This plugin fills the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Gatling

    Gatling

    Modern Load Testing as Code

    Gatling is a high-performance load testing tool built on the JVM that emphasizes realism, scalability, and developer ergonomics. Test scenarios are scripted in a concise Scala-based DSL, allowing you to model user journeys with think times, feeders (dynamic data), checks, and assertions all in code. Its asynchronous, non-blocking engine (backed by Netty) can drive very high concurrency from a single injector, reducing the need for large injector farms. Gatling supports HTTP out of the box as...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    Scala 2

    Scala 2

    Scala 2 compiler and standard library

    Scala combines object-oriented and functional programming in one concise, high-level language. Scala's static types help avoid bugs in complex applications, and its JVM and JavaScript runtimes let you build high-performance systems with easy access to huge ecosystems of libraries. Scastie is Scala + sbt in your browser! You can use any version of Scala, or even alternate backends such as Dotty, Scala.js, Scala Native, and Typelevel Scala. You can use any published library. You can save and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Spark NLP

    Spark NLP

    State of the Art Natural Language Processing

    Experience the power of large language models like never before, unleashing the full potential of Natural Language Processing (NLP) with Spark NLP, the open source library that delivers scalable LLMs. The full code base is open under the Apache 2.0 license, including pre-trained models and pipelines. The only NLP library built natively on Apache Spark. The most widely used NLP library in the enterprise. Spark ML provides a set of machine learning applications that can be built using two main...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Algebird

    Algebird

    Abstract Algebra for Scala

    Algebird is Twitter’s Apache‑licensed Scala library providing abstract algebra data structures and algorithms, especially for online/streaming aggregation. It includes Monoid, Approximate, HyperLogLog, CMS, BloomFilter, Min/Max, Averaged Value types, supporting efficient distributed aggregation and approximate analytics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Feathr

    Feathr

    A scalable, unified data and AI engineering platform for enterprise

    Feathr is a data and AI engineering platform that is widely used in production at LinkedIn for many years and was open sourced in 2022. It is currently a project under LF AI & Data Foundation. Define data and feature transformations based on raw data sources (batch and streaming) using Pythonic APIs. Register transformations by names and get transformed data(features) for various use cases including AI modeling, compliance, go-to-market and more. Share transformations and data(features)...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Summingbird

    Summingbird

    Streaming MapReduce with Scalding and Storm

    Summingbird is a streaming + batch hybrid computation framework developed by Twitter. Its aim is to let developers express data aggregation pipelines in a unified way, where the same logic can run either in real time (stream) or in batch mode, and the results can be merged or reconciled. In effect, Summingbird abstracts over multiple execution engines (such as Storm, Scalding, etc.) to provide one high-level program that composes transformations and aggregations, and then executes them in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB