Showing 3911 open source projects for "source"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Diffgram

    Diffgram

    Training data (data labeling, annotation, workflow) for all data types

    ...Diffgram is a single application that will improve your data labeling and bring all aspects of training data under a single roof. Diffgram is world’s first truly open source training data platform that focuses on giving its users an unlimited experience. This is aimed to reduce your data labeling bills and increase your Training Data Quality. Training Data is the art of supervising machines through data. This includes the activities of annotation, which produces structured data; ready to be consumed by a machine learning model. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Dagster

    Dagster

    An orchestration platform for the development, production

    Dagster is an orchestration platform for the development, production, and observation of data assets. Dagster as a productivity platform: With Dagster, you can focus on running tasks, or you can identify the key assets you need to create using a declarative approach. Embrace CI/CD best practices from the get-go: build reusable components, spot data quality issues, and flag bugs early. Dagster as a robust orchestration engine: Put your pipelines into production with a robust...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Timesketch

    Timesketch

    Collaborative forensic timeline analysis

    Timesketch is a collaborative forensic timeline analysis platform used to investigate security incidents by turning diverse evidence into a single, searchable chronology. Analysts ingest logs and artifacts from many sources—endpoints, servers, cloud services—and Timesketch normalizes them into events on a unified timeline. Powerful search, aggregations, and saved views help you pivot quickly, highlight anomalies, and preserve investigative steps for later review. The system supports tagging,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Gorse Recommender System Engine

    Gorse Recommender System Engine

    An open source recommender system service written in Go

    ...Analyze online recommendation performance from recently inserted feedback. Provide GUI for data management, system monitoring, and cluster status checking. Gorse is an open-source recommendation system written in Go. Gorse aims to be a universal open-source recommender system that can be easily introduced into a wide variety of online services. By importing items, users and interaction data into Gorse, the system will automatically train models to generate recommendations for each user.
    Downloads: 1 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    plotly

    plotly

    An interactive graphing library for R

    This part of the book teaches you how to leverage the plotly R package to create a variety of interactive graphics. There are two main ways to creating a plotly object: either by transforming a ggplot2 object (via ggplotly()) into a plotly object or by directly initializing a plotly object with plot_ly()/plot_geo()/plot_mapbox(). Both approaches have somewhat complementary strengths and weaknesses, so it can pay off to learn both approaches. Moreover, both approaches are an implementation of...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Apache HBase

    Apache HBase

    Get random, realtime read/write access to your Big Data

    Use Apache HBase™ when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables, billions of rows X millions of columns, atop clusters of commodity hardware. Apache HBase is an open-source, distributed, versioned, non-relational database modeled after Google's Bigtable. A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, Apache HBase provides Bigtable-like capabilities on top of Hadoop and HDFS. Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Akka

    Akka

    Build concurrent, distributed, and resilient message-driven apps

    Build powerful reactive, concurrent, and distributed applications more easily. Akka is a toolkit for building highly concurrent, distributed, and resilient message-driven applications for Java and Scala. Actors and Streams let you build systems that scale up, using the resources of a server more efficiently, and out, using multiple servers. Building on the principles of The Reactive Manifesto Akka allows you to write systems that self-heal and stay responsive in the face of failures. Up to...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Space Radar Electron

    Space Radar Electron

    Disk And Memory Space Visualization App built with Electron & d3.js

    Space Radar Electron is an application that offers an interactive and comprehensive visualization of disk space and memory usage of your computer. Built with Electron & d3.js, it currently offers visualizations in the form of Sunburst, Treemap and Flamegraph charts. As it scans the contents of your disk, it produces a preview visualization so you can already see what's been scanned. It allows for drilldown of directories, breadcrumbs and navigation. Space Radar works fast, and is...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Apache RocketMQ

    Apache RocketMQ

    Distributed messaging and streaming platform with low latency

    Apache RocketMQ is a distributed messaging and streaming platform with low latency, high performance and reliability, trillion-level capacity and flexible scalability. Messaging patterns including publish/subscribe, request/reply and streaming. Financial grade transactional message. Built-in fault tolerance and high availability configuration options base on DLedger. A variety of cross language clients, such as Java, C/C++, Python, Go. Pluggable transport protocols, such as TCP, SSL, AIO....
    Downloads: 2 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 10
    DataChain

    DataChain

    AI-data warehouse to enrich, transform and analyze unstructured data

    Datachain enables multimodal API calls and local AI inferences to run in parallel over many samples as chained operations. The resulting datasets can be saved, versioned, and sent directly to PyTorch and TensorFlow for training. Datachain can persist features of Python objects returned by AI models, and enables vectorized analytical operations over them. The typical use cases are data curation, LLM analytics and validation, image segmentation, pose detection, and GenAI alignment. Datachain...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Infiltrator.jl

    Infiltrator.jl

    No-overhead breakpoints in Julia

    This package provides the @infiltrate macro, which acts as a breakpoint with negligible runtime performance overhead. Note that you cannot access other function scopes or step into further calls. Use an actual debugger if you need that level of flexibility. Running code that ends up triggering the @infiltrate REPL mode via inline evaluation in VS Code or Juno can cause issues, so it's recommended to always use the REPL directly. When the infiltration point is hit, it will drop you into an...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Personal Management System

    Personal Management System

    Your web application for managing personal data

    It's easier to understand this web application when you think about a CMS (WordPress) or CRM (SugarCRM); the logic behind this system is very similar to those two. My PMS may offer fewer possibilities than those systems above, but it just does what I want it to do. Additionally, writing extensions is not too hard, depending on the logic required. Anyone with development knowledge can pretty much write their own extensions for personal needs. Keep a track of your personal goals. You can use...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    XGBoost

    XGBoost

    Scalable and Flexible Gradient Boosting

    XGBoost is an optimized distributed gradient boosting library, designed to be scalable, flexible, portable and highly efficient. It supports regression, classification, ranking and user defined objectives, and runs on all major operating systems and cloud platforms. XGBoost works by implementing machine learning algorithms under the Gradient Boosting framework. It also offers parallel tree boosting (GBDT, GBRT or GBM) that can quickly and accurately solve many data science problems....
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    KernelDensity.jl

    KernelDensity.jl

    Kernel density estimators for Julia

    Kernel density estimators for Julia.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    city-roads

    city-roads

    Visualization of all roads within any city

    city-roads is a data visualization and mapping project that renders street networks of cities as vector paths, offering an interactive, zoomable experience that highlights how cities are stitched together by their road infrastructure. It typically fetches open map data (such as from OpenStreetMap) and then processes that data into geometry suited for rendering in the browser, allowing users to explore intricate road layouts at various scales. The visual style often abstracts away labels and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    see

    see

    Visualisation toolbox for beautiful and publication-ready figures

    see is an R package that serves as the visualization component of the easystats ecosystem, providing plotting utilities to produce publication-ready visualizations of statistical model parameters, diagnostics, predictions, and performance metrics. It works in conjunction with other easystats packages (such as parameters, performance, modelbased, bayestestR, etc.) to convert model outputs or summary objects into visual forms (dot-and-whisker plots, diagnostic plots, residual plots, etc.). It...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    DearPyGui

    DearPyGui

    Graphical User Interface Toolkit for Python with minimal dependencies

    Dear PyGui is an easy-to-use, dynamic, GPU-Accelerated, cross-platform graphical user interface toolkit(GUI) for Python. It is “built with” Dear ImGui. Features include traditional GUI elements such as buttons, radio buttons, menus, and various methods to create a functional layout. Additionally, DPG has an incredible assortment of dynamic plots, tables, drawings, debuggers, and multiple resource viewers. DPG is well suited for creating simple user interfaces as well as developing complex...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    MIRT.jl

    MIRT.jl

    MIRT: Michigan Image Reconstruction Toolbox (Julia version)

    MIRT.jl is a collection of Julia functions for performing image reconstruction and solving related inverse problems. It is very much still under construction, although there are already enough tools to solve useful problems like compressed sensing MRI reconstruction. Trying the demos is a good way to get started. The documentation is even more still under construction.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Vespa

    Vespa

    The open big data serving engine

    Make AI-driven decisions using your data, in real-time. At any scale, with unbeatable performance. Vespa is a full-featured text search engine and supports both regular text search and fast approximate vector search (ANN). This makes it easy to create high-performing search applications at any scale, whether you want to use traditional techniques or a modern vector-based approach. You can even combine both approaches efficiently in the same query, something no other engine can do....
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Genie

    Genie

    Distributed Big Data Orchestration Service

    Genie is a completely open source distributed job orchestration engine developed by Netflix. Genie provides REST-ful APIs to run a variety of big data jobs like Hadoop, Pig, Hive, Spark, Presto, Sqoop and more. It also provides APIs for managing the metadata of many distributed processing clusters and the commands and applications which run on them.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    janitor

    janitor

    Simple tools for data cleaning in R

    janitor provides simple, convenient tools for data cleaning, formatting, and exploration in R. It is especially useful for cleaning messy data frames, removing duplicates, formatting column names, and producing frequency tables in a tidy workflow.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Reactor Core

    Reactor Core

    Non-Blocking Reactive Foundation for the JVM

    Reactor Core is a foundational library for building reactive applications in Java, providing a powerful API for asynchronous, non-blocking programming.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    UnROOT.jl

    UnROOT.jl

    Native Julia I/O package to work with CERN ROOT files objects

    UnROOT.jl is a reader for the CERN ROOT file format written entirely in Julia, without any dependence on ROOT or Python.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Latexify.jl

    Latexify.jl

    Convert julia objects to LaTeX equations, arrays or other environments

    This is a package for generating LaTeX maths from Julia objects. This package utilizes Julia's homoiconicity to convert expressions to LaTeX-formatted strings. Latexify.jl supplies functionalities for converting a range of different Julia objects.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    ClearML

    ClearML

    Streamline your ML workflow

    ...The ClearML Server storing experiment, model, and workflow data, and supports the Web UI experiment manager, and ML-Ops automation for reproducibility and tuning. It is available as a hosted service and open source for you to deploy your own ClearML Server. The ClearML Agent for ML-Ops orchestration, experiment and workflow reproducibility, and scalability.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB