Showing 97 open source projects for "data science"

View related business solutions
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • Save Up to 91% on Cloud Compute With Spot VMs Icon
    Save Up to 91% on Cloud Compute With Spot VMs

    Automatic sustained-use discounts. One free VM per month. No negotiation needed.

    Run batch jobs at 60-91% off with Spot VMs. Long-running workloads get automatic discounts with sustained use.
    Try Free
  • 1
    Hibernate

    Hibernate

    An object relational-mapping (ORM) library for Java

    The Hibernate projects offer a suite of powerful Java libraries to work with data. It is best known for Hibernate ORM, which provides relational persistence for Java models and is an implementation of the Jakarta Persistence specification. Hibernate projects do not consistently release binaries or documentation to SourceForge anymore. For up-to-date information, refer to the Hibernate website: * Hibernate ORM: https://hibernate.org/orm/ * Hibernate Validator:...
    Leader badge
    Downloads: 3,446 This Week
    Last Update:
    See Project
  • 2
    GeoNetwork opensource - Metadata Catalog
    A web based Metadata Catalog Server for data description and discovery. Supports both generic and geospatial data discovery. It implements international standards (e.g. ISO19115/19139/19119, ISO19115-3, DCAT-AP, CSW 2.0, OGC API Records). It originates from the United Nations and is used by many governments as geoportal software. Active development and discussion takes place on GitHub and OSGeo Discourse. You can also connect directly with the companies supporting the...
    Leader badge
    Downloads: 181 This Week
    Last Update:
    See Project
  • 3
    AMBIT:Chemical Structure DB&Web Service

    AMBIT:Chemical Structure DB&Web Service

    Chemical structures database & machine learning with web services API

    AMBIT offers a cheminformatics data management for chemical substances, structures and nanomaterials. Flexible structure, similarity and study queries storage, descriptor calculation and predictive models building via REST web services.User interface for creating read across assessment and generating reports, aligned with regulatory and industrial requirements. Command line applications also available. Integration with third party tools and databases.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    Treebolic is a Java suite (including library) whose purpose is to give a hyperbolic rendering of hierarchical data. A tree is rendered with nodes and edges but display space is subject to a particular curvature (hence the name).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    The Lemur Project

    The Lemur Project

    Search engine and data mining applications and ClueWeb datasets.

    The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
    Leader badge
    Downloads: 25 This Week
    Last Update:
    See Project
  • 6
    JAMon API

    JAMon API

    Monitor Java applications - SQL, HTTP, Methods, Exceptions and more.

    JAMon API is a free, simple, high performance, thread safe, Java API that allows developers to easily monitor the performance and scalability of production applications. JAMon tracks hits, execution times (total, avg, min, max, std dev), and more. * JAMon Users Manual: For more on the JAMon, including installing, configuring, and using it, see http://jamonapi.sourceforge.net/. * Support: If you have any questions about usage please post a question on the forum at ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 7
    Linha do Texto is a web based game for classificate semiotic text content from user input data with 1 to 4 parameters predifined in each section. It can be used to discuss discrete and continuous semantic categories in a scientific and educational fied.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Software, information, data sets and documentation for the Web as Corpus community.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    libwlocate is a shared library that can be used for location-based services. It checks for WLAN-networks that are available near to a user and retrieves the current geographic position out of them. Thus no GPS is needed for positioning. This WLAN-based geolocation happens without using any of the commercial services that do the same but track the users position and data. libwlocate instead bases on the free OpenWLANMap project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    panFMP
    panFMP is a generic framework suitable for harvested XML metadata that is searchable through Apache Lucene without any additional RDBMS. Fields can be defined by XPath allowing for full text queries on all types of fields including numerical ranges. The code was moved to Github: https://github.com/pangaea-data-publisher/panfmp
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    XML:DB Initiative for XML Databases
    XML:DB provides a community for collaborative development of specifications for XML databases and data manipulation technologies. Along with each specification an open source reference implementation will be developed to validate the ideas.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    jDHTUQ
    ATTENTION: For more info and latest version, refer to https://github.com/estigma88/jdhtuq. jDHTUQ is a peer-to-peer DHT system based in Chord algorithm, but built like API to generalize the implementation of peer-to-peer DHT systems. jDHTUQ have two fundamental services , put and get of resource.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Object Oriented Distributed Systems

    Simplified Architecture for Distributed Systems

    OODVS is a Java framework which goal is to increase the Java runtime support for Object Oriented Distributed Virtual Systems providing local access to distributed resources via proxies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Ycad is a library of CAD functions in Java. Currently only DXF is supported for reading, viewing and writing. The DXF drawing may be rendered to a Graphics object for printing or imaging.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    Seldon Server

    Seldon Server

    Machine learning platform and recommendation engine on Kubernetes

    ...Seldon Core is a progression of the goals of the Seldon-Server project but also a more restricted focus to solving the final step in a machine learning project which is serving models in production. Seldon Server is a machine learning platform that helps your data science team deploy models into production. It provides an open-source data science stack that runs within a Kubernetes Cluster. You can use Seldon to deploy machine learning and deep learning models into production on-premise or in the cloud (e.g. GCP, AWS, Azure).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit for All of Us

    DSTK - DataScience ToolKit is an opensource free software for statistical analysis, data visualization, text analysis, and predictive analytics. Newer version and smaller file size can be found at: https://sourceforge.net/projects/dstk3/ It is designed to be straight forward and easy to use, and familar to SPSS user. While JASP offers more statistical features, DSTK tends to be a broad solution workbench, including text analysis and predictive analytics features. Of course you may specify...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Makumba

    Makumba

    Moved to GitHub: https://github.com/makumba

    Makumba helps you rapidly develop data driven web applications. Provides a custom JSP taglib as a main interface, but leaves API open for advanced access. It is implemented in Java. Moved to GitHub: https://github.com/makumba
    Downloads: 6 This Week
    Last Update:
    See Project
  • 18
    Generic Model Organism Database Project
    GMOD is a set of interoperable open source software components for visualizing, annotating, and managing biological data. See http://gmod.org for more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Identifiers.org is a system providing resolvable and persistent identifiers, in the form of URIs, used to identify data for the scientific community, with a current focus on the Life Sciences domain. The provision of resolvable identifiers (URLs) fits well with the Semantic Web vision, and the Linked Data initiative. The resolving services of Identifiers.org rely on the information stored in the MIRIAM Registry.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Implementation of an OGC Web Map Service, primarily for serving images dynamically generated from NetCDF data files (although other file formats can also be handled). Also an AJAX web mapping interface based on OpenLayers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    *The Fedora repository project is no longer being maintained here. All current information can be found on the project wiki (https://wiki.lyrasis.org/display/FF/Fedora+Repository+Home)* Fedora is an open-source software to ensure durability and integrity of digital content, use semantics to contextualize and inter-relate content from many sources, and to enable the creation of innovative, collaborative information spaces.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22
    Toolboxes for SPM (http://www.fil.ion.ucl.ac.uk/spm/) developed at Freiburg Brain Imaging (http://fbi.uniklinik-freiburg.de/)
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23

    Immutable Sparse Wave Trees (WaveTree)

    Realtime bigdata tool for bit strings up to 2^63 based on AVL forest

    Realtime bigdata tool at the bit level based on immutable AVL forest which can be run in memory or, in future versions, as a merkle forest like a blockchain. Main object is a sparse bit string (Bits) that efficiently scales up to 2^63 bits normally compressed as forest has duplicated substrings. Bits objects support reading bit, byte, short, int, or long (Java primitives) at any bit index in 64 bit range. Example: instead of building a class to hold a header and then data, represent all of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    bio2rdf
    The Bio2RDF project aims to transforms silos of life science data into a globally distributed network of linked data for biological knowledge discovery. Bio2RDF creates and provides machine understandable descriptions of biological entities using the RDF/RDFS/OWL Semantic Web languages. Using both syntactic and semantic data integration techniques, Bio2RDF seamlessly integrates diverse biological data and enables powerful new SPARQL-based services across its globally distributed knowledge bases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Geoportal Server
    Geoportal Server is a standards-based, open source product that enables discovery and use of geospatial resources including data and services.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
Auth0 Logo