Showing 22 open source projects for "feature selection"

View related business solutions
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    River ML

    River ML

    Online machine learning in Python

    River is a Python library for online machine learning. It aims to be the most user-friendly library for doing machine learning on streaming data. River is the result of a merger between creme and scikit-multiflow.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    caret

    caret

    caret (Classification And Regression Training) R package

    The caret (Classification And Regression Training) R package streamlines the process of building predictive machine learning models. It provides uniform interfaces for model training, tuning, evaluation, preprocessing, and variable importance. With support for over 200 models, caret is foundational for R workflows in modeling and machine learning.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Kaggle Solutions

    Kaggle Solutions

    Collection of Kaggle Solutions and Ideas

    Kaggle Solutions is an open-source repository that compiles winning solutions, insights, and educational resources from hundreds of Kaggle data science competitions. The repository acts as a knowledge base for competitive machine learning by collecting solution write-ups, discussion threads, code notebooks, and tutorial resources shared by top Kaggle participants. Each competition entry typically includes information about the dataset, evaluation metrics, modeling strategies, and techniques...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Smile

    Smile

    Statistical machine intelligence and learning engine

    Smile is a fast and comprehensive machine learning engine. With advanced data structures and algorithms, Smile delivers the state-of-art performance. Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is a couple of times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster? Write applications quickly in Java, Scala, or any JVM...
    Downloads: 8 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    MLJAR Studio

    MLJAR Studio

    Python package for AutoML on Tabular Data with Feature Engineering

    We are working on new way for visual programming. We developed a desktop application called MLJAR Studio. It is a notebook-based development environment with interactive code recipes and a managed Python environment. All running locally on your machine. We are waiting for your feedback. The mljar-supervised is an Automated Machine Learning Python package that works with tabular data. It is designed to save time for a data scientist. It abstracts the common way to preprocess the data,...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    handson-ml2

    handson-ml2

    Jupyter notebooks that walk you through the fundamentals of ML

    This repository contains the Jupyter notebooks and code for the second edition of a popular hands-on machine learning book that teaches both classical ML and deep learning using modern tooling. The notebooks emphasize end-to-end workflows: data preparation, model selection, tuning, and reliable evaluation. Deep learning sections use the contemporary Keras/TensorFlow 2 ecosystem, highlighting clean APIs and eager execution to make experiments easier to reason about. Traditional ML topics remain central, with scikit-learn pipelines, feature engineering, and cross-validation patterns that transfer to real projects. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Practical Machine Learning with Python

    Practical Machine Learning with Python

    Master the essential skills needed to recognize and solve problems

    ...It centralizes example code, datasets, model pipelines, and explanatory notebooks that teach users how to approach problems from data ingestion and cleaning all the way through feature engineering, model selection, evaluation, tuning, and production-ready deployment patterns. The repository emphasizes end-to-end workflows rather than isolated code snippets, showing how to handle common challenges like class imbalance, overfitting, hyperparameter optimization, and interpretability. By leveraging popular Python libraries such as pandas, scikit-learn, XGBoost, and visualization tools, it illustrates how to build reproducible and robust solutions that scale beyond small demos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    PoseidonQ  - AI/ML Based QSAR Modeling

    PoseidonQ - AI/ML Based QSAR Modeling

    ML based QSAR Modelling And Translation of Model to Deployable WebApps

    - This Software was made with an intention to make QSAR/QSPR development more efficient and reproducible. - Published in ACS, Journal of Chemical Information and Modeling . Link : https://pubs.acs.org/doi/10.1021/acs.jcim.4c02372 - Simple to use and no compromise on essential features necessary to make reliable QSAR models. - From Generating Reliable ML Based QSAR Models to Developing Your Own QSAR WebApp. For any feedback or queries, contact kabeermuzammil614@gmail.com - Available on...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 9
    scikit-learn-videos

    scikit-learn-videos

    Jupyter notebooks from the scikit-learn video series

    scikit-learn-videos repository accompanies a video tutorial series designed to teach machine learning using Python’s scikit-learn library. It provides the Jupyter notebooks used in each lesson so learners can reproduce the demonstrations and experiment with the code themselves. The series introduces fundamental machine learning concepts such as classification, regression, model evaluation, feature engineering, and cross-validation using clear examples and real datasets. Each video...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 10
    fe4ml-zh

    fe4ml-zh

    Feature Engineering for Machine Learning

    fe4ml-zh is an open-source project that provides a Chinese translation and structured documentation of the book Feature Engineering for Machine Learning. The repository aims to make advanced feature engineering concepts accessible to a broader audience by translating the content and organizing it into readable documentation and code examples. Feature engineering is a critical component of machine learning pipelines because it determines how raw data is transformed into features that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    OmicSelector

    OmicSelector

    Feature selection and deep learning modeling for omic biomarker study

    OmicSelector is an environment, Docker-based web application, and R package for biomarker signature selection (feature selection) from high-throughput experiments and others. It was initially developed for miRNA-seq (small RNA, smRNA-seq; hence the name was miRNAselector), RNA-seq and qPCR, but can be applied for every problem where numeric features should be selected to counteract overfitting of the models. Using our tool, you can choose features, like miRNAs, with the most significant diagnostic potential (based on the results of miRNA-seq, for validation in qPCR experiments).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Yellowbrick

    Yellowbrick

    Visual analysis and diagnostic tools to facilitate ML selection

    Yellowbrick extends the Scikit-Learn API to make model selection and hyperparameter tuning easier. Under the hood, it’s using Matplotlib. Yellowbrick is a suite of visual diagnostic tools called "Visualizers" that extend the scikit-learn API to allow human steering of the model selection process. In a nutshell, Yellowbrick combines scikit-learn with matplotlib in the best tradition of the scikit-learn documentation, but to produce visualizations for your machine learning workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SGX-Full-OrderBook-Tick-Data-Trading

    SGX-Full-OrderBook-Tick-Data-Trading

    Providing the solutions for high-frequency trading (HFT) strategies

    SGX-Full-OrderBook-Tick-Data-Trading-Strategy is an open-source research project focused on modeling high-frequency financial market behavior using machine learning techniques. The repository analyzes tick-level order book data from the Singapore Exchange and attempts to capture the dynamics of limit order book movements. By extracting features such as order depth ratios and price movement indicators, the system trains machine learning models to predict short-term market changes. Several...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    CUDA-JMI

    Tool for feature selection using the JMI metric and multiple GPUs

    CUDA-JMI is a parallel tool to accelerate the feature selection process using Joint Mutual Information as metric. This tool receives as input a file with ARFF, CVS or LIBSVM extensions that contais the values of m individuals and n features and returns a file with those features that provide more non-rendundant information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    MLBox

    MLBox

    MLBox is a powerful Automated Machine Learning python library

    MLBox is a powerful Automated Machine Learning python library. Fast reading and distributed data preprocessing/cleaning/formatting. Highly robust feature selection and leak detection. Accurate hyper-parameter optimization in high-dimensional space. State-of-the-art predictive models for classification and regression (Deep Learning, Stacking, LightGBM,...) Prediction with model interpretation. MLBox has been developed and used by many active community members. Your help is very valuable to make it better for everyone.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    EBCS for Feature Selection

    Enhanced Binary Cuckoo Search with Frequent Values and RST (EBCS)

    This Filter Feature Selection approach (EBCS) with other tasks developed by PHP Programing language. Initial parameters for EBCS and FS-BCS as follows: Maximum number of iteration is 20. Population size is 20. Probability (P) is 0.25. Alpha is 0.1. After Downloading and copying the EBCS directory to directory root, and request the EBCS/index.php page to show home page which contains the following tasks: 1.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    fscaret_shiny

    UI for fscaret

    User Interface (ui) application which implements the automated feature selection provided by the 'fscaret' package of R-environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    OWL Machine Learning

    Machine learning algorithm using OWL

    Feature construction and selection are two key factors in the field of Machine Learning (ML). Usually, these are very time-consuming and complex tasks because the features have to be manually crafted. The features are aggregated, combined or split to create features from raw data. This project makes use of ontologies to automatically generate features for the ML algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    lonestar

    A feature selection and classification algorithm based on L1 Norm SVM

    A feature selection and classification algorithm. It is based on L1 Norm Support Vector Machine with t-test and Recursive Feature Elimination.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    mlpy

    mlpy

    Machine Learning Python

    mlpy is a Python module for Machine Learning built on top of NumPy/SciPy and of GSL. mlpy provides high-level functions and classes allowing, with few lines of code, the design of rich workflows for classification, regression, clustering and feature selection. mlpy is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License version 3. mlpy is available both for Python >=2.6 and Python 3.X.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21
    RapidMiner Feature Selection Extension
    This RapidMiner-plugin consists of operators for feature selection and classification - mainly on high-dimensional (microarray-) data - and some helper-classes/operators.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Leark is a Data Mining library developed in C#.NET. It contains several methods for ranking web documents described with a set of normalized features, and a feature selection algorithm. The methods are based on perceptron and clustering.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo