Logo-bi
Bioimpacts. 2025;15: 30340.
doi: 10.34172/bi.30340
  Abstract View: 79
  PDF Download: 9

Special issue | AI & Bioinformatics

Original Article

An efficient hybrid filter-wrapper method based on improved Harris Hawks optimization for feature selection

Jamshid Pirgazi 1* ORCID logo, Mohammad Mehdi Pourhashem Kallehbasti 1 ORCID logo, Ali Ghanbari Sorkhi 1 ORCID logo, Ali Kermani 1 ORCID logo

1 Department of Electrical and Computer Engineering, University of Science and Technology of Mazandaran, Behshahr, Iran
*Corresponding Author: Jamshid Pirgazi, Email: j.pirgazi@mazust.ac.ir

Abstract

Introduction: High-dimensional datasets often contain an abundance of features, many of which are irrelevant to the subject of interest. This issue is compounded by the frequently low number of samples and imbalanced class samples. These factors can negatively impact the performance of classification algorithms, necessitating feature selection before classification. The primary objective of feature selection algorithms is to identify a minimal subset of features that enables accurate classification.
Methods: In this paper, we propose a two-stage hybrid method for the optimal selection of relevant features. In the first stage, a filter method is employed to assign weights to the features, facilitating the removal of redundant and irrelevant features and reducing the computational cost of classification algorithms. A subset of high-weight features is retained for further processing in the second stage. In this stage, an enhanced Harris Hawks Optimization algorithm and GRASP, augmented with crossover and mutation operators from genetic algorithms, are utilized based on the weights calculated in the first stage to identify the optimal feature set.
Results: Experimental results demonstrate that the proposed algorithm successfully identifies the optimal subset of features.
Conclusion: The two-stage hybrid method effectively selects the optimal subset of features, improving the performance of classification algorithms on high-dimensional datasets. This approach addresses the challenges posed by the abundance of features, low number of samples, and imbalanced class samples, demonstrating its potential for application in various fields.
First Name
Last Name
Email Address
Comments
Security code


Abstract View: 80

Your browser does not support the canvas element.


PDF Download: 9

Your browser does not support the canvas element.

Submitted: 11 Feb 2024
Revision: 17 Apr 2024
Accepted: 30 Apr 2024
ePublished: 01 Oct 2024
EndNote EndNote

(Enw Format - Win & Mac)

BibTeX BibTeX

(Bib Format - Win & Mac)

Bookends Bookends

(Ris Format - Mac only)

EasyBib EasyBib

(Ris Format - Win & Mac)

Medlars Medlars

(Txt Format - Win & Mac)

Mendeley Web Mendeley Web
Mendeley Mendeley

(Ris Format - Win & Mac)

Papers Papers

(Ris Format - Win & Mac)

ProCite ProCite

(Ris Format - Win & Mac)

Reference Manager Reference Manager

(Ris Format - Win only)

Refworks Refworks

(Refworks Format - Win & Mac)

Zotero Zotero

(Ris Format - Firefox Plugin)