University of Illinois Chicago
Browse

Advancements in Active Learning: Strategies for Imbalanced Class Settings

Download (3.56 MB)
thesis
posted on 2024-05-01, 00:00 authored by Francesco Bolognesi
Active learning (AL) is a machine learning technique that selects the most informative samples from a large pool of unlabeled data for annotation, thus reducing the labeling cost and improving the learning performance. However, conventional AL approaches often neglect the intricate issue of class imbalance, where certain classes are either overrepresented or underrepresented in the dataset distribution. This disparity can introduce bias in sampling and compromise the overall generalization ability of the classifier. In this work, we introduce a novel threshold-based strategy for AL designed to navigate the challenges of class imbalance. This strategy dynamically adjusts to the degree of class imbalance, ensuring the selection of samples that are both informative and well-representative of minority classes. Our approach is rigorously tested on a variety of imbalanced datasets and benchmarked against state-of-the-art AL methods. Empirical results demonstrate that our proposed method significantly enhances classifier performance, especially in scenarios characterized by imbalanced class labels.

History

Advisor

hadis anahideh

Department

mechanical and industrial engineering department (MIE)

Degree Grantor

University of Illinois Chicago

Degree Level

  • Masters

Degree name

MS, Master of Science

Committee Member

Abolfazl Asudeh Federica Ciccullo Roberto Cigolini

Thesis type

application/pdf

Language

  • en

Usage metrics

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC