ADVERTISEMENT

Home|Journals|Articles by Year|Audio Abstracts
 

Original Article

JJCIT. 2025; 11(2): 151-169


HAML-IRL: Overcoming the Imbalanced Record Linkage Problem Using Hybrid Active Machine Learning

Mourad Jabrane, Mouad JBEL, Imad HAFIDI, Yassir ROCHD.



Abstract
Download PDF Post

Traditional active machine learning (AML) methods employed in Record Linkage (RL) or Entity Resolution (ER) tasks often struggle with model stability, slow convergence, and handling imbalanced data. Our study introduces a novel hybrid Active Machine Learning approach to address RL, overcoming the challenges of limited labeled data and imbalanced classes. By combining and balancing informativeness, which selects record pairs to reduce model uncertainty, and representativeness, which ensures the chosen pairs reflect the overall dataset patterns, our hybrid approach, called Hybrid Active Machine Learning for Imbalanced Record Linkage (HAML-IRL), demonstrates significant advancements.HAML-IRL achieves an average 12% improvement in F1-scores across eleven real-world datasets, including structured, textual, and dirty data, when compared to state-of-the-art AML methods. Our approach also requires up to 60% - 85% fewer labeled samples dependening on the datasets, accelerates model convergence, and offers superior stability across iterations, making it a robust and efficient solution for real-world record linkage tasks.

Key words: Record Linkage, Entity Resolution, Active Machine Learning, Hybrid Query.







Bibliomed Article Statistics

46
34
18
10
15
22
11
R
E
A
D
S

108

25

27

19

21

37

28
D
O
W
N
L
O
A
D
S
06070809101112
2025

Full-text options


Share this Article


Online Article Submission
• ejmanager.com




ejPort - eJManager.com
Author Tools
About BiblioMed
License Information
Terms & Conditions
Privacy Policy
Contact Us

The articles in Bibliomed are open access articles licensed under Creative Commons Attribution 4.0 International License (CC BY), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.