Improving the performance of machine learning algorithms using conceptual models: a case study of auto insurance

Moosavi, Maedeh (2025) Improving the performance of machine learning algorithms using conceptual models: a case study of auto insurance. Masters thesis, Memorial University of Newfoundland.

[img] [English] PDF - Accepted Version
Available under License - The author retains copyright ownership and moral rights in this thesis. Neither the thesis nor substantial extracts from it may be printed or otherwise reproduced without the author's permission.

Download (1MB)

Abstract

The integration of domain knowledge into machine learning models has been proposed as a means to address the limitations of purely data-driven approaches. Traditional machine learning techniques often rely on pre-defined, fixed data structures, which can overlook valuable context-specific insights that domain knowledge provides. This study investigates the impact of incorporating domain knowledge into the preprocessing and feature engineering stages of machine learning models, specifically focusing on decision tree algorithms and Support Vector Machines (SVM) within the insurance sector. To evaluate the effectiveness of this integration, this study compares the performance of models trained on a pre-defined dataset (A) with models trained on the same dataset after it was enhanced with domain-specific knowledge (Revised A). The results demonstrate that the integration of domain-specific guidelines into the feature engineering process significantly improved the accuracy and reliability of the predictive models, particularly in complex scenarios such as predicting customer profitability. In scenarios where domain knowledge played a crucial role in refining features that capture relationships within the insurance data, the enhanced models outperformed the original ones. Conversely, for tasks where the domain knowledge had less influence, the performance improvement was marginal. These findings suggest that integrating domain knowledge into machine learning processes can provide a meaningful boost in model effectiveness, but the benefits are context-dependent.

Item Type: Thesis (Masters)
URI: http://research.library.mun.ca/id/eprint/16883
Item ID: 16883
Additional Information: Includes bibliographical references (pages 54-60)
Keywords: machine learning, domain knowledge, feature engineering, decision trees, SVM, insurance industry, predictive modeling
Department(s): Business Administration, Faculty of
Date: February 2025
Date Type: Submission
Library of Congress Subject Heading: Machine learning; Industrial life insurance--Data processing; Decision trees; Computational intelligence; Expert systems (Computer science)

Actions (login required)

View Item View Item

Downloads

Downloads per month over the past year

View more statistics