A data-driven framework for modeling and optimization of industrial hydrocracking units

Rezaei, Arian (2025) A data-driven framework for modeling and optimization of industrial hydrocracking units. Masters thesis, Memorial University of Newfoundland.

[img] [English] PDF - Accepted Version
Available under License - The author retains copyright ownership and moral rights in this thesis. Neither the thesis nor substantial extracts from it may be printed or otherwise reproduced without the author's permission.

Download (17MB)

Abstract

This thesis presents a comprehensive framework for modelling and multi-objective optimization of industrial hydrocracking and hydroprocessing units that e↵ectively addresses the challenges associated with complex process dynamics and data quality. The proposed framework combines advanced machine learning techniques, explainability methods and multi-objective optimization algorithms to improve the operational performance and efficiency of hydrocracking units. The framework begins with data preparation based on cycles and modes of operation. Data preprocessing involves addressing missing data using the iterative imputation method, followed by detection and removal of anomalies using the 3-! rule and Isolation Forest algorithm. An extensive comparative analysis was performed for product yield prediction using a diverse set of machine learning algorithms, including Decision Trees, Support Vector Regression, Random Forests, Deep Neural Networks, XGBoost, LightGBM and CatBoost. Through comprehensive performance assessment, CatBoost emerged as the most robust predictive algorithm, demonstrating exceptional potential in mitigating overfitting and significantly enhancing predictive accuracy across datasets. Subsequently, SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations) analyses were employed to explain model predictions and identify key influencing input variables to manipulate for optimization. The multi-objective optimization is then performed using the Non-dominated Sorting Genetic Algorithm II (NSGA-II), exploring various scenarios to maximize desired products’ yields or volume swell during cracking. The framework is implemented and validated through two case studies, achieving higher prediction accuracy and providing optimization results that significantly enhance operational performance and profitability. This research highlights the potential of integrating machine learning and optimization techniques to advance industrial processes. Future work could explore using reinforcement learning and digital twins for real-time optimization and control.

Item Type: Thesis (Masters)
URI: http://research.library.mun.ca/id/eprint/16943
Item ID: 16943
Additional Information: Includes bibliographical references (pages 150-155)
Keywords: hydrocracking unit, data-driven modeling, multi-objective optimization, machine learning
Department(s): Engineering and Applied Science, Faculty of
Date: May 2025
Date Type: Submission
Library of Congress Subject Heading: Hydrocracking; Machine learning

Actions (login required)

View Item View Item

Downloads

Downloads per month over the past year

View more statistics