Author(s) | Collection number | Pages | Download abstract | Download full text |
---|---|---|---|---|
Чорняк В. О., Tymchenko O. V. | № 1 (70) | 60-71 |
![]() |
![]() |
The article explores modeling the probability of survival for Titanic passengers using machine learning and multicriteria decision analysis (MCDA) methods. The aim of the study is to identify key factors that influenced survival outcomes and to develop a predictive model with high classification accuracy.
The analysis is based on an open dataset from the Kaggle platform, which contains detailed information on passenger age, gender, ticket class, fare, family ties, and other relevant features. A complete data preprocessing cycle was carried out, including outlier detection, normalization, encoding of categorical variables, and splitting the dataset into training and testing sets.
The study applied a range of machine learning algorithms: logistic regression, decision trees, support vector machines (SVM), random forest, and gradient boosting. These models were evaluated based on standard metrics to determine their predictive performance. To interpret model outputs and assess feature importance, the LIME and TOPSIS methods were used, allowing for a more transparent evaluation of which variables most influenced survival.
The findings confirm the critical role of socio-demographic factors in survival likelihood, particularly gender, age, and ticket class. Women, children, and first-class passengers had notably higher chances of survival. These insights mirror historical realities and have practical implications for modern risk modeling, especially in high-stakes environments where quick and accurate decision-making is essential.
Integrating MCDA with machine learning enhances both interpretability and reliability of predictive models. The approach can be valuable in designing decision-support systems for transport safety, disaster response, and other areas where forecasting human outcomes is crucial.
Keywords: multicriteria analysis, machine learning, data analysis, forecasting, support vector method, random forest.
doi: 10.32403/1998-6912-2025-1-70-49-59