Skip to main content

Examining the empathy levels of medical students using CHAID analysis

Abstract

Background

Empathy is a key factor in the medical field as it strengthens doctor-patient relationships, enhances communication, and leads to improved patient outcomes. This study aims to investigate the empathy levels of medical students, providing insights into the factors that influence these levels and using advanced analytical methods for accurate predictions.

Methods

The study was conducted with 322 medical students from a public university in Turkey. A relational screening model was applied, using a “Personal Information Form” and an “Empathy Scale” to gather data. CHAID analysis was employed to identify the key predictors influencing empathy levels, whereas machine learning algorithms were utilized to classify and predict individuals' empathy levels.

Results

The analysis revealed that 41.3% of students displayed high empathy, 44.7% moderate empathy, and 14.0% low empathy. Factors such as parental education, maternal occupation, and gender were significant in determining empathy levels, with gender being the most influential. The machine learning models achieved an 80.1% accuracy in predicting empathy levels.

Conclusions

The findings indicate that targeted educational and social interventions, especially those addressing gender differences, could improve empathy in medical students, potentially leading to better patient care.

Trial registration

Not applicable, as this study does not report results from a health care intervention involving human participants.

Peer Review reports

Introduction

Empathy is a complex and multifaceted psychological process that plays a critical role in understanding others and forming social bonds. It is a multidimensional concept defined as the ability to feel what another person is feeling, to imagine oneself in another’s situation, or to envision oneself as the other person in that context [1]. This skill not only strengthens emotional connections but also significantly impacts various domains such as social cohesion, communication, and cooperation. Empathy allows individuals to better understand one another, resolve conflicts, develop a sense of social belonging, and enables them to coordinate their activities effectively in interpersonal interactions [2].

Empathy has been extensively studied in various medical disciplines such as psychology and neuroscience and is recognized as a fundamental component of human relationships and social interactions. Particularly in medical education, empathy is an indispensable element, playing a vital role in communication between physicians and their patients, as it fosters understanding and trust. For this reason, medical students require not only clinical knowledge and skills but also social and emotional competencies such as empathy to enhance interpersonal connections [3, 4]. Empathy, which plays an important role in the educational process of medical students, is shaped by various factors. A review of the literature reveals that different studies have addressed the factors influencing empathy. Capdevilla-Gaudens et al. [5] stated that medical students’ empathy scores are at the higher levels of the scale, and female have a higher percentage within the highest scoring group. Furthermore, Shi and Du [6] found that female students have higher levels of empathic concern compared to male students. This finding highlights the impact of gender on empathy, suggesting that social and cultural factors may also shape these relationships. Dyrbye et al. [7] expressed that positive student-faculty interactions are associated with higher empathy scores, and older female students generally score higher in empathy compared to younger male students. In another study, Silva and Junior [8] found that empathy levels were influenced by factors such as students’ year of study and their parents’ education level. These findings demonstrate that the factors shaping empathy are multidimensional. These factors provide us with insight into the developmental process of medical students’ empathetic skills.

Empathy strengthens trust in doctor-patient relationships, promotes treatment adherence, and enhances patient satisfaction [9]. Additionally, empathetic communication is regarded as an ethical imperative in patient care and positively shapes physicians’ professional identity [10]. Numerous studies in the literature support the idea that empathic doctor-patient communication positively influences clinical outcomes [11,12,13,14]. For example, a meta-analysis conducted by Roter et al. [12] found that empathic communication skills not only improved psychological treatment processes but also enhanced the effectiveness of pharmacological treatments. Additionally, Hojat et al. [14] demonstrated that physicians with higher levels of empathy are less prone to errors, directly influencing patient safety. Several studies have associated empathic doctor-patient communication with better outcomes in both psychological and pharmacological treatments [15, 16]. However, recent observations suggest an increasing number of physicians in medical schools struggling to establish empathic relationships with their patients, having difficulty understanding them, or even failing to make efforts in this regard [17]. One of the main reasons for this is the demanding medical curricula and the emotional burdens medical students face [5]. Some studies have emphasized that empathy levels decline over time, becoming more pronounced during clinical training years [18, 19]. This situation highlights the necessity of educational strategies aimed at increasing empathy during the clinical years. Examining general empathy levels in medical students will provide a deeper understanding of the emotional competencies that influence doctor-patient interactions and overall clinical outcomes, making a significant contribution to the existing literature.

Another limitation in the current research is the predominant focus on medical empathy, often measured through scales specifically designed for clinical contexts. While these studies provide valuable insights into the application of empathy in healthcare, they tend to overlook the evaluation of general empathy, which serves as the basis for developing medical empathy. Clinical empathy, a concept specifically measured by scales applied in doctor-patient relationships and clinical contexts, encompasses emotional and cognitive processes that form the foundation of this relationship [20, 21]. In this context, measuring general empathy levels is crucial for a comprehensive assessment of medical students’ empathy skills. In this study, the Toronto Empathy Questionnaire (TEQ), which has proven validity and is reliable in capturing a multidimensional structure, was used to measure participants’ general empathy levels. TEQ, by addressing empathy with both its emotional and cognitive components, facilitates understanding the key elements that enhance clinical empathy [21]. Thus, an attempt has been made to provide a more comprehensive view of the emotional and cognitive skills that shape empathic behaviors in medical practice. Despite its importance, there remains a lack of research employing general empathy scales to assess the empathy levels of medical students. Addressing this gap is essential for enhancing our understanding of empathy as a multidimensional construct and for developing more holistic strategies to cultivate this vital skill in medical education.

Medical students are expected to possess high levels of empathy, as this quality is crucial for doctors who will solve health problems and provide support to patients. Measuring and evaluating the empathy levels of medical students is important for supporting the development of these skills. Examining the empathy levels of future doctors using predictive models can help identify the variables affecting empathy and understand their significance.

The aim of this study is to determine the empathy levels of medical students, reveal the relationships between the variables influencing empathy, and develop models that can predict empathy levels using machine learning techniques. While various statistical methods have been employed to explore empathy in medical education, this study is unique in applying CHAID analysis to uncover interaction patterns and subgroup structures that are often missed by traditional analyses. To achieve this goal, the current empathy levels were first evaluated, and then the relationships between the variables influencing empathy were revealed using CHAID (Chi-squared Automatic Interaction Detection) analysis. A literature review indicates that there are no studies examining the empathy levels of medical students using CHAID analysis. CHAID is a decision tree algorithm based on chi-squared tests that identifies statistically significant relationships between variables [22]. By using CHAID, rule sets can be developed to classify outcomes, and groups can be formed based on predictive variables [23]. This allows for a better understanding of the factors influencing empathy levels. Additionally, machine learning techniques have been used in this study to develop models that can predict empathy levels. Measuring and assessing the empathy levels of future physicians with machine learning algorithms can provide valuable insights for improving educational processes. Therefore, this study aims to conduct an analysis that highlights the importance of educational and social strategies that promote empathy. By evaluating empathy skills in a multifaceted way, the study aims to contribute to the development of this critical competency. The findings will help make effective and efficient planning in medical education and contribute to filling an important gap in the literature.

Empathy

Rogers described empathy as the process of entering another person’s perceptual world, understanding that person’s inner structure along with its emotional components, and engaging in the thought process of “if I were that person” [24, 25]. Feshbach [26] interpreted empathy as an activity that involves the ability to recognize another individual’s perspective and, at the same time, share the emotional response that person is experiencing. Wispé [27] defined empathy as “the effort to understand another self’s positive and negative experiences in a non-judgmental way by someone who knows themselves”. Kohut [28] in his early writings on empathy, defined empathy as “vicarious introspection” and considered it one of the most important tools for exploring another person’s inner world.

The ability to feel another person’s emotions, thoughts, and experiences as if they were our own can be defined as empathy. Empathy is a fundamental component of an effective therapeutic relationship [29]. Empathy for medical students is not only a professional competency but also a fundamental part of effective patient care and ethical decision-making processes [30]. Defined as giving an emotional response that is appropriate to another’s emotional state and independent of one’s own situation, empathy is a complex, multidimensional concept with moral, cognitive, emotional, and behavioral components [31, 32]. Today, the most widely accepted approach posits that empathy has two sub-dimensions: cognitive and emotional [33, 34]. Cognitive empathy is the ability to understand and predict others’ behaviors, particularly their epistemic mental states such as beliefs, knowledge, pretense, and expectations [35]. According to Hoffman [32], emotional empathy is an emotional response that is more appropriate to someone else’s situation than one’s own. Emotional empathy refers to an emotional response in one person that arises from and parallels the emotional state of another [35]. The development of emotional empathy in medical students is important for establishing a strong emotional connection with patients and ensuring effective communication, especially in sensitive situations [36]. The cognitive aspect of empathy involves the ability to effectively grasp distressing situations, recognize another person’s emotions, and adopt that person’s perspective. The emotional aspect of empathy requires an individual to experience an indirect emotional response to the emotions expressed by others [37]. The cognitive and affective dimensions of empathy complement each other and are said to be in constant interaction. The simultaneous development of these two dimensions in medical students ensures the integration of both technical skills and the human aspect in patient care [38].

Empathy is a complex psychological process influenced by both biological predispositions and environmental factors. Our genetic makeup, brain structure, and neurological functions provide the biological foundation that affects our capacity to understand and share others’ emotions. However, environmental factors such as family environment, education, social interactions, and life experiences shape this biological foundation, significantly impacting our empathy skills. These environmental factors encompass all experiences, social interactions, and learning processes to which an individual is exposed from birth. They play a critical role in the development of empathy and continue to influence an individual’s level of empathy throughout their life.

Empathy in medical domain

Empathy is considered one of the cornerstones of the medical profession and is one of the most critical factors in determining the quality of the relationship between a physician and a patient. This dynamic process reflects a physician’s effort to deeply understand the patient’s perspective, concerns, and experiences. Empathy in the medical context involves not only being sensitive to the patient’s current emotions but also effectively communicating this understanding back to the patient [39]. Medically, empathy is understood as the non-judgmental recognition of a patient’s inner world as a separate individual [20]. According to Jackson [40], empathy is a crucial element in healing. A physician with empathy is better able to understand their patient’s problems and is more effective in addressing those issues [17].

In the field of healthcare, empathy has been examined under various terms, including clinical empathy and medical empathy. Medical empathy is defined as the ability to understand the patient’s perspective and emotions, convey this understanding to the patient, and act therapeutically based on this understanding [31]. Physician empathy is characterized as a multidimensional construct encompassing cognitive (the ability to understand and reflect another’s perspective), emotional (the subjective ability to perceive another’s inner experiences and natural emotions), and behavioral (the ability to effectively communicate this understanding) components [41]. The cognitive aspect of physician empathy involves accurately understanding the patient’s mental state (the ability to take another’s perspective) and effectively communicating this perspective to patients. The emotional aspect of physician empathy is defined as the physician’s ability to respond to and ameliorate the patient’s emotional state [11].

Recent research in medical education increasingly emphasizes the importance of empathy skills in clinical practice and examines various intervention programs aimed at developing this skill among medical students. Targeted programs such as communication skills training [42], narrative-based medical practices [43], and role-playing or simulation-based activities [44] have been shown to be effective in enhancing students’ levels of empathy. In addition, structured practices such as Balint group discussions, narrative workshops, and mindfulness-based training have been found to improve emotional awareness and strengthen empathic communication [19, 45]. Practices such as reflective writing and patient shadowing, which aim to foster perspective-taking and compassion, also contribute to this process [44, 46]. These findings support the growing pedagogical trend in medical curricula that approaches empathy as both a teachable and measurable competence.

Empathy plays a critical role in the communication between a physician and their patient. Studies have shown that empathy strengthens the communication between patient and physician [47, 48]. Empathetic communication supports the patient and creates a perception in the patient regarding the physician’s clinical competence. When a patient feels that their physician understands their emotions, thoughts, and concerns, they are more likely to trust their physician and express themselves more comfortably [49]. In empathy between a patient and physician, it is not enough for the physician to understand the patient; the physician must also have the ability to reflect this understanding back to the patient. Because even if physicians accurately understand their patients’ perspectives and emotions, they may not be perceived as sufficiently empathetic by their patients if they fail to convey this understanding [50].

Purpose of the study and research questions

The purpose of this study is to analyze the genel empathy levels of medical students using CHAID analysis and to develop a predictive model utilizing machine learning algorithms. The Toronto Empathy Questionnaire is used as the data collection tool to assess these empathy levels. Accordingly, the research seeks answers to the following sub-problems:

  1. 1.

    What are the results of the two-step cluster analysis of the empathy scale scores of medical students?

  2. 2.

    What is the order of importance of the predictor variables that have a significant effect on the empathy levels of medical students?

  3. 3.

    How is the decision tree obtained through CHAID analysis for the empathy levels of medical students?

  4. 4.

    What are the rule sets derived from CHAID analysis regarding the empathy levels of medical students?

  5. 5.

    What are the results related to the prediction of medical students’ empathy levels using machine learning methods?

Methodology

Research design

A correlational survey model is adopted in this study. In a correlational survey model, the relationship presented indicates that a portion of the change observed in one variable may be attributed to another variable [51]. Correlational studies are employed to identify relationships between two or more variables and to determine the effects of these relationships on causality [52].

Participants

The participants of this study consist of 322 medical students studying at a public university in Turkiye during the fall semester of the 2023–2024 academic year, who volunteered to participate in the study. The study employed a convenience sampling method, and data were collected through an online survey. General information about the participants is provided in Table 1.

Table 1 General information about the participants of the study

The sample size used in this study is considered sufficient to ensure the reliability of the analyses. The literature suggests that a sample size greater than 200 is generally sufficient to ensure adequate statistical power for data analysis [53, 54]. Additionally, a sample of 300 cases has also been suggested [55].

In decision tree models, the adequacy of the sample size is evaluated based on factors such as the number of variables, the number of terminal nodes, and the depth of the tree. These factors influence the complexity of the model and the statistical power required to detect meaningful relationships. For robust decision trees, it is essential that each terminal node contains enough observations to avoid overfitting and ensure the model’s generalizability [56]. In current study, a decision tree with a depth of 4 was created using 5 variables, resulting in a total of 9 terminal nodes. Each terminal node contains a sufficient number of observations, averaging 35, which ensures reliable interpretation and eliminates the risk of overfitting.

Data collection tool

The Turkish version of the Toronto Empathy Questionnaire (TEQ) was used to measure the general empathy levels of the participants. Originally developed by Spreng et al. [21], the TEQ was adapted and tested for reliability and validity by Totan et al. [57] on Turkish university students. The reliability of the scale was examined using test-retest and Cronbach’s internal consistency methods. For the test-retest study, a correlation of 0.73 was found between the two applications. The Cronbach’s internal consistency coefficient of the questionnaire was 0.79.

The Turkish TEQ consists of 13 questions that test empathy. The survey questions are scored between 1 and 5 (1: not at all appropriate, 2: not appropriate, 3: somewhat appropriate, 4: appropriate, and 5: completely appropriate). The items numbered 1, 3, 5, 7, 8, 9, 11, and 12 in the scale are reverse-scored. The total score ranges from 13 to 65, with higher scores indicating higher levels of empathy.

The primary aim of the study is to assess the empathy levels of medical students in a general sense; therefore, a more general empathy scale has been used instead of one specific to medical empathy. This approach offers the opportunity to address empathy not only in a medical context but also in a broader human and social context. Furthermore, the TEQ is widely accepted in the literature for its measurement validity and reliability. Some items from the scale are as follows:

  • Other people’s misfortunes do not disturb me a great deal.

  • I can tell when others are sad even when they do not say anything.

  • I do not feel sympathy for people who cause their own serious illnesses.

  • I am not really interested in how other people feel.

Data analysis

The data obtained from medical students were first subjected to a two-step cluster analysis to divide the dependent variable into homogeneous subgroups. Afterward, CHAID analysis was performed to identify the predictor variables and their order of importance.

The key feature of decision trees is their method of recursively dividing a target variable based on the values of input variables or predictors. This process creates partitions and resulting subsets of data, known as leaves or nodes. These subsets become more homogenous in terms of the target variable within each leaf or node, while being increasingly different from each other across different leaves or nodes at each level of the tree [58]. In decision trees, classification is performed using the feature values of the samples. Each node in the decision tree represents a feature of the classified sample [59].

The reason for choosing the CHAID decision tree algorithm in this study is to take advantage of its benefits, which allow for the quick and clear understanding of complex relationships and interactions. CHAID comprehensively investigates the independent variables that show the most differentiation in the dependent variable. It uses a systematic algorithm to identify the strongest relationship among these variables [60, 61]. CHAID analysis stands out as a superior method compared to traditional regression models in identifying interactions between variables and classifying groups. Regression analyses are typically used to understand linear relationships and main effects; however, they may be limited in capturing complex interactions and multiple group differences [62, 63]. CHAID analysis is particularly powerful in identifying interactions among multiple variables and demonstrating how these interactions form distinct groups.

When using the CHAID algorithm, the predicted (dependent) variable, empathy levels, was utilized as a three-class categorical variable: low, medium, and high. As independent variables, the predictive factors with the greatest impact on students’ empathy levels, namely gender, mothers level of education, fathers level of education, and mother’s occupation.

Machine learning algorithms used in classification

There are existing studies that utilize machine learning in analyzing the education of medical students, particularly in predicting various behavioral and cognitive outcomes [64,65,66]. A variety of machine learning algorithms were utilized in this study, including random forest, AdaBoost, decision tree, and Multi-Layer Perceptron (MLP), to predict the empathy levels of medical students. These algorithms were chosen based on their distinct advantages in handling different types of data patterns and predictive tasks. Random forest and decision tree were included due to their interpretability and ability to capture non-linear relationships. Decision tree was selected for its simplicity and high interpretability, allowing clear insights into the structure of the predictive model [58]. Random forest, on the other hand, was included to enhance predictive accuracy and reduce overfitting by aggregating the results of multiple decision trees trained on different data subsets [67]. AdaBoost was included because of its robustness in improving the accuracy of weak learners through boosting and its proven performance in similar prediction tasks. Compared to other ensemble techniques such as Gradient Boosting or XGBoost, AdaBoost was preferred due to its computational efficiency and simpler hyperparameter tuning in the context of the dataset used [68]. MLP, as a neural network-based model, was incorporated to capture complex, non-linear patterns that traditional tree-based methods might overlook. As the fundamental form of feedforward neural networks, it offers a balance between modeling capacity and relatively low computational complexity, making it suitable for structured data [69]. By employing a diverse set of algorithms, the study aimed to balance interpretability, robustness, and predictive accuracy in assessing empathy levels.

Metrics used in the study

Precision, recall, F1-score, and accuracy metrics are employed in this study. These metrics are calculated using the counts of TP (True Positive), representing correctly predicted positive classes, TN (True Negative), representing correctly predicted negative classes, FP (False Positive), representing incorrectly predicted positive classes, and FN (False Negative), representing incorrectly predicted negative classes.

Softwares used in the study

SPSS Modeler software is used in the decision tree modeling phase of this study. Python (v3.10) was used for the implementation of machine learning algorithms. The machine learning concepts were realized using the Scikit-learn library (v1.2.2). Correlation matrices are plotted using the Seaborn library (v0.12.2).

Findings

In this section, the findings obtained from the analysis of the research data are presented.

Results of the two-step cluster analysis of the empathy scale scores of medical students

The results of the cluster quality analysis for the two-step clustering of the average scores on the empathy scale are presented in Fig. 1 and Table 2.

Fig. 1
figure 1

Cluster quality

Table 2 Results of the two-step clustering analysis for the empathy scale of medical students

The clustering performed in this study indicates that the cluster structures have strong evidence and fall into the “good” category (Average Silhouette = 0.7).

Examining Table 2, it is observed that the average scores for medical students are as follows: 3.31 for 45 students (14%) in the first cluster (low level), 4.05 for 144 students (44.7%) in the second cluster (medium level), and 4.63 for 133 students (41.3%) in the third cluster (high level). Consequently, a three-category dependent variable has been established. These results indicate that a significant majority, approximately 86%, of the medical students exhibit empathy levels at or above the medium level.

Importance ranking of predictor variables affecting medical students’ empathy levels

Predictor importance refers to the process of determining which variables have the most significant impact on the model’s predictions. Variables with high importance provide the most relevant information for predicting the target outcome, helping the model make more accurate and meaningful predictions. On the other hand, variables with low importance can unnecessarily increase the model’s complexity and potentially lead to statistical errors. An initial CHAID analysis was performed to assess predictor importance. Mother’s level of education was found to be the variable with most impact on students’ empathy levels. Mother’s level of education, gender, father’s level of education, and mother’s occupation were selected for conducting the actual CHAID analysis. By identifying and focusing on the most influential predictors, the model’s clarity and accuracy were improved. The importance ranking of predictor variables that significantly affect the empathy levels of medical students is shown in Fig. 2.

Fig. 2
figure 2

The ranking of predictor variables based on their significant impact on medical students’ empathy levels

Decision tree obtained from CHAID analysis for medical students’ empathy levels

The decision tree algorithm for students’ empathy levels is presented in Fig. 3.

Fig. 3
figure 3

Decision tree obtained from CHAID analysis of students’ empathy levels. *Education levels were recoded as 1: Illiterate, 2: Literate, 3: Elementary School Graduate, 4: Middle School Graduate, 5: High School Graduate, 6: University Graduate. *Occupation categories were recoded as 1: Civil Servant, 2: Worker, 3: Self-Employed, 4: Housewife, 5: Other

In this study, CHAID analysis was applied to examine the empathy levels of medical students. The root node of the decision tree represents the entire student sample and provides a general distribution of empathy levels. In this node, it is observed that 41.30% of the students have high, 44.72% have medium, and 13.98% have low empathy levels.

The variable that best explains students’ empathy levels is gender. A significant majority, 58.11%, of students identifying as female have high empathy levels. The variable that best explains the cluster of students identifying as female is mother’s occupation. For students whose mother’s occupation is classified as civil servant, worker, housewife, or other, a majority of 60.14% have been found to have high empathy levels. For students whose mother’s occupation is classified as self-employed, a majority of 80.00% have been found to have low empathy levels.

For students identifying as female and whose mother’s occupation is civil servant, worker, housewife, or other, the variable mother’s level of education best explains the empathy levels. Among students with mother’s level of education classified as illiterate, literate, middle school graduate, or high school graduate, 73.75% have high empathy levels. Students whose mother’s level of education is elementary school graduate or university graduate show a 50.79% rate of medium empathy levels.

Among students identifying as male, 52.87% have been found to have medium empathy levels. The variable that best explains the cluster of students identifying as male is father’s level of education. For students whose father’s level of education is classified as literate or high school graduate, 40.91% have medium empathy levels. Students whose father’s level of education is classified as elementary school graduate or middle school graduate have been found to have high empathy levels, with 44.44% in this category. For students whose father’s level of education is classified as university graduate, 69.74% have been found to have medium empathy levels.

The variable that best explains the cluster of students identifying as male and whose father’s level of education is classified as literate or high school graduate is mother’s level of education. Under this cluster, 48.57% of students whose mother’s level of education is classified as illiterate, elementary school graduate, middle school graduate, or high school graduate have been found to have low empathy levels, while all students whose mother’s level of education is classified as literate or university graduate have been found to have medium empathy levels.

The variable that best explains the cluster of students identifying as male with a father’s level of education of university graduate is mother’s occupation. Under this cluster, among students whose mother’s occupation is classified as civil servant, worker, self-employed, or housewife, 73.24% have been found to have medium empathy levels, while 80.00% of those whose mother’s occupation is classified as other have been found to have low empathy levels.

The variable that best explains the cluster of students identifying as male with a father’s level of education of university graduate and mother’s occupation as civil servant, worker, self-employed, or housewife is mother’s level of education. Under this cluster, among students whose mother’s level of education is classified as illiterate, middle school graduate, or high school graduate, 50.00% have been found to have medium empathy levels, while 85.11% of those whose mother’s level of education is classified as literate, elementary school graduate, or university graduate have also been found to have medium empathy levels.

Rule sets from CHAID analysis on medical students’ empathy levels

The rule sets presented in Fig. 4, derived from the CHAID analysis, highlight the factors influencing medical students’ empathy levels. These rules outline the probabilities of students exhibiting specific empathy levels under various conditions.

Fig. 4
figure 4

Rule sets obtained through CHAID analysis. *Education levels were recoded as 1: Illiterate, 2: Literate, 3: Elementary School Graduate, 4: Middle School Graduate, 5: High School Graduate, 6: University Graduate. *Occupation categories were recoded as 1: Civil Servant, 2: Worker, 3: Self-Employed, 4: Housewife, 5: Other

The rule sets can be interpreted as follows: Female students are more likely to have higher empathy levels compared to male students (RS1). Female students whose mothers are employed as civil servants, workers, housewives, or in other roles are more likely to exhibit higher empathy levels (RS2). On the other hand, female students whose mothers are self-employed have a higher probability of having lower empathy levels (RS3). Male students generally have a medium level of empathy (RS4). Male students with fathers who are literate or high school graduates tend to have medium empathy levels (RS5). However, male students with fathers who have completed elementary or middle school are more likely to have high empathy levels (RS6). Finally, male students whose fathers are university graduates generally have medium empathy levels (RS7).

Prediction results of medical students’ empathy levels using machine learning methods

One of the aims of the study is to predict medical students’ empathy levels using machine learning methods. Within this scope, classification algorithms including random forest, decision tree, AdaBoost and MLP were employed. The following parameters are used as input for the classification algorithms: gender, age, grade, marital status, number of siblings, graduation, GPA, daily internet usage time, primary purpose of internet use, mother’s level of education, father’s level of education, mother’s occupation, father’s occupation, family income status, living with elderly, and residence. The target variable is defined as empathy levels of medical students.

Experiments were conducted using the fivefold cross-validation method. In this approach, the dataset was randomly divided into five equal subsets. In each iteration, 80% of the data were used for training and 20% for testing. This procedure was repeated five times, with each subset serving as the test set once. This method helps prevent overfitting by ensuring that all observations are used for both training and testing across different iterations, thereby providing a more reliable estimate of the model’s generalization performance. The findings related to the prediction of computational thinking skills using machine learning methods are presented in Table 3. Accuracy was the primary metric for evaluating model performances, and the results include the accuracy of each fold, along with the mean accuracy and standard deviations across folds. Furthermore, to aid in the proper interpretation of classification performance, average precision, recall, and F1 score metrics of all folds are also included.

Table 3 Performance evaluation of machine learning algorithms

It is found that the random forest algorithm to have the highest performance, achieving a mean accuracy of 0.801. Following this, decision tree, AdaBoost, and MLP algorithms achieved mean accuracies of 0.778, 0.770, and 0.739, respectively. The parameters used for different machine learning algorithms employed in the study are provided in Table 4.

Table 4 Parameter configurations for used machine learning models

Discussion

Empathy, at the core of doctor-patient communication, is not only a social skill but also a fundamental component of providing effective and efficient healthcare. Numerous studies have shown that empathy is directly related to positive patient outcomes. This not only enhances the quality of healthcare services but also supports patient satisfaction and treatment adherence. As demonstrated by Hojat et al. [14], an empathetic doctor-patient relationship offers several advantages, including increased mutual trust and the development of more accurate diagnoses and treatment plans. This can result in significant cost savings for both the patient’s well-being and the healthcare system. Findings by Nightingale [70] also support this view, showing that empathetic doctors tend to avoid unnecessary tests and procedures, thereby reducing costs and performing fewer invasive interventions.

In this study, 41.30% of medical students were evaluated as having high, 44.72% as having medium, and 13.978% as having low levels of empathy. According to these results, 86.02% of medical students demonstrate a moderate or high level of empathy. This finding suggests that medical education generally has a positive impact on fostering empathy. However, considering the proportion of students with low empathy levels, developing targeted training programs to support these individuals would be beneficial. Empathy is an indispensable part of medical education because it is necessary for doctors to understand their patients’ emotional worlds and communicate effectively with them. Similar findings have been reported by Ardenhgi et al. [71], who suggested that medical students’ empathy levels could be rated as medium to high. Another study found that the empathy level score for resident doctors was 73.1 ± 8.9 [17]. In another study by Teke et al. [72], the average empathy level score for doctors was found to be 79.37. However, Demir Karabulut et al. [73] found that students at a foundation university’s medical school had empathy scores below the medium level. Another study found that medical students had low affective and cognitive empathy scores [74]. These differences may arise because empathy’s affective, cognitive, and behavioral dimensions interact in complex ways, and empathy levels are influenced by numerous variables such as age, gender, and cultural background. Additionally, different research methods (survey, observation, experimental study, etc.) and different empathy measurement tools may have been used, leading to varying results.

In this study, the factors affecting the empathy levels of medical students were examined using CHAID analysis. CHAID analysis is an effective decision tree algorithm used to identify statistically significant relationships and classify data [22]. By examining the interactions between dependent and independent variables, the data was divided into meaningful subgroups. As a result of the analysis, factors influencing the students’ empathy levels were identified. The findings suggest that CHAID analysis is a valuable tool in creating strategies for developing empathy skills. In the decision tree generated by the study, each node specifies the number and percentage of students classified into high, medium, and low empathy levels based on the relevant variables. These numerical values are crucial for understanding which variables have a more dominant impact on empathy levels in the decision tree and how these variables explain the differences in empathy among student groups. This analysis provides a detailed insight into the factors determining medical students’ empathy levels.

The study found that female students have higher empathy levels compared to male students. This finding aligns with many studies in the literature. Maximiano-Barreto et al. [75] showed that female gender, being married, having siblings, and having children are associated with higher empathy levels. Numerous studies indicate that gender differences favor women concerning attitudes that value empathy and humanitarian qualities [76, 77]. Regarding individual factors affecting empathy development in health professions, findings from many studies show that women are generally more empathetic than men and score higher on empathy measures [78].

These findings support the general belief that women are typically more sensitive to emotional cues and better at understanding others’ emotional states [30, 79]. This can be explained by a combination of factors such as biological differences, social learning experiences, and cultural expectations. For example, women’s typically greater involvement in caregiving roles and the encouragement of empathy in family life may contribute to the development of their empathy skills. Additionally, societal expectations regarding gender roles may also shape women’s empathy abilities. However, this is a generalization and does not imply that all women and men behave in the same way. Indeed, a different study found no statistically significant difference in empathy levels between male and female resident doctors [17]. An Italian study on empathy levels among doctors also found no significant difference based on gender [80]. These differing findings indicate that empathy is influenced by many variables and that gender is not the sole determining factor. Variables such as age, culture, education level, and familial factors also impact empathy levels. Individual differences, personality traits, and life experiences are also significant factors affecting empathy levels.

It is important to consider that the mechanisms of gender differences in empathy are not solely social. Studies have shown that gender differences play a significant role in the neural mechanisms underlying emotional processes [81]. Another study shows that in women, the interaction between verbal working memory and negative emotion is associated with relative hyperactivation in emotion-related areas, whereas in men, regions associated with cognitive control and cognition are activated [82]. These neurological differences may explain why women are better at recognizing and responding to others'emotional states.

Additionally, although men are neurologically less sensitive to emotional cues, they can develop cognitive empathy, such as understanding others'perspectives; this involves understanding others'emotions without experiencing emotional empathy [83]. Research also indicates that societal norms may limit men's expression of emotional empathy, reinforcing the differences in empathetic responses between genders [84]. Therefore, both social learning experiences and neurological tendencies interact as factors shaping the empathy differences between genders.

Empathy is shaped not only by genetic predispositions but also by environmental factors such as family environment and education [37]. Early childhood family bonds significantly shape individuals’ social relationships later in life [85]. This study confirms this situation. It is found that the education level of mothers, the profession of mothers, and the fathers’ education levels are important factors determining children’s empathy levels. This finding indicates that parents’ education levels and professional roles affect empathy development in their children through the social learning opportunities and modeling they provide. This finding is consistent with previous research showing that parental education levels can affect children’s cognitive and emotional development [86,87,88].

This study shows that the occupation of mothers affects students’ empathy levels. Daughters of mothers working as civil servants, workers, housewives, or in other professions have higher empathy levels. In contrast, daughters of self-employed mothers have lower empathy levels. The higher empathy levels of daughters whose mothers are civil servants, workers, or housewives may be interpreted as these professions providing more social interaction and requiring more empathy. Conversely, the lower empathy levels of children of self-employed mothers may be attributed to the more individualistic and competitive nature of this profession, which may limit time spent with children. Individuals’ relationships and experiences with their parents can influence the development of their empathy skills.

According to the study results, a complex relationship was observed between the education level of fathers and empathy levels in male children. The fact that children with fathers who are elementary or middle school graduates exhibit higher levels of empathy can be interpreted as these fathers having warmer and more supportive relationships with their children. However, the observation that children with university-educated fathers have medium levels of empathy suggests that a higher level of education is not always associated with higher empathy. This indicates that, in addition to education level, other factors such as fathers’ parenting styles, professional lives, and personality traits may also play a significant role in the development of empathy.

The scale used in the study focuses on the affective dimension of empathy. However, the findings of this study appear to align with those of previous studies that also assess cognitive empathy. Despite the tool used in this study focusing solely on affective empathy, the alignment of the findings with prior research incorporating cognitive empathy can be explained by the potential relationship between affective and cognitive empathy. The literature suggests that these two dimensions of empathy may function in a complementary or interconnected manner. For instance, Decety and Jackson [89] propose that while affective and cognitive empathy rely on different processes, the empathy mechanism may integrate these two components. Additionally, Shamay-Tsoory [90] highlights that affective empathy might facilitate cognitive empathy. In this context, the findings of this study, which focuses on measuring affective empathy, may indirectly reflect outcomes related to cognitive empathy. Furthermore, it has been emphasized in the literature that different instruments tend to measure overlapping aspects of the empathy construct [91]. Therefore, the consistency of the findings with the literature might stem from the interrelation of the different dimensions of empathy.

The algorithms used in this study achieved prediction accuracies ranging from 73.9% to 80.1%, indicating a reliable level of performance. The highest accuracy was obtained with the Random Forest algorithm (accuracy = 80.1%; precision = 0.808, recall = 0.656, f1-score = 0.724). This can be attributed to the nature of the dataset, which primarily consists of categorical and ordinal independent variables (e.g., gender, marital status, number of siblings, graduation status, parental education and occupation, family income status, and primary purpose of internet use). Decision tree-based models, such as random forest and decision dree, are particularly well-suited for handling such variables as they create rule-based splits that effectively capture categorical distinctions. Furthermore, random forest’s ensemble learning approach enhances model robustness by reducing overfitting through the aggregation of multiple decision trees, while its feature selection capability ensures that the most informative predictors contribute to the final decision-making process.

In contrast, the inferior performance of AdaBoost and MLP can be explained by their inherent limitations with categorical data. AdaBoost, which relies on iterative weighting mechanisms, may struggle with potential class imbalances and complex interactions, leading to cumulative errors. Similarly, MLP models, which generally perform better with large datasets containing continuous variables, may have been less effective due to the categorical nature of the dataset and the absence of highly nonlinear relationships. It could be said that tree-based models may have an advantage in this context.

Inherently, bias is one of the major drawbacks of machine learning algorithms, as they are prone to certain biases, such as overfitting and feature importance. Overfitting occurs when the model learns patterns too specific to the training data, which reduces its ability to generalize to new data, leading to high variance and unreliable predictions. Similarly, feature importance can cause models to place undue weight on variables that may correlate with the target but lack true causal significance, potentially leading to misleading conclusions. These biases can increase the margin of error, reduce model interpretability, and affect fairness in decision-making. Addressing them requires careful feature selection, regularization, and model validation to ensure robustness and fairness.

Given the reliability and high accuracy of these results, educational institutions can leverage such predictive models to identify student empathy levels earlier, enabling the development of strategies tailored to individual developmental needs. Using the developed prediction models, it is aimed to plan strategies to enhance empathy skills among medical students.

Conclusion

Healthcare sector is one of the fields where interpersonal communication is crucial, thus high levels of empathy are expected from physicians. This study shows that medical students generally have high levels of empathy. The findings indicate that empathy is a complex construct influenced by both genetic predispositions and environmental factors. In our study, variables such as parental education levels, mother’s occupation, and gender have been found to have significant effects on empathy levels.

The finding that women generally have higher empathy levels than men is consistent with previous research and is supported in our study as well. This can be explained by various factors, including women’s biological structures, social learning experiences, and cultural expectations. However, it should be noted that this is a generalization and may not apply to every individual.

It is found that the effect of parental education levels on children’s empathy levels is noteworthy. This finding indicates that parents’ educational levels affect empathy development through the social learning opportunities and modeling they provide to their children.

The impact of the mother’s occupation on empathy levels is another important point emphasized in our study. It was observed that children of mothers working in professions that require social interaction and empathy have higher empathy levels.

The predictive models developed using machine learning algorithms can estimate the empathy levels of medical students with a certain degree of accuracy. These models can be used in the future to plan and evaluate interventions aimed at improving students’ empathy skills in medical schools.

In conclusion, this study reveals multiple factors affecting the empathy levels of medical students. The findings suggest that empathy training should be an integral part of medical education and that such training should be tailored considering students’ personal characteristics and family backgrounds.

Limitations and future directions

This study has several limitations. The research was conducted on a relatively small sample. Studies with larger samples may provide more generalizable results about medical students’ empathy levels. The research only examined medical students in Turkey. Further research is needed to examine the empathy levels of medical students in different countries. Future studies could test the generalizability of these findings with research conducted in different cultures and socioeconomic levels. Different studies could help us better understand the relationships between mother’s occupation, parental education levels, and medical students’ empathy levels. Additionally, researchers should investigate other factors that could affect empathy levels (e.g., personality traits, family environment). Future research could explore the effects of different dimensions of empathy (cognitive, emotional, behavioral), cultural differences, socioeconomic status, and longitudinal studies to examine changes in empathy development over time.

Data availability

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

References

  1. Jeffrey D, Downie R. Empathy – can it be taught? J R Coll Physicians Edinb. 2016;46(2):107–12.

    Article  Google Scholar 

  2. Hodges SD, Klein KJK. Regulating the costs of empathy: The price of being human. J Socio Econ. 2001;30(5):437–52.

    Article  Google Scholar 

  3. Hojat M. Empathy in Patient Care: Antecedents, Development, Measurement, and Outcomes. New York: Springer; 2007.

    Google Scholar 

  4. Halpern J. From Detached Concern to Empathy. New York: Oxford University Press; 2001.

    Book  Google Scholar 

  5. Capdevila-Gaudens P, García-Abajo JM, Flores-Funes D, García-Barbero M, García-Estañ J. Depression, anxiety, burnout and empathy among Spanish medical students. PLoS One. 2021;16(12):e0260359 Sattar K, editor.

    Article  Google Scholar 

  6. Shi M, Du T. Associations of emotional intelligence and gratitude with empathy in medical students. BMC Med Educ. 2020;20(1):116.

    Article  Google Scholar 

  7. Dyrbye LN, Satele D, West CP. Association of characteristics of the learning environment and US medical student burnout, empathy, and career regret. JAMA Netw Open. 2021;4(8):e2119110.

    Article  Google Scholar 

  8. Silva JTN, Jr Toledo A. Association between emotional intelligence and empathy among medical students: a single center cross-sectional study, Brazil, 2019. Rev Bras Educ Med. 2021;45(1). https://doiorg.publicaciones.saludcastillayleon.es/10.1590/1981-5271v45.1-20200053.ING.

  9. Derksen F, Bensing J, Lagro-Janssen A. Effectiveness of empathy in general practice: a systematic review. Br J Gen Pract. 2013;63(606):e76.

    Article  Google Scholar 

  10. Neumann M, Edelhäuser F, Tauschel D, Fischer MR, Wirtz M, Woopen C, et al. Empathy decline and its reasons: A systematic review of studies with medical students and residents. Acad Med. 2011;86(8):996–1009.

    Article  Google Scholar 

  11. Kim SS, Kaplowitz S, Johnston MV. The effects of physician empathy on patient satisfaction and compliance. Eval Heal Prof. 2004;27(3):237–51.

    Article  Google Scholar 

  12. Roter DL, Hall JA, Merisca R, Nordstrom B, Cretin D, Svarstad B. Effectiveness of interventions to improve patient compliance a meta-analysis. Med Care. 1998;36(8):1138–61.

    Article  Google Scholar 

  13. Staudenmayer H, Lefkowitz MS. Physician-patient psychosocial characteristics influencing medical decision-making. Soc Sci Med Part E Med Psychol. 1981;15(1):77–81.

    Article  Google Scholar 

  14. Hojat M, Louis DZ, Markham FW, Wender R, Rabinowitz C, Gonnella JS, et al. Physicians’ empathy and clinical outcomes for diabetic patients. Acad Med. 2011;86(3):359–64.

    Article  Google Scholar 

  15. Krupnick JL, Sotsky SM, Simmens S, George JM, Atlanta JW, Elkin I, et al. The role of the therapeutic alliance in psychotherapy and pharmacotherapy outcome: Findings in the National Institute of Mental Health Treatment of Depression Collaborative Research Program. J Consult Clin Psychol. 1996;64(3):532–9.

    Article  Google Scholar 

  16. Horvath AO, Symonds BD. Relation between working alliance and outcome in psychotherapy: a meta-analysis. J Couns Psychol. 1991;38(2):139–49.

    Article  Google Scholar 

  17. Aslan E. Covid-19 Pandemisi Sürecinde İstanbul Haseki Eğitim Ve Araştırma Hastanesi’nde Görev Yapan Asistan Hekimlerin Empati Düzeylerinin Yaşam Kalitesi Ve Tükenmişlik Durumu Üzerindeki Etkisi. [Master’s Thesis, T.C Sağlık Bilimleri Üniversitesi]; 2022.

  18. Neuman WL. Basics of Social Research: Qualitative and Quantitative Approaches. 3rd ed. New Jersey: Pearson; 2011. 406 p.

  19. Chen D, Lew R, Hershman W, Orlander J. A cross-sectional measurement of medical student empathy. J Gen Intern Med. 2007;22(10):1434–8.

    Article  Google Scholar 

  20. Hojat M, Nasca TJ, Cohen MJM, Gonnella JS, Erdmann JB, Veloski J, et al. The Jefferson scale of physician empathy: development and preliminary psychometric data. Educ Psychol Meas. 2001;61(2):349–65.

    Article  Google Scholar 

  21. Spreng RN, McKinnon MC, Mar RA, Levine B. The Toronto empathy questionnaire: Scale development and initial validation of a factor-analytic solution to multiple empathy measures. J Pers Assess. 2009;91(1):62–71.

    Article  Google Scholar 

  22. Kass G V. An exploratory technique for investigating large quantities of categorical data. Appl Stat. 1980;29(2):119. Available from: https://www.jstor.org/stable/10.2307/2986296?origin=crossref

  23. McArdle JJ, Ritschard G, editors. Contemporary issues in exploratory data mining in the behavioral sciences. Contemporary Issues in Exploratory Data Mining in the Behavioral Sciences. Routledge; 2013. 70–96 p.

  24. Rogers CR. Empathic: an unappreciated way of being. Couns Psychol. 1975;5(2):2–10.

    Article  Google Scholar 

  25. Rogers CR. The necessary and sufficient conditions of therapeutic personality change. J Consult Psychol. 1957;21(2):95–103.

    Article  Google Scholar 

  26. Feshbach ND. Empathy in children: some theoretical and empirical considerations. Couns Psychol. 1975;5(2):25–30.

    Article  Google Scholar 

  27. Wispé L. The distinction between sympathy and empathy: To call forth a concept, a word is needed. J Pers Soc Psychol. 1986;50(2):314–21. https://doiorg.publicaciones.saludcastillayleon.es/10.1037/0022-3514.50.2.314.

    Article  Google Scholar 

  28. Kohut H. Introspection, empathy, and psychoanalysis an examination of the relationship between mode of observation and theory. J Am Psychoanal Assoc. 1959;7(3):459–83.

    Article  Google Scholar 

  29. Yu J, Kirk M. Evaluation of empathy measurement tools in nursing: systematic review. J Adv Nurs. 2009;65(9):1790–806.

    Article  Google Scholar 

  30. Hojat M, Gonnella JS, Mangione S, Nasca TJ, Veloski JJ, Erdmann JB, et al. Empathy in medical students as related to academic performance, clinical competence and gender. Med Educ. 2002;36(6):522–7.

    Article  Google Scholar 

  31. Mercer SW, Reynolds WJ. Empathy and quality of care. Br J Gen Pract. 2002;52(SUPPL):S9–12.

    Google Scholar 

  32. Hoffman ML. The contribution of empathy to justice and moral judgment. In: Eisenberg N, Strayer J, editors. Empathy and its development. Cambridge University Press; 1987. p. 47–80.

    Google Scholar 

  33. Eisenberg N, Strayer J. Empathy and its development. Cambridge: Cambridge University Press; 1990.

    Google Scholar 

  34. Jolliffe D, Farrington DP. Development and validation of the Basic Empathy Scale. J Adolesc. 2006;29(4):589–611.

    Article  Google Scholar 

  35. Smith A. The empathy imbalance hypothesis of autism: a theoretical approach to cognitive and emotional empathy in autistic development. Psychol Rec. 2009;59(2):273–94.

    Article  Google Scholar 

  36. Stepien KA, Baernstein A. Educating for empathy. J Gen Intern Med. 2006;21(5):524–30.

    Article  Google Scholar 

  37. Knafo A, Zahn-Waxler C, Van Hulle C, Robinson JAL, Rhee SH. The developmental origins of a disposition toward empathy: genetic and environmental contributions. Emotion. 2008;8(6):737–52.

    Article  Google Scholar 

  38. Hojat M. Empathy in Health Professions Education and Patient Care. Cham: Springer International Publishing; 2016.

    Book  Google Scholar 

  39. Levinson W. Physician-patient communication. JAMA. 1994;272(20):1619.

    Article  Google Scholar 

  40. Jackson SW. The listening healer in the history of psychological healing. Am J Psychiatry. 1992;149(12):1623–32.

    Article  Google Scholar 

  41. O’Tuathaigh CMP, Nadhirah Idris A, Duggan E, Costa P, Costa MJ. Medical students’ empathy and attitudes towards professionalism: Relationship with personality, specialty preference and medical programme. PLoS One. 2019;14(5):e0215675 Arrieta A, editor.

    Article  Google Scholar 

  42. Stepien KA, Baernstein A. Educating for empathy: A review. J Gen Intern Med. 2006;21(5):524–30. Available from: http://link.springer.com/10.1111/j.1525-1497.2006.00443.x

  43. Charon R. Narrative medicine: A model for empathy, reflection, profession, and trust. Jama. 2001;286(15):1897–902. Available from: http://jama.jamanetwork.com/article.aspx?doi=10.1001/jama.286.15.1897

  44. Batt-Rawden SA, Chisolm MS, Anton B, Flickinger TE. Teaching empathy to medical students: An updated, systematic review. Acad Med. 2013;88(8):1171–7. Available from: http://journals.lww.com/00001888-201308000-00037

  45. Shapiro J, Morrison EH, Boker JR. Teaching empathy to first year medical students: Evaluation of an elective literature and medicine course. Educ Heal. 2004;17(1):73–84. Available from: http://www.informaworld.com/openurl?genre=article&doi=10.1080/13576280310001656196&magic=crossref%7C%7CD404A21C5BB053405B1A640AFFD44AE3

  46. Kelm Z, Womer J, Walter JK, Feudtner C. Interventions to cultivate physician empathy: A systematic review. BMC Med Educ. 2014;14(1):219. Available from: https://biomedcentral-bmcmededuc.publicaciones.saludcastillayleon.es/articles/10.1186/1472-6920-14-219

  47. Levinson W. Physician-patient communication: the relationship with malpractice claims among primary care physicians and surgeons. JAMA J Am Med Assoc. 1997;277(7):553.

    Article  Google Scholar 

  48. Levinson W. A study of patient clues and physician responses in primary care and surgical settings. JAMA. 2000;284(8):1021.

    Article  Google Scholar 

  49. Adler HM. The history of the present illness as treatment: who’s listening, and why does it matter? J Am Board Fam Pract. 1997;10(1):28–35.

    Google Scholar 

  50. Bylund CL, Makoul G. Examining empathy in medical encounters: an observational study using the empathic communication coding system. Health Commun. 2005;18(2):123–40.

    Article  Google Scholar 

  51. Christensen LB, Johnson RB, Turner LA. Research methods, design, and analysis. 11th ed. Boston: Allyn and Bacon; 2010.

    Google Scholar 

  52. Fraenkel JR, Wallen NE, Hyun HH. How to Design and Evaluate Research in Education. New York: McGraw-Hill; 2012.

    Google Scholar 

  53. Siu Loon Hoe. Issues and procedures in adopting structural equation modeling technique. J Appl Quant Methods. 2008;3(1):83.

    Google Scholar 

  54. Singh K, Junnarkar M, Kaur J. Measures of positive psychology: Development and validation. Springer. New Delhi: Springer India; 2016. 1–215 p. Available from: https://link.springer.com/10.1007/978-81-322-3631-3

  55. Tabachnick BG, Fidell LS. Using Multivariate Statistics Title: Using multivariate statistics. 6th ed. Vol. 5, Pearson Education. 2019. 481–498 p. Available from: https://www.pearsonhighered.com/assets/preface/0/1/3/4/0134790545.pdf%0Ahttps://www.pearsonhighered.com/assets/preface/0/1/3/4/0134790545.pdf%0Ahttps://lccn.loc.gov/2017040173

  56. Kotsiantis SB. Decision trees: A recent overview. Artif Intell Rev. 2013;39(4):261–83. Available from: http://link.springer.com/10.1007/s10462-011-9272-4

  57. Totan T, Dogan T, Sapmaz F. The Toronto empathy questionnaire: evaluation of psychometric properties among Turkish University Students. Egit Arastirmalari-Eurasian J Educ Res. 2012;12(46):179–98.

    Google Scholar 

  58. de Ville B. Decision trees. WIREs Comput Stat. 2013;5(6):448–55.

    Article  Google Scholar 

  59. Pandya R, Pandya J. C5. 0 Algorithm to improved decision tree with feature selection and reduced error pruning. Int J Comput Appl. 2015;117(16):18–21.

    Google Scholar 

  60. Chan F, Cheing G, Chan JYC, Rosenthal DA, Chronister J. Predicting employment outcomes of rehabilitation clients with orthopedic disabilities: A CHAID analysis. Disabil Rehabil. 2006;28(5):257–70.

    Article  Google Scholar 

  61. van Diepen M, Franses PH. Evaluating chi-squared automatic interaction detection. Inf Syst. 2006;31(8):814–31.

    Article  Google Scholar 

  62. Breiman L, Friedman JH, Olshen RA, Stone CJ. Classification and regression trees. Routledge; 2017. Available from: https://www.taylorfrancis.com/books/9781351460491

  63. Lemon SC, Roy J, Clark MA, Friedmann PD, Rakowski W. Classification and regression tree analysis in public health: Methodological review and comparison with logistic regression. Ann Behav Med. 2003;26(3):172–81. Available from: https://academic.oup.com/abm/article/26/3/172-181/4631556

  64. Iqbal Malik S, Mathew R, Moufaq Tawafak R, Khan I. Gender Difference In Perceiving Algorithmic Thinking In An Introductory Programming Course. In 2019. p. 8246–54. Available from: http://library.iated.org/view/IQBALMALIK2019GEN

  65. Monteverde-Suárez D, González-Flores P, Santos-Solórzano R, García-Minjares M, Zavala-Sierra I, de la Luz VL, et al. Predicting students’ academic progress and related attributes in first-year medical students: an analysis with artificial neural networks and Naïve Bayes. BMC Med Educ. 2024;24(1):74. Available from: https://biomedcentral-bmcmededuc.publicaciones.saludcastillayleon.es/articles/10.1186/s12909-023-04918-6

  66. Mastour H, Dehghani T, Moradi E, Eslami S. Early prediction of medical students’ performance in high-stakes examinations using machine learning approaches. Heliyon. 2023;9(7):e18248.

    Article  Google Scholar 

  67. Smith C, Koning M. Decision Trees and Random Forests: A Visual Introduction for Beginners. Independently Published. 2017.

  68. Ferreira AJ, Figueiredo MAT. Boosting Algorithms: A Review of Methods, Theory, and Applications. In: Ensemble Machine Learning. New York, NY: Springer New York; 2012. p. 35–85. Available from: https://link.springer.com/10.1007/978-1-4419-9326-7_2

  69. Kruse R, Mostaghim S, Borgelt C, Braune C, Steinbrecher M. Multi-layer Perceptrons. In: Computational Intelligence. 2022. p. 53–124. Available from: https://link.springer.com/10.1007/978-3-030-42227-1_5

  70. Nightingale SD, Yarnold PR, Greenberg MS. Sympathy, empathy, and physician resource utilization. J Gen Intern Med. 1991;6(5):420–3.

    Article  Google Scholar 

  71. Ardenghi S, Rampoldi G, Bani M, Strepparava MG. Attachment styles as predictors of self-reported empathy in medical students during pre-clinical years. Patient Educ Couns. 2020;103(5):965–70.

    Article  Google Scholar 

  72. Teke AK, Cengiz E, Demir C. Hekimlerin Empatik Özelliklerinin Ölçümü ve Bu Ölçümlerin Demografik Değişkenlere Göre Değişimi. Çukurova Üniversitesi Sos Bilim Enstitüsü Derg. 2010;19(3):505–16.

    Google Scholar 

  73. Demir Karabulut S, Zengin HY, Yıldırım RV, Yılmaz ÇZ. Tıp Fakültesi Dönem 1-2-3 Öğrencilerinin İletişim ve Empati Becerilerinin Analizi. Tıp Eğitimi Dünyası. 2020;19(58):42–53.

    Article  Google Scholar 

  74. Dehning S, Gasperi S, Krause D, Meyer S, Reiß E, Burger M, et al. Emotional and cognitive empathy in first-year medical students. ISRN Psychiatry. 2013;2013:1–6.

    Article  Google Scholar 

  75. Maximiano-Barreto MA, de Fabrício DM, Luchesi BM, Chagas MHN. Factors associated with levels of empathy among students and professionals in the health field: A systematic review. Trends Psychiatry Psychother. 2020;42(2):207–15.

    Article  Google Scholar 

  76. Hojat M, Gonnella JS, Mangione S, Nasca TJ, Magee M. Physician empathy in medical education and practice: Experience with the Jefferson scale of physician empathy. Semin Integr Med. 2003;1(1):25–41.

    Article  Google Scholar 

  77. Boztarla S. Ondokuz Mayıs Üniversitesi Tıp Fakültesinde görevli asistan hekimlerin genel empati ve hekimlik empati becerisinin karşılaştırılması (Masters Thesis, Ondokuz Mayıs Üniversitesi Tıp Fakültesi); 2021.

  78. Hojat M, DeSantis J, Shannon SC, Speicher MR, Bragan L, Calabrese LH. Empathy as related to gender, age, race and ethnicity, academic background and career interest: A nationwide study of osteopathic medical students in the United States. Med Educ. 2020;54(6):571–81.

    Article  Google Scholar 

  79. Robert T. Parental Investment and Sexual Selection. In: Campbell B, editor. Sexual Selection and the Descent of Man. Aldine: Chicago; 1972. p. 1871–971.

    Google Scholar 

  80. Soncini F, Silvestrini G, Poscia A, Ciorba V, Conti A, Murru C, et al. Public health physicians and empathy. Are we really empathic? The Jefferson Scale applied to Italian resident doctors in Public Health. Eur J Public Health. 2013;23(suppl_1):264–5.

  81. Whittle S, Simmons JG, Allen NB. Emotion and Gender-Specific Neural Processing in Men and Women. In: Principles of Gender-Specific Medicine: Gender in the Genomic Era: Third Edition. Elsevier; 2017. p. 183–201. Available from: https://linkinghub.elsevier.com/retrieve/pii/B9780128035061000139

  82. Koch K, Pauly K, Kellermann T, Seiferth NY, Reske M, Backes V, et al. Gender differences in the cognitive control of emotion: An fMRI study. Neuropsychologia. 2007;45(12):2744–54. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0028393207001558

  83. Christov-Moore L, Simpson EA, Coudé G, Grigaityte K, Iacoboni M, Ferrari PF. Empathy: Gender effects in brain and behavior. Neurosci Biobehav Rev. 2014;46(P4):604–27.

    Article  Google Scholar 

  84. Chaplin TM. Gender and emotion expression: A developmental contextual perspective. Emot Rev. 2015;7(1):14–21. Available from: https://journals.sagepub.com/doi/10.1177/1754073914544408

  85. Fonagy P. Attachment Theory and Psychoanalysis. London: Routledge; 2018.

    Book  Google Scholar 

  86. Karaoglan D, Sagir S, Dayioglu M, Saracolgu DS. The impact of maternal education on early childhood development: The Case of Turkey. In: ERC Working Papers 2303. 2023.

  87. Hanifah I, Oviyanti F, Muhtarom M. Child cognitive development based on the maternal education. Aṭfālunā J Islam Early Child Educ. 2022;5(1):15–24.

    Article  Google Scholar 

  88. Sonego M, Llácer A, Galán I, Simón F. The influence of parental education on child mental health in Spain. Qual Life Res. 2013;22(1):203–11.

    Article  Google Scholar 

  89. Decety J, Jackson PL. The functional architecture of human empathy. Behav Cogn Neurosci Rev. 2004;3(2):71–100.

    Article  Google Scholar 

  90. Shamay-Tsoory SG. The neural bases for empathy. Neuroscientist. 2011;17(1):18–24.

    Article  Google Scholar 

  91. Preston SD, de Waal FBM. Empathy: Its ultimate and proximate bases. Behav Brain Sci. 2002;25(1):1–20.

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to thank all the medical students who participated in this study for their valuable contributions.

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

N. H. S. contributed to the all stages of this study.

Corresponding author

Correspondence to Nesrin Hark Söylemez.

Ethics declarations

Ethics approval and consent to participate

Approval for the current study was obtained from the Dicle University Social Sciences Ethics Committee, with the ethical assessment decision dated 17.02.2023 and documented under the number 17.02.2023–53 and ensured compliance with all ethical standards and guidelines of Declaration of Helsinki. Information about the aim of the research was shared with all participants. Participation was voluntary and participants could choose not to participate in the research without repercussion. Informed consent obtained from participant.

Consent for publication

Informed consent obtained from participant.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix A

Appendix A

Table 5 The Turkish Toronto Empathy Questionnaire

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hark Söylemez, N. Examining the empathy levels of medical students using CHAID analysis. BMC Med Educ 25, 726 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12909-025-07296-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12909-025-07296-3

Keywords