1. Introduction
Unsafe behaviors are the primary direct cause of construction accidents. Different types of accidents can be attributed to different sets of unsafe behaviors [
1]. For example, to avoid falls from height the management should take care of unprotected holes/borders and correct workers’ inappropriate use of personal protective equipment (PPE). Safety behavior is traditionally categorized as either safety compliance or safety participation. The former is an in-role task-related behavior, while the latter involves extra-role behaviors, which are voluntary and initiated by employees [
2]. Griffin and Curcuruto further identify two categories of safety participation behavior: affiliative and proactive [
3]. Helping and stewardship behaviors, civic virtue, and caring for safety are typical of affiliative safety participation behavior, whereas proactive safety participation behavior includes safety voice behaviors and initiating safety-related changes. Affiliative safety participation behavior is related to minor incidents, such as property damage and microinjuries, while proactive safety participation behavior is positively associated with near-miss reporting. Therefore, it can be hypothesized that different sets of drivers are accountable for different (un)safety behaviors. This paper attempts to validate this hypothesis with a machine-learning-enabled classification framework.
Besides the theoretical significance, this paper also has both a practical and a methodological significance as well. On the practical front, if different patterns of drivers for different safety behaviors are ascertained, targeted interventions can be proposed accordingly. Specifically, this paper selects five typical safety behaviors, i.e., the use of all necessary safety equipment to do the job (hereafter coded as SB1); following safety procedures in doing the job (hereafter coded as SB2); promoting safety programs willingly (hereafter coded as SB3); put in extra effort to improve workplace safety (hereafter coded as SB4); and help colleagues out when they are under risky conditions (hereafter coded as SB5). On the methodological front, as a subset of artificial intelligence, machine learning enables a system to learn from example data or past experience without explicit programming. Like traditional statistical modelling, it is also intended to seek solutions from data. Unlike traditional methods that are based on assumptions and ignore the nonlinear relationship among independent variables, machine learning methods are more flexible, have fundamental and simple assumptions, and take into consideration the complex relationship among independent variables. Machine learning has seen an increasing use by safety researchers in recent years. Construction workers’ risk perceptions have a direct impact on their safety behavior. The traditional measurement of risk perceptions primarily relies on a post hoc survey-based assessment, which has limitations such as lack of objectivity and continuous monitoring ability. Given this, Lee et al. developed an automatic system to measure workers’ risk perception using physiological signals obtained by wristband-type wearable biosensors in combination with a supervised learning algorithm [
4]. Overexertion-induced work-related musculoskeletal disorders (WMSDs) are a primary cause of the nonfatal injuries for construction workers. To reduce overexertion, appropriate levels of physical loads need to be identified. In this regard, Yang et al. propose to employ a bidirectional long short-term memory algorithm to classify physical load levels, and investigate the feasibility of such an approach with a laboratory experiment [
5]. In view of machine learning’s advantage in predictive accuracy, Goh et al. use six supervised learning algorithms (i.e., support vector machine, random forest, K-nearest neighbor, naïve Bayes, artificial neural network, and decision tree) to assess the relative importance of different cognitive factors derived from the theory of reasoned action in affecting safety behavior [
6].
Given the theoretical, practical, and methodological significance, a machine-learning-enabled safety behavior classification framework should be developed in order to improve construction safety performance in an efficient and effective way. In particular, this paper has two objectives, namely: (a) To identify drivers of different safety behaviors; (b) To propose new machine learning methods in predicting safety behaviors. The former intends to make targeted interventions for different safety behaviors based on the findings and the latter to explore new algorithms which are more suitable for analyzing safety-related behavioral data.
This paper is organized as follows. First, a safety behavior factor analysis and classification system is developed based on the literature review. Second, the sample, measures, machine learning models, and classification outputs are described. Third, results are presented, with an emphasis on model performance and factor importance analysis. Finally, both the contribution and limitations of the findings are discussed along with future research directions.
2. Safety Behavior Factor Analysis and Classification System
Safety behavior is an emergent property of a more complex system. Choi and Lee find that construction workers’ safety behavior is a function of their socio-cognitive process and their interaction with the environment [
7]. Based on bibliometric and content analyses of 101 empirical studies, Xia et al. propose a safety behavior antecedent analysis and classification system [
8], which organizes the antecedents of safety behavior into five levels: (a) Self; (b) Work; (c) Home; (d) Work–home interface; (e) Industry/society. In addition, they put forward a resource flow model to explain how safety behavior emerges from such a complex system. Using Xia et al. ’s framework [
8], this study organizes influencing factors of construction safety behavior at four levels, i.e., client, project, group, and individual, and hence, develops a safety behavior factor analysis and classification system as well. The next section deliberates on the impact of these factors on safety behavior before presenting the system.
2.1. Client Level Factors
Among stakeholders in the construction supply chain, clients have the economic power to encourage other stakeholders to implement safety measures. Therefore, clients play a pivotal role in improving safety performance across construction projects. Specifically, client type and the extent of client involvement in safety management have implications for safety performance [
9,
10].
2.1.1. Client Type
Construction project clients can be categorized as either public or private according to their source of funding. Ma observes that safety records for the projects with public sector clients are better than those projects with private sector clients in Hong Kong [
11], and believes that the reason is that most safety initiatives are mandatorily executed in public works’ contracts, whereas they are voluntarily adopted in the private sector. In Nigeria, Umeokafor also notes that public clients’ safety commitment and attitudes are better than their counterparts [
12]. So, it is hypothesized that there are more safety behaviors in public projects than in private projects.
2.1.2. Client Involvement
Clients’ direct involvement in safety management contributes to safety performance. In Australia, given the important contribution that clients can make to the safety performance of the construction projects, Lingard et al. develop a model client framework [
13]. The framework establishes clients’ safety roles throughout the life-cycle of the project. Using safety climate as a leading indicator of safety performance of small- and medium-sized construction projects, Votano and Sunindijo found that six of the clients’ safety roles depicted by Lingard et al. are related to safety performance, and they are participation in the safety program, review and analysis of safety data, appointment of safety team, selection of safe contractors, safety specifications in tenders, and regular checks on plant/equipment [
13,
14]. Hence, this research postulates that client safety involvement is positively associated with workers’ safety behaviors.
2.2. Project Level Factors
Safety management system at the project level has implications for workers’ safety behavior. In order to curb unsafe acts, Shin et al. suggest that project management should offer a safety incentive as early as possible and facilitate effective communication about accidents in as much detail as possible [
15]. Fang et al. propose a leadership–culture–behavior (LCB) approach, which maintains that leadership creates a safety culture, and hence, promotes safety behavior [
16]. The LCB approach has been implemented in railway and residential projects in mainland China and Hong Kong, and has seen success. Among others, this paper focuses on the following project level factors: stage of project, contract sum, goal congruency, participative decision-making, professional development, organizational support, standardized safety rules and procedures, and safety climate.
2.2.1. Project Information
At least two project characteristics, namely, stage of project and contract sum, have bearing on construction project employees’ safety behavior. Based on the percentage of construction works that has been completed, a project can be categorized into three stages, namely, start-up, advanced, and near close-out. At the start-up stage, the construction work has been completed by less than 30%. At the advanced stage, the construction work has been completed by 30–70%. At the near close-out stage, the construction work that has been completed is more than 70%. Employees usually exhibit more safety behaviors at the start-up and near close-out stages than at the advanced stage. This is because at the start-up stage, employees are new to the site, and act scrupulously. As time passes and production pressure increases, employees are more likely to take shortcuts and more unsafe behaviors ensue. When the project is being completed, as employees are more familiar with the site and some of their unsafe behaviors have been rectified, their safety behavior increase. Awolusi and Marks develop a safety activity analysis framework and tool, and validate the framework and tool using a case study project that is in the construction stage [
17]. Over an eight-month period of the case study project, the occurrence rate of safety behavior experiences a U-shaped curve, initially decreasing from 45.7% to 37.0% and then increasing to 62.8%.
Contract sum is also related to employees’ safety performance. Generally, in jurisdictions where mandatory safety incentive scheme is applied, projects with large a contract sum usually set aside more money on safety measures, and therefore, more safety behaviors result. Take Hong Kong as an example, due to the introduction of safety initiatives, such as the Pay for Safety Scheme (PFSS), the Safety Management System (SMS),the Independent Safety Auditing Scheme (ISAS), and the Site Supervision Plan System (SSPS), the construction industry has seen a dramatic decrease in accidents [
18]. Hence, this paper hypothesizes that a large contract sum contributes to more safety behaviors.
2.2.2. Goal Congruency
Goal congruency has an impact on organizational behavior. Goal congruency is a scenario where employees at different levels of an organization share the same goal. When employees’ personal goals are consistent with organizational goals, they feel more positive about the organization and expend more personal efforts to achieve those goals. Ukraine-based IT professionals De Clercq et al. found that goal congruence between employees and their supervisor negatively affects employees’ organizational deviance, and the indirect effect of goal congruence on organizational deviance through work engagement is moderated by employees’ emotional intelligence [
19]. With 171 employees under the leadership of 24 supervisors, Bouckenooghe et al. found that supervisors’ ethical leadership has a positive effect on followers’ in-role job performance through the sequential mediation of goal congruence and psychological capital [
20]. Hence, when project personnel, both the management and workers, take safety as the first priority, their safety behavior ensues.
2.2.3. Participative Decision-Making
Participative decision-making is positively associated with safety behavior. Participative decision-making refers to the extent to which employers allow or encourage employees to take part in organizational decision-making. Through participation in decision-making, employees bring different perspectives and frames of references to safety discussions and activities, and hence, can reduce all members’ ignorance to hazards and signals of danger [
21]. As employees are aware that their suggestions have been incorporated in safety decisions, they are more likely to take ownership of those decisions and act on them more proactively. As a leadership behavior, participative decision-making is associated with safety participation [
22]. In the medical industry, Lee et al. found that empowering leaders who empower employees to participate in decision-making enhance employees’ safety compliance [
23].
2.2.4. Professional Development
Employees are the most valuable resource in construction projects. Despite the time and resource pressures preventing project managers from investing in employees’ professional development, it pays off. Design for safety has been advocated for quite a long time, and designers need to receive safety training as part of their professional development. Toole elaborates on the opportunities and barriers in increasing designers’ role in construction safety [
24]. In another scenario, if a semi-skilled bar bender is sponsored to receive more professional training, s/he may bring more best safety practices to the crew and promote more safety behaviors.
2.2.5. Organizational Support
Organizational support is critical in creating a safety climate and, hence, safety behavior. Organizational support refers to employees’ global beliefs about the extent to which their organization satisfies their needs and cherishes their contributions. It can be general or specific. Mearns and Reader found that general perceived organizational support has an impact on the UK’s offshore workers’ safety performance [
25]. With Ghanaian industrial workers, Gyekye and Salminen found that general perceived organizational support is positively associated with compliance with safety procedures [
26]. Guo et al. discovered that perceived supervisory and coworker support for safety reduces the negative impact of job insecurity on Chinese high-railway drivers’ safety performance [
27]. Tucker et al. found that urban bus drivers’ perceived organizational support for safety exerts influence on their safety voice behavior through the mediation of their perceived coworkers’ support for safety [
28], highlighting the role played by coworkers.
2.2.6. Standardized Safety Rules and Procedures
Standardization in construction projects is difficult to achieve. Other high-risk industries, such as aviation and nuclear, usually have well-defined work procedures. Since the construction process is characterized by high variety and loose coupling, most of the construction work, to a significant extent, depends on employees’ discretion and experience. Standardized safety rules and procedures make those rules and procedures easy to follow, and hence, contribute to an increase in safety behavior. However, the secondary effect of too much standardization should be restrained [
29].
2.2.7. Safety Climate
Safety climate is a perceptual, collective, and multidimensional phenomenon, referring to individuals’ shared perceptions of how safety is valued in the workplace [
3]. The impact of a safety climate on safety behavior has been well-documented. Safety climates can exert a direct influence on safety behavior, and also can impact safety behavior through mediators, such as the psychological contract [
30], safety knowledge and motivation [
31], etc.
2.3. Group Level Factors
Construction workers usually move from project to project and may work with different main contractors, but they often work in a workgroups for a relatively long period. Therefore, compared with supervisors from the main contractor, workgroup supervisors usually have a bigger influence on construction workers [
32]. This paper focuses on four phenomena at the workgroup level, i.e., supervisors’ transformational leadership and contingent reward behavior (one aspect of their transactional leadership), leader–member exchange, and team–member exchange.
2.3.1. Transformational Leadership
Leadership refers to a process of motivating others to act toward shared goals. It involves setting goals, devising achievement methods, persuading others to accept these goals and achievement methods, and solving problems decisively and quickly. James M. Burns proposes two leadership styles: transactional and transformational. The transactional leader identifies the needs of employees and the organization, and then informs employees what to do to meet these needs. Beyond these needs, transformational leaders arouse and satisfy higher needs within each individual. A transactional–transformational leadership paradigm is broad enough to capture the leadership construct.
Transformational leadership is positively associated with safety behavior. Shen et al. propose and validate a sequential mediation model to explain the impact of supervisory transformational leadership on construction personnel’s safety behavior [
10]. Hoffmeister et al. found that different facets of transformational leadership have a different impact on different sample’s safety behavior [
33]. In particular, idealized influence has an impact on safety compliance behavior in both the apprentice and journeyman samples, but it has an impact on safety participation behavior only in the apprentice sample.
2.3.2. Contingent Reward
Contingent reward is a facet of transactional leadership, and refers to the leader clarifying which employee behaviors are desired, what the rewards for such behaviors will be, and rewarding the followers depending on task fulfilment and outcome. Behaviorism maintains that behavior is a function of its consequences. Leaders engage in contingent reward with regard to safety when they help employees appreciate safety-related goals, keep them focused on meeting these goals, and reward them for engaging in safety behaviors required by those goals [
33]. Therefore, contingent reward should be associated with increased employee safety behaviors.
2.3.3. Leader–Member Exchange
Leader–follower relationships are an essential part of leadership effectiveness, and leader–member exchange refers to the follower’s perceptions of the quality of the exchange between leader and followers [
34]. Leader–member exchange is positively associated with safety behavior [
35,
36,
37].
2.3.4. Team–Member Exchange
Similar to leader–member exchange, team–member exchange refers to an individual’s perception of the quality of the exchange relationships within the team. It is positively associated with safety behavior [
38,
39].
2.4. Individual Level Factors
Safety behavior is complex, and an individual may work safely in some occasions and unsafely in others [
40]. Hence, some individual differences may contribute to an individual’s safety behavior. This study focuses on construction personnel’s personal demographics, habit, affiliation, and safety motivation.
2.4.1. Personal Demographics
Personal demographics, including age, gender, marital status, educational level, number of dependents to support, and industrial experience, may have an influence on safety behavior [
41]. Meng and Chan found that female poorly educated workers exhibit less safety citizenship behavior [
42]. The level of safety citizenship behavior has seen an initial downtrend followed by an uptrend as industrial experience increases.
2.4.2. Habit
Alcohol and tobacco use are more prevalent in blue collar workers than in white collar workers. There is a strong association between unsafe behavior (e.g., infrequently using sunscreen) and smoking and risky drinking [
43].
2.4.3. Affiliation
At least two affiliation-related factors, namely, affiliation type and hierarchical position in affiliation, are related to construction personnel’s safety behavior. Personnel affiliated with clients exhibit more safety behaviors than those with contractors and consultants. Personnel in managerial positions exhibit more safety behaviors than supervisory staff, who in turn exhibit more safety behavior than workers.
2.4.4. Safety Motivation
Safety motivation refers to an individual’s readiness to expend effort to engage in safety behaviors and the valence associated with these behaviors. It directs, energizes and sustains safety behavior [
3]. Griffin and Curcuruto view safety motivation as an outcome of safety climate and a determinant of safety behavior based on theories and empirical evidence [
3].
Based on the arguments made earlier, the safety behavior factor analysis and classification system is proposed and shown in
Figure 1.
5. Discussion
5.1. Findings
This study has achieved the two objectives mentioned earlier, and has theoretical, practical, and methodological implications.
First, in theory, safety behavior as an emergent property of a complex socio-technical system has different drivers. Using machine learning, this study supports the proposition. In particular, this study found that in order to encourage personnel to use all necessary safety equipment on the job (i.e., SB1), clients should set examples for contractors and consultants, safety motivation should be enhanced, and clients, private clients in particular, are encouraged to be involve in safety management as early as possible. In projects with a large contract sum, older personnel with more dependents to support is more likely to follow safety procedures on the job (i.e., SB2). In projects with a large contract sum, construction personnel are more likely to promote safety programs willingly (i.e., SB3) with clients actively engaging in safety management. In public projects with a large contract sum, personnel is encouraged to pursue professional development, and hence, more likely to put in extra effort to improve workplace safety (i.e., SB4). In projects with a sound safety climate and more client involvement, personnel is more likely to help colleagues who are in risky conditions (i.e., SB5). Based on the findings, practicable and targeted measures are proposed to promote the five safety behaviors, respectively.
Second, machine learning has advantage over traditional statistical methods in addressing more complex interrelations among independent variables [
6]. To garner this advantage, this study first evaluates the performance of four common machine-learning methods. Although these four methods achieve the comparatively satisfactory performance, this study develops a combinative method, CatBoost–MOSMA, to train and test the data again. This is because MOSMA has achieved superior performance in hyperparameter tuning, and this study attempts to introduce it into the safety research domain. Through 64 trials, the combinative method has achieved the maximum classification performance, and therefore, is used to establish factor importance. Furthermore, as noted by Poh et al. [
45], the imbalanced distribution of the classes is usually an issue in previous research. This combinative method adopts the SMOTE technique to address this issue and obtains more robust results. This is shown in
Table 5, which compares the classification performance between the proposed combinative method and other classification methods. Compared with other methods of tuning and optimization, MOSMA achieves a higher accuracy score when using the same classifiers. When the performance of classifiers is not significantly different, MOSMA achieves a higher F1-score. Hence, it can be concluded that the proposed combinative strategy of MOSMA-CatBoost is effective and efficient in classifying binary construction safety behavioral data.
5.2. Limitations and Future Research Directions
Although the study has achieved its objectives, it has limitations. First, the sample size can be further enlarged. Although a new machine-learning strategy is developed specifically to tackle the small sample size issue and some seminal studies have used a smaller sample set, it is highly recommended that future researches collect more data. Second, the study uses a sample from Hong Kong, and whether the findings can be extrapolated to other countries/regions needs more research efforts. Third, the factors affecting safety behaviors mentioned in the study are not exhaustive, and their interrelationship is not clearcut. Hence, more in-depth research needs to be undertaken in this regard. Fourth, similar to the third one, this study attempts to propose a generic classification framework, and different construction sites are encouraged to tailor the framework to cater for their own needs. Fifth, this study employs a combination of three feature-selection methods, including FI, CT and BS. Only those input indicators that obtain over half votes were retained. In other word, this approach may omit some input indicators that are strongly correlated with some safety behavior. For instance, the input indicator SmoHab is strongly negatively correlated with SB5, but does not correlate with other safety behaviors. Therefore, it has been deleted. It can be seen in the experiment results that this method generally benefits all of the safety behaviors as a whole since the classification performance improves after deleting those input indicators that were only correlated with certain safety behaviors.
Despite these limitations, the classification framework is highly recommended for future research efforts, given its satisfactory performance.
5.3. Practical Use of the Research
The proposed methods can be used in safety management practice on construction sites, as shown in
Figure 9. A survey is conducted with a representative sample of construction personnel on the site, and the data are stored into a safety behavioral database. After training, safety staff is charged with modeling and algorithm implementation and deriving model results, which suggest different safety behavioral orientation associated with different feature patterns. Using a combination of their experience and this data-driven clue, safety staff shall be able to predict a newcomer’s safety behavioral orientation, and then propose and implement targeted interventions. When the prediction performance turns out to be unsatisfactory, a new round of survey begins, and more data are stored in the database. Complemented with their gut feeling, this data-driven decision support system is supposed to help deter unsafe behaviors on construction sites in an efficient and effective way.
6. Conclusions
Different sets of drivers underlie different safety behaviors, and uncovering such complex patterns, help formulate targeted measures to cultivate safety behaviors. Machine learning can explore such complex patterns among safety behavioral data. Given the theoretical, methodological and practical significance, this paper attempts to develop a classification framework for construction personnel’s safety behaviors with machine-learning algorithms, including LR, SVM, RF, and CatBoost. The classification framework has three steps, i.e., data collection and preprocessing, modeling and algorithm implementation, and optimal model acquisition. For illustrative purposes, five common safety behaviors of a random and representative sample of Hong Kong-based construction personnel are used to validate the classification framework. To achieve a high classification performance, this paper employs a combinative strategy of CatBoost–MOSMA. Results support this combinative strategy in dealing with construction safety behavioral data. From the derived optimal model, a unique set of important features can be identified for each safety behavior, and ten out of the 46 input indicators are found important for all the five safety behaviors. Based on the findings, safety staff is supposed to make concrete and targeted interventions to individual construction personnel on site, and improve safety performance in a more efficient and effective way.