Proposal for a System for the Identification of the Concentration of Students Who Attend Online Educational Models

Villegas-Ch., William; García-Ortiz, Joselin; Urbina-Camacho, Isabel; Mera-Navarrete, Aracely

doi:10.3390/computers12040074

Open AccessArticle

Proposal for a System for the Identification of the Concentration of Students Who Attend Online Educational Models

by

William Villegas-Ch.

^1,*

,

Joselin García-Ortiz

¹

,

Isabel Urbina-Camacho

² and

Aracely Mera-Navarrete

³

¹

Escuela de Ingeniería en Tecnologías de la Información, FICA, Universidad de Las Américas, Quito 170125, Ecuador

²

Facultad de Filosofía, Letras y Ciencias de la Educación, Universidad Central del Ecuador, Quito 170129, Ecuador

³

Departamento de Sistemas, Universidad Internacional del Ecuador, Quito 170411, Ecuador

^*

Author to whom correspondence should be addressed.

Computers 2023, 12(4), 74; https://doi.org/10.3390/computers12040074

Submission received: 4 March 2023 / Revised: 28 March 2023 / Accepted: 31 March 2023 / Published: 6 April 2023

(This article belongs to the Special Issue Present and Future of E-Learning Technologies)

Download

Browse Figures

Versions Notes

Abstract

:

Currently, e-learning has revolutionized the way students learn by offering access to quality education in a model that does not depend on a specific space and time. However, due to the e-learning method where no tutor can directly control the group of students, they can be distracted for various reasons, which greatly affects their learning capacity. Several scientific works try to improve the quality of online education, but a holistic approach is necessary to address this problem. Identifying students’ attention spans is important in understanding how students process and retain information. Attention is a critical cognitive process that affects a student’s ability to learn. Therefore, it is important to use a variety of techniques and tools to assess student attention, such as standardized tests, behavioral observation, and assessment of academic achievement. This work proposes a system that uses devices such as cameras to monitor the attention level of students in real time during online classes. The results are used with feedback as a heuristic value to analyze the performance of the students, as well as the teaching standards of the teachers.

Keywords:

artificial intelligence; computer vision; blink rate

1. Introduction

E-learning is a teaching modality that uses information and communication technologies (ICT) to facilitate learning through the Internet [1,2]. In e-learning, students can access educational materials online, interact with content at their own pace, and communicate with teachers and other students through online tools such as discussion forums, chat rooms, and video conferencing [3]. E-learning can take many different forms, from online courses complete with videos and assessments, to short, self-administered learning modules. It can also be used in combination with other teaching methods, such as face-to-face classes or tutorials [4]. E-learning has proven to be an effective tool for distance learning and online training, allowing students to access educational resources from anywhere and at any time, making them more accessible and convenient.

Even though e-learning has many advantages, it also presents some challenges and problems. For example, within e-learning, some sessions are carried out asynchronously, which means that students may feel isolated by not having the social interaction they have in a traditional classroom [5]. This can affect the motivation and commitment of students. In addition, this requires a structured plan that generates motivation and discipline in the student to remain committed to the learning process. Some students may have difficulty staying motivated and focused on online content. Additionally, there are technical problems such as Internet connection, software, or hardware problems that can affect access to educational resources and, therefore, the learning process. In some cases, students may find it difficult to receive immediate feedback from their teachers and peers, which can hinder the learning process and the improvement of their skills [6].

Works like [7] mention that there are different methods and tools to measure student concentration in e-learning. For example, quizzes are common tools used to measure student concentration and engagement in e-learning [8]. These may include questions that measure student attention, level of engagement, and interest in the content. In addition, some systems use gaze tracking software to monitor student eye movement as they interact with online course content [9]. These systems provide information on student attention and concentration at different times of the course. Other systems using biometric sensors can measure a student’s physiological activity, such as heart rate, brain activity, and sweating, to determine the student’s level of concentration and engagement. There are also methods included in learning management platforms (LMS) that integrate online interaction analysis tools, which present online interaction data, such as the frequency and type of interactions in discussion forums or online chats, can provide information about the level of participation and commitment of the students in the course.

In this work, a blink counting system is proposed to determine the level of concentration, relating the resulting statistics with similar works that determine the average blink rate of a person from 8 to 21 blinks per minute [10]. However, when a person is deeply focused on a specific visual task, the blink rate is significantly reduced to an average of 4.5 blinks per minute [11,12]. The blink rate increases to more than 32.5 blinks per minute when the individual’s concentration level is low. In addition, this system integrates the development of an algorithm for the classification of emotions through the identification of gestures that the student generates during an academic activity. The study [13] explores how the emotional state of students varies during the learning process and how emotional feedback can improve learning experiences. In addition, there are emotional factors such as happiness, boredom, surprise, and neutrality that denote a positive, constructive learning experience, while emotions such as sadness, fear, anger, and disgust represent a negative experience [14,15].

The design of systems that use artificial intelligence (AI) to identify the level of concentration of e-learning students has garnered great interest and motivation on the part of researchers in the educational sector. First, one of the main goals of online education is to make sure that students are fully engaged and focused during the learning process. Inattention or distraction can negatively affect student performance and ultimately affect their ability to learn and retain information. Therefore, the use of AI systems to measure student attention can help ensure that students are fully engaged in the learning process. Second, the development of systems that use AI to measure the concentration level of students through blink counts and emotion identifications helps to improve the quality of online education [16]. The information collected by these systems helps educators identify when students are distracted or having difficulty concentrating. Armed with this information, tutors can tailor their teaching approach and provide additional resources to help students stay focused and engaged during the learning process.

Third, the use of AI systems to measure student attention also helps improve the efficiency of online learning. By providing alerts when students are distracted or inactive, AI systems help students refocus and become more productive in their study time. This allows students to complete their work more efficiently and improve their ability to retain information. Finally, the motivation to develop systems that use AI to measure student attention is related to competitiveness in the online education market. With the growing popularity of this educational model, institutions are constantly looking for ways to improve the quality and efficiency of the learning process to remain competitive in the market.

2. Materials and Methods

The method consists of two components to define the concentration level of students in e-learning. In the first stage, a blink-counting algorithm is developed and used to measure the frequency and duration of eye blinks [17]. Systems with this capability use sensors to detect flicker and then record the information in a database. Blink frequency measurement is useful for measuring a person’s attention and concentration in different situations, including e-learning. Studies have shown that blink frequency can decrease when a person is highly focused on a task [11,18]. Blink counting systems use different detection techniques, such as cameras, electro-oculography sensors, and motion sensors. These systems are portable and non-invasive, making them easy to use in a variety of situations. The results obtained with a blink counting system should be evaluated in conjunction with other measures, such as questionnaires, analysis of online interactions, identification of emotions, etc., to obtain a more complete understanding of the level of concentration and attention of the students [12].

In the second component, an algorithm is designed to identify student emotions in an e-learning environment. An algorithm for emotion classification uses different techniques, such as the detection of facial expressions, the measurement of electrical activity in the brain, and the detection of physiological changes such as heart rate and respiration [19]. These systems use algorithms and machine learning models to identify and classify a person’s emotions into different categories, such as happiness, sadness, fear, anger, and surprise. In the context of e-learning, emotion rating systems can be useful for measuring students’ emotional responses to different course elements, such as learning materials and interactions with other students and teachers [20]. This information is used to personalize online content and interactions and enhance the learning experience for students.

In this work, the use of the images of the students and the objective of this study were revealed. Therefore, informed consent was obtained from the people participating in this study. The design of the student concentration level identification system with the use of AI involves several technical details; among these details considered, signal processing and the data collected by sensors or cameras are processed by signal processing techniques to extract relevant characteristics that can indicate the level of concentration of the student. Machine learning: machine learning techniques are used to train models that can classify the concentration level of the student based on the extracted features. The integration of software and hardware is for the system to work effectively. Performance evaluation and improvement: the system must be continuously evaluated to measure its accuracy, and improvements must be made when necessary. Privacy and security: appropriate measures must be taken to ensure that data are secure, and that student privacy is respected. The implementation in real-time: the system must be able to process the data and classify the concentration level of the student in real time so that it can be used in an educational environment.

2.1. Review of Related Works

This paper proposes the use of AI algorithms to identify the level of concentration in university students; the use of AI techniques has the potential to be useful to improve education and academic performance. However, in works such as [11,12], it is mentioned that it is important to consider that any AI application must be carefully designed and developed to guarantee that it is accurate, fair, safe, and ethical. Furthermore, according to [21], there must be transparency as to how student data are collected and used so that users can trust the tool and feel comfortable using it.

Online education or e-learning has gained popularity in recent years due to its accessibility, flexibility, and convenience for students. However, one of the most important concerns for educators is how to ensure students are fully focused and engaged during online learning. One solution that has been explored is the use of AI systems to monitor and measure the concentration level of students. Currently, several research papers have addressed the issue of the use of AI systems to identify the level of concentration of students in e-learning. An example of this is the study carried out by [22], where they developed an AI model to measure students’ attention using their device’s webcam. The model was based on tracking the student’s gaze and head movement to determine their level of concentration. The study yielded positive results, showing that the model was able to accurately identify the level of attention of the students.

Another interesting study was conducted by [23], where a machine learning-based AI system was used to analyze mouse cursor behavior and student keystrokes. The system was able to detect when students were distracted or inactive and provide alerts to help keep their attention during learning. In addition, the work of [24] focused on the use of physiological sensors to measure the attention level of students. Electroencephalography (EEG) sensors were used to measure the students’ brain activity, and skin conductance sensors measured their emotional responses. The study demonstrated that physiological data can be used to assess students’ attention and emotions during online learning.

Among the problems identified in e-learning, several studies emphasize the identification of the concentration level of students in this modality of study. In the results and conclusions of these works, it is determined that the levels of concentration are affected by various factors such as the design of the course, the technology used, and the level of student participation, among others. However, other related work suggests that students’ concentration in e-learning can be affected by factors such as the amount of social interaction, the level of difficulty of the content, the amount of material to be covered, and the quality of feedback. For example, a study published in the journal Computers & Education [25] found that the amount of social interaction, such as online communication with classmates and teachers, can improve students’ concentration and performance in e-learning. Another study published in the Journal of Educational Psychology [26] suggests that the complexity and level of difficulty of content can affect students’ concentration. Additionally, other studies have found that information overload can affect student concentration in e-learning. Therefore, it is recommended that online courses be designed with an adequate amount of information, organized into manageable modules, and allow frequent feedback to maintain students’ attention and motivation.

Works like [27] mention that there are different methods and tools to measure student concentration in e-learning. For example, quizzes are common tools used to measure student concentration and engagement in e-learning. These may include questions that measure student attention, level of engagement, and interest in the content. In addition, some systems use gaze tracking software to monitor student eye movement as they interact with online course content [28]. These systems provide information on student attention and concentration at different times of the course. Other systems using biometric sensors can measure a student’s physiological activity, such as heart rate, brain activity, and sweating, to determine the student’s level of concentration and engagement. There are also methods included in LMS that integrate online interaction analysis tools, which present online interaction data, such as the frequency and type of interactions in discussion forums or online chats, and can provide information about the level of participation and commitment of the students in the course.

While these studies show promising results, it is important to note that the use of AI systems to measure students’ attention levels has also raised ethical concerns. Some argue that the constant monitoring of students can violate their privacy and create an environment of mistrust. Therefore, AI systems must be implemented with transparency, and the rights of students must be respected.

2.2. Identification of the Environment and the Population

This work is carried out with the participation of students from a university in Ecuador. The sample size is given by all the students enrolled in the 2020 promotion at the Faculty of Administrative Sciences. In this cohort, there are a total of 229 students. The volume of students is not high; therefore, the entire cohort is considered. By including the entire population, a more precise result is sought, and this research is applied to matters that are directly related to the use of ICT. Therefore, the data included are those obtained in the subject of office automation II. In this matter, the use of computing devices is a priority for the development of different activities. In the structure of the subject, synchronous and autonomous activities have been defined, in which the student must comply with the use of the algorithms that measure the student’s concentration.

The synchronous activities are guided by the tutor; for the development, it is essential to use personal computer equipment such as computers, cameras, tablets, etc. The activities are varied, from reading articles to developing practical exercises. Autonomous activities, such as synchronous ones, depending on the use of a computer, include readings, research, the development of mental maps, exercises, etc. The course where the system is implemented is made up of three sessions per week, 60 min each, one of which is synchronous.

2.3. Method

This study makes use of two parameters to calculate the attention level of the e-learning student. The ability to concentrate is calculated using blink rate and facial expression, and this process is continuously updated over 5 s. Instead of a sequential run, all the models needed to calculate the concentration level are run in parallel once the online class starts. This is obtained by using multi-threading in all functions, which plays an important role in reducing the time consumption of each model as well as the whole system [29]. Every 5 s, the model will generate the attention span score and provide real-time feedback to students in the form of live graphs that are plotted for each parameter, as well as the calculated attention span score [30,31]. The general architecture of the proposed system is shown in Figure 1; it is composed of several stages that focus on image processing, classification, and categorization.

2.4. Design and Development of the Blink Count Algorithm

The blinking of the human eye is the object of study by psychologists, psychiatrists, ophthalmologists, neurophysiologists, etc., due to the numerous applications attributed to it. Blinking can be used as a criterion for the diagnosis of certain medical conditions of a patient or to determine the level of concentration of a person before an activity is carried out with the use of computing devices. There are a lot of variations when it comes to blinking, and several studies mention that, depending on the task that is performed and the conditions that are performed, the influence on blinking will be different. Thus, when a person performs tasks with a high degree of concentration, the number of blinks is reduced. A fundamental part of the design of an image processing algorithm is that it can detect if there is a specific object in an area [32]. In Figure 2, the phases for the count of blinks are presented. In these, the proposed algorithm with the objective of counting blinks is represented; its initial process must establish if there is a face within an image. Once a face has been identified, the algorithm must identify and detect the desired area, in this case, the person’s eyes, for a subsequent count of blinks during a defined period.

There are several computer vision techniques and libraries that automatically detect flickers in a video sequence or streaming. These techniques are based on an estimate of movement in the region of the eye; the face and eyes are detected by a Viola–Jones-type detector [33]. When applied, movement in the eye area is estimated from the optical flow by sparse tracking or by applying the intensity difference from frame to frame and adaptive thresholding. This process allows for identifying if the eyes are covered by the eyelids or not.

For blink detection, the regions containing the pairs of eyes are cut out, and each eye is divided into two halves. With this, the eye aspect ratio (EAR) is calculated using Euclidean distances, which are observed in Figure 3 for each frame according to Equation (1), and identify if the eyes are open or closed [34]. A countdown timer has also been incorporated into the algorithm design. This is activated once a blink is detected and keeps track of the number of seconds the eyes are closed. The purpose of this event is to conclude that the user enters a sleepy state (loss of attention) by detecting that the eyes are closed for more than two seconds. For the calculation of the frequency of blinks, the number of blinks is taken continuously at an interval of 5 s to determine the average blink rate of the user. The EAR threshold value is set to 0.2 based on the experiments performed.

EAR = \frac{‖ p 2 - p 6 ‖ + ‖ p 3 - p 5 ‖}{2 ‖ p 1 - p 4 ‖}

(1)

In the equation, p1, p2, pn, etc., are the reference point locations. Since the blinking is performed by both eyes synchronously, the EAR is averaged to determine whether there is complete blinking.

The algorithm proposed for the design of the blink counter is developed in Python. In this, several libraries are applied, among which MediaPipe Face Mesh stands out, from which 468 referential points distributed on the person’s face are obtained when it is detected [35,36]. Of all the reference points, 12 points are taken to detect the eyes, six for the left eye and six for the right eye, as shown in Figure 3.

In classification, generally, a low EAR value does not mean that a person is blinking. A low EAR value can occur when a subject intentionally closes their eyes for a long time or makes a facial expression, yawns, etc., or the EAR captures a brief random fluctuation of the reference points. In these cases, it is possible to use a classifier that takes as input a time window greater than one frame. Figure 4 shows an example of an EAR signal in the video sequence, in which the student is wearing a mask and glasses. However, the algorithm easily detects the student’s eyes through the reference points and counts a blink, as shown by the fluctuation generated in the graph.

The algorithm uses the mediapipe, OpenCV, Numpy, and Matploylib libraries; these libraries oversee face detection and blink counting. For the use of these libraries, several functions are created where the detection of eye coordinates is declared using the reference points detected on the face [37,38]. The drawing_output function allows coloring the eye area and displaying the results, as can be seen in the previous figure. The eye_aspect_ratio function calculates the three distances shown in Figure 5. The eye aspect ratio is involved in the calculation and returns the result of the EAR equation.

2.5. Design and Development of the Algorithm for the Recognition of Emotions

For the development of the algorithm, three fundamental bases are considered that guarantee the operation of the identification of the emotions of the students through the gestures that their faces generate in a didactic environment, as represented in Figure 6. The bases considered are bases of image data, affective computing, and emotion recognition systems with artificial intelligence.

2.5.1. Image Database and Algorithm Training

The image base in the design of the gesture and emotion recognition algorithm refers to a collection of data used to train the recognition model. For this, the algorithm uses deep learning techniques, such as convolutional neural networks (CNN), to train the gesture recognition model. As the model learns to recognize patterns in images, it can identify gestures in new images that it has not seen before. The accuracy of the model will largely depend on the quality and diversity of the image base used to train it [39,40]. In Python, different libraries and tools are used to create and manipulate image bases in a gesture recognition algorithm. For the development of the algorithm, a practical comparison of libraries such as OpenCV, TensorFlow, and Keras was carried out. Of these libraries, TensorFlow was the one used in the algorithm, since it presented better characteristics about the available hardware requirements.

Affective computing arises from the need to provide computer equipment with a certain capacity to interact with people. This task is carried out using artificial vision techniques and machine learning algorithms; the objective of human–machine interaction is for the system to be capable of producing an effective response in people [20]. According to [21,22], affective computing is subdivided into four research areas, as follows:

The analysis and characterization of affective states that identify through natural interactions the relationships between effect and cognitive processes in learning;
Automatic recognition of affective states by analyzing facial expressions and extracting features from linguistic expressions, posture, gaze tracking, and heart rate, among others;
The adequacy of the systems to respond to a particular affective state of the users;
The design of avatars that show appropriate affective states for better interaction with the user.

As for emotions, these are classified into two groups, primary or basic and secondary or alternative. In [41], six basic emotions were identified, anger, disgust, fear, happiness, sadness, and surprise, and the gestures that appear on the face, as shown in Figure 7. Secondary or alternative emotions are complex emotions that appear after primary emotions and depend on the situation and context of the person. For example, a person who is afraid (primary emotion) can turn it into anger or rage (secondary emotion) and provoke an aggressive reaction.

Facial expression analysis is applied in different areas of interest, such as education, video games, and telecommunications, to name a few. In addition, it is one of the most used in human–machine interactions. Facial expression recognition is an intelligent system that identifies a person’s face and, from it, obtains certain characteristics that it analyzes and processes to know the affective state of the person [42].

2.5.2. Face Detection, Gesture Identification, and Emotion Classification

Face recognition depends on four steps shown in Figure 8; the first step is to detect faces in an image, applying the oriented gradient histogram algorithm. In the second step, the facial landmark estimation algorithm is used, which identifies 68 landmarks on each face. In the third step, 128 measurements are created for each face through deep learning, which corresponds to the unique characteristics of the faces; finally, with the unique characteristics of each face, the person is identified.

Figure 9 shows the stages that the system performs to correctly identify the emotion. The initial stage validates that the image received by the recognizer contains a face; if the algorithm does not find it, it discards the image. Next, a gray filter is applied to remove the different color channels to later detect some important parts of the face, such as the nose, eyes, eyebrows, and mouth. In the next stage, facial points are marked on the detected parts, and an initial reference point is placed in the center of the nose to identify various facial points on the face parts. Geometric calculations of the distance between the initial reference point and each facial point detected on the face are then performed. The result of the calculations is a matrix of facial features that are processed by the support vector algorithm with their emotion label so that it can learn to classify facial expressions. Finally, the trained neural network is sent more facial feature vectors to test whether the algorithm has learned to classify gestures and recognize emotions.

3. Results

According to the results obtained, the values obtained from the parameters measured by the developed algorithms are presented. These parameters correspond to the blink rate detection and emotion classification; these values are normalized to calculate the attention score according to Equation (2). Figure 10 shows the live graphs of the EAR with the student’s expected attention level updated in real time. The figure shows the variation of the parameters measured with the designed algorithms. Figure 10a detects the blinking frequency; Figure 10b shows the classification of the emotion, and Figure 10c presents the level of care. Facial recognition data are not presented because they do not contribute to determining the level of attention of the student; this parameter is specific to the system to offer personalized treatment to students.

A t t = \frac{\sum s c o r e (i)}{n} * 100

(2)

To determine the performance of the system, a data set of 45 students corresponding to the 2020 cohort was analyzed. Even though the entire cohort is under analysis, two parallels were established as a sample that takes the subject of office automation 2. The groups evaluated are composed of 27 women and 18 men. For the evaluation, students were asked to attend online tutorials on specific topics of the subject. The tutorials lasted 45 min, divided into three sections of 15 min (900 s) each. During these sessions, students must turn on their web cameras and enable the blink frequency counting and emotion identification applications. The measurements are made online, and the data are stored in the cloud, where they are consumed by experts to determine the level of concentration per session. The mechanics of the tutorials has been determined considering three activities: one is the tutor’s explanation of a subject, and the second section is given by the development of a reading-type activity, especially scientific articles. In the third section, the development of a practical activity related to the theme developed is proposed. The sections within the session do not have a specific order, so the student does not generate a plan and affect the measurement.

Table 1 presents the results obtained from the evaluation; at this stage, the aim is to identify the number of students who are concentrated, distracted, and sleepy during the development of the activities of a session. According to the results obtained during the activity developed by the tutor (a class on a specific subject), 27 students are within the range of concentrating. This calculation is obtained according to the blink frequency count; that is, the blink rate is between four and five per minute. Within this same activity, 13 students were distracted; the calculation corresponds to their blink rate being greater than eight per minute. In this group, it was found that five students were in a sleepy state; for this identification, the algorithm identifies the time that the eyelids are closed. If this time is greater than two seconds, it is established that the student entered a state of sleepiness.

In the second activity, a reading of an article on a specific theme of the subject was established. This activity lasted 15 min; at the end of this period, it was found that 19 students were concentrated during reading, 18 had distraction stages, and 8 people presented drowsiness. These results indicate that this type of activity is usually not very effective during an online tutorial. In the third section of the tutorial, students are proposed to develop a practical activity; the results reflect a high concentration in most of the students, amounting to 37 with 82% concentration identified in the group. Of the group, only six students presented a distracted attitude toward the activity, and two of them were in a state of drowsiness.

Table 2 presents the results of the emotion classification algorithm. The Support Vector Machine (SVM) algorithm is used to classify students’ emotions into seven different classes: anger, disgust, fear, happiness, sadness, surprise, and neutral. Each emotion is given a score based on its effect on the user’s level of attention. The designed algorithm considers four classes of emotions: concentration, boredom, surprise, and neutral, these states being those identified within the classroom. However, it is necessary to consider that these emotions are based on those supported by SVM and the six universal emotions of people [43]. According to the results of the tutoring activity, 13 concentrated people were identified, 7 were bored, 15 were surprised, and 10 had neutral emotions. Regarding the results obtained for the blink rate, it can be mentioned that the emotion of surprise and neutrality can be classified as emotions that have a certain relationship with distraction and drowsiness. For example, a student during the activity can demonstrate her interest in the topic by showing joy or with a neutral gesture. Similarly, a bored person can generate gestures that the algorithm detects as neutral emotions. This relationship is similar in the following activities; even in the development of the practical exercise, the relationship between the algorithms is more noticeable.

The acceptable limits of quality metrics vary depending on the specific application. Quality metrics are tools that allow you to assess the accuracy of models and predictions and are used to compare different models or to determine if a model is accurate enough for your application. Common quality metrics include root mean square error (RMSE), mean absolute error (MAE), coefficient of determination (R2), and mean absolute percentage error (MAPE). These metrics are presented in Table 3, with the values corresponding to the performance of the system. By relating the results obtained from each algorithm and the EAR, it is possible to evaluate the general performance of the system. The table shows the performance metrics of the system.

According to the results obtained, it has been identified that the operation of the system is on the real state of the students during the educational environment developed. The general precision of the system was developed for the identification of the level of concentration, taking the average of the precisions of each module. Compiling the OpenCV DNN module with Deepface support improved performance and significantly reduced the inference time in the algorithms, as shown in Table 4. We achieved an overall accuracy of 96.16%.

4. Discussion

According to the results obtained, it was identified that the system, when using the AI libraries to recognize the level of concentration of the students, must process a large volume of data to guarantee its effectiveness. For this, the training and test data set that contains information on the characteristics of emotions and the times that a person blinks when they generate a state of concentration requires a wide variety of images in different states, as well as different environments and settings. With a data set that allows the model to be adjusted and its performance to be evaluated, an adequate result that can be used in education is guaranteed. For this, unlike other works, several quality metrics have been used in a set of tests before the system goes to a production stage. The results of the quality metrics obtained in the set of tests are presented below:

RMSE: 11.351
MAE: 10.924
R2: 0.803
MAPE: 15%

Now, to determine the acceptable limits of these quality metrics, it is necessary to consider the context of the problem and the accuracy expectations of the model. For example, in testing, the model is expected to have acceptable accuracy if it can predict a student’s concentration level with an average error of +/−15.000. In this case, the RMSE value of 11.351 is less than the precision expectation, so the model could be considered acceptable in terms of precision. However, the MAE value of 10.924 indicates that the model has a systematic bias in its predictions, and adjustments may be necessary to improve its accuracy. The R2 value of 0.803 indicates that the model explains 80% of the variability in concentration levels demonstrated by a student, which would be considered acceptable in many cases. On the other hand, the MAPE value of 15% indicates that the model has an average percentage error of 15% in the predictions, which may or may not be acceptable, depending on the context and expectations of the problem.

It is important to consider that concentration in learning is a subjective experience, where there is great individual variability in what is considered “concentration”. Therefore, any AI algorithm used for concentration level identification should be designed to account for this individual variability and should not rely solely on technology to assess student performance [44]. Other works mention that there are potential advantages of using AI tools in e-learning that allow personalization of learning; this can help to maximize the learning of each student and ensure that the material is adjusted to their individual needs. It is proposed to improve immediate feedback by allowing errors to be identified and corrected immediately. This can help students improve their understanding and retain information better, saving time and costs by automating many tasks where time and costs can be saved for teachers and educational institutions.

The work carried out uses computer vision techniques with AI techniques to perform a blink count to measure a person’s concentration [39,40]. This is based on similar studies where it has been identified that when a person is focused, they blink less than when they are distracted or bored. Therefore, if it is possible to measure a person’s number of blinks per minute, their level of concentration can be inferred. One advantage of blink counting with the developed algorithm is that it is a simple and non-invasive way to measure a person’s concentration, but it has limitations and should not be considered a precise and universal measure of concentration [10]. To overcome this limitation, another algorithm that classifies students’ emotions during certain activities is used to evaluate multiple factors to obtain a complete picture of a person’s concentration.

The identification of emotions using an AI algorithm is a useful tool to measure the concentration level of students in an e-learning model [42,45]. Excitement and concentration are closely related, and a person who is excited or interested in a subject is more likely to be focused on it. Therefore, if a student’s emotion can be measured, the level of concentration can be inferred from it [42,43]. AI algorithms need large amounts of data to learn to accurately identify emotions; therefore, Python libraries are used that are previously trained. With this, the system improves its accuracy percentages and guarantees the results obtained [44].

Compared to related works, the system developed using Python with AI libraries has several advantages. First, Python is a popular programming language in the AI community due to its ability to handle large data sets and its wide selection of AI libraries. AI libraries, such as TensorFlow, Keras, and Scikit-Learn, provide powerful tools for data processing and predictive modeling, allowing you to build complex and accurate models. Secondly, the developed system allows greater automation in the data analysis process, which reduces the time and costs necessary to perform this type of analysis. Third, the proposed system is highly customizable and adaptable to the specific needs of the educational institution or the online learning program. Machine learning algorithms can be trained to recognize specific patterns of student behavior and adapt to different student needs. In addition, the developed system can be integrated with other online learning management systems by using Python and its libraries, which allows a more integrated and complete analysis of student behavior [33].

However, there are also some limitations identified in the development of the proposed system; these are focused on the use of Python with AI libraries for the identification of the concentration level of e-learning students. First, the use of sensors or cameras to capture student data may raise privacy and security concerns. Additionally, the accuracy of AI models can be affected by factors such as the quality of the input data, feature selection, and the choice of the machine learning algorithm [46]. Finally, the developed system has several advantages compared to related works on the identification of the concentration level of e-learning students. The use of AI libraries allows for greater automation, customization, and adaptability to the specific needs of students and educational institutions. However, it is important to keep in mind the limitations in terms of privacy and data security, as well as the accuracy of the AI models.

5. Conclusions

Blink counting can be a useful technique for determining a student’s concentration level, as there is a correlation between attention and blink pattern. When a student is more focused, they generally blink less frequently, and their blinks are longer. However, the blink count only provides a rough estimate of the concentration level and is not a precise measurement. The blink pattern can be affected by factors such as eye strain, ambient lighting, the position of the student’s head, and individual blink habits. Additionally, blink counting can be difficult to perform in practical situations, such as in an e-learning environment where the student may be in different positions or moving around. Therefore, it is necessary to consider a methodological update in the way knowledge is provided. In the proposed environment, three activities have been established that can be considered a guideline to follow in the pedagogical development of a class.

On the other hand, the use of techniques for the identification of emotions using AI algorithms is useful to determine the level of concentration of a student in an e-learning environment. Emotions are closely related to attention and motivation and can indicate whether a student is interested in the learning material or is distracted or bored. However, it is important to note that emotion identification is an active field of research, and there are still challenges in accurately detecting emotions in different contexts and cultures. Additionally, emotion identification can be affected by the quality of the data set and the accuracy of the AI algorithm used. Therefore, although emotion identification can be a useful technique to determine a student’s concentration level in an e-learning environment, it should be used in combination with other monitoring and evaluation techniques to obtain a more accurate and complete picture of student performance.

This work makes use of AI as an integral part of education, considering that these tools are currently gaining ground in the educational sector. Additionally, the integration of AI into e-learning has the potential to significantly improve the quality of learning by providing students with a more personalized and adaptive learning experience. For example, AI algorithms can analyze the learning behavior of students, such as their progress in the course, their strengths and weaknesses, and their interaction patterns with course content. With this information, e-learning systems can offer students content recommendations and personalized learning activities that are tailored to their individual needs. In addition, technical limitations and the need for proper system design must be considered to ensure that AI is used effectively and efficiently in e-learning.

The designed concentration level identification system has demonstrated effectiveness in its usability and efficiency. Therefore, several recommendations can be made that may be useful in the use of the system or for institutions that design a similar system. Among the most important ones, an important recommendation is to ensure that any information collected by the system is used solely to improve the learning experience of students and is not shared with third parties without their consent. It is important to validate the system through rigorous testing, to ensure that the system works accurately and reliably before deploying it in a production environment. It is important to validate the system to ensure that the results are accurate and reliable. In the same way, it is important to provide feedback to students; the results of the system must be shared to provide feedback to students and help them improve their level of concentration and performance. Feedback can include advice on how to improve your concentration and suggestions on how to adjust your learning environment to increase your focus. Considering the context of learning, it must be taken into account that the level of concentration of students can be affected by external factors, such as the learning environment or the level of stress. Therefore, it is necessary to consider the learning context when interpreting the results of the system. Finally, it is a priority to evaluate the impact of the system, considering the performance and learning experience of the students.

Author Contributions

Conceptualization, W.V.-C. and A.M.-N.; methodology, W.V.-C.; software, J.G.-O.; validation, I.U.-C.; formal analysis, W.V.-C.; investigation, J.G.-O.; data curation, W.V.-C. and I.U.-C.; writing—original draft preparation, A.M.-N.; writing—review and editing, J.G.-O.; visualization, J.G.-O.; supervision, A.M.-N. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Fuchs, K. The Difference Between Emergency Remote Teaching and E-Learning. Front. Educ. 2022, 7, 921332. [Google Scholar] [CrossRef]
Hariyanto, D. The Design of Adaptive Learning System Based on the Collaboration of M-Learning and e-Learning Platform. J. Adv. Comput. Netw. 2014, 2, 311–314. [Google Scholar] [CrossRef] [Green Version]
Chen, C.; Lee, H.; Chen, Y. Personalized E-Learning System Using Item Response Theory. Comput. Educ. 2005, 44, 237–255. [Google Scholar] [CrossRef]
Villegas-Ch, W.; Luján-Mora, S. Systematic Review of Evidence on Data Mining Applied to LMS Platforms for Improving E-Learning. In Proceedings of the International Technology, Education and Development Conference, Valencia, Spain, 6–8 March 2017; Chova, L.G., Martinez, A.L., Torres, I., Eds.; EDULEARN: Palma de Mallorca, Spain, 2017; pp. 6537–6545. [Google Scholar]
Rodriguez-Ascaso, A.; Boticario, J.G.; Finat, C.; Petrie, H. Setting Accessibility Preferences about Learning Objects within Adaptive Elearning Systems: User Experience and Organizational Aspects. Expert Syst. 2017, 34, e12187. [Google Scholar] [CrossRef]
Zamzuri, Z.F.; Manaf, M.; Ahmad, A.; Yunus, Y. Computer Security Threats Towards the E-Learning. In Proceedings of the In International Conference on Software Engineering and Computer Systems, Kuantan, Pahang, Malaysia, 27–29 June 2011; Springer: Berlin/Heidelberg, Germany, 2011; pp. 335–345. [Google Scholar]
Lin, H.-M.; Wu, J.-Y.; Liang, J.-C.; Lee, Y.-H.; Huang, P.; Kwok, O.-M.; Tsai, C.-C. A Review of Using Multilevel Modeling in E-Learning Research. Comput. Educ. 2023, 198, 104762. [Google Scholar] [CrossRef]
Lindgren, R.; Morphew, J.W.; Kang, J.; Planey, J.; Mestre, J.P. Learning and Transfer Effects of Embodied Simulations Targeting Crosscutting Concepts in Science. J. Educ. Psychol. 2022, 114, 462–481. [Google Scholar] [CrossRef]
Ali, M.; Hussein, A.; Al-Chalabi, H.K.M. Pedagogical Agents in an Adaptive E-Learning System. SAR J.-Sci. Res. 2020, 3, 24–30. [Google Scholar] [CrossRef] [Green Version]
Boumiza, S.; Souilem, D.; Bekiarski, A. Workflow Approach to Design Automatic Tutor in E-Learning Environment. In Proceedings of the International Conference on Control, Decision and Information Technologies, Saint Julian's, Malta, 6–8 April 2016; IEEE: Saint Julian’s, Malta, 2016; pp. 263–268. [Google Scholar]
Lee, J.; Song, H.D.; Hong, A.J. Exploring Factors, and Indicators for Measuring Students’ Sustainable Engagement in e-Learning. Sustainablity 2019, 11, 985. [Google Scholar] [CrossRef] [Green Version]
Heishman, R.; Duric, Z. Using Image Flow to Detect Eye Blinks in Color Videos. In Proceedings of the Proceedings-IEEE Workshop on Applications of Computer Vision, Austin, TX, USA, 21–22 February 2007; IEEE: Austin, TX, USA, 2007. [Google Scholar]
Sofia Jennifer, J.; Sree Sharmila, T. Edge Based Eye-Blink Detection for Computer Vision Syndrome. In Proceedings of the International Conference on Computer, Communication, and Signal Processing: Special Focus on IoT, Chennai, India, 10–11 January 2017; IEEE: Chennai, India, 2017. [Google Scholar]
Clavijo, G.L.R.; Patino, J.O.; Leon, D.M. Detection of Visual Fatigue by Analyzing the Blink Rate. In Proceedings of the 2015 20th Symposium on Signal Processing, Images and Computer Vision, STSIVA 2015-Conference Proceedings, Bogota, Colombia, 2–4 September 2015; IEEE: Bogota, Colombia, 2015. [Google Scholar]
Zeng, H.; Shu, X.; Wang, Y.; Wang, Y.; Zhang, L.; Pong, T.C.; Qu, H. EmotionCues: Emotion-Oriented Visual Summarization of Classroom Videos. IEEE Trans. Vis. Comput. Graph. 2021, 27, 3168–3181. [Google Scholar] [CrossRef] [PubMed]
Kim, Y.; Soyata, T.; Behnagh, R.F. Towards Emotionally Aware AI Smart Classroom: Current Issues and Directions for Engineering and Education. IEEE Access 2018, 6, 5308–5331. [Google Scholar] [CrossRef]
Tzacheva, A.; Ranganathan, J.; Mylavarapu, S.Y. Actionable Pattern Discovery for Tweet Emotions. Adv. Intell. Syst. Comput. 2020, 965, 46–57. [Google Scholar] [CrossRef]
Vijayalaxmi; Sudhakara Rao, P.; Sreehari, S. Neural Network Approach for Eye Detection. arXiv 2012, arXiv:1205.5097. [Google Scholar]
Onners, J.; Alam, M.; Cichy, B.; Wymbs, N.; Lukos, J. U-EEG: A Deep Learning Autoencoder for the Detection of Ocular Artifact in EEG Signal. In Proceedings of the 2021 IEEE Signal Processing in Medicine and Biology Symposium, SPMB 2021-Proceedings, Philadelphia, PA, USA, 4 December 2021; IEEE: Philadelphia, PA, USA, 2021. [Google Scholar]
Scherer, K.R.; Coutinho, E. How Music Creates Emotion: A Multifactorial Process Approach. Emot. Power Music 2013, 1, 121–145. [Google Scholar]
Greene, G. Guidelines for Assessing and Minimizing Risks of Emotion Recognition Applications. In Proceedings of the 2021 9th International Conference on Affective Computing and Intelligent Interaction (ACII), Nara, Japan, 28 September–1 October 2021; IEEE: Nara, Japan, 2021. [Google Scholar] [CrossRef]
Madureira, J.; Pereira, C.; Paciência, I.; Teixeira, J.P.; de Oliveira Fernandes, E. Identification and Levels of Airborne Fungi in Portuguese Primary Schools. J. Toxicol. Environ. Health-Part A Curr. Issues 2014, 77, 816–826. [Google Scholar] [CrossRef] [Green Version]
Kobai, R.; Murakami, H. Effects of Interactions between Facial Expressions and Self-Focused Attention on Emotion. PLoS ONE 2021, 16, e0261666. [Google Scholar] [CrossRef]
Alzbier, A.M.T.; Cheng, H. Real Time Tracking RGB Color Based Kinect. Mod. Appl. Sci. 2017, 11, 98. [Google Scholar] [CrossRef] [Green Version]
Lai, H.Y.; Ke, H.Y.; Hsu, Y.C. Real-Time Hand Gesture Recognition System and Application. Sens. Mater. 2018, 30, 869. [Google Scholar] [CrossRef] [Green Version]
Ismael, K.D.; Irina, S. Face Recognition Using Viola-Jones Depending on Python. Indones. J. Electr. Eng. Comput. Sci. 2020, 20, 1513–1521. [Google Scholar] [CrossRef]
Huang, J.; Shang, Y.; Chen, H. Improved Viola-Jones Face Detection Algorithm Based on HoloLens. EURASIP J. Imag. Video Process 2019, 2019, 41. [Google Scholar] [CrossRef]
Liu, F.; Kromer, P. Early Age Education on Artificial Intelligence: Methods and Tools. In Advances in Intelligent Systems and Computing; Springer: Cham, Switzerland, 2020; Volume 1156 AISC. [Google Scholar]
Sciarrone, A.; Bisio, I.; Garibotto, C.; Lavagetto, F.; Hamedani, M.; Prada, V.; Schenone, A.; Boero, F.; Gambari, G.; Cereia, M.; et al. Early Detection of External Neurological Symptoms through a Wearable Smart-Glasses Prototype. J. Commun. Softw. Syst. 2021, 17, 160–168. [Google Scholar] [CrossRef]
Uranishi, Y. OpenCV: Open Source Computer Vision Library. Kyokai Joho Imeji Zasshi/J. Inst. Image Inf. Telev. Eng. 2018, 72, 736–739. [Google Scholar] [CrossRef]
Emami, S.; Suciu, V.P. Facial Recognition Using OpenCV. J. Mob. Embed. Distrib. Syst. 2012, 4, 1. [Google Scholar]
Naveenkumar, M.; Ayyasamy, V. OpenCV for Computer Vision Applications. In Proceedings of the National Conference on Big Data and Cloud Computing (NCBDC’15), Trichy, India, 20 March 2016. [Google Scholar]
Sigut, J.; Castro, M.; Arnay, R.; Sigut, M. OpenCV Basics: A Mobile Application to Support the Teaching of Computer Vision Concepts. IEEE Trans. Educ. 2020, 63, 328–335. [Google Scholar] [CrossRef]
Zhu, Z.; Cheng, Y. Application of Attitude Tracking Algorithm for Face Recognition Based on OpenCV in the Intelligent Door Lock. Comput. Commun. 2020, 154, 390–397. [Google Scholar] [CrossRef]
Kumar, Y.; Mahajan, M. Machine Learning Based Speech Emotions Recognition System. Int. J. Sci. Technol. Res. 2019, 8, 722–729. [Google Scholar]
Drowsiness Detection Using Eye-Blink Frequency and Yawn Count for Driver Alert. Int. J. Innov. Technol. Explor. Eng. 2019, 9, 314–317. [CrossRef]
Rosique, F.; Losilla, F.; Navarro, P.J. Using Artificial Vision for Measuring the Range of Motion. IEEE Lat. Am. Trans. 2021, 19, 1129–1136. [Google Scholar] [CrossRef]
Serrano-Ramírez, T.; Lozano-Rincón, N.D.C.; Mandujano-Nava, A.; Sámano-Flores, Y.J. Artificial Vision System for Object Classification in Real Time Using Raspberry Pi and a Web Camera. Rev. Tecnol. Inf. Y Comun. 2021, 5, 20–25. [Google Scholar] [CrossRef]
Ouyang, F.; Jiao, P. Artificial Intelligence in Education: The Three Paradigms. Comput. Educ. Artif. Intell. 2021, 2, 100020. [Google Scholar] [CrossRef]
Xue, Y.; Wang, Y. Artificial Intelligence for Education and Teaching. Wirel. Commun. Mob. Comput. 2022, 2022, 1–10. [Google Scholar] [CrossRef]
Tonguç, G.; Ozaydın Ozkara, B. Automatic Recognition of Student Emotions from Facial Expressions during a Lecture. Comput. Educ. 2020, 148, 103797. [Google Scholar] [CrossRef]
Nct Emotion Recognition Training for Young People. 2015. Available online: https://clinicaltrials.gov/show/NCT02550379 (accessed on 16 February 2023).
Hossain, M.S.; Muhammad, G. An Emotion Recognition System for Mobile Applications. IEEE Access 2017, 5, 2281–2287. [Google Scholar] [CrossRef]
Wani, T.M.; Gunawan, T.S.; Qadri, S.A.A.; Kartiwi, M.; Ambikairajah, E. A Comprehensive Review of Speech Emotion Recognition Systems. IEEE Access 2021, 9, 47795–47814. [Google Scholar] [CrossRef]
Mohammad, S. NRC Emotion Lexicon. Saif Mohammad. 2015. Available online: https://saifmohammad.com/WebPages/NRC-Emotion-Lexicon.htm (accessed on 16 February 2023).
Scraping of Social Media Data Using Python-3 and Performing Data Analytics Using Microsoft Power Bi. Int. J. Eng. Sci. Res. Technol. 2020, 9. [CrossRef]

Figure 1. Proposed architecture for a concentration level identification system in an e-learning educational model.

Figure 2. Architecture for blink rate detection.

Figure 3. Eye measurements for eyelid movement identification.

Figure 4. Identification of the frequency of blinks in a video in real-time.

Figure 5. Application of the eye aspect ratio function that determines the EAR calculation to identify the eye distances.

Figure 6. Flowchart of the architecture for the detection of emotions by means of computer vision.

Figure 7. Examples of recognizable gestures on people’s faces that demonstrate an emotion.

Figure 8. Detection phases applied in a convolutional neural network.

Figure 9. Architecture for gesture detection and emotion identification.

Figure 10. This figure shows the graphs of the parameters measured with the algorithms designed: (a) detection of the blink frequency; (b) classification of emotion; (c) level of care.

Table 1. Identification of the level of concentration of students by counting the frequency of blinks.

	Tutoring	Reading	Practical Exercise
Concentrated	27	19	37
Distracted	13	18	6
Sleepy	5	8	2

Table 2. Identification of the concentration level of students with the use of an emotion detection algorithm.

Emotion	Tutoring	Reading	Practical Exercise
Concentration	13	11	35
Boredom	7	13	3
Surprise	15	10	2
Neutral	10	11	5

Table 3. Values obtained from the system pressure versus the limits established in the system tests.

Metric	RMSE	MAE	R2	MAPE
Value	11.351	10.924	0.803	15%
Boundaries	15.000	13.000	1 (100%)	20%

Table 4. Metrics obtained from the application of the concentration level identification system in an e-learning model.

Module	Accuracy	Inference Time
Blink rate detection	92.54%	0.0293 ms
Emotion classification	89.11%	0.024 ms
Facial recognition	95.21%	0.061 ms
Overall system	92.16%	0.1143 ms

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Villegas-Ch., W.; García-Ortiz, J.; Urbina-Camacho, I.; Mera-Navarrete, A. Proposal for a System for the Identification of the Concentration of Students Who Attend Online Educational Models. Computers 2023, 12, 74. https://doi.org/10.3390/computers12040074

AMA Style

Villegas-Ch. W, García-Ortiz J, Urbina-Camacho I, Mera-Navarrete A. Proposal for a System for the Identification of the Concentration of Students Who Attend Online Educational Models. Computers. 2023; 12(4):74. https://doi.org/10.3390/computers12040074

Chicago/Turabian Style

Villegas-Ch., William, Joselin García-Ortiz, Isabel Urbina-Camacho, and Aracely Mera-Navarrete. 2023. "Proposal for a System for the Identification of the Concentration of Students Who Attend Online Educational Models" Computers 12, no. 4: 74. https://doi.org/10.3390/computers12040074

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Proposal for a System for the Identification of the Concentration of Students Who Attend Online Educational Models

Abstract

1. Introduction

2. Materials and Methods

2.1. Review of Related Works

2.2. Identification of the Environment and the Population

2.3. Method

2.4. Design and Development of the Blink Count Algorithm

2.5. Design and Development of the Algorithm for the Recognition of Emotions

2.5.1. Image Database and Algorithm Training

2.5.2. Face Detection, Gesture Identification, and Emotion Classification

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI