Next Article in Journal
Knowledge-Based Features for Speech Analysis and Classification: Pronunciation Diagnoses
Previous Article in Journal
Safety-Critical Control for Control Affine Systems under Spatio-Temporal and Input Constraints
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Learning Peer Recommendation Based on Weighted Heterogeneous Information Networks on Online Learning Platforms

1
School of Information and Communication, Guilin University of Electronic Technology, Guilin 541004, China
2
School of Computer and Information Security, Guilin University of Electronic Technology, Guilin 541004, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Electronics 2023, 12(9), 2051; https://doi.org/10.3390/electronics12092051
Submission received: 9 March 2023 / Revised: 13 April 2023 / Accepted: 27 April 2023 / Published: 29 April 2023

Abstract

:
With the development of online education, there is an urgent need to solve the problem of the low completion rate of online learning courses. Although learning peer recommendation can effectively address this problem, prior studies of learning peer-recommendation methods extract only a portion of the interaction information and fail to take into account the heterogeneity of the various types of objects (e.g., students, teachers, videos, exercises, and knowledge points). To better motivate students to complete online learning courses, we propose a novel method to recommend learning peers based on a weighted heterogeneous information network. First, we integrate the above different objects, various relationships between objects, and the attribute values to links in a weighted heterogeneous information network. Second, we propose a method for automatically generating all meaningful weighted meta-paths to extract and identify meaningful meta-paths. Finally, we use the Bayesian Personalized Ranking (BPR) optimization framework to discover the personalized weights of target students on different meaningful weighted meta-paths. We conducted experiments using three real datasets, and the experimental results demonstrate the effectiveness and interpretability of the proposed method.

1. Introduction

The booming development of the Internet has accelerated education reform, and online learning is growing rapidly. The Corona Virus Disease 2019 (COVID-19) pandemic has seriously restricted offline classroom teaching [1], and online learning, as an extension and an essential complement to offline education [2], is brought into the spotlight [3,4]. Currently, it is commonly observed that students drop out of online learning courses in the learning process [5]. This is because teachers and students are separated in space and time. When faced with difficulties, students are unable to communicate with each other in real time, which may result in feelings of loneliness and helplessness [6,7].
The above phenomenon can be effectively alleviated by learning peer recommendation [8,9,10]. Xu et al. [6] constructed a social network based on learners’ activity information in a course forum and recommended learning peers to target learners based on the network’s link relationships. Hu et al. [7] proposed a Learning Peer Recommendation (LPR) framework for learning peer recommendation that used a dynamic interaction tripartite graph to depict the complex relationships among learners, learning content, and interaction behaviors and adopted Convolutional Neural Network (CNN) to adjust the weight of interaction behaviors. Potts et al. [11] created the Open-Source Course-Level Recommendation (RiPPLE) platform to assist target learners in finding suitable learning peers based on their learning logs. Prabhakar et al. [12] developed a reciprocal recommendation system to match learners with similar interests. However, these methods extract only a portion of the interaction information in the online learning process for modeling purposes and fail to take into account the heterogeneity of the various types of objects and the complex interactions among these objects, resulting in a great deal of significant information loss and a negative impact on recommendation performance. In the learning process, heterogeneous information networks provide an effective solution for modeling data heterogeneity, which opens up a lot of potential for data mining [13]. Xu et al. [14] proposed a heterogeneous information network model to provide scholar-friend recommendations. Liu et al. [15] proposed a method based on heterogeneous information network embedding for scientific collaborator recommendation. Unfortunately, the performance of these methods is limited by three factors. First, it is difficult to manually generate the meta-path set in practice. Second, these methods are only applicable to symmetric meta-paths. Third, the effect of attribute values on links is not considered. Karnyoto et al. [16] adopted a heterogeneous graph neural network to detect fake news related to COVID-19. Wu et al. [17] proposed a heterogeneous graph attention network method for course recommendation, which learned student and course representations in a heterogeneous graph. Although these methods can learn more complex and accurate network representations and are suitable for large-scale application scenarios, they have high time and space complexity and require expensive hardware support.
To solve the problem of the low completion rate of online learning courses, this study overcomes the limitations of the above-mentioned existing research. We propose a learning peer-recommendation method based on a weighted heterogeneous information network (LPRWHIN) by leveraging the behavior data and exercises test data collected in the learning process. The main contributions of this study are summarized as follows:
  • We construct a weighted heterogeneous information network to retain semantic, structural, and attribute information more comprehensively, which consists of multiple types of objects (e.g., students, teachers, videos, exercises, and knowledge points), relationships between objects (e.g., students-knowledge points, students-videos, and students-exercises) and attribute values on links (e.g., the degrees of student–system interaction, the degrees of student–teacher interaction, the degrees of student–student interaction, and test scores).
  • A method for automatically generating meaningful weighted meta-paths is proposed, which makes full use of network structural information to flexibly extract and then effectively identify all meaningful meta-paths for learning peer recommendations.
  • The Bayesian Personalized Ranking optimization framework is employed to calculate the personalized weights of target students on selected weighted meta-paths.
The rest of this paper is organized as follows. Section 2 briefly reviews related work. Section 3 presents relevant definitions. Section 4 describes the proposed method in detail. Section 5 reports experimental results. Section 6 concludes this paper and gives future research directions.

2. Related Works

This section provides a brief review of the main approaches commonly used in recommendation systems.

2.1. Content-Based Filtering and Collaborative Filtering Methods

Content-based filtering methods recommend items by matching user preferences in user profiles with item attributes [18]. Huang et al. [19] presented a content-based method for students to choose online courses based on their own interests. Camposet et al. [20] recommended courses for students based on a content-based filtering method. However, the courses recommended by these methods are closely related to those that the target students were previously interested in, resulting in a lack of novelty and diversity in the recommended courses. Collaborative filtering methods provide recommendations based on the similar preferences of peer users. Pang et al. [21] adopted a collaborative filtering recommendation approach to recommend learning paths for students. Zhao et al. [22] developed a user-based collaborative filtering recommendation method based on students’ behavior logs and learning resources’ ratings, and applied it to the Massive Open Online Courses (MOOC) platform to provide them with resources. However, these methods rely heavily on student feedback or learning behavior data, and there are problems associated with the cold start or data sparsity.

2.2. Hybrid Recommendation Methods

Hybrid recommendation methods are characterized by the combination of multiple recommendation methods [23]. Safarov et al. [24] proposed a Deep Neural Networks (DNN) approach that combined the K-Means algorithm and deep neural networks to generate candidates on e-learning platforms. Wu [25] presented a method combining deep learning and collaborative filtering to recommend MOOC resources. Liu et al. [26] proposed a hybrid method based on deep learning and collaborative filtering, which not only recommended courses suitable for student interests and preferences but also predicted their performance in each course. Hu et al. [7] outlined a framework for learning peer recommendation by combining a tripartite graph and CNN. Gong et al. [27] proposed a knowledge concept recommendation method that constructed a heterogeneous information network to capture semantic relationships between different types of entities and incorporated them into the representation learning process. Although these methods can be used to overcome the limitations of a single recommendation method [28,29], their time complexity is relatively high due to the mixing of multiple recommendation models.

2.3. Network-Based Recommendation Methods

Network-based recommendation methods have been widely developed to model more interaction information and improve recommendation performance. Li et al. [30] proposed a novel social recommendation method based on a homogeneous information network that ignored the heterogeneity between users and topics. Ma et al. [31] proposed a novel method that leveraged bipartite graphs to provide social recommendations. Paleti et al. [32] utilized two tripartite graphs to integrate the interaction information among users, items, ratings, and opinions for social recommendations. Although bipartite and tripartite graphs can integrate heterogeneous information, they cannot integrate more types of objects and relationships. Liu et al. [15] modeled data heterogeneity based on a heterogeneous information network to provide scientific collaborator recommendations. Li et al. [33] proposed an approach based on a heterogeneous information network for making paper recommendations. Li et al. [34] applied a heterogeneous information network to approach the expert recommendation problem. Unfortunately, the above studies only consider various types of objects and relationships, and ignore the effect of the attribute value on links in heterogeneous information networks. For example, the order of authors in a paper corresponds to their relative contribution to the work. We can analyze the semantic relationships between authors and papers more precisely using the contribution of the authors as the attribute value on the “Author-Paper” link.

3. Relevant Definitions

This section explains the term definitions used in this study to illustrate the method in this paper more clearly.

3.1. Three Types of Interaction Degrees

3.1.1. The Degree of Student–System Interaction

The degree of student–system interaction can be defined as the workload that students watch knowledge point videos [35]. The degree of student–system interaction S C u , K can be expressed as Formula (1):
S C u , K = λ 1 × f S C u , K + λ 2 × t S C u , K + λ 3 × p S C u , K ,
where f S C u , K represents the frequency of student u learning knowledge point K, t S C u , K represents the duration of student u learning knowledge point K, and p S C u , K represents the pause and drag frequency of student u learning knowledge point K. Literature [36] concludes that the degree of student–system interaction is best characterized when ( λ 1 , λ 2 , λ 3 ) = ( 1 , 5 , 4 ) .

3.1.2. The Degree of Student–Teacher Interaction

The degree of student–teacher interaction is measured by the workload that students interact with the teacher [35], which is primarily determined by the questioning and answering process between the teacher and students. The degree of student–teacher interaction S T u , K can be expressed as Formula (2):
S T u , K = W S T u , K × f S T u , K W S T u , K = t S T u , K max t S T 1 , K , t S T 2 , K , , t S T d , K ,
where W S T u , K represents the weight coefficient of the student–teacher interaction of student u to knowledge point K, f S T u , K represents the student–teacher interaction frequency of student u to knowledge point K, d is the total number of students learning the online course for each grade, t S T u , K represents the student–teacher interaction duration of student u to knowledge point K, and max t S T 1 , K , t S T 2 , K , , t S T d , K is the maximum duration of student–teacher interaction to the knowledge point K.

3.1.3. The Degree of Student–Student Interaction

The degree of student–student interaction represents the workload that a student puts into interacting with other students regarding knowledge points [35]. The degree of student–student interaction S S u , K can be expressed as Formula (3):
S S u , K = v = 1 d 1 W S S u v , K × f S S u v , K d 1 u v W S S u v , K = t S S u v , K max t S S u 1 , K , t S S u 2 , K , , t S S u d , K ,
where W S S u v , K represents the weight coefficient of the student–student interaction between student u and student v to knowledge point K, f S S u v , K represents the frequency of interaction between student u and student v to knowledge point K, t S S u v , K represents the interaction duration between student u and student v to knowledge point K, and max t S S u 1 , K , t S S u 2 , K , , t S S u d , K represents the maximum duration of student–student interaction between student u and other students to the knowledge point K.

3.2. Weighted Heterogeneous Information Network

A weighted heterogeneous information network can be described as a directed graph G =  ( V , E , W ) with an object type mapping function ϕ : V A , a link type mapping function ψ : E R and an attribute value type mapping function θ : W W , where the types of objects | A | > 1 (or the types of relationships | R | > 1 ) and the types of attribute values | W | > 0 . Based on the graph, each object ν V belongs to a particular object type ϕ ( v ) A , each link e E belongs to a particular relationship type ψ ( e ) R , and each attribute value ω W belongs to a particular attribute value type θ ( ω ) W [37]. A conventional heterogeneous information network is an unweighted heterogeneous information network where there are no attribute values on links or we ignore them. For a weighted heterogeneous information network, there are attribute values on links, and these attribute values may be discrete or continuous.
Figure 1 illustrates the weighted heterogeneous information network of objects and their relationships on an online learning platform. R i denotes a particular type of relationship between two different types of objects ( R i 1 represents the inverse relationship of R i ), and δ R i is the range of attribute values on relationship R i . The figure depicts different relationships R 1 , R 1 1 , R 2 , R 2 1 , R 3 , R 3 1 , R 4 , R 4 1 , R 5 , R 5 1 , R 6 , R 6 1 , R 7 , R 7 1 , R 8 and R 8 1 , which denote do, done, examine, examined-by, answer, answered-by, record, recorded-by, include, included-in, watch, watched-by, question, questioned-by, discuss and discussed-by, respectively.
According to literature [17,27], the network consists of five types of objects (e.g., students, teachers, videos, exercises, and knowledge points), sixteen types of relationships, and the attribute values on links. Links between two types of objects represent different relationships. For example, the network can be described in detail using various relationships among objects and the attribute values on links. “Students-do-Exercises” describes exercises test behavior, where links exist between students and exercises denoting the relationship R 1 , and δ R 1 represents the range of test scores. “Students-question-Knowledge points-answered by-Teachers” describes student–teacher interaction behavior, where links exist between teachers and knowledge points denoting the relationship R 3 , and δ R 3 represents the range of the degrees of student–teacher interaction. “Students-watch-Videos-include-Knowledge points” describes student–system interaction behavior, where links exist between students and videos denoting the relationship R 6 and δ R 6 represents the range of the degrees of student–system interaction. “Students-discuss-Knowledge points-discussed by-Students” describes student–student interaction behavior, where links exist between students and knowledge points denoting the relationship R 8 and δ R 8 represents the range of the degrees of student–student interaction.

3.3. Network Schema

The network schema T G  =  ( A , R , W ) is a meta structure of the weighted heterogeneous information network G =  ( V , E , W ) , which comprises a set of object types A  =  { A } , a set of relationships connecting object pairs R  =  { R } , and a set of attribute values on relationships A  =  { A } [37].
Figure 2 illustrates the network schema of the weighted heterogeneous information network on the online learning platform, which contains five different types of objects: students (S), teachers (T), videos (V), exercises (E), and knowledge points (K). Multiple objects are connected via various links. It is worth noting that there are four types of relationships between students and knowledge points: (1) students discuss the knowledge points; (2) knowledge points are discussed by students; (3) students ask the teacher questions about a certain knowledge point; and (4) knowledge points are questioned by students.

3.4. Weighted Meta-Path

A meta-path M P = A 1 R 1 A 2 R 2 R l A l + 1 is a new composite relationship R = R 1 R 2 R l between object types A 1 , A 2 , A l , where A i represents an object type, R j represents a relationship and ∘ represents composition operator on relationships, ( i = 1 , 2 , , l + 1 ; j = 1 , 2 , , l ). In addition, a meta-path with symmetric relationship types is called a symmetric meta-path.
Based on the above description, a weighted meta-path M P = A 1 δ R 1 A 2 δ R 2 δ R l A l + 1 is a path defined based on a network schema T G  =  ( A , R , W ) , where δ R j represents the range of attribute values on relationship, ( i = 1 , 2 , , l + 1 ; j = 1 , 2 , , l ). Basically, a concrete path m p = a 1 ω 1 a 2 ω 2 ω l a l + 1 is a path instance of the weighted meta-path if each link e i = < a i , a i + 1 > R i , each object a i A i , and each attribute value ω i δ R i [38]. A conventional meta-path can be viewed as a special case of a weighted meta-path, and three types of interaction degrees and test scores can be regarded as ratings and used as attribute values on links. As a result, there are two types of attribute values in Figure 2: one is similar to the rating relationships between students and knowledge points, the attribute values on links of which range from 0 to 1, while the other is associated with all the relationships except rating relationships, the attribute values of its links all take value 1. In addition, Table 1 shows examples of weighted meta-paths and their corresponding semantic meanings in this study, these semantic information can effectively reflect the different types of learning behaviors of students and the workload put into learning knowledge points.

4. Proposed Methodology

Based on the above definitions, we propose a learning peer-recommendation method based on a weighted heterogeneous information network, as shown in Figure 3. The proposed method consists of the following five steps. First, we construct a weighted heterogeneous information network containing multiple types of objects and relationships according to the corresponding network schema on an online learning platform. Second, we propose a method for automatically generating all meaningful weighted meta-paths based on the specified network schema for exploring the various links between target students and candidate students. Third, we apply random walk to calculate the recommendation scores of candidate students in each meaningful weighted meta-path. Fourth, we adopt a Bayesian Personalized Ranking optimization framework to learn personalized weights for all weighted meta-paths. Fifth, we make a Top-N recommendation list for each target student based on the recommendation scores.

4.1. Constructing a Weighted Heterogeneous Information Network

The construction of a weighted heterogeneous information network on the online learning platform is shown in Figure 1. First, the types of objects and relationships are determined from the online learning platform dataset (Online dataset) to specify a network schema T G  =  ( A , R , W ) . Second, according to the obtained network schema, we build a weighted heterogeneous information network to model the interaction behaviors among students, teachers, videos, exercises, and knowledge points, in which the degrees of student–system interaction, the degrees of student–teacher interaction, the degrees of student–student interaction and test scores are used as attribute values.
We build the adjacency matrix of this network, and then the adjacency matrix A M N = a i j p × q between object types M and N can be defined as Formula (4):
a i j = w i j if a link exists between M i and N j 0 otherwise ,
where p and q are described as corresponding indexes of M and N, respectively. w i j is the attribute value on the link of the relationship between M i and N j , and all the attribute values will be normalized to a unified range between 0 and 1 [39]. A point worth noting is that there are different relationship types between objects. For example, A S V represents the adjacency matrix of the watch relationship between students and videos, and A V S represents the adjacency matrix of the watched-by relationship between videos and students.

4.2. Generating a Meta-Path Set

Meta-path contains rich semantic information and can be used to discover students’ interests and preferences. It is an effective tool for modeling student preferences in heterogeneous information networks [13]. Literature [14] employed a method for manually specifying meta-paths to develop a meta-path set related to the recommendation task. This method not only relies on domain expert knowledge but also omits some meta-paths due to manual negligence, which leads to an incomplete meta-path set, resulting in the loss of a large amount of information and the degradation of recommendation performance.
Theoretically, all meta-paths can be used for learning peer recommendation [33]. However, the redundancy and the noise in some meta-paths greatly increase the computational cost of personalized weight-learning for meta-paths and bring noise information into the recommendation model.
Therefore, this paper proposes a method for automatically generating meaningful meta-paths that can automatically extract and then effectively identify all meaningful meta-paths for learning peer recommendations. The method comprises two steps. First, a complete meta-path set is generated by the method for automatically extracting meta-paths to ensure that the meta-paths used for learning peer recommendation can cover richer and more objective semantic information and help to improve recommendation performance. Second, the information gained is used to estimate the values of these meta-paths, and based on these values, the meta-paths with noisy and redundant information are eliminated, and the meaningful meta-paths are retained for learning peer recommendation. Due to the reduction of meta-paths containing noisy and redundant information, the weights will be allocated to meaningful meta-paths, which will help to reduce computational costs.

4.2.1. Extracting Meta-Paths

The method for automatically extracting meta-paths can automatically extract any type of symmetric and asymmetric meta-paths. Since the purpose of this study is to recommend learning peers to students, we only focus on meta-paths with student type as source and target objects. In addition, to minimize the time complexity of searching for meta-paths, this paper only extracts meta-paths within a limited length. Figure 2 shows the network schema T G  =  ( A , R , W ) of the weighted heterogeneous information network on the online learning platform. In this paper, the maximum length of meta-paths to be extracted is set to 4, and the source object type and the target object type are both S (student), then meta-paths are automatically searched according to the given network schema.

4.2.2. Selecting Meaningful Meta-Paths

The meaningful meta-paths can reveal students’ unique preferences when they choose learning peers. In the process of automatically generating a complete meta-path set, each merged relationship in the meta-path should add useful information to the recommendation model [40]. Therefore, according to the information gained from each meta-path, we abandon the meta-paths with more noisy and redundant information to ensure that the meaningful meta-paths are selected for our recommendation task. For a certain meta-path P = M R 1 N , the information gain obtained from object type M to object type N following the meta-path P can be expressed as Formula (5):
I G P = H ( M ) H P ( M N ) ,
where I G P is the information gain of the meta-path P, which represents the information value starting from M and arriving at N via P. If I G P is close to zero, it means that the meta-path P does not add additional useful information beyond what is already contained in the dimension M. H ( M ) denotes the entropy of object type M, and H P ( M N ) is the conditional entropy obtained following the meta-path P. H ( M ) can be expressed as Formula (6):
H ( M ) = m M p ( m ) log ( p ( m ) ) ,
where p ( m ) is defined by the degree centrality, which represents the probability of node m with object type M, and can be expressed as Formula (7):
p ( m ) = Degree ( m ) x M Degree ( x ) ,
where D e g r e e ( x ) represents the number of edges that belong to the relationship type R 1 incident on node m, and R 1 is the first relationship type in the meta-path P. x M D e g r e e ( x ) represents the total degree of all nodes of the object type.
Conditional entropy H P ( M N ) is used to measure the information uncertainty of the dimension M given dimension N. H P ( M N ) can be expressed as Formula (8):
H P ( M N ) = n N p ( N = n ) × H P ( M N = n ) = n N p ( n ) × n N p P ( m n ) log p P ( m n ) ,
where p P ( m n ) represents the conditional probability of the meta-path P and can be expressed as Formula (9):
p P ( m n ) = k m M , n N , k P , m k n y N k m M , y N , k P , m k y ,
where k m M , n N , k P , m k n represents the number of path instances from node m to node n following the meta-path P, and y N { k m M , y N , k P , m k y represents the total number of path instances from node m to node y following the meta-path P.
The normalization of information gain can make this method suitable not only for those meta-paths with the same source and target objects in this study. I G P ^ can be expressed as Formula (10):
I G P ^ = I G P H ( M ) H ( N ) ,
The meta-path with the lower information gain value has a higher probability of being noisy and containing less information. Taking the meta-path P e g = Student question Knowledge point answered by Teacher as an example, the information gain calculation process is described. We first present how to obtain H ( S t u d e n t ) . As shown in Formula (6), the key step is to calculate p ( m ) using Formula (7), where D e g r e e ( m ) represents the number of edges that belong to the relationship type q u e s t i o n on the node m, and x M D e g r e e ( x ) represents total number of edges on nodes of type S t u d e n t . We then figure out how to compute H P e g ( S t u d e n t T e a c h e r ) . Its calculation process is similar to that of H ( S t u d e n t ) . p P e g ( m n ) and H P e g ( S t u d e n t T e a c h e r ) can be calculated using Formula (9) and Formula (8), respectively. Finally, the information gain I G P e g of P e g can be calculated using Formula (5) and normalized using Formula (10).
Figure 4 shows the specific process of automatically generating meaningful meta-paths according to the given network schema in Figure 2, where the solid lines indicate the relationships between objects, the dashed lines represent the searched meta-path, and the solid lines marked with circles denote the selected meta-paths using information gain. when the maximum length of the meta-paths is set to 4, the source object type and target object type are both S (student), and the method for automatically extracting meta-paths will perform 4 iterations.
In the first iteration, the meta-paths with source object type S are searched, which are expressed as f s t _ S e t = { S E , S V , S K } . The meta-paths with source object type S and target object type S are extracted in f s t _ S e t , and these meta-paths are represented as p a t h S e t . p a t h S e t is empty in this iteration.
In the second iteration, we extract the target object types of each element in f s t _ S e t of the previous iteration, expressed as t a r g e t O b j e c t s = { E , V , K } . After obtaining t a r g e t O b j e c t s , it will be regarded as the source object type in this iteration. According to the network schema in Figure 2, we can obtain s n d _ S e t = { E K , E S , V K , V S , V T , K S , K T , K E , K V } . Then the f s t _ S e t in this iteration can be obtained by merging the f s t _ S e t in the previous iteration with the current s n d _ S e t , which can be expressed as f s t _ S e t = { S E K , S E S , S V K , S V S , S V T ,   S K S , S K T , S K E , S K V } . Similarly, we only focus on the meta-paths with source object type S and target object type S in f s t _ S e t , p a t h S e t are now updated to { S E S , S V S , S K S } .
The process of the third and fourth iteration is similar to that of the second iteration. After four iterations, we can extract the meta-paths suitable for learning peer recommendations within the set length.
To reduce the impact of meta-paths with noisy and redundant information in p a t h S e t on the recommendation performance, we use information gain to identify meaningful meta-paths for learning peer recommendation.
Algorithm 1 illustrates the detailed pseudo code that describes the method for automatically generating all meaningful meta-paths. Please note that function 1 and function 2 are the same as those used by Lu [41]. In addition, the further difference is that our method can automatically generate meta-paths and utilize the information gained to select meaningful meta-paths, while the previous method only provides the ability to generate meta-paths.
Algorithm 1 Procedures of a method for automatically generating meaningful meta-paths.
  • Input:  T G = ( A , R , W ) : network schema of heterogeneous information network in adjacency matrix
  •      A s : source object type of a meta-path, where A s A
  •      A t : target object type of a meta-path, where A t A
  •      m a x _ L e n : maximum length for meta-paths to be searched
  • Output:  p a t h S e t  =  P 1 , P 2 , : set of meta-paths searched from T G in form of A s -*- A t
  •   1:
    f i n a l P a t h S e t N U L L
      2:
    p a t h S e t N U L L
      3:
    for all R i R , R i = A s i , A t i  do
      4:
        f s t _ S e t l i s t ( A s i )
      5:
       for  i 1 to m a x _ L e n  do
      6:
          s n d _ S e t n e x t _ l i s t ( f s t _ S e t ) //function 1
      7:
          m _ S e t m e r g e _ l i s t ( f s t _ S e t , s n d _ S e t ) //function 2
      8:
          f s t _ S e t m _ S e t
      9:
         for  j 1 to length( f s t _ S e t ) do
      10:
            if last element of f s t _ S e t [ j ] = = A t i  then
    11:
               f i n a l P a t h S e t f i n a l P a t h S e t f s t _ S e t [ j ]
    12:
            end if
    13:
         end for
    14:
       end for
    15:
    end for
    16:
    for k 1 to length( f i n a l P a t h S e t ) do
    17:
       Calculate p ( m ) using Formula (7)
    18:
       Calculate H ( M ) using Formula (6)
    19:
       Calculate p P ( m n ) using Formula (9)
    20:
       Calculate H P ( M N ) using Formula (8)
    21:
       Calculate I G P using Formula (5)
    22:
       Calculate I G P ^ using Formula (10)
    23:
       if  I G P ^ > 0 then
    24:
          p a t h S e t f i n a l P a t h S e t [ k ]
    25:
       end if
    26:
    end for

    4.3. Recommendation Score Calculation Based on Weighted Meta-Paths

    Tracking a student’s history in terms of meta-paths can capture information about students’ preferences in choosing learning peers. PathSim is used to measure the similarity between objects in the meta-path [14]. However, this method can only be applied to symmetric meta-paths and does not consider the effect of the degree of student–student interaction on the similarity measure between two students.
    To overcome the above limitations, we use random walk on each differently weighted meta-path to calculate the corresponding node proximity between source objects and target objects [33]. Given the student u, the student query vector Q ( i ) can be defined as Formula (11):
    Q ( i ) = 1 if i = u 0 otherwise ,
    where Q ( i ) is the i t h term of Q. When i = u , student u is assigned a value of 1, while the others are all 0.
    Once the target student is assigned a value, the value will be transferred to its neighbor objects on this meta-path. Specifically, the transition matrix T M N = t i j p × q is described as a representation of transition probabilities between object types M and N, and can be determined by normalizing the adjacency matrix A M N = t i j p × q by columns as shown in Formula (12):
    t i j = a i j p = 0 p a p j ,
    where p = 0 p a p j is the sum of elements in the column j of A M N = a i j p × q .
    Based on the above statement, for a given weighted meta-path M P k = Student δ R 1 A 1 δ R 2 δ R l 1 Student, we can define the recommendation score vector as shown in Formula (13):
    r u k = T S S A l 1 · · T A 1 S T · Q ( u ) ,
    where r u k represents each candidate student’s recommendation score to target student u on the meta-path k, S T represents the set of target students, S S represents the set of candidate students, T S S A l 1 is the transition matrix of candidate students to object type A l 1 , T A 1 S T is the transition matrix of object type A 1 to target students.

    4.4. Personalized Weight-Learning for Weighted Meta-Paths

    For every student, we develop a linear function that integrates the recommendation scores between the target student and candidate students on all selected weighted meta-paths, therefore obtaining the global score vector as shown in Formula (14):
    g u = k α u k · r u k ,
    where g u represents the global score vector of target student u, α u k denotes personalized weight of student u on weighted meta-path k, r u k denotes the recommendation score vector of student u on meta-path k.
    Each meta-path in the weighted heterogeneous information network has exact semantic information, and the weight of the meta-path indicates how much students attach importance to the information on the meta-path. Different weighted meta-paths reflect different learning behaviors of students (such as student–student interaction behavior, student–teacher interaction behavior, and student–system interaction behavior). Through personalized weighted learning, a higher value can be assigned to the selected weighted meta-path that expresses students’ preference for choosing a learning peer.
    There is no need to pursue accurate recommendation scores of candidate students based on weighted meaningful meta-paths [33]. Therefore, to ensure that the recommendation scores of recommended students are higher than those of students that are not recommended, the BPR optimization framework is employed to build our objective function [42]. We consider students who are learning peers of a target student as a positive set, denoted as P S u , and students who are not learning peers of the target student as a negative set, denoted as N S u [43]. In addition, we outline the objective function for target student u as Formula (15):
    maxobj u ( α ) = i P S u j N S u σ g u ( i ) g u ( j ) P S u N S u ,
    where P S u denotes the positive set of student u, N S u denotes the negative set of student u, P S u denotes the size of P S u , N S u denotes the size of N S u , g u ( i ) is the global recommendation score of candidate student i to target student u, g u ( j ) is the global recommendation score of candidate student j to target student u, and σ ( ) is an indicator function, which can be expressed as Formula (16):
    σ ( x ) = 1 1 + e x ,
    The gradient ascent (GA) algorithm can be applied to maximize the objective function for student u. For each student, we update the parameter α u according to the ascending direction of the gradient, and the update process is shown in Formula (17):
    α u ( t + 1 ) : = m a x ( 0 , α u ( t ) + l r · o b j u ( α ) α ) ,
    where l r denotes learning rate, α u ( t ) and α u ( t + 1 ) are the parameters of student u at time t and t + 1 . The gradient of obj u ( α ) to parameter α is calculated as Formula (18):
    o b j u ( α ) α = i P S u j N S u σ Γ i j Γ i j g u ( i ) α g u ( j ) α P S u N S u ,
    where Γ i j = g u ( i ) g u ( j ) . We can have σ Γ i j / Γ i j = e Γ i j / 1 + e Γ i j 2 by deriving the sigmoid function. The derivative g u / α u k of each parameter α u k is calculated as Formula (19):
    g u α u k = r u k ,
    Please note that if the sum of all parameters α u of student u is not equal to 1, these parameters are normalized to ensure the sum is 1. The specific procedure of personalized weight-learning is shown in Algorithm 2.
    Algorithm 2 Personalized weight-learning for weighted meta-paths.
    Input: 
    student u, learning rate l r , threshold ξ
    Output: 
    personalized weight α u
    1:
    t = 0
    2:
    Initialize α u ( 0 )
    3:
    while α u ( t + 1 ) α u ( t ) > ξ do
    4:
       Compute the recommendation ranking score g u ( v ) of each candidate student v
    5:
       Compute g u α u k using Formula (19)
    6:
       Update α based on Formula (17)
    7:
       Update t = t + 1
    8:
    end while

    4.5. Personalized Learning Peer Recommendation Based on Weighted Meta-Paths

    The basic principle of recommending learning peers is that the target students tend to be interested in interacting with other students who are connected through important weighted meta-paths that the target students emphasize [14]. We use a global recommendation score to incorporate recommendation scores between the target and candidate students through differently weighted meta-paths. The global recommendation score g u ( v ) is determined by the recommendation scores of the candidate student on each meta-path and the target student’s corresponding personalized weight of the weighted meta-path, and can be expressed as Formula (20):
    g u ( v ) = k α u k · r u k ( v ) ,
    For each target student, the Top-N recommendation list is generated based on the recommendation scores of all candidate students to become potential learning peers.

    5. Experiments and Results Analysis

    5.1. Datasets

    We conducted our experiments on three real-world datasets. To demonstrate the ability of our method on other weighted heterogeneous information networks, we used two other datasets, namely the DBLP dataset (DBLP (https://www.aminer.cn/citation, accessed on 7 January 2022)) and Aminer dataset (Aminer (https://www.aminer.cn/data/#Academic-Social-Network, accessed on 17 December 2021)). Additionally, the Online dataset was used for learning peer recommendations. The relevant statistics of three datasets are shown in Table 2.
    The DBLP dataset is obtained from the DBLP original dataset, which extracts conference papers published from 2015 to 2020 and guarantees that each paper is not written by a single author. The network schema for the DBLP dataset [37] is shown in Figure 5a, which covers author (A), paper (P), and conference venue (V), and two types of relationships, namely paper-venue, and paper-author.
    The Aminer dataset is extracted directly from the Aminer original dataset, which selects papers published from 2010 to 2014, and each paper is not written by a single author. Meanwhile, to make use of the text information, We select five terms with the highest Term Frequency Inverse Document Frequency (TF-IDF) score from the title and abstract of each paper. The network schema for the Aminer dataset [44] is illustrated in Figure 5b, which contains author (A), paper (P), venue (V), and term (T). Please note that “1∼N” in Figure 5a,b represents the order of authors in a paper.
    The Online dataset consists of historical behavior data generated by students in grades 2017, 2018, 2019, and 2020 in the process of learning “Data Structure and Algorithm”. The network schema for this dataset is shown in Figure 2, which includes student (S), teacher (T), video (V), exercises (E), knowledge point (K), and their relationships with each other.

    5.2. Evaluation Metrics

    In each experiment, each dataset is divided into five partitions to obtain a training set and a test set. We use four partitions for training different recommendation approaches and the remaining partition for testing. After the training, we recommend learning peers for each target student.

    5.2.1. Precision and Recall

    If the recommended students communicate with the target students, the recommendation is valid [6]. In addition, there are two evaluation metrics used to evaluate performance, namely precision and recall. These two metrics can be calculated as follows:
    Precision = | rec ( u ) real ( u ) | | rec ( u ) | ,
    Recall = | rec ( u ) real ( u ) | | real ( u ) | ,
    where rec ( u ) denotes the recommendation list for target student u, real ( u ) denotes the true learning peer set of target student u.

    5.2.2. The Achievement Degree of Curriculum Objectives

    To study the practical benefits of our method, we use the achievement degree of curriculum objectives in the process of engineering education certification to scientifically evaluate students’ learning outcomes and effectively feed them back to teachers [45]. This method involves the concepts of assessment methods and curriculum objectives. Different assessment methods are used to test whether students have passed the course in all respects. Different curriculum objectives correspond to different engineering index points in the graduation requirements to determine whether students have met the graduation requirements of the course. The evaluation method is described in detail below.
    There are m assessment methods for a certain course, and the total score of each assessment method is S . In addition, the course has n curriculum objectives. Please note that the score of assessment method j corresponding to curriculum objective i is a i j , and the average score of assessment method j corresponding to curriculum objective i of all students participating in the course is b i j . The total achievement degree D of a group’s curriculum objectives is defined as Formula (23):
    D = i = 1 n D i · j = 1 m a i j m · S
    where D i represents the achievement degree of the group’s curriculum objective i. The calculation process can be expressed as Formula (24):
    D i = j = 1 m v j · c i j
    where v j is the weight of assessment method j, and j = 1 m v j = 1 . c i j represents the evaluation value of the assessment method j corresponding to the curriculum objective i of all students, and the calculation process is c i j = b i j / a i j .

    5.3. Baseline Methods

    This paper compares LPRWHIN with the following baseline methods:
    (1)
    Recommendation Based on the Meta-Path (Metapath) [14]: This method first integrates students’ historical preference information into a heterogeneous information network and generates a meta-path set. Second, the similarity between students is calculated based on the meta-path, and the regularization-based optimization method is introduced to learn the personalized weight on the meta-path.
    (2)
    Paper Recommendation Based on a Heterogeneous Information Network (PRHN) [33]: This method first constructs a heterogeneous information network containing multiple types of objects and relationships. Second, random walks are performed on each meta-path to obtain recommendation scores of candidate students, and personalized weight-learning is performed on each meta-path.
    (3)
    Matrix Factorization (MF) [46]: This model maps the interaction between students into a joint latent factor space of dimensionality f, so that the interactions between students are modeled as inner products in this space.
    (4)
    Deep Matrix Factorization (DMF) [47]: This method uses a neural network framework to project target students and candidate students to low-dimensional vectors. In the experiment, the interaction matrix between students is used as input.

    5.4. Parameter Settings

    In the process of personalized weight-learning for weighted meta-paths, threshold ξ and learning rate l r have a greater impact on recommendation performance. In this experiment, we set ξ to be an infinitesimal random number and l r to 0.001 empirically.

    5.5. Results of Comparative Experiments

    5.5.1. Impact of Different Numbers of Meaningful Meta-Paths

    To demonstrate the impact of generating a completed meaningful meta-path set on recommendation performance, the following experiments were conducted in this paper. First, we extracted completed meaningful meta-path sets with a maximum length of 4 on DBLP, Aminer, and Online datasets using a method for automatically generating meaningful meta-paths, respectively. Second, we simulated the manual enumeration method by reducing one meta-path in each of those completed meaningful meta-path sets. Third, we used the corresponding meta-path sets on these three different datasets for experiments. The experimental results are shown in Figure 6.
    Figure 6a–c show the results of LPRWHIN under different numbers of meta-paths on DBLP, Aminer, and Online datasets. The experimental results show that using the complete meaningful meta-path set performs better. Therefore, a method for automatically generating meaningful meta-paths can achieve better precision and recall. Especially on the Online dataset, reducing a meaningful meta-path has a significant impact on the precision and recall. It indicates that a small number of meaningful meta-paths have a great impact on the recommendation performance, which is consistent with the conclusion in the literature [48].

    5.5.2. Analysis of Experimental Results

    We conducted experiments on three different datasets using LPRWHIN, PRHN, Metapath, MF, and DMF. The experimental results are shown in Figure 7, Table 3 and Table 4.
    Although the scale of the three datasets is different, the performance shows similar trends. Figure 7 and Table 3 and Table 4 show that LPRWHIN outperforms other baseline methods in terms of precision and recall.
    As can be seen from Table 3 and Table 4, the precision decreases with increasing recommendation list length, while the recall increases with increasing recommendation list length. Figure 7a shows the recommendation performance of LPRWHIN and other baseline methods on the DBLP dataset. It illustrates that LPRWHIN performs the best, and the highest precision of LPRWHIN is up to 32.4% when N is 1. Figure 7b shows the recommendation performance of five methods on the Aminer dataset. It demonstrates that LPRWHIN has the highest precision of 61.7%, which indicates that LPRWHIN has a 61.7% probability of recommending co-authors in the Top-1 recommendation list. In addition, it shows that the recall of LPRWHIN is 33.8% when N is 10, which indicates that there are 33.8% true co-authors in the Top-10 recommendation list. Figure 7c shows the precision and recall of five methods on the Online dataset. As shown in Figure 7c, the highest precision of LPRWHIN reaches 30.2% when N is 1, which is 6.2% higher than the second-best method PRHN. The highest recall of LPRWHIN reaches 32.4% when N is 10, which is 7.3% higher than the second-best method PRHN.
    In conclusion, the proposed method LPRWHIN significantly outperforms other baseline methods in terms of precision and recall on three different datasets. Compared with PRHN, LPRWHIN utilizes an algorithm to automatically generate meaningful meta-paths to ensure the completeness of the meta-path set, and all the weights are assigned to meaningful meta-paths. In addition, the effect of attribute values on links is also considered. The recommendation performance of Metapath is inferior to that of LPRWHIN because Metapath uses a method of manually generating meta-paths and the similarity measure used is only suitable for symmetric meta-paths as well as does not consider the effect of attribute values on links. MF has the worse performance because it only considers the interaction behavior between students and does not consider other behavior data in the learning process. DMF’s performance is unstable on different datasets and performs worse than expected because it only considers the interaction behavior between students and requires large-scale data.

    5.6. Case Study

    We have conducted case studies on students in grade 2020 to evaluate the practical benefits of this study in the online learning process. The course “Data Structure and Algorithm” we selected has three assessment methods and four curriculum objectives. The assessment methods of this course are composed of final examinations, experimental reports, and homework. The total score of each assessment method is 100, and the corresponding weights are 0.7, 0.2, and 0.1, respectively. In addition, the course has four curriculum objectives, of which curriculum objective 1 mainly examines the application ability of students to basic theories and methods in data structure. Curriculum objective 2 mainly examines students’ ability to accurately describe the process of dealing with complex engineering problems. Curriculum objective 3 mainly examines students’ abilities to comprehensively consider various technical requirements and design modules and solutions to meet specific needs. Curriculum objective 4 mainly examines the ability of students to conduct system simulation and optimization research through correlation analysis of relevant complex engineering problems. Table 5 shows the score relationship between curriculum objectives and assessment methods.

    5.6.1. Comparison of the Total Achievement Degree of Curriculum Objectives

    We divide the students in grade 2020 into three groups: students who become learning peers according to the recommendation results (Group 1), students who do not become learning peers according to the recommendation results (Group 2), and all students (Group 3). In engineering education certification, it is generally believed that the expected value of the achievement degree of curriculum objectives is 0.7, which indicates that the student learning outcomes meet the expectations. Based on this consensus, we analyzed the total achievement degree of the curriculum objectives of Group 1, Group 2, and Group 3. The experimental results of the total achievement degree of curriculum objectives for different groups are shown in Figure 8.
    From Figure 8, it can be observed that the total achievement degree of curriculum objectives in Group 3 is 0.688, which is composed of Group 1 and Group 2. The total achievement degree of curriculum objectives in Group 1 is greater than that in Group 2 and exceeds the expected value, which makes a great contribution to the improvement of the total achievement degree of curriculum objectives in Group 3. However, the total achievement degree of curriculum objectives in Group 2 is lower than 0.7, indicating that the completion of this group’s course has not met expectations. It can be concluded that Group 1 can complete the course well and achieve the expected objectives, indicating that the proposed method can encourage students to keep learning and help them pass the course.

    5.6.2. Comparison of the Achievement Degree of the Four Curriculum Objectives

    We analyzed the achievement degree of the four curriculum objectives for different groups, and the comparative results are shown in Figure 9 and Table 6.
    It can be seen from Figure 9 and Table 6 that the achievement degree of the four curriculum objectives in Group 1 is greater than that in Group 2. The achievement degree of the four curriculum objectives in Group 1 exceeds the expected value, and the achievement degree of curriculum objective 1 is the largest, which is 0.781. The achievement degree of curriculum objective 3 is the smallest, 0.740. On the contrary, the achievement degree of the four curriculum objectives in Group 2 is lower than the expected value, but the achievement degrees of curriculum objective 1 and curriculum objective 2 are close to 0.7, namely 0.683 and 0.680, respectively.
    In groups 1 and 2, the achievement degree of curriculum objective 1 is the highest, and the achievement degree of curriculum objective 3 is the lowest, which indicates that students have a strong ability to apply the basic theories and methods of data structure, but a weak ability to accurately describe the processing process of complex engineering problems. Based on the analysis of the achievement degree of curriculum objectives and the historical data, it can be concluded that there are two reasons for this situation:
    • When some students in Group 1 encounter difficulties, there is less interaction between students and teachers, resulting in insufficient learning of the corresponding difficult knowledge points.
    • Most of the students in Group 2 only have little interaction, and even many students do not complete the learning of course knowledge points.
    Based on the analysis of the total achievement degree of curriculum objectives and achievement degrees of the four curriculum objectives, we can conclude that the learning quality of Group 2 is better. It can be shown that the proposed method can improve students’ participation in the course and help students better learn relevant knowledge, which is of great significance for students to successfully pass the course and meet the graduation requirements.

    6. Conclusions

    In this paper, we propose a learning peer-recommendation method based on a weighted heterogeneous information network to increase student participation in online courses. First, we use a weighted heterogeneous information network to incorporate the multiple types of objects, relationships, and attribute values. Second, we propose a method for automatically generating all meaningful weighted meta-paths, which can effectively generate meaningful meta-paths for learning peer recommendations. Finally, this study employs an optimization process using Bayesian Personalized Ranking to learn the personalized weights on different meta-paths and calculate the recommendation score for candidate students. In addition, the method fuses heterogeneous information in the learning process and provides interpretability for the recommendation results from three perspectives. First, the method analyzes the effect of attribute values on links, which not only reveals what objects with which target students and candidate students commonly interact, but also indicates how much workload they devote to interacting with these objects. Second, the weighted meta-path can reflect students’ preferences. The selected meaningful weighted meta-paths reflect what information target students care about when choosing learning peers. Third, personalized weights for each meaningful meta-path are assigned via personalized weight-learning, reflecting the extent to which target students care about the information when choosing their learning peers. In this paper, we compare the proposed method LPRWHIN with several baseline methods on three real-world datasets. Experimental results indicate that LPRWHIN outperforms other baseline methods in terms of precision and recall. Moreover, we use the achievement degree evaluation system based on the concept of outcome-based education to reasonably evaluate the student learning outcomes. The final evaluation results can not only give feedback about the problems existing in the learning process to students, but also help teachers find the deficiencies in their teaching and provide suggestions for follow-up teaching.
    However, there are some limitations to our research. First, this study relies on students’ historical behavioral preferences, and data sparsity affects recommendation performance. Second, this study only focuses on the information relating to objects and the attribute values on links but does not consider the specific text content involved in the interaction between students and teachers.
    Future research may go in three directions. First, as the learning process continues, interactions between students will constantly evolve. Therefore, future research can implement a dynamic recommendation model using incremental computing in heterogeneous information networks to make adaptive recommendations. Second, broad learning has attracted much attention for its ability to overcome deep learning’s complex network structure and numerous parameters. Future research may integrate heterogeneous information networks and broad learning to discover more fine-grained student preference patterns and greatly improve recommendation performance. Third, real-world feedback from students is vital for improving recommendation methods. Future research should pay more attention to students’ responses to recommendation results.

    Author Contributions

    Conceptualization, Z.S. (Zhaoyu Shou) and Z.S. (Zhixuan Shi); methodology, Z.S. (Zhaoyu Shou) and Z.S. (Zhixuan Shi); software, Z.S. (Zhixuan Shi) and J.L.; validation, Z.S. (Zhaoyu Shou), Z.S. (Zhixuan Shi) and H.W.; formal analysis, Z.S. (Zhixuan Shi); investigation, Z.S. (Zhaoyu Shou) and Z.S. (Zhixuan Shi); resources, Z.S. (Zhaoyu Shou); data curation, Z.S. (Zhaoyu Shou), Z.S. (Zhixuan Shi) and H.W.; writing—original draft preparation, Z.S. (Zhixuan Shi); writing—review and editing, Z.S. (Zhaoyu Shou), Z.S. (Zhixuan Shi) and J.L.; visualization, Z.S. (Zhixuan Shi); supervision, Z.S. (Zhaoyu Shou) and H.W.; project administration, Z.S. (Zhaoyu Shou), Z.S. (Zhixuan Shi), H.W. and H.Z.; funding acquisition, Z.S. (Zhaoyu Shou). All authors have read and agreed to the published version of the manuscript.

    Funding

    This research is supported by The National Natural Science Foundation of China (62177012, 61967005, 62267003), Innovation Project of GUET Graduate Education (2020YCXS022, 2021YCXS033), the Key Laboratory of Cognitive Radio and Information Processing Ministry of Education (CRKL190107).

    Institutional Review Board Statement

    Not applicable.

    Informed Consent Statement

    Not applicable.

    Data Availability Statement

    The original data of the DBLP dataset and Aminer dataset supporting the study can be available at https://www.aminer.cn/citation (accessed on 7 January 2022) and https://www.aminer.cn/data/#Academic-Social-Network (accessed on 17 December 2021). However, the Online dataset is not suitable for direct publishing because it involves the personal privacy of students.

    Acknowledgments

    We are grateful to anonymous reviewers for their helpful comments.

    Conflicts of Interest

    The authors declare no conflict of interest.

    Abbreviations

    The following abbreviations are used in this manuscript:
    BPRBayesian Personalized RankingAn Optimization Framework is Employed to Build the Objective Function
    COVID-19Corona Virus Disease 2019Respiratory Diseases Caused by the Corona Virus
    LPRLearning Peer RecommendationA Method for Learning Peer Recommendation
    CNNConvolutional Neural NetworkA Kind of Feedforward Neural Network with Convolution Calculation and
    Deep Structure
    RiPPLEOpen-Source Course-LevelAn Online Learning Platform for Anyone
    Recommendation Platform
    LPRWHINLearning Peer RecommendationOur Proposed Methodology for Learning Peer Recommendation
    Method Based on a Weighted
    Heterogeneous Information Network
    MOOCMassive Open Online CoursesFree Online Courses are Available for Anyone
    DNNDeep Neural NetworksA Technology in the Field of Machine Learning
    OnlineOnline Learning Platform DatasetA Dataset for Learning Peer Recommendation
    DBLPDBLP DatasetA Dataset for Evaluating Performance
    AminerAminer DatasetA Dataset for Evaluating Performance
    TF-IDFTerm Frequency Inverse DocumentA Commonly Used Weighting Technique
    Frequencyfor Information Retrieval and Data Mining
    MetapathRecommendation Based on theA Method for Scholar-Friend Recommendation
    Meta-Path
    PRHNPaper Recommendation Based on aA Method for Paper Recommendation
    Heterogeneous Information Network
    MFMatrix FactorizationThe Method of Decomposing a Matrix into the Products of Several Matrices
    DMFDeep Matrix FactorizationA Deep Matrix Factorization Model for Recommender Systems

    References

    1. Butnaru, G.I.; Niță, V.; Anichiti, A.; Brînză, G. The effectiveness of online education during covid 19 pandemic—A comparative analysis between the perceptions of academic students and high school students from romania. Sustainability 2021, 13, 5311. [Google Scholar] [CrossRef]
    2. Runtian, Z. The Game and Integration between Online Education and Traditional School Education. In Proceedings of the 2020 11th International Conference on E-Education, E-Business, E-Management, and E-Learning, Osaka, Japan, 10–12 January 2020; pp. 121–124. [Google Scholar] [CrossRef]
    3. Bai, Y.; Li, Z.; Zhao, F.; Jiang, Y. Strategies, methods and problems of online education in China during the epidemic. In Proceedings of the 2020 Ninth International Conference of Educational Innovation through Technology (EITT), Porto, Portugal, 13–17 December 2020; pp. 30–34. [Google Scholar] [CrossRef]
    4. Chang, J. Online Instruction and Offline Classroom Teaching: A Study on Parallel Education Systems. In Proceedings of the 2021 9th International Conference on Information and Education Technology (ICIET), Okayama, Japan, 7–29 March 2021; pp. 157–160. [Google Scholar] [CrossRef]
    5. Mrhar, K.; Douimi, O.; Abik, M. A dropout predictor system in MOOCs based on neural networks. J. Autom. Mob. Robot. Intell. Syst. 2021, 42, 72–80. [Google Scholar] [CrossRef]
    6. Xu, B.; Yang, D. Study partners recommendation for xMOOCs learners. Comput. Intell. Neurosci. 2015, 2015, 15. [Google Scholar] [CrossRef] [PubMed]
    7. Hu, Q.; Han, Z.; Lin, X.; Huang, Q.; Zhang, X. Learning peer recommendation using attention-driven CNN with interaction tripartite graph. Inf. Sci. 2019, 479, 231–249. [Google Scholar] [CrossRef]
    8. Bouchet, F.; Labarthe, H.; Yacef, K.; Bachelet, R. Comparing peer recommendation strategies in a MOOC. In Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization, Bratislava, Slovakia, 9–12 July 2017; pp. 129–134. [Google Scholar] [CrossRef]
    9. Yang, D.; Wen, M.; Rose, C. Peer influence on attrition in massively open online courses. In Proceedings of the Educational data mining 2014, London, UK, 4–7 July 2014. [Google Scholar]
    10. Bouchet, F.; Labarthe, H.; Bachelet, R.; Yacef, K. Who wants to chat on a MOOC? Lessons from a peer recommender system. In Proceedings of the European Conference on Massive Open Online Courses, Madrid, Spain, 22–26 May 2017; Springer: Cham, Switzerland, 2017; pp. 150–159. [Google Scholar] [CrossRef]
    11. Potts, B.A.; Khosravi, H.; Reidsema, C.; Bakharia, A.; Belonogoff, M.; Fleming, M. Reciprocal peer recommendation for learning purposes. In Proceedings of the 8th international conference on learning analytics and knowledge, Tempe, AZ, USA, 4–8 March 2019; pp. 226–235. [Google Scholar] [CrossRef]
    12. Prabhakar, S.; Spanakis, G.; Zaiane, O. Reciprocal recommender system for learners in massive open online courses (MOOCs). In Proceedings of the International Conference on Web-Based Learning, Cape Town, South Africa, 20–22 September 2017; Springer: Cham, Switzerland, 2017; pp. 157–167. [Google Scholar] [CrossRef]
    13. Shi, C.; Li, Y.; Zhang, J.; Sun, Y.; Philip, S.Y. A survey of heterogeneous information network analysis. IEEE Trans. Knowl. Data Eng. 2016, 29, 17–37. [Google Scholar] [CrossRef]
    14. Xu, Y.; Zhou, D.; Ma, J. Scholar-friend recommendation in online academic communities: An approach based on heterogeneous network. Decis. Support Syst. 2019, 119, 1–13. [Google Scholar] [CrossRef]
    15. Liu, X.; Wu, K.; Liu, B.; Qian, R. HNERec: Scientific collaborator recommendation model based on heterogeneous network embedding. Inf. Process. Manag. 2023, 60, 103253. [Google Scholar] [CrossRef]
    16. Karnyoto, A.S.; Sun, C.; Liu, B.; Wang, X. Augmentation and heterogeneous graph neural network for AAAI2021-COVID-19 fake news detection. Int. J. Mach. Learn. Cybern. 2022, 13, 2033–2043. [Google Scholar] [CrossRef]
    17. Wu, Z.; Liang, Q.; Zhan, Z. Course Recommendation Based on Enhancement of Meta-Path Embedding in Heterogeneous Graph. Appl. Sci. 2023, 13, 2404. [Google Scholar] [CrossRef]
    18. Lops, P.; Gemmis, M.d.; Semeraro, G. Content-based recommender systems: State of the art and trends. In Recommender Systems Handbook; Springer: New York, NY, USA, 2011; pp. 73–105. [Google Scholar] [CrossRef]
    19. Huang, R.; Lu, R. Research on Content-based MOOC Recommender Model. In Proceedings of the 2018 5th International Conference on Systems and Informatics (ICSAI), Nanjing, China, 10–12 November 2018; pp. 676–681. [Google Scholar] [CrossRef]
    20. Campos, R.; dos Santos, R.P.; Oliveira, J. A Recommendation System based on Knowledge Gap Identification in MOOCs Ecosystems. In Proceedings of the XVI Brazilian Symposium on Information Systems, São Bernardo do Campo, Brazil, 3–6 November 2020; pp. 1–8. [Google Scholar] [CrossRef]
    21. Pang, Y.; Wang, N.; Zhang, Y.; Jin, Y.; Ji, W.; Tan, W. Prerequisite-related MOOC recommendation on learning path locating. Comput. Social Netw. 2019, 6, 1–16. [Google Scholar] [CrossRef]
    22. Zhao, X.; Liu, B. Application of Personalized Recommendation Technology in MOOC System. In Proceedings of the 2020 International Conference on Intelligent Transportation, Big Data & Smart City (ICITBS), Vientiane, Laos, 11–12 January 2020; pp. 720–723. [Google Scholar] [CrossRef]
    23. Çano, E.; Morisio, M. Hybrid recommender systems: A systematic literature review. Intell. Data Anal. 2017, 21, 1487–1524. [Google Scholar] [CrossRef]
    24. Safarov, F.; Kutlimuratov, A.; Abdusalomov, A.B.; Nasimov, R.; Cho, Y.I. Deep Learning Recommendations of E-Education Based on Clustering and Sequence. Electronics 2023, 12, 809. [Google Scholar] [CrossRef]
    25. Wu, L. Collaborative filtering recommendation algorithm for MOOC resources based on deep learning. Complexity 2021, 2021. [Google Scholar] [CrossRef]
    26. Liu, J.; Yin, C.; Li, Y.; Sun, H.; Zhou, H. Deep Learning and Collaborative Filtering-Based Methods for Students’ Performance Prediction and Course Recommendation. Wirel. Commun. Mobile Comput. 2021, 2021, 2157343. [Google Scholar] [CrossRef]
    27. Gong, J.; Wang, S.; Wang, J.; Feng, W.; Peng, H.; Tang, J.; Yu, P.S. Attentional graph convolutional networks for knowledge concept recommendation in moocs in a heterogeneous view. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, Virtual, 25–30 July 2020; pp. 79–88. [Google Scholar] [CrossRef]
    28. Uddin, I.; Imran, A.S.; Muhammad, K.; Fayyaz, N.; Sajjad, M. A systematic mapping review on MOOC recommender systems. IEEE Access 2021, 9, 118379–118405. [Google Scholar] [CrossRef]
    29. Urdaneta-Ponte, M.C.; Mendez-Zorrilla, A.; Oleagordia-Ruiz, I. Recommendation Systems for Education: Systematic Review. Electronics 2021, 10, 1611. [Google Scholar] [CrossRef]
    30. Li, Y.; Liu, J.; Ren, J. Social recommendation model based on user interaction in complex social networks. PLoS ONE 2019, 14, e0218957. [Google Scholar] [CrossRef]
    31. Ma, X.; Dong, L.; Wang, Y.; Li, Y.; Liu, Z.; Zhang, H. An enhanced attentive implicit relation embedding for social recommendation. Data Knowl. Eng. 2023, 145, 102142. [Google Scholar] [CrossRef]
    32. Paleti, L.; Radha Krishna, P.; Murthy, J. User opinions driven social recommendation system. Int. J. Knowl.-Based Intell. Eng. Syst. 2021, 25, 21–31. [Google Scholar] [CrossRef]
    33. Li, Y.; Wang, R.; Nan, G.; Li, D.; Li, M. A personalized paper recommendation method considering diverse user preferences. Decis. Support Syst. 2021, 146, 113546. [Google Scholar] [CrossRef]
    34. Li, H.; Li, J.; Li, G.; Gao, R.; Yan, L. Expert Recommendation in Community Question Answering via Heterogeneous Content Network Embedding. Comput. Mater. Contin. 2023, 75, 1687–1709. [Google Scholar] [CrossRef]
    35. Shou, Z.; Wen, Y.; Chen, P.; Zhang, H. Personalized Knowledge Map Recommendations based on Interactive Behavior Preferences. Int. J. Perform. Eng. 2021, 17, 36–49. [Google Scholar] [CrossRef]
    36. Zhu, H.; Liu, Y.; Tian, F.; Ni, Y.; Wu, K.; Chen, Y.; Zheng, Q. A cross-curriculum video recommendation algorithm based on a video-associated knowledge map. IEEE Access 2018, 6, 57562–57571. [Google Scholar] [CrossRef]
    37. Shi, C.; Zhang, Z.; Ji, Y.; Wang, W.; Yu, P.S.; Shi, Z. SemRec: A personalized semantic recommendation method based on weighted heterogeneous information networks. World Wide Web 2019, 22, 153–184. [Google Scholar] [CrossRef]
    38. Yang, C.; Zhao, C.; Wang, H.; Qiu, R.; Li, Y.; Mu, K. A semantic path-based similarity measure for weighted heterogeneous information networks. In Proceedings of the International Conference on Knowledge Science, Engineering and Management, Changchun, China, 17–19 August 2018; Springer: Cham, Switzerland, 2018; pp. 311–323. [Google Scholar] [CrossRef]
    39. Lu, M.; Wei, X.; Ye, D.; Dai, Y. A unified link prediction framework for predicting arbitrary relations in heterogeneous academic networks. IEEE Access 2019, 7, 124967–124987. [Google Scholar] [CrossRef]
    40. Vahedian, F.; Burke, R.; Mobasher, B. Multirelational recommendation in heterogeneous networks. ACM Trans. Web 2017, 11, 1–34. [Google Scholar] [CrossRef]
    41. Meilian, L.; Danna, Y. HIN_DRL: A random walk based dynamic network representation learning method for heterogeneous information networks. Expert Syst. Appl. 2020, 158, 113427. [Google Scholar] [CrossRef]
    42. Rendle, S.; Freudenthaler, C.; Gantner, Z.; Schmidt-Thieme, L. BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, Montreal, QC, Canada, 18–21 June 2009; pp. 452–461. [Google Scholar]
    43. Pham, T.A.N.; Li, X.; Cong, G.; Zhang, Z. A general recommendation model for heterogeneous networks. IEEE Trans. Knowl. Data Eng. 2016, 28, 3140–3153. [Google Scholar] [CrossRef]
    44. Tang, J.; Zhang, J.; Yao, L.; Li, J.; Zhang, L.; Su, Z. Arnetminer: Extraction and mining of academic social networks. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Las Vegas, NV, USA, 24–27 August 2008; pp. 990–998. [Google Scholar] [CrossRef]
    45. Katawazai, R. Implementing outcome-based education and student-centered learning in Afghan public universities: The current practices and challenges. Heliyon 2021, 7, e07076. [Google Scholar] [CrossRef]
    46. Koren, Y.; Bell, R.; Volinsky, C. Matrix factorization techniques for recommender systems. Computer 2009, 42, 30–37. [Google Scholar] [CrossRef]
    47. Xue, H.J.; Dai, X.; Zhang, J.; Huang, S.; Chen, J. Deep matrix factorization models for recommender systems. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI-17), Melbourne, VIC, Australia, 19–25 August 2017; Volume 17, pp. 3203–3209. [Google Scholar] [CrossRef]
    48. Shi, C.; Hu, B.; Zhao, W.X.; Philip, S.Y. Heterogeneous information network embedding for recommendation. IEEE Trans. Knowl. Data Eng. 2018, 31, 357–370. [Google Scholar] [CrossRef]
    Figure 1. A weighted heterogeneous information network on an online learning platform.
    Figure 1. A weighted heterogeneous information network on an online learning platform.
    Electronics 12 02051 g001
    Figure 2. An example of network schema on an online learning platform.
    Figure 2. An example of network schema on an online learning platform.
    Electronics 12 02051 g002
    Figure 3. Block diagram of LPRWHIN.
    Figure 3. Block diagram of LPRWHIN.
    Electronics 12 02051 g003
    Figure 4. Example of automatically generating meta-paths.
    Figure 4. Example of automatically generating meta-paths.
    Electronics 12 02051 g004
    Figure 5. Network schema for different datasets. (a) Network schema for DBLP. (b) Network schema for Aminer.
    Figure 5. Network schema for different datasets. (a) Network schema for DBLP. (b) Network schema for Aminer.
    Electronics 12 02051 g005
    Figure 6. Results of LPRWHIN under different numbers of meta-paths on three datasets. (a) Results on DBLP. (b) Results on Aminer. (c) Results on Online.
    Figure 6. Results of LPRWHIN under different numbers of meta-paths on three datasets. (a) Results on DBLP. (b) Results on Aminer. (c) Results on Online.
    Electronics 12 02051 g006
    Figure 7. Performance comparison of five recommendation methods on three datasets. (a) Results on DBLP. (b) Results on Aminer. (c) Results on Online.
    Figure 7. Performance comparison of five recommendation methods on three datasets. (a) Results on DBLP. (b) Results on Aminer. (c) Results on Online.
    Electronics 12 02051 g007
    Figure 8. Comparison of the total achievement degree of curriculum objectives for different groups.
    Figure 8. Comparison of the total achievement degree of curriculum objectives for different groups.
    Electronics 12 02051 g008
    Figure 9. Results of the achievement degree of the four curriculum objectives.
    Figure 9. Results of the achievement degree of the four curriculum objectives.
    Electronics 12 02051 g009
    Table 1. Examples of weighted meta-paths and their semantic meanings.
    Table 1. Examples of weighted meta-paths and their semantic meanings.
    Weighted Meta-PathPath InstanceSemantic Meaning
    Student δ d o Exercises δ d o n e Student Liu 0.6 Binary tree 0.6 LiLiu and Li have the same test score of 0.6 on the binary tree
    Student δ q u e s t i o n Knowledge point δ ( answered by ) Teacher Wu 0.5 Stack 0.5 ZhangWu puts into interacting with Zhang on the stack, and the degree of student–teacher interaction is 0.5
    Table 2. Statistics of the experimental datasets.
    Table 2. Statistics of the experimental datasets.
    DatasetsObjectsNumberLinksNumber
    papers23,607paper-venues23,607
    DBLPvenues1796paper-authors80,535
    authors4524--
    papers16,358paper-venues16,358
    Aminervenues3765paper-authors59,343
    authors3925paper-terms81,790
    terms10,928--
    students1055videos-teachers207
    videos207students-exercises427,478
    teachers1students-videos283,061
    Onlineexercises163students-knowledge points310,090
    knowledge points207exercises-knowledge points7505
    --videos-knowledge points207
    --teachers-knowledge points10,490
    Table 3. Results of five recommendation methods on different datasets-precision.
    Table 3. Results of five recommendation methods on different datasets-precision.
    DBLP
    Recommendation methodsPre@1Pre@2Pre@3Pre@4Pre@5Pre@6Pre@7Pre@8Pre@9Pre@10
    LPRWHIN0.3240.2770.2290.2030.1830.1670.1530.1430.1340.126
    PRHN0.2960.2480.2100.1830.1640.1490.1370.1270.1190.112
    Metapath0.2050.1700.1550.1400.1310.1220.1140.1080.1030.098
    MF0.1440.1430.1350.1250.1180.1120.1070.1030.1000.096
    DMF0.2520.1950.1630.1440.1290.1160.1060.0970.0900.083
    Aminer
    Recommendation methodsPre @ 1 Pre @ 2 Pre @ 3 Pre @ 4 Pre @ 5 Pre @ 6 Pre @ 7 Pre @ 8 Pre @ 9 Pre @ 10
    LPRWHIN0.6170.5580.4730.4080.3590.3220.2930.2690.2490.231
    PRHN0.5920.5320.4460.3820.3330.2970.2690.2450.2270.211
    Metapath0.4330.3880.3560.3290.3060.2830.2640.2460.2310.218
    MF0.1810.1760.1730.1650.1580.1500.1430.1370.1310.126
    DMF0.2080.1770.1540.1360.1230.1120.1020.0940.0870.081
    Online
    Recommendation methodsPre @ 1 Pre @ 2 Pre @ 3 Pre @ 4 Pre @ 5 Pre @ 6 Pre @ 7 Pre @ 8 Pre @ 9 Pre @ 10
    LPRWHIN0.3020.2490.2120.1870.1630.1500.1410.1290.1200.114
    PRHN0.2400.2070.1830.1660.1440.1290.1170.1090.0980.089
    Metapath0.1920.1700.1480.1270.1150.1010.0970.0900.0850.079
    MF0.1690.1380.1240.1130.0990.0930.0830.0770.0690.062
    DMF0.2390.2040.1880.1650.1470.1330.1200.1100.0990.090
    Table 4. Results of five recommendation methods on different datasets-recall.
    Table 4. Results of five recommendation methods on different datasets-recall.
    DBLP
    Recommendation methodsRec@1Rec@2Rec@3Rec@4Rec@5Rec@6Rec@7Rec@8Rec@9Rec@10
    LPRWHIN0.0500.0860.1070.1260.1420.1550.1660.1770.1870.196
    PRHN0.0460.0770.0970.1140.1270.1380.1480.1580.1660.173
    Metapath0.0320.0530.0720.0870.1020.1130.1240.1350.1430.152
    MF0.0220.0440.0630.0780.0920.1040.1160.1280.1390.149
    DMF0.0390.0600.0760.0890.1000.1080.1150.1210.1250.128
    Aminer
    Recommendation methodsRec@1Rec@2Rec@3Rec@4Rec@5Rec@6Rec@7Rec@8Rec@9Rec@10
    LPRWHIN0.0900.1630.2070.2380.2620.2810.2990.3140.3260.338
    PRHN0.0860.1550.1950.2230.2430.2600.2740.2860.2970.307
    Metapath0.0630.1130.1560.1920.2230.2480.2690.2870.3030.318
    MF0.0260.0510.0760.0960.1150.1310.1460.1590.1720.184
    DMF0.0300.0520.0670.0790.0900.0980.1040.1100.1140.117
    Online
    Recommendation methodsRec@1Rec@2Rec@3Rec@4Rec@5Rec@6Rec@7Rec@8Rec@9Rec@10
    LPRWHIN0.0860.1420.1800.2130.2320.2560.2790.2920.3070.324
    PRHN0.0680.1170.1560.1890.2050.2200.2330.2470.2500.251
    Metapath0.0550.0960.1260.1440.1640.1730.1920.2040.2160.226
    MF0.0480.0780.1060.1280.1400.1580.1640.1740.1760.177
    DMF0.0680.1160.1600.1870.2090.2260.2380.2500.2540.256
    Table 5. The score relationship between curriculum objectives and assessment methods.
    Table 5. The score relationship between curriculum objectives and assessment methods.
    Assessment Method 1Assessment Method 2Assessment Method 3Target ScoreEquivalent Score
    Curriculum objective 12020206020
    Curriculum objective 22020206020
    Curriculum objective 33030309030
    Curriculum objective 43030309030
    Total100100100300100
    Table 6. Comparative results of the achievement degree of the four curriculum objectives.
    Table 6. Comparative results of the achievement degree of the four curriculum objectives.
    Curriculum Objective 1Curriculum Objective 2Curriculum Objective 3Curriculum Objective 4
    Expected values0.7000.7000.7000.700
    Group 10.7810.7660.7400.760
    Group 20.6830.6800.6140.631
    Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

    Share and Cite

    MDPI and ACS Style

    Shou, Z.; Shi, Z.; Wen, H.; Liu, J.; Zhang, H. Learning Peer Recommendation Based on Weighted Heterogeneous Information Networks on Online Learning Platforms. Electronics 2023, 12, 2051. https://doi.org/10.3390/electronics12092051

    AMA Style

    Shou Z, Shi Z, Wen H, Liu J, Zhang H. Learning Peer Recommendation Based on Weighted Heterogeneous Information Networks on Online Learning Platforms. Electronics. 2023; 12(9):2051. https://doi.org/10.3390/electronics12092051

    Chicago/Turabian Style

    Shou, Zhaoyu, Zhixuan Shi, Hui Wen, Jinghua Liu, and Huibing Zhang. 2023. "Learning Peer Recommendation Based on Weighted Heterogeneous Information Networks on Online Learning Platforms" Electronics 12, no. 9: 2051. https://doi.org/10.3390/electronics12092051

    Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

    Article Metrics

    Back to TopTop