Enhanced Social Recommendation Method Integrating Rating Bias Offsets

Han, Lu; Qin, Jiwei; Xia, Boshen

doi:10.3390/electronics12183926

Open AccessArticle

Enhanced Social Recommendation Method Integrating Rating Bias Offsets

by

Lu Han

^1,2,

Jiwei Qin

^1,2,* and

Boshen Xia

^1,2

¹

School of Information Science and Engineering, Xinjiang University, Urumqi 830046, China

²

Key Laboratory of Signal Detection and Processing, Xinjiang Uygur Autonomous Region, Xinjiang University, Urumqi 830046, China

^*

Author to whom correspondence should be addressed.

Electronics 2023, 12(18), 3926; https://doi.org/10.3390/electronics12183926

Submission received: 13 August 2023 / Revised: 7 September 2023 / Accepted: 13 September 2023 / Published: 18 September 2023

(This article belongs to the Special Issue Data Push and Data Mining in the Age of Artificial Intelligence)

Download

Browse Figures

Versions Notes

Abstract

:

Current social recommendations based on Graph Neural Networks (GNNs) often neglect to extract rating bias from user and item statistics, leading to misinterpreting real user preferences. For example, a high rating from a user with lenient rating standards and a high average rating does not always indicate a real preference for the item. This situation highlights inherent flaws in existing recommendation algorithms that do not adequately account for bias in user and item ratings and rating trends. To address this problem, this paper proposes an enhanced social recommendation method based on GNNs with integrated rating bias offsets (SR-BS). Firstly, we obtain rating bias from users and items by subtracting their average rating value from the historical rating value for each user/item. To enhance the model’s learning capability, we transform the rating biases into vector representations. Secondly, in the model learning, diverse meta-paths are predefined for modeling interaction relations between graph nodes (e.g., user–item–user, user–user). The aggregation of semantic information from these relational paths is achieved by stacking multiple GNN layers, enabling the fusion of higher-order information. Finally, the experimental results on four datasets—Ciao, Epinions, Douban, and FilmTrust—show that our method outperforms other state-of-the-art methods in social recommendation tasks, exhibiting high stability and personalization.

Keywords:

recommendation system; social recommendations; graph neural networks; global-layer information

1. Introduction

With the advent of social networks, social recommendation systems have gained widespread application in information filtering and personalized services. By utilizing social relations as supplementary information, these systems predict user preferences and interests more accurately [1,2]. For instance, an effective movie recommendation system can efficiently suggest movies that align with a user’s interests from an extensive collection, thereby saving time and enhancing satisfaction. Nevertheless, the critical challenges in this domain revolve around fully leveraging user–item interaction data and social relations, while also addressing issues like data sparsity and bias.

Specifically, within social networks, users with direct links typically exhibit similar behaviors [3]. Based on this theory, social recommendation algorithms have started considering incorporating social relations as ancillary information into recommendation systems. Early research primarily focused on methods based on Matrix Factorization (MF), such as collaborative decomposition and regularization techniques. However, addressing the challenge of insufficient interaction data between users or items in social recommendations is crucial to improve the accuracy of identifying and predicting user preferences. To counter this issue, Ma et al. [4] devised an MF-based model on social regularization. In this model, the correlation between social relations is introduced into the regularization term of the objective function. While their method did not completely resolve the issue, it effectively leveraged social network information to alleviate the problem of data sparsity in collaborative filtering, highlighting the importance of integrating social relations in the recommendation process. Furthermore, the interactions between users and items in social recommendation systems (e.g., ratings) have nonlinearity and complexity. MF-based methods often find it challenging to handle these nonlinear relations and high-dimensional features, thus limiting the model’s expressive capability and recommendation performance. In contrast to MF-based methods, Graph Neural Networks (GNNs) have shown great strength in handling unstructured data. They have been applied in various fields, such as bioinformatics, computer vision, and social network analysis [5], offering a research direction with enormous potential. In the field of social recommendation, some recommendation methods based on GNNs have been developed. These methods represent user–item interaction and observed social relations as two graphs. Then, they apply GNNs to learn effective node representations, aiming to capture users’ social influence and interaction patterns.

The current recommendation methods based on GNNs mainly learn from the original interaction graph and most GNNs are subject to inductive bias, which can cause false positives in the final result. However, the processing of statistical information about the graph has not been adequately considered, which ignores the impact of rating bias on user preferences, and this shortcoming may lead to a misinterpretation of the user’s true preferences. In our study, user rating bias consists of the tendency of users to rate items differently based on their personal preferences and rating behavior. On the other hand, low standards in rating behavior refer to relatively generous users, giving high ratings to a variety of items and potentially overstating the perceived quality of those items. In contrast, high average ratings reflect the tendency of some users to consistently give items very positive ratings, resulting in higher average ratings for the items they interact with. For example, some users may show leniency in their ratings and often give higher ratings, while others may be more critical and tend to offer lower ratings. Such differences in user behavior can lead to discrepancies between perceived and actual item preferences, affecting the accuracy of recommendations. In a previous study, Y. Koren et al. [6] also considered such factors and designed a complex collaborative filtering model by introducing user and item bias terms in an attempt to model user rating bias. However, this approach has some limitations, such as the propagation of latent bias and lack of explicitness in fully utilizing statistical information for modeling and understanding. Rating bias has been recognized as a critical factor affecting the accuracy of recommender systems, and the concept has been extensively studied in the literature [7]. Despite the extensive work in this area, integrating rating bias into recommendation models, especially GNN-based models, remains a relatively unexplored field. Additionally, existing methods often overly rely on the patterns and trends in the original data for learning, making capturing and correcting biases challenging. For example, GDSRec [8] is a recommendation model that employs GNN to improve social recommendations by considering rating bias and social relations. However, it focuses mainly on static correction factors and may not fully address the dynamic nature of rating bias, thus creating limitations in capturing various aspects of user preferences. Besides direct user–item interaction, many potential higher-order relations (e.g., user similarity and item similarity) can enrich graph information and benefit the recommendation task, but have not been fully exploited. Although some research shows that Graph Neural Collaborative Filtering effectively uses these latent higher-order relations to obtain more accurate and personalized recommendation results [9], many existing social recommendation methods have not effectively implemented this. In conclusion, while current social recommendation methods have progressed in handling sparse data and capturing users’ social influence and interaction patterns, there is still room for improvement in processing statistical information about the graph, correcting rating biases, and fully utilizing higher-order relations.

Addressing the aforementioned issues, this paper proposes an enhanced social recommendation method based on a GNN integrated with rating bias offsets, which utilizes GNNs not only to model user–item interactions, but also to include rating bias offsets as an inherent part of the learning process. To elaborate, within the recommendation models, we engage bias data that are formed based on statistical information, employing the user/item rating bias offset as an auxiliary vector for learning the latent characteristics of users and items. Including a bias vector facilitates the model’s in-depth exploration of the relations between users and items, effectively considering the real preferences of users and the difference in personalization of the items, thereby accurately portraying their characteristics to enhance system performance. To realize this aim, we have designed a decentralized interaction graph that embodies the statistical bias information of users (items). Embedding the decentralized graph into our proposed model and explicitly extracting biased information creates a rich representation that enhances model learning, considering associations among users, among items, and between users and items more comprehensively. Moreover, to handle high-order semantic relations, we utilize a multi-layer GNN architecture to learn high-order node embedding representations to deeply excavate nodes’ interactive features, thereby accurately portraying the nodes’ characteristics and behaviors within the social recommendation system. Our primary contributions can be summarized as follows:

We adopt a relative perspective, utilizing the original data’s statistical information to adjust ratings to be zero-centered, thereby mitigating the impact of overall rating trends. Then, the original heterogeneous infographics are processed into decentralized graphs and biased information is explicitly extracted from the decentralized graphs. Next, we regard rating bias as an additional vector and incorporate it into the model proposed in this paper, thereby modeling the latent representations of users/items.
We define multiple meta-paths by combining rich node relations, and by fusing the Graph Attention Network (GAT) of meta-paths, high-order information is explicitly encoded into the embedded learning to explore user relations and item relations with high-order correlations.
We propose SR-BS, an enhanced social recommendation model that combines GNNs with rating bias. This model learns the higher-order representations of users and items on decentralized graphs, leveraging rating bias offsets.
We conducted extensive experiments on four publicly available datasets to verify the efficacy of the proposed method. The experimental outcomes demonstrate an improvement over the state-of-the-art methods in the Mean Absolute Error (MAE) of Ciao (2.61%), Epinions (1.46%), Douban (0.52%), and FilmTrust (0.55%), indicating that our SR-BS model can generate superior recommendation results.

The remainder of this paper is organized as follows: Section 2 elaborates on the background of social recommendations and related work. Section 3 elaborates on some preliminary content and the proposed framework. Subsequently, in Section 4, we report and analyze the experimental results. Lastly, Section 5 concludes the paper.

2. Related Work

There are two main evolutions of social recommendation. In this section, we briefly describe the classical social recommendation method and GNN-based social recommendation method.

Classic social recommendation method

Early research on social recommendation algorithms focused on MF, leveraging it as a fundamental model due to its flexibility in incorporating prior knowledge. MF-based recommendation models integrated social information to capture more expressive user preference vectors and to achieve noteworthy success in various scenarios [10]. We can broadly classify recommendation models based on MF into two categories: co-factorization-based methods and social regularization-based methods. The co-factorization method is popular due to its efficiency and scalability. This method analyzes the historical rating records of users to identify users with analogous preferences to the target user. Then, overlapping rating information is used to predict target user preferences, thus achieving recommendations [11]. Such methods have many applications and spawned many social network-based recommendation algorithms [11,12,13]. In [12], SoRec is an MF-based co-factorization method. This method cleverly incorporates social information into traditional recommendation algorithms by analyzing the impact of users’ social information and rating records on users’ interests. TrustMF [13] proposes an innovative hybrid recommendation model that integrates the roles of the trustor and the trusted. The model is achieved by employing matrix factorization on the trusted network among users within social interactions to drive rating predictions. TrustSVD [11] proposes that the influence of ratings and trust information on user interest is not limited to the explicit level, but also exists at the implicit level. The model significantly boosts the accuracy of the recommender system by modeling explicit and implicit influences in an integrated way. In addition, to further augment the expressive power of learned user representations, the researchers also introduced an MF-based social regularization strategy, which used regularization constraints to model the social relation among users. SoReg [4] is a typical regularization model that uses the social matrix to impose regularization constraints on user feature vectors. SoDimRec [14], on the other hand, takes another perspective by considering the heterogeneity and weak dependency of social relations as a regularization condition. SocialMF [15] shows that a user’s potential interests are influenced, to some extent, by the interests of their social neighbors by incorporating a social network-based trust propagation mechanism into a recommendation algorithm to capture the user’s embedded representation.

Despite those mentioned above, traditional social recommendation algorithms effectively capture linear information and exhibit learning limitations concerning user–item interactions and the complexity of the social domain. In this background, researchers introduced a Deep Neural Network (DNN)-based social recommendation method [16]. The technique utilizes the powerful modeling capabilities of DNN to jointly depict the user–item interactions and the user’s social network to mine nonlinear features from the interaction network, thus giving more robust representation and generalization capabilities to collaborative filtering recommendation techniques. The study of NeurMF [17] combines the traditional MF and Multilayer Perceptron (MLP) techniques to extract low-dimensional and high-dimensional features efficiently, thus achieving superior recommendation results. On the other hand, DeepSoR [18], a DNN-based social recommendation model, has the core advantage of learning embedded representations from social relations and further incorporating these representations into probabilistic matrix factorization for accurate prediction.

Nonetheless, these methods still lack explicitness in encoding informative interactions, especially for implicit collaborative signals, which they may fail to capture adequately. In the NSCR model [19], the strategy adopted users in social networks as bridges for information propagation, further propagating user representations generated by attribute-aware deep collaborative filtering models. To summarize, existing research has explored the potential value of interaction data and social relations to some extent. However, there is still a great deal of work to be conducted in excavating and utilizing high-quality and credible relations within social networks.

GNN-based social recommendation method

In the domain of social recommendations, interactions between users and their social neighbors, as well as between users and items, can be intuitively constructed as user–user and user–item graphs. GNNs [20], the generalized neural network based on graph structure, effectively extracts feature information from adjacent nodes and captures the network’s topology. Recently, GNNs have gained widespread attention in the study of recommendation systems and achieved significant progress and breakthroughs in numerous applications [21,22,23]. In [21], a GNN-based social recommendation algorithm called GraphRec is proposed, which captures interaction information in the user–item interaction graph and extracts social information in the social graph through a joint modeling strategy, focusing on modeling the influence of first-order neighbors. Recent studies have also made breakthroughs in exploring higher-order influences.

Specifically, the DiffNet [22] model employs a multi-layer graph convolutional neural network based on trust propagation, effectively capturing user preference diffusion in social networks. Furthermore, the DiffNet++ [23] model, an enhancement of DiffNet, introduces a multilevel attention mechanism to compute the respective influence of neighbor nodes and graph information in the aggregation process, thus capturing the characteristics of users and items more accurately. Some previous recommender systems have recognized the importance of rating biases and attempted to address this problem through various mechanisms. For example, in [6], a sophisticated collaborative filtering model is constructed to capture these rating biases by introducing bias terms for users and items. GDSRec [8], a novel model for social recommendation using GNNs, enhances the microscopic nature of social connections by aggregating rating bias offsets for users and items, combined with the strength of social ties based on similarity of preferences. These mechanisms typically focus on static correction factors or post-processing techniques. However, these approaches tend to ignore the dynamic nature of rating bias and may not be fully effective in capturing the intricacies of user preferences. Inspired by the potential of GNNs in processing graph-structured data, researchers are experimenting with GNNs to capture the complex connections between users and items. For instance, DANSER [24], a model for collaborative filtering using dual graph attention neural networks, can simultaneously reveal the social homogeneity and social influence of users and items. Specifically, DANSER integrates social homogeneity information through the vector representation of neighbors and aggregates social influence using neighbors’ context-aware preferences.

DICER [25], based on the social relations between users, collaborative similarities, and the collaborative relations between items, considers them to be deep context relations to model the interaction influence between users and items. The model created a relation-aware neural network to model the interests of users and their friends. On the other hand, the core concept of DESIGN [26] is that social relations reflect similarities in user preferences. However, separately modeling the social and user–item graphs led to two models with different characteristics. To address this issue, DESIGN trained a model that incorporated the user–user social graph and the user–item graph and trained an auxiliary model for each graph independently. Adopting the knowledge distillation method constrained the model’s training process, promoting learning communication between the models.

Nevertheless, although GNN-based social recommendation has successfully modeled explicit social influence, these methods fail to adequately consider the data bias present in the statistical information of the graph in most cases. Meanwhile, these methods still need to be improved in learning high-order node relations in heterogeneous networks (e.g., except simple pairwise interaction relations and social relations with direct connections).

3. Proposed Framework and Problem Definition

3.1. Notation and Problem Definition

In the GNN-based social recommendation tasks, we deal with two entities: users and items. Here, we first introduce notation definitions and problem definitions. The user and item sets,

U = \{u_{1}, u_{2}, \dots, u_{M}\}

and

V = \{v_{1}, v_{2}, \dots, v_{N}\}

, consist of

M

users and

N

items, respectively. The preference matrix, denoted as

R \in {[r_{i j}]}_{M \times N}

, reflects users’ real-valued ratings for items. In this matrix,

r_{i j}

is the rating by the user

u_{i}

for the item

v_{j}

. The set of observed ratings,

O = \{u_{i}, v_{j} | r_{i j} is observed\}

, which requires that the user

u_{i}

’s rating value

r_{i j}

for the item

v_{j}

is not 0 or not null. The set of users that interact directly with the item

v_{j}

is denoted by

I (v_{j})

, and the set of items that interact with the user

u_{i}

directly is denoted by

I (u_{i})

.

A (u_{i})

and

A (v_{j})

are defined as the average ratings of the user

u_{i}

and the item

v_{j}

, respectively. Next, we extract the higher-order relations

S_{U}

and

S_{V}

for users and items, defining different meta-paths to represent inter-user and inter-item relations according to different higher-order relations, respectively. Hence, we extend a typical social recommender system by introducing higher-order relations and meta-paths.

Based on the constructed social relations graph, item similarity graph, user–item decentralized graph, and item–user decentralized graph, we can use these graphs to build similarity relations between inner-user and inner-item, as well as interactions between users and items, to predict unobserved ratings in R and recommend items to users based on these predicted ratings.

During the learning process, we recognized the potential advantage of incorporating the influence of the time factor into recommendation models, as it can lead to more accurate and personalized recommendations. In practice, users may show different interests in items at different points in time, and the time factor can also reveal some trends regarding changes in user preferences, so changes in user preferences over time can significantly affect the relevance and effectiveness of recommendations. Although our current research emphasizes addressing the issue of rating bias and trends, we cannot ignore the importance of adjusting recommendation models to changing user preferences. Therefore, we intend to explore the integration of time factors in our future research to improve the accuracy and relevance of recommendations further.

3.2. Relation Graph Construction

Most social recommender systems enhance effectiveness by leveraging explicit social relations, such as friendships between users [27]. However, in addition to the explicit relations, the implicit user–item interaction and the graph structure’s semantic relations will play a role. It has been shown that higher-order relations can provide valuable clues to exploit the various relations between features [28] to mine the different preferences of users for items and optimize the learning of user and item representations in the social recommendation. By considering higher-order relations, social recommender systems can better comprehend inter-entity connections, incorporating this information into recommendation algorithms for more accurate, personalized outputs.

3.2.1. Social Graph

One-hop relations among users typically suggest similar preferences. Nevertheless, such relations are frequently constrained by the local graph structure (e.g., degree), inadequately capturing the social relations between one-hop and multi-hop users. This limitation restricts our capacity to harness social information fully. Therefore, we adopt an approach to extend direct relations and construct higher-level user relations to compensate for this shortcoming. Specifically, we scrutinize user interactions (e.g., following links) to unearth deeper similarities and construct extended networks containing second-order or higher social relations. This method allows us to form a richer graph of users’ social relations. For the user

u_{i}

, we define the higher-order user relations

S_{U} (i)

as follows:

S_{U} (i) = \{k| ∥ c_{j i} = 1 ⋀ c_{j k} = 1, j \in U ∥ \geq τ\}

(1)

Here,

c_{j i} = 1

signifies a direct following link between the user

u_{j}

and the user

u_{i}

, while

c_{j i} = 0

indicates the contrary.

∥ \cdot ∥

denotes the number of users, and

τ

is a threshold we set to measure whether the shared following between users meets the requirements for defining higher-order relations. This formula indicates that when there is a direct attention relation between user

u_{j}

and users

u_{i}

and

u_{k}

at the same time, and when the number of other users with the same attention between user

u_{i}

and user

u_{k}

users in total reaches or exceeds

τ

, we consider the relation between these users

u_{i}

and user

u_{k}

users as a higher-order relation. Upon conducting experiments across multiple public datasets (refer to Section 4), the results show that the higher-order social relations constructed based on this approach can capture more comprehensive user preferences and reveal deeper similarities among users, which is extremely useful for us to learn about user interest and behavioral analysis in greater depth.

3.2.2. Item Similarity Graph

Traditional item-based collaborative filtering methods rely heavily on inter-item similarity for user rating predictions, typically utilizing metrics like the Pearson Correlation Coefficient, which predominantly considers the proximity of ratings between items, but often overlooks the number of users who have rated the items. However, two items are more similar when they receive close ratings and have a large number of rating users. Therefore, to measure the similarity between items more accurately, we propose a new method, i.e., to calculate the similarity between items with close ratings based on the number of users who rated the items and further construct the item similarity graph. Specifically, it is known that the user

u_{i}

has

r_{i j}

and

r_{i k}

ratings for the items

v_{j}

and

v_{k}

, respectively, and we define the similarity

l_{j k}^{i}

between the items

v_{j}

and

v_{k}

to be:

l_{j k}^{i} = \frac{1}{|r_{i j} - r_{i k}| + 1}

(2)

This formula means that the closer the ratings of the items

v_{j}

and

v_{k}

are, the higher their similarity

l_{j k}^{i}

is; if the ratings are more different, the similarity is lower.

|\cdot|

denotes an integer-valued function. Then, considering that the scale of users’ ratings also has some influence on item similarity, we further incorporate the set of users who have rated both the items

v_{j}

and

v_{k}

into the definition of item similarity, i.e.,

l_{j k} = \sum_{i \in U_{j k}} l_{j k}^{i}

, where

U_{j k}

denotes the set of users who have rated both of the items

v_{j}

and

v_{k}

. When two items gain similar ratings from a large user base, it generally suggests that these items possess feature similarity, which should amplify the inter-item similarity. To mine richer higher-order item similarity relations, we set

l_{j k} > 1

and select the top 20 most similar items based on the value of

l_{j}

from largest to smallest. For the item

v_{j}

, we define the higher-order item relations

S_{V} (j)

as follows:

S_{V} (j) = {k, \dots | l_{j k} > \dots > 1}

(3)

We employ a similarity measure

l_{j k}

, which composites both the item rating differences and the set of users who have rated, and define higher-order similarity relations for each item based on this measure. This integrated assessment helps to reveal more precise item similarity.

3.2.3. Decentralized Graph

The user–item interaction graph, as shown in Figure 1, describes user and item interaction behaviors where the edges represent users’ ratings for items, such as users

u_{1}

and

u_{2}

ratings for item

v_{1}

being

r_{11}

and

r_{21}

, respectively. The social graph contains the following links between users, and the first and last nodes of the arrows represent the following users and the followed users, respectively. In the social graph, the user

u_{1}

has direct following links with the user

u_{2}

,

u_{3}

, and

u_{5}

, and we can represent these relations as

c_{12}, c_{13}, c_{15} = 1

. When we align the user nodes in these two graphs, we form a heterogeneous infographic containing users’ social and user–item interaction information. In this graph, users have both neighbors in the social space (friends) and neighbors in the interaction space (items); items have both neighbors in the interaction space (items interacted by the same user) and neighbors in the social space (items interacted by similar users). However, directly deciphering user preferences from heterogeneous infographics may not yield accurate results. For example, a tolerant user may rate all items highly, but this does not mean he likes them all. This user-generated bias may impact our ability to discern the user’s latent interests. To understand user preferences more accurately, we need to consider the overall rating tendencies of users and items. Mitigating this bias by utilizing the graph’s statistical information further helps us distinguish users’ real preferences. Figure 1 shows the specific handling.

In decomposing original heterogeneous infographics into two decentralized graphs, we apply the following process. As shown in Figure 1, for user

u_{1}

, we subtract the individual interaction scores of each user–item pairing (e.g.,

r_{11} - A (v_{1})

,

r_{12} - A (v_{2})

,

r_{13} - A (v_{3})

) from the average scores of the corresponding items, and the difference obtained is used as the edge value with other item nodes, with the user node as the center of the node as the only node, to obtain the user–item decentralized graph; for item

v_{1}

, the interaction scores of each item and the user are obtained by subtracting the average scores of the users (e.g.,

r_{11} - A (u_{1})

,

r_{21} - A (u_{2})

) from the true scores, and the difference in the scores obtained is used as the edge value with other user nodes, with the item node as the unique center node, to obtain the item–user decentralized graph. If a user or item lacks historical interactions, we use the dataset’s average as the rating means for that user or item.

By calculating the rating difference and combining it with the overall rating trend, the rating bias of the user, and the item’s popularity, we can obtain a more realistic picture of the user’s preference for the item and the attractiveness of the item to the user. Using decentralized graphs for training recommendation models, compared to original graphs, effectively curbs noise introduced by averaging, enabling a more accurate capture of user interests and item popularity.

In our approach, we implement the interaction between users and items in the form of a graph by constructing a two-part graph. This graph structure allows us to capture higher-order information about users and items, where ‘k’ represents the number of hops in the graph (k > 1).

3.3. Predefined Meta-Paths

During training, the model might learn low-relevance connections, which may lead to distraction and thus ignore the higher-order user–item correlations. For instance, there is a direct following link between the users

u_{2}

and

u_{1}

in the heterogeneous infographics in Figure 1. The user

u_{2}

rated the item

v_{1}

as 2, and the user

u_{1}

rated the item

v_{1}

as 3. They rated the item

v_{1}

similarly, so the relation among the nodes

u_{2}

,

u_{1}

, and

v_{1}

is much closer. In contrast, the nodes on the path linking

u_{3} - u_{1} - u_{2} - v_{1}

are not so closely related because there is no direct interaction between the user

u_{3}

and item

v_{1}

. We hypothesize that by preferring reliable links and appropriately ignoring unreliable links, we can enhance the learning ability of the final embedding representation and reduce the effect of noise. Drawing from successful network embedding models [22], we define a variety of meta-paths by combining the relationship graphs constructed above and based on the similarity of influence effects and collaborative signals, which are designed to help identify various connections in the relationship graphs. By utilizing meta-paths, we can not only represent traditional low-order relations, but also reveal higher-order relations that are important for enhancing user and item preference learning. Moreover, this approach also helps us to dig deeper into the semantic correlations between different types of nodes. Specifically, we predefine a set of meta-paths to describe different types of interactions between users (U) and items (I) in the graph. Each meta-path is a specific sequence of nodes and edges representing a particular interaction pattern. For example, an interaction path

P a t h_{4}

representing items with rating bias reflecting user preferences allows our model to recognize that users tend to interact with similar items that their friends have interacted with, thus revealing higher-order correlations between distant user–item pairs. Thus, the meta-paths we define for user nodes intuitively reflect users’ intrinsic behavior and rating similarity, while those defined for item nodes reveal the similarity and popularity of items, providing a comprehensive view of the graph structure. Table 1 details the specific meanings of the meta-paths we have designed, which will inform our subsequent work.

3.4. Proposed Model

3.4.1. Model Framework

This subsection depicts the SR-BS model for social recommendation and presents its framework structure in Figure 2. It primarily comprises three modules: user modeling, item modeling, and rating prediction. Using data from decentralized graphs, our model introduces rating bias offsets to mine higher-order user relations and higher-order item relations using constructed social graphs and item similarity graphs, and then predefines various meta-paths based on these different interactions integrating the rating bias, and learns the latent representations of the users and the items to identify user preferences and item popularity through GAT modeling. Taking Figure 2 as an example, in learning the latent representation of user

u_{1}

, the information of the items that users interact with (i.e., aggregated items

v_{1}

,

v_{2}

, and

v_{3}

) and the information of the users for which direct and indirect relations exist (i.e., aggregated users

u_{2}

,

u_{3}

,

u_{4}

, and

u_{5}

) will be synthesized. Similarly, for the potential representation of item

v_{1}

, information about the users with whom the item interacts (i.e., aggregated users

u_{1}

and

u_{2}

) and information about the items with which the item interacts with a high degree of similarity (i.e., aggregated items

v_{2}

and

v_{3}

) will be combined. In the user modeling and item modeling modules, with the help of the previously defined meta-paths of users and items, we utilize GNNs to learn the intrinsic information of users and items with multivariate interactions in the implicit space. To address the shortcoming of indiscriminately applying GNNs in meta-paths with higher-order relations, which results in those nodes with too many interaction behaviors will generate noise, we incorporate Dropout during model training, which mitigates noise by randomly discarding some nodes and cooperating with GAT to achieve robust representations. The rating prediction module amalgamates the user and item modeling modules, employing the acquired representation vectors within an MLP. Through training, the model learns the parameters to predict the ratings accurately. After obtaining the set of predicted scores, we generate recommendations based on these scores. Specifically, for each user, we rank the items based on the predicted scores from highest to lowest. The top-ranked items that have not yet interacted with the user are selected as recommended items. Based on our model, this approach ensures that the recommended items are those that are most likely to be of interest to the user.

3.4.2. Graph Attention Networks Fusing Meta-Paths

Researchers frequently use GAT [29] to encode the neighbor information of nodes in graph structures, facilitating the creation of dense, low-dimensional node embeddings. From this line of research, several GAT-based recommendation methods [29,30] have emerged, embedding users and items as vectors via a GAT-driven message-passing mechanism. Thus, in the SR-BS model, we utilize the GAT to aggregate semantic relations from various meta-paths, thereby capturing a wide range of features associated with each node. Specifically, we define the aggregation process as shown in Equation (4). This process involves incorporating information from local neighbors

N (i)

in a given meta-path and utilizing the attention mechanism to determine the importance weight (

β_{i j}

) of each neighbor in the representation of node

i

.

E_{i} = R e l u (W \cdot \{\sum_{j \in N (i)} β_{i j} \cdot P_{j}\} + b)

(4)

where

R e l u

is the activation function,

W \in ℝ^{d \times d}

and

b \in ℝ^{d}

denote the weight matrix and bias vector of the neural network, respectively,

d

is the embedding size,

P_{j}

is the interactive embedding of user

u_{i}

and item

v_{j}

, and

β_{i j}

denotes the importance weight of each

j \in N (i)

in the learning representation. This approach allows us to better capture the features of node

i

under different semantic spaces. Here is the formula for this:

β_{i j} = s o f t m a x ({\bar{β}}_{i j}) = \frac{e x p ({\bar{β}}_{i j})}{\sum_{j \in N (i)} e x p ({\bar{β}}_{i j})}

(5)

where

β_{i j}

, by normalizing the attention values using the

s o f t m a x

function so that the final embedding of the node will maintain stability as the gradient descends. We parameterize

β_{i j}

with a two-layer neural network whose inputs are the embeddings

P_{i}

and

P_{j}

and whose outputs are the attention values. This network, referred to as a GAT, is defined as

{\bar{β}}_{i j}

. To calculate the importance weight (

β_{i j}

), we apply the

s o f t m a x

function, as shown in Equation (5), to normalize the attention values. This normalization ensures that the gradient descent process remains stable during training. Additionally, we parameterize

β_{i j}

using a two-layer neural network, whose inputs are embeddings

P_{i}

and

P_{j}

, defined as the GAT, as detailed in Equation (6).

{\bar{β}}_{i j} = W_{2}^{T} \cdot L e a k y R e l u (W_{1} [P_{i} \oplus P_{j}] + b_{1})

(6)

The GAT employs

L e a k y R e l u

as a nonlinear activation function, enhancing the model’s ability to capture complex, nonlinear relations within the data.

W_{1}, W_{2} \in ℝ^{d \times d}

and

b_{1}, b_{2} \in ℝ^{d}

are the weight matrix and bias vector and

\oplus

represents the concat operation. By considering the interaction of node

i

with neighboring nodes on specific meta-paths, the characteristics of node

i

in different semantic spaces are captured. It allows the model to understand the importance of each neighbor node in influencing the representation of the central node, thus encoding aspects of the node’s context and behavior. With the ability to concat multiple graph learning layers, we can integrate the information of individual meta-paths to further improve the efficiency and accuracy of learning on each meta-path.

There is a standard subgraph sampling method called node sampling for aggregating information about neighboring nodes. This approach can represent the distribution of neighboring nodes in isomorphic graphs by restricting the number of nodes sampled. This technique enhances model robustness in heterogeneous graphs by providing diverse perspectives to effectively capture users’ and items’ rating patterns [31]. The proposed SR-BS model introduces rating bias offsets among predefined meta-paths and incorporates the meta-paths in the representation learning process. Specifically, we perform a discard mechanism to discard user and item nodes randomly. Finally, rating prediction is performed based on the learned representation.

User Modeling

When learning the latent representation

h_{e}^{I}

of a user

u_{e}

from a user–item decentralized graph, we consider the user’s interaction history and their rating bias offsets for each item in the graph. We introduce rating bias offsets in user modeling, enabling the learning of potential representations that reflect statistical differences among users, rather than relying on raw rating data. The rating bias offset

{\dot{r}}_{e f}

for a user

u_{e}

is computed as shown in Equation (7), which takes into account the absolute difference between the user’s historical rating

r_{e f}

and their average rating

A (v_{e})

. This offset quantifies the deviation of a user’s rating from their typical behavior. We transform these rating bias offsets into vector representations to further enhance our model’s learning capability.

{\dot{r}}_{e f} = ⎡ |r_{e f} - A (v_{e})| ⎤

(7)

where ∥ denotes the absolute value function, and ⎡ ⎤ denotes the ceiling function. To transform the rating bias offsets into vector representations, we map all of the rating bias offsets of user

u_{e}

to an Embedding Lookup Table. This table allows us to obtain a rating difference vector, denoted as

d_{{\dot{r}}_{e f}}

, for the user

u_{e}

. The embedding process encodes the rating bias information into a dense vector space, which the model can efficiently learn from during training. These vectors serve as informative features that capture nuanced user preferences, allowing the model to better discern genuine user preferences from biased or inconsistent ratings. This enhancement ultimately leads to more accurate and reliable recommendations. Given that the computation of the rating bias

r_{e f} - A (v_{e})

may yield small values, we used an Embedding Lookup Table to avoid the problems associated with embedding methods when dealing with this case. Next, we model the interaction between the user

u_{e}

and item

v_{f}

with rating bias offset

{\dot{r}}_{e f}

, referred to here as the rating difference interaction representation

T_{e f}

, and use it for modeling subsequent potential representations of the user

u_{e}

. It is defined as follows:

T_{e f} = M L P_{u} [q_{v_{f}} \oplus d_{{\dot{r}}_{e f}}]

(8)

M L P_{u}

is an MLP, and

q_{v_{f}} \in ℝ^{d}

is the embedding vector of the item

v_{f}

. By incorporating rating biases in the form of vectors into the learning process of user latent representations, we can more effectively mine the statistical data for the latent preferences of user interaction behaviors, thus improving the accuracy of the recommendations.

Utilizing multi-relational paths for learning the potential representations of users and items can optimize the accuracy of recommendations in the recommendation process. We define a series of meta-paths in Section 3.3, comprising five user-centric paths and three item-centric paths. These meta-paths are embedded with different higher-order relations, thus helping users and items to dig deeper and learn their latent factors. Subsequently, we incorporated these meta-paths in the learning process of GAT and aggregated the different levels of adjacencies to obtain a semantically embedded representation of the meta-paths. For example, in the case of meta-path

P a t h_{1}

, the model takes the user–item interaction representation

T_{e f}

previously gained and uses it as input to Equation (4), and based on the item embedding

q_{v_{f}}

, generates preliminary user representations

h_{e}^{I} \in ℝ^{d}

through item aggregation. Similarly, we can obtain the preliminary user representations

h_{e}^{S_{i}}

,

h_{e}^{S}

,

h_{e}^{S f}

, and

h_{e}^{S f_{i}} \in ℝ^{d}

for the other meta-paths from the social relation and item–user decentralized graphs by item aggregation and social aggregation, respectively. The specific definitions are as follows:

h_{e} = c o m b_{u} ([h_{e}^{I} \oplus h_{e}^{S_{i}} \oplus h_{e}^{S f_{i}} \oplus h_{e}^{S} \oplus h_{e}^{S f}])

(9)

We merge the five representations as inputs to the GAT and learn their relative importance through the model to obtain the final potential representation

h_{e}

of the user

u_{e}

. Higher-order forms of user relations can be characterized through the collocation of

h_{e}^{I} \oplus h_{e}^{S_{i}} \oplus h_{e}^{S f_{i}} \oplus h_{e}^{S} \oplus h_{e}^{S f}

.

Item Modeling

The item–user decentralization graph contains the item’s interaction history with different users and the rating bias offsets obtained by the items from these users. The graph reveals varying user attitudes towards the same item, informing us of its characteristics based on these diverse responses. Here, unlike the method of calculating rating bias in user modeling, the rating bias of the user

u_{k}

and

u_{l}

is calculated as

r_{k l} - A (u_{k})

. We then use the information in this graph to learn the potential representation

z_{k}^{U}

of the item

v_{k}

. We define the rating bias offset

{\dot{r}}_{k l}

for the item

v_{k}

as follows:

{\dot{r}}_{k l} = ⎡ |r_{k l} - A (u_{k})| ⎤

(10)

In the next operation, similar to user modeling, the model learns different users’ potential representations of the same item. We first obtain the rating difference vector

d_{{\dot{r}}_{k l}}

for the item

v_{k}

, concatenate it with the embedded representation

p_{u_{l}} \in ℝ^{d}

of the user

u_{l}

, and inject it into the MLP. In this way, we learn the interaction representation

O_{k l}

of rating differences between the user

u_{l}

and item

v_{k}

with rating bias offset

{\dot{r}}_{k l}

. It is defined as follows:

O_{k l} = m l p_{v} [p_{u_{l}} \oplus d_{{\dot{r}}_{k l}}]

(11)

Similarly,

m l p_{v}

, an MLP, obtains the rating difference vector

d_{{\dot{r}}_{k l}}

analogously to acquiring

d_{{\dot{r}}_{e f}}

. The goal of item modeling is to gain the potential representation

z_{k}

of the item

v_{k}

from user–item interactions and similarity relations among items. As shown in Table 1, we use three meta-paths

P a t h_{6}

,

P a t h_{7}

, and

P a t h_{8}

, respectively, to reflect the associative relations between different nodes, and further obtain a preliminary item representation

z_{k}^{U}

,

z_{k}^{I f}

, and

z_{k}^{I f_{u}} \in ℝ^{d}

for each meta-path. Next, another MLP

c o m b_{v}

is employed to fuse these three representations, and the final potential representation

z_{k}

of the output of the item

v_{k}

is:

z_{k} = c o m b_{v} ([z_{k}^{U} \oplus z_{k}^{I f} \oplus z_{k}^{I f_{u}}])

(12)

Rating Prediction

The latent representations of users and items learned by the model capture the implicit relations and interactions between users and items, and by concatenating the latent representations of users and items and their dot product results, the user’s features and the item’s features can be correlated with each other to express the correlation between users and items. This combined information is fed into the three-layer neural network, where the model can learn more complex nonlinear relations, capture deeper relations between user preferences and item features, and compute preference ratings. To derive the preference rating

{\hat{r}}_{i j}

, we use the following operation:

F_{1} = R e l u (W_{1} [h_{e} \oplus z_{k} \oplus (h_{e} \cdot z_{k})] + b_{2})

(13)

F_{2} = R e l u (W_{2} \cdot F_{1} + b_{2})

(14)

{\hat{r}}_{i j} = W^{T} \cdot F_{2}

(15)

Among them,

\cdot

in Equation (13) represents the dot product operation, and

\cdot

in Equation (14) is matrix multiplication. The

R e L U

activation function introduces nonlinearities to the inputs during the operation, allowing the model to better adapt to complex patterns in the data. The output of each layer is then passed on to the next layer using a multi-layer neural network to improve the generalization of the model, and, finally, the result is mapped using the transposed weight matrix

W^{T}

to compute the predicted preference ratings

{\hat{r}}_{i j}

.

3.4.3. Graph Attention Networks Fusing Meta-Paths

In social recommendation, we employ a standard objective function to measure prediction error, ensuring recommendation accuracy and enabling performance evaluation. This function utilizes the mean square error between real and predicted ratings as its error metric. We denote the function as:

L = \frac{1}{2 〈 O 〉} \sum_{(u_{i}, v_{j} \in O)} {(r_{i j} - {\hat{r}}_{i j})}^{2}

(16)

where 〈 〉 denotes the number of acquired datasets,

r_{i j}

is the base real rating performed by user

u_{i}

on item

v_{j}

, and

{\hat{r}}_{i j}

is the preference rating predicted by the model. During training, we randomly initialize the learned embedding vectors, with the rating difference embedding depending on the rating level, represented by five users’ rating values [1,5] of the item. We optimize the model by using a node-dropping mechanism to curb overfitting and bolster generalization.

4. Experimental Evaluation

To verify the efficacy of the proposed SR-BS model, we conducted experiments across various social recommendation datasets. Our experimental design focuses on the following research questions:

RQ1: How does SR-BS perform in rating prediction relative to the current mainstream social recommendation methods?

RQ2: How does the application of rating differences under different relation types affect the performance of the SR-BS model?

RQ3: How do different hyperparameter settings affect recommendation performance?

4.1. Datasets

The experiments utilize four real-world datasets: Epinions, Ciao, FilmTrust, and Duban. These datasets contain information about users, items, ratings, and social relations. Below, we provide a specific description of the datasets.

Epinions, Ciao: Epinions and Ciao are social network-based consumer review platforms. Users rate items on a scale of 1 to 5 and form social relations by adding other users to a trust list. The datasets capture user–item ratings and user–user trust relations.

FilmTrust: FilmTrust is an online movie review platform that contains user ratings and reviews of movies and allows for a one-way trust relation to share movie reviews and opinions.

Douban: Douban is the most popular online review platform in China, where a user can rate items according to their preferences and can build social relations.

To address dataset sparsity, we excluded users with fewer than five interactions. The rating data were then randomly partitioned into 80% training, 10% validation, and 10% testing sets. Table 2 summarizes the statistics of the four datasets.

4.2. Evaluation Metrics

To assess the rating prediction performance of the recommendation methods, we use the MAE and Root-Mean-Square Error (RMSE) as the evaluation metrics. When stability and robustness are critical, MAE is preferred because it is less sensitive to outliers. When user personalization is a priority, RMSE is preferred because it gives greater weight to the larger error due to the squared difference. These metrics are widely used in recommender systems, and they quantify the difference between the predicted results and the actual scores, thus measuring the model’s predictive accuracy. These two metrics are defined as follows:

M A E = \frac{1}{〈 D_{T} 〉} \sum_{(u_{i}, v_{j} \in D_{T})} |{\hat{r}}_{i j} - r_{i j}|

(17)

MAE is used to compute, for each user–item pair (

u_{i}

,

v_{j}

) in the test set, the absolute difference between the predicted rating

{\hat{r}}_{i j}

and the true ratings

r_{i j}

and averages these differences.

D_{T}

indicates the size of the test set, i.e., the number of user–item pairs.

R M S E = \sqrt{\frac{1}{〈 D_{T} 〉} \sum_{(u_{i}, v_{j} \in D_{T})} {({\hat{r}}_{i j} - r_{i j})}^{2}}

(18)

The RMSE is similar to the MAE in that for each user–item pair, the squared difference between the predicted rating

{\hat{r}}_{i j}

and the true rating

r_{i j}

is computed, and these differences are averaged and squared.

The smaller the value of these two indicators, the closer the model’s prediction results are to the actual scores and the higher the accuracy of the prediction. We replicate experiments to determine the average performance of the test set with the best epoch. Despite our model’s minor improvements in these metrics, the research in [6] indicates that even subtle improvements in MAE or RMSE can significantly enhance recommendation quality.

4.3. Baseline

To validate the effectiveness of our proposed model, we conducted four sets of comparative experiments. In each group, we selected representative methods as baselines:

Traditional recommendation methods:

PMF [32]: Utilizes an MF-based probabilistic model that only uses the user’s rating information to predict the user’s rating of an item by learning potential factors.

Matrix factorization-based social recommendation method:

RSTE [33]: Utilizes random mapping to mitigate the impact of data noise by jointly modeling users’ social and rating information.

SocialMF [15]: Integrates the trust propagation mechanism with MF techniques, aiming to boost recommendation accuracy by considering user trust relations.

SoReg [4]: Constrains user relations as social regularization conditions onto the MF objective, acting as a social regularization model.

Social recommendation methods based on social relations:

LOCABAL [34]: Advocates for recommendations based on local social influence, underlining that a user’s social neighbors may significantly influence their tastes.

SREPS [30]: Generates recommendations by considering social relations and item characteristics.

Social recommendation algorithms based on graph neural networks:

GraphRec [21]: Leverages two GATs to model user–item interactions and interactions between users, primarily focusing on the first-order neighbors of users and items.

ConsisRec [29]: Enhances GNNs-based recommendation models by sampling consistent neighbors and introducing relational attention during aggregation.

GDSRec [24]: Boosts social connection strength by aggregating rating bias vectors from dispersed neighborhoods and incorporating preference similarity to improve prediction accuracy, eliminate rating bias, and advance collaborative filtering for social recommendation.

S4Rec [35]: Predicts results by adaptively merging the depth graph and SVD models while employing TransH to model the rating difference behavior, consequently improving the model’s generalization ability.

4.4. Parameter Settings

The social recommendation model in this paper is built using the PyTorch framework and trained and tested on an RTX3090 Ti with 24G RAM. We optimize the model’s objective function using RMSprop as the optimizer, randomly selecting a training instance and updating each model parameter in the negative gradient direction. We configure the model’s hyperparameters on the four datasets as the batch size

B = 256

, the embedding size of the model is

d = 80

, the dropout rate

p = 0.5

, and the learning rate of the model training is

L_{r} = 0.001

. In the Attention Mechanisms Module, we choose the

L e a k y R e L u

activation function with a slope of 0.2. Sample sizes are set to 25 for the user–item bipartite graph in DropNode and 20 for the friendly neighbor nodes in the social graph. To ensure fairness, we refer to the optimal parameter settings reported from the original baseline paper and adapt all of them to ensure optimal performance. Section 4.7 discusses how different parameters (i.e.,

d

and

L_{r}

) and dropout mechanisms affect the model’s performance.

4.5. Performance Comparison (RQ1)

To show that our model has better stability and user personalization, we compare it with 10 existing baseline models for social recommendation and show the validation made on four datasets in Table 3.

In the table, the underlined values represent the best performance among all baselines, and the bold values represent the best performance among all models. The results show that PMF (Probabilistic Matrix Factorization), which solely uses users’ rating information to model latent factors, performs subpar compared to models like RSTE, SocialMF, and SoReg, which utilize social information. This conclusion suggests that combining social information can effectively improve recommendation performance. Furthermore, comparing LOCABAL and SREPS models highlights the importance of fully exploring social relations and item characteristics. The GNN-based models, i.e., GraphRec and ConsisRec, effectively utilize first-order social neighborhood information, demonstrating the advantages of GNNs in effectively exploiting interacting associative relations to enhance recommender systems. In addition, GDSRec incorporates rating bias data and preference similarity to enhance its predictive power. At the same time, the S4Rec model learns by combining semantic and structural views using implicit relations and user rating behaviors and performs the best among all baselines. However, the proposed SR-BS model outperforms the diverse baseline models across all datasets. This superior performance can be attributed to the model’s ability to effectively model user and item rating difference information, as well as different interaction information, and the efficient utilization of meta-paths to integrate varying types of relations. This comprehensive approach, as revealed by the performance metrics across diverse datasets, the SR-BS model improves 2.612%, 1.457%, 0.522%, and 0.546% on MAE and 1.127%, 1.241%, 1.444%, and 0.189% on RMSE on the four datasets, respectively, compared to the sub-optimal performance. Furthermore, the recommendation performance of the SR-BS model significantly surpasses that of other baseline models, even when tested on larger datasets such as Ciao and Epinions.

4.6. Ablation Experiment (RQ2)

In this section, we perform ablation studies to validate the impact of the SR-BS model components.

4.6.1. Effect of Rating Differences

This experiment illustrates the effectiveness of our proposed model. Our model considers three types of information to enhance accuracy: (1) the introduction of rating bias offsets for users, (2) the introduction of rating bias offsets for items, and (3) the integration of rating difference information in user–item interactions to learn latent factor representations. To better understand the proposed model, we compare the performance of SR-BS and its four variants with the same hyperparameters: SR-BS|Ud removes the user’s rating bias offset; SR-BS|Id removes the item’s rating bias offset; SR-BS|UId uses raw rating data instead of statistical rating differences in user–item interactive learning; and SR-BS|Nd processes all interaction information using raw rating information during representation learning. Figure 3 shows the experimental results, indicating that the MAE of the four datasets decreased by 0.2350%, 0.9435%, 0.8669%, and 0.3434%, respectively, when we remove the user’s rating bias offsets. This conclusion suggests that considering the user’s preference tendency enhances modeling effectiveness. The removal of item bias offset also leads to a decrease in MAE on the four datasets, which suggests that adding additional item rating difference information can help to learn a more reflective representation of the item characteristics to improve recommendation performance. On the other hand, SR-BS|UId shows a similar MAE to SR-BS, yet the latter shows a 3.83% improvement in RMSE, suggesting that learning from rating difference data reflects more effective information than learning from raw rating data. SR-BS|Nd exhibits higher MAE and RMSE on all datasets, indicating its inferior performance compared to SR-BS. This reinforces the effectiveness of our raw users and items rating data processing, the importance of considering user and item preference tendencies, and analyzing rating difference information to enhance model performance.

4.6.2. Effect of Attention Networks

In the embedding aggregation process of the model, we employ the GAT to adaptively learn the semantic contributions of different meta-paths to the representation. This experiment further validates the functionality and effectiveness of GAT in the proposed SR-BS model. Additionally, the SR-BS model employs the

S o f t m a x

function to normalize attention values. To test the effect of the attention mechanism on meta-path aggregation in the SR-BS model, we designed two different aggregation operations in our experiments. We denoted these two variants by SR-BS_mean and SR-BS_max, respectively. We have changed Equation (5) in the paper, as follows:

β_{i j} = \frac{1}{〈 N (i) 〉}

(19)

or

β_{i j} = \max_{j ϵ N (i)} \frac{e x p ({\bar{β}}_{i j})}{\sum_{j \in N (i)} e x p ({\bar{β}}_{i j})}

(20)

Table 4 shows the performance results of these two variants and the proposed model on four datasets with different evaluation metrics. The above experimental results demonstrate that the SR-BS_mean variant has the worst performance. This is attributed to the averaging operation’s inability to recognize the importance of meta-paths, making it potentially ineffective in handling noisy data. In comparison, SR-BS_max performs better because its maximum operation is better at capturing critical meta-paths and is more advantageous against noisy data. Both variants, SR-BS_mean and SR-BS_max, yield the same output weights when processing attention values from different inputs. Therefore, they are categorized together. However, both SR-BS_mean and SR-BS_max fall short of the best performance of SR-BS. They exhibit a decrease in MAE by 3.6322%, 4.5728%, 1.6194%, 0.9104%, and 1.4446%, 2.2209%, 0.5457%, 0.2945% on the four datasets, respectively. This discrepancy is due to SR-BS’s fine-grained level of distinguishing the importance of different meta-paths, enabling it to demonstrate superior performance. This also demonstrates the effectiveness of widely applying GAT in social recommendation modeling.

4.7. Hyper-Parameter Study (RQ3)

In this subsection, we comparatively analyze the impact of various hyperparameters on model performance, including the embedding size

d

of the model, the learning rate

L_{r}

, and the sample sizes of DropNode in the user–item interaction graph and the social graph. Given the space constraints, we only show the performance variations on the Epinions dataset. As mentioned before,

τ

is the threshold we set to measure whether the shared following among users reaches the level that defines a higher-order relation. We observe that a larger τ can help identify more stable higher-order relations. We ignore

τ

here due to its minimal impact on model performance.

Impact of embedding size: Embedding size plays a crucial role in model capacity and expressiveness. A small size limits model expressiveness, while a large size may result in overly sparse embedding vectors, thus reducing performance. As shown in Figure 4A, as we gradually increase the embedding size, the expressiveness of the model is significantly improved, and the accuracy is initially improved. However, once $d$ reaches a certain threshold (e.g., in the Epinions dataset, this threshold is 80), the performance gain is no longer significant. At the same time, the computational complexity increases dramatically, and the performance decreases with a further increase in embedding size. Hence, choosing the right embedding size to balance model capacity with computational complexity is critical.
Impact of learning rate: The learning rate, which is the step size of the model to update the parameters each time during the training process, significantly impacts the model’s performance. If the learning rate is too large or too small, it will hinder the optimization of the model. Figure 4B shows that when we choose $L_{r} = 1 e - 03$ , the model performs best under all evaluation metrics. When the learning rate exceeds $L_{r} = 1 e - 03$ , the model’s performance starts to decline gradually. Therefore, choosing an appropriate learning rate is crucial to achieving the best performance.
Impact of sample size in DropNode: As demonstrated in Figure 4C,D, we find that whether it is a user–item interaction graph or a social graph, the model performance shows a gradual improvement with the increase in the sample size, primarily due to the additional learning information this provides. However, as we increase the sample size to a specific scale, the learned representations may become biased and confusing, resulting in decreased model performance and increased computational burden. Therefore, appropriate sample size control can reduce the model complexity and ensure prediction accuracy.

5. Conclusions

Most of the current recommendation algorithms fail to adequately consider the effects of user and item rating bias offsets and overall rating trends. Based on this issue, we introduce SR-BS, a model designed to address rating prediction issues within the domain of social recommendations. In user and item modeling, we adopt the GNN model and learn to obtain rich and fine-grained potential representations of users and items by fusing different types of rating bias information. We employ GAT during the feature fusion process, enhancing the semantic space’s node representations by aggregating predefined meta-paths of diverse interaction relations. We conduct extensive comparative experiments on four public datasets and demonstrated that the SR-BS model has better rating prediction performance than the state-of-the-art models. Compared to the most effective baseline, our model demonstrates maximum improvements of 2.612% and 1.444% in MAE and RMSE, respectively. In addition, we performed ablation experiments to validate the effectiveness of each component. Specifically, considering the preference tendencies of users and projects and analyzing the scoring discrepancy information can improve the performance of the model; when applying GAT in the model, it can differentiate the importance of different meta-paths at a finer granularity level to show better performance. Overall, our proposed SR-BS model achieves a substantial improvement in the quality of social recommendations, as evidenced by our evaluation metrics.

Author Contributions

Conceptualization, L.H.; methodology, L.H.; software, L.H.; validation, L.H.; formal analysis, L.H.; writing—original draft preparation, L.H.; writing—review and editing, J.Q.; supervision, B.X.; funding acquisition, J.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Science Fund for Outstanding Youth of Xinjiang Uygur Autonomous Region under Grant No. 2021D01E14.

Data Availability Statement

We tested our framework on the Epinions, Ciao, Douban, and FilmTrust datasets. The download links for each of these four datasets are: https://github.com/Wang-Shuo/GraphRec_PyTorch/tree/master/datasets/Epinions/; https://github.com/Wang-Shuo/GraphRec_PyTorch/tree/master/datasets/Ciao/; http://konect.cc/networks/douban/; https://guoguibing.github.io/librec/datasets/filmtrust.zip (accessed on 10 May 2023).

Conflicts of Interest

The authors declare no conflict of interest.

Notation

$r_{i j}$	The user $u_{i}$ real-valued ratings for the item $v_{j}$
$I (v_{j})$	The set of users that interact directly with the item $v_{j}$
$I (u_{i})$	The set of items that directly interact with the user $u_{i}$
$A (u_{i})$	The average ratings of the user $u_{i}$
$A (v_{j})$	The average ratings of the item $v_{j}$
$N (i)$	Local neighbor of node $i$
$c_{j i}$	The connection relations that exist between the user $u_{j}$ and the user $u_{i}$
$I (v_{j})$	The set of users that interact directly with the item $v_{j}$
$I (u_{i})$	The set of items that directly interact with the user $u_{i}$
$τ$	The threshold measures the shared attention between users
$S_{U} (i)$	The higher-order user relations for user $u_{i}$
$l_{j k}^{i}$	For the user $u_{i}$ , the similarity of the items $v_{j}$ and $v_{k}$
$l_{j k}$	For all users, the similarity of the items $v_{j}$ and $v_{k}$
$S_{V} (j)$	The higher-order item relations for the item $v_{j}$
$β_{i j}$	The importance weights for nodes $i$ and $j$
$P_{i}$	The embedding of the user $u_{j}$ and the item $v_{i}$
$P_{j}$	The embedding of the user $u_{i}$ and the item $v_{j}$
${\dot{r}}_{e f}$	The rating bias offset for the user $u_{i}$
$d_{{\dot{r}}_{e f}}$	The rating difference vector for the user $u_{e}$
$q_{v_{f}}$	The embedding vector of the item $v_{f}$
$T_{e f}$	The interaction representation of rating differences between the user $u_{e}$ and item $v_{f}$
$h_{e}^{I}$ , $h_{e}^{S}$ , $h_{e}^{S f}$	The item aggregation user latent factor from local neighbor $N (i)$ of user $u_{e}$
$h_{e}^{S_{i}}$ , $h_{e}^{S f_{i}}$	The social aggregation user latent factor from local neighbor $N (i)$ of user $u_{e}$
$h_{e}$	The final potential representation of the user $u_{e}$
${\dot{r}}_{k l}$	The rating bias offset for the item $v_{k}$
$d_{{\dot{r}}_{k l}}$	The rating difference vector for the item $v_{k}$
$O_{k l}$	The interaction representation of rating differences between the user $u_{l}$ and item $v_{k}$
$z_{k}^{U}$	The user aggregation item latent factor from local neighbor $N (i)$ of item $v_{k}$
$z_{k}^{I f}$	The item aggregation item latent factor from local neighbor $N (i)$ of item $v_{k}$
$z_{k}^{I f_{u}}$	The social aggregation item latent factor from local neighbor $N (i)$ of item $v_{k}$
$z_{k}$	The final potential representation of the output of the item $v_{k}$
${\hat{r}}_{i j}$	The predicted preference ratings of the user $u_{i}$ and item $v_{j}$
$\oplus$	The concat operation of two vectors
$W$ , $b$	The weight and bias in neural network

References

Batmaz, Z.; Yurekli, A.; Bilge, A.; Kaleli, C. A review on deep learning for recommender systems: Challenges and remedies. Artif. Intell. Rev. 2019, 52, 1–37. [Google Scholar] [CrossRef]
Zhang, Y.; Cheng, D.Z.; Yao, T.; Yi, X.; Hong, L.; Chi, E.H. A model of two tales: Dual transfer learning framework for improved long-tail item recommendation. In Proceedings of the Web Conference 2021, Ljubljana, Slovenia, 19–23 April 2021; pp. 2220–2231. [Google Scholar]
Liu, Y.; Chen, L.; He, X.; Peng, J.; Zheng, Z.; Tang, J. Modelling high-order social relations for item recommendation. IEEE Trans. Knowl. Data Eng. 2020, 34, 4385–4397. [Google Scholar] [CrossRef]
Ma, H.; Zhou, D.; Liu, C.; Lyu, M.R.; King, I. Recommender systems with social regularization. In Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, Hong Kong, China, 9–12 February 2011; pp. 287–296. [Google Scholar]
Wang, Y.; Sun, Y.; Liu, Z.; Sarma, S.E.; Bronstein, M.M.; Solomon, J.M. Dynamic graph cnn for learning on point clouds. ACM Trans. Graph. 2019, 38, 1–12. [Google Scholar] [CrossRef]
Koren, Y. Factorization meets the neighborhood: A multifaceted collaborative filtering model. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Las Vegas, NV, USA, 24–27 August 2008; pp. 426–434. [Google Scholar]
Krishnan, S.; Patel, J.; Franklin, M.J.; Goldberg, K. A methodology for learning, analyzing, and mitigating social influence bias in recommender systems. In Proceedings of the 8th ACM Conference on Recommender Systems, Silicon Valley, CA, USA, 6–10 October 2014; pp. 137–144. [Google Scholar]
Chen, J.; Xin, X.; Liang, X.; He, X.; Liu, J. GDSRec: Graph-Based Decentralized Collaborative Filtering for Social Recommendation. IEEE Trans. Knowl. Data Eng. 2022, 35, 4813–4824. [Google Scholar]
Lin, Z.; Tian, C.; Hou, Y.; Zhao, W.X. Improving graph collaborative filtering with neighborhood-enriched contrastive learning. In Proceedings of the ACM Web Conference 2022, Lyon, France, 25–29 April 2022; pp. 2320–2329. [Google Scholar]
Koren, Y.; Bell, R.; Volinsky, C. Matrix factorization techniques for recommender systems. Computer 2009, 42, 30–37. [Google Scholar] [CrossRef]
Guo, G.; Zhang, J.; Yorke-Smith, N. Trustsvd: Collaborative filtering with both the explicit and implicit influence of user trust and of item ratings. In Proceedings of the AAAI Conference on Artificial Intelligence, Austin, TX, USA, 25–30 January 2015. [Google Scholar]
Ma, H.; Yang, H.; Lyu, M.R.; King, I. Sorec: Social recommendation using probabilistic matrix factorization. In Proceedings of the 17th ACM Conference on Information and Knowledge Management, Napa Valley, CA, USA, 26–30 October 2008; pp. 931–940. [Google Scholar]
Yang, B.; Lei, Y.; Liu, J.; Li, W. Social collaborative filtering by trust. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 39, 1633–1647. [Google Scholar] [CrossRef] [PubMed]
Tang, J.; Wang, S.; Hu, X.; Yin, D.; Bi, Y.; Chang, Y.; Liu, H. Recommendation with social dimensions. In Proceedings of the AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA, 12–17 February 2016. [Google Scholar]
Jamali, M.; Ester, M. A matrix factorization technique with trust propagation for recommendation in social networks. In Proceedings of the Fourth ACM Conference on Recommender Systems, Barcelona, Spain, 26–30 September 2010; pp. 135–142. [Google Scholar]
Chen, C.; Zhang, M.; Wang, C.; Ma, W.; Li, M.; Liu, Y.; Ma, S. An efficient adaptive transfer neural network for social-aware recommendation. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Paris, France, 21–25 July 2019; pp. 225–234. [Google Scholar]
He, X.; Chua, T.-S. Neural factorization machines for sparse predictive analytics. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, Tokyo, Japan, 7–11 August 2017; pp. 355–364. [Google Scholar]
Fan, W.; Li, Q.; Cheng, M. Deep modeling of social relations for recommendation. In Proceedings of the AAAI Conference on Artificial Intelligence, Hilton New Orleans Riverside, New Orleans, LA, USA, 2–7 February 2018. [Google Scholar]
Wang, X.; He, X.; Nie, L.; Chua, T.-S. Item silk road: Recommending items from information domains to social users. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, Tokyo, Japan, 7–11 August 2017; pp. 185–194. [Google Scholar]
Chen, H.; Yin, H.; Chen, T.; Wang, W.; Li, X.; Hu, X. Social boosted recommendation with folded bipartite network embedding. IEEE Trans. Knowl. Data Eng. 2020, 34, 914–926. [Google Scholar] [CrossRef]
Fan, W.; Ma, Y.; Li, Q.; He, Y.; Zhao, E.; Tang, J.; Yin, D. Graph neural networks for social recommendation. In Proceedings of the World Wide Web Conference, San Francisco, CA, USA, 13–17 May 2019; pp. 417–426. [Google Scholar]
Wu, L.; Sun, P.; Fu, Y.; Hong, R.; Wang, X.; Wang, M. A neural influence diffusion model for social recommendation. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Paris, France, 21–25 July 2019; pp. 235–244. [Google Scholar]
Wu, L.; Li, J.; Sun, P.; Hong, R.; Ge, Y.; Wang, M. Diffnet++: A neural influence and interest diffusion network for social recommendation. IEEE Trans. Knowl. Data Eng. 2020, 34, 4753–4766. [Google Scholar] [CrossRef]
Wu, Q.; Zhang, H.; Gao, X.; He, P.; Weng, P.; Gao, H.; Chen, G. Dual graph attention networks for deep latent representation of multifaceted social effects in recommender systems. In Proceedings of the World Wide Web Conference, San Francisco, CA, USA, 13–17 May 2019; pp. 2091–2102. [Google Scholar]
Fu, B.; Zhang, W.; Hu, G.; Dai, X.; Huang, S.; Chen, J. Dual side deep context-aware modulation for social recommendation. In Proceedings of the Web Conference 2021, Ljubljana, Slovenia, 19–23 April 2021; pp. 2524–2534. [Google Scholar]
Tao, Y.; Li, Y.; Zhang, S.; Hou, Z.; Wu, Z. Revisiting graph based social recommendation: A distillation enhanced social graph network. In Proceedings of the ACM Web Conference 2022, Lyon, France, 25–29 April 2022; pp. 2830–2838. [Google Scholar]
Yu, J.; Gao, M.; Li, J.; Yin, H.; Liu, H. Adaptive implicit friends identification over heterogeneous network for social recommendation. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management, Torino, Italy, 22–26 October 2018; pp. 357–366. [Google Scholar]
Zhang, S.; Tong, H. Final: Fast attributed network alignment. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 1345–1354. [Google Scholar]
Yang, L.; Liu, Z.; Dou, Y.; Ma, J.; Yu, P.S. Consisrec: Enhancing gnn for social recommendation via consistent neighbor aggregation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Virtual Event, Canada, 11–15 July 2021; pp. 2141–2145. [Google Scholar]
Liu, C.-Y.; Zhou, C.; Wu, J.; Hu, Y.; Guo, L. Social recommendation with an essential preference space. In Proceedings of the AAAI Conference on Artificial Intelligence, Hilton New Orleans Riverside, New Orleans, LA, USA, 2–7 February 2018. [Google Scholar]
Guo, F.; Liu, J.; Li, M.; Huang, T.; Zhang, Y.; Li, D.; Zhou, H. A concise TSK fuzzy ensemble classifier integrating dropout and bagging for high-dimensional problems. IEEE Trans. Fuzzy Syst. 2021, 30, 3176–3190. [Google Scholar] [CrossRef]
Mnih, A.; Salakhutdinov, R.R. Probabilistic matrix factorization. Adv. Neural Inf. Process. Syst. 2007, 20. [Google Scholar] [CrossRef]
Ma, H.; King, I.; Lyu, M.R. Learning to recommend with social trust ensemble. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, MA, USA, 19–23 July 2009; pp. 203–210. [Google Scholar]
Tang, J.; Hu, X.; Gao, H.; Liu, H. Exploiting local and global social context for recommendation. In Proceedings of the IJCAI, Beijing, China, 3–9 August 2013; pp. 2712–2718. [Google Scholar]
Yuan, K.; Liu, G.; Wu, J.; Xiong, H. Semantic and Structural View Fusion Modeling for Social Recommendation. IEEE Trans. Knowl. Data Eng. 2022. [Google Scholar] [CrossRef]

Figure 1. Two approaches to manipulating decentralized graphs.

Figure 2. Proposed modeling framework. The model contains three components: user modeling, item modeling, and rating prediction. Here, Graph Attention Network (GAT) is used to aggregate the features of nodes based on predefined meta-paths.

Figure 3. Impact of different types of rating differences on four datasets.

Figure 4. Effect of different hyperparameters on the Epinions dataset.

Table 1. Meta-path information for social recommendation design.

	Meta-Path	Schema	Description
User	$P a t h_{1}$	$U - I$	Items with rating bias reflect user preferences
	$P a t h_{2}$	$U - U$	Users have similar preferences to the friends they interact with directly
	$P a t h_{3}$	$U - U - U$	Users have similar preferences to users with whom there is a co-following link
	$P a t h_{4}$	$U - U - I$	Users tend to interact with similar items that their friends have interacted with
	$P a t h_{5}$	$U - U - U - I$	Users interacting with similar items with friends who share their following
Item	$P a t h_{6}$	$I - U$	A user’s rating bias reflects the popularity of items
	$P a t h_{7}$	$I - U - I$	The similarity of items with the same user interaction
	$P a t h_{8}$	$I - U - I - U$	Users who rate similar items may have similar preferences

Table 2. Statistical data on data sets.

Dataset	Epinions	Ciao	Douban	FilmTrust
# of Users	23,251	7371	1631	1227
# of Items	120,711	90,913	29,609	1892
# of Ratings	613,574	268,174	536,842	34,887
Density of Interaction	0.0219%	0.0400%	1.1117%	1.5028%
# of Social Relations	374,039	111,749	17,454	1333
Density of Social Relations	0.0692%	0.2057%	0.6561%	0.0885%

Table 3. Performance comparison of two metrics (Mean Absolute Error (MAE) and Root-Mean-Square Error (RMSE)) of different recommendation models under four datasets.

Dataset	Metrics	SR-BS	PMF	RSTE	SocialMF	SoReg	LOCABAL	SREPS	GraphRec	ConsisRec	GDSRec	S4Rec	Improv.
Ciao	MAE	0.6824	0.9021	0.8542	0.8321	0.8593	0.8431	0.7756	0.7253	0.7252	0.7323	0.7007	2.612%
Ciao	RMSE	0.9391	1.1238	1.0789	1.0657	1.0782	1.0627	0.9981	0.9850	0.9581	0.9740	0.9498	1.127%
Epinions	MAE	0.7641	0.9952	0.8923	0.8730	0.8732	0.8687	0.8232	0.8029	0.8029	0.8047	0.7754	1.457%
Epinions	RMSE	1.0267	1.2128	1.1672	1.1494	1.1477	1.1274	1.1087	1.0656	1.0542	1.0566	1.0396	1.241%
FilmTrust	MAE	0.6095	0.6973	0.6550	0.6189	0.6114	0.6667	0.6678	0.6891	0.6117	0.6850	0.6127	0.522%
FilmTrust	RMSE	0.7713	0.9293	0.8628	0.8085	0.8064	0.8920	0.87083	0.9124	0.7862	0.8921	0.7826	1.444%
Douban	MAE	0.5832	0.8628	0.7232	0.5879	0.5976	0.5874	0.71942	0.5957	0.5864	0.6105	0.5891	0.546%
Douban	RMSE	0.7392	1.1361	0.9357	0.7406	0.7608	0.7512	0.96239	0.7527	0.7430	0.7698	0.7507	0.189%

Table 4. Effect of attention networks on four datasets.

	Dataset	Epinions	Ciao	Douban	FilmTrust
SR-BS_mean	MAE	0.7929	0.7151	0.5928	0.6151
SR-BS_mean	RMSE	1.0434	0.9497	0.7508	0.7797
SR-BS_max	MAE	0.7753	0.6979	0.5864	0.6113
SR-BS_max	RMSE	1.0495	0.9474	0.7464	0.7768
SR-BS	MAE	0.7641	0.6824	0.5832	0.6095
SR-BS	RMSE	1.0267	0.9391	0.7392	0.7713

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Han, L.; Qin, J.; Xia, B. Enhanced Social Recommendation Method Integrating Rating Bias Offsets. Electronics 2023, 12, 3926. https://doi.org/10.3390/electronics12183926

AMA Style

Han L, Qin J, Xia B. Enhanced Social Recommendation Method Integrating Rating Bias Offsets. Electronics. 2023; 12(18):3926. https://doi.org/10.3390/electronics12183926

Chicago/Turabian Style

Han, Lu, Jiwei Qin, and Boshen Xia. 2023. "Enhanced Social Recommendation Method Integrating Rating Bias Offsets" Electronics 12, no. 18: 3926. https://doi.org/10.3390/electronics12183926

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Enhanced Social Recommendation Method Integrating Rating Bias Offsets

Abstract

1. Introduction

2. Related Work

3. Proposed Framework and Problem Definition

3.1. Notation and Problem Definition

3.2. Relation Graph Construction

3.2.1. Social Graph

3.2.2. Item Similarity Graph

3.2.3. Decentralized Graph

3.3. Predefined Meta-Paths

3.4. Proposed Model

3.4.1. Model Framework

3.4.2. Graph Attention Networks Fusing Meta-Paths

User Modeling

Item Modeling

Rating Prediction

3.4.3. Graph Attention Networks Fusing Meta-Paths

4. Experimental Evaluation

4.1. Datasets

4.2. Evaluation Metrics

4.3. Baseline

4.4. Parameter Settings

4.5. Performance Comparison (RQ1)

4.6. Ablation Experiment (RQ2)

4.6.1. Effect of Rating Differences

4.6.2. Effect of Attention Networks

4.7. Hyper-Parameter Study (RQ3)

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Notation

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI