Abstract
The analysis of the attitudes of online users plays an important role in real life and the future society, which can help us to analyze and predict the behavior of the users. The researchers also carried out a lot of related work. However, most of the previous classic methods searched the rules of the structure of the graph or the propagation path and did not consider the internal psychological dynamic mechanism of the user. In the context of big data, identifying online behavioral motivations of users through online information should be a future development trend. In this paper, we delve into the users’ internal psychological energy based on the Elaboration Likelihood Model and divide the user’s behavioral drive into the central route and the peripheral route. The degree of user’s processing of the known information and elaboration determines which route is more effective. The central route is mainly the user’s behavioral habits studied through user role theory. In the peripheral route, it calculates the influence of other users through a kind of hydromechanics algorithm—FluidRating. The final model we called ELMFluid which can express the flow of psychological energy through thermodynamics and hydromechanics. The experiments are performed using two real datasets in the end. The results show that the new model is superior to other algorithms.
1. Introduction
In modern age, more and more people have joined websites that provide product comparison information in order to obtain personalized recommendation services [1]. The most popular are based on trusted relationships, such as Epinions and Ciao [2]. There are two important functions of this type of service system. One is that the registered users can add other users into their trusted user list and the other is that all registered users can comment on a certain product and give a one- to five-star rating. These reviews and scores are provided for those who have not purchased as a judgment basis of their purchase.
Predecessors have already done a lot of work on recommendation systems based on trusted relationships. Two of the most classic models TidalTrust and MoleTrust are proposed by Golbeck and Massa, respectively [3, 4]. However, these models are still looking for general rules and do not go into the users’ cognitive process. What is the real driving force of human behavior? Sigmund Freud thought it was “sex” and he began to introduce the model of the energy of physics into psychology [5]. Carl Gustav Jung developed this theory and used the concept of mental energy directly. Mental energy can do psychological work and produce people’s perceptions, cognitions, emotions, thoughts, personality, and so on. In the field of market and psychology, many researchers have done some work in recent years, including finding the relationship between the user perception and content, usability and aesthetics of the online website [6, 7], social selection mechanisms [8], the consumer mindset metrics [9], and so on. What we need to do is apply the existing psychological conclusions.
So, in order to explore user behavior from a deeper level, we generalize a classic market model—Elaboration Likelihood Model (ELM) [10, 11]. The model divides the user’s mental state into central route and peripheral route. That is, whenever a user generates an attitude to a product, this user is affected by two aspects at the same time. On the one hand, it is the rational thinking about the value of goods. This part is the central route. The factors that work on the other hand include emotional factors, the expression of information, and so on. This part is the peripheral route. The first step of the model is to distinguish the user’s motivation and ability to analyze and comprehend information. When the user has a high degree of processing and digesting the known information, the user’s rational thinking in the central route will play a leading role. Otherwise, the peripheral route plays a leading role. For example, if a product is a celebrity endorsement which the user likes, the reason why the user likes the product is the emotional factors so the peripheral route is more effective. If a user is a digital product master, his own subjective opinions are dominant because he is well aware of the configuration information. This time the central route is more effective. On this basis, we express these two routes in the form of fluid mechanics from the idea of FluidRating [12, 13]. Finally, the model was proposed, which we called ELMFluid.
Our new method makes the following contributions:(1)The new model describes the user’s internal psychological state and behavioral dynamics. This new model uses fluid mechanics and thermodynamics to express the transmission of mental energy.(2)In the new model, we use the number of review items which indicates how deeply the user relies on the website and has processed the known information.(3)ELMFluid is evaluated by using real datasets of Epinions.com and Ciao.com. The results show that the method is very efficient and effective.
The remaining part of the article is organized as follows:
The second part of the article investigates the relevant work. The solution section describes the problem and explains and extends the classic model. Then, a hypothesis about the new model is made and the performance of the algorithm is analyzed. The experimental part includes a wide range of experiments driven by real datasets to verify the hypothesis. In the final section, the article is summarized and future work is given.
2. Related Work
In this section, we introduce the previous work on the classic consumer research models, trust models and give the differences between the previous work and our model.
2.1. Consumer Research Model
The classic consumer research models are mostly based on psychology and sociology. Kahle et al. proposed a conceptual model of regional consumption differences, known as the TOFA (Traditionalism, Optimism, Fashion and Financing, Advance) model. The model uses the two indices S (Style) and R (Risk) to divide the consumers into four types and identify the changes in the basic style and the form of regional consumption [14]. Arnold Mitchell designed Values and Lifestyle Systems (VALS system). VALS combines Maslow’s hierarchy of needs with the internal drivers and external drivers which are proposed by the American sociologist David Riesman. Nine lifestyle groups are identified, which have now been upgraded to VALS2 and divided into eight markets [15]. Based on value theory, Sheth, Newman, and Gross propose that products provide customers with five values. These are functions, emotions, conditions, society, and cognition. In different consumption context, consumers are affected differently by different values [16]. Most of the methods are based on statistics to classify the customers directly. The model used in this article is the Elaboration Likelihood Model (ELM) by the psychologists Petty and Cacioppo [10, 11].
2.2. Trust Model
At present, most of the recommendation systems are based on the relationship graph. The most important step is to calculate the relationship value or trust transfer rules [17, 18]. Among the trust models, two well-known models are TidalTrust based on breadth-first search proposed by Folbeck and MoleTrust proposed by Massa. TidalTrust is a method based on the shortest path, with high scalability and low complexity [3]. MoleTrust calculates the spread range of trust relationships by figuring out the central node [4]. Personalized PageRank calculates the trust value of the whole network node via Markov chain [19, 20]. Ziegler and Lausen used the local group trust propagation mechanism of the activation diffusion mechanism in psychology to calculate the trust value [21]. Kuter et al. predict the probability of trust among Bayesian network nodes by using probabilistic sampling techniques [22]. Liu et al. divide trusted users into different fields to improve recommendation accuracy [23]. Asghari and Azadi proposed an inverted ant colony optimization algorithm to find the reliable propagation path [24]. Deng used deep learning with the community effect for recommendations [25]. Ghavipour and Meybodi proposed DLATrust which is a learning automaton for discovering reliable paths [26]. However, most models do not take time evolution into account. FluidRating contacts the temperature and the user’s rating by the dynamics theory [13]. The model we proposed considers the user’s internal deeper and inner psychodynamics. In order to explore the user’s behavioral motivation, the model considers the user’s own inherent attitudes and personality (central route) and the part affected by other users (peripheral route). The new model also conforms to the general principle of transitivity.
2.3. Personality
The psychologists believe that personality is the driving force of behavior [27]. Personality is defined as a series of habitual behaviors evolved from biological and environmental factors, cognitive and emotional patterns. It is the stable difference between different people [28, 29]. It also can have an impact on online behavior [30]. Through the study of personality traits, the online behavior of users can be well analyzed. For example, Feng uses personality traits to predict customer satisfaction and loyalty to the service companies through online replies [31]. Caci et al. analyze the use of Facebook by 600 users through personality traits [32]. Cojocaru et al. analyze social influence based on personality traits [33]. Hamburger believes that based on the Big Five personality, men with extraversion personalities prefer to use the leisure and entertainment services on the internet. Women with extraversion personality are more reluctant to use social networking services [34]. Most of the judgments on the user’s personality are based on the text information posted by the user on the internet. When the website has only scores and no comments, it is very difficult to identify the user’s personality, so at this point, our method only considers the user’s rating habits about the product.
3. Preliminary
3.1. FluidRating
The idea of FluidRating is an important part of our model [12, 13]. For a certain product, FluidRating divides the users into two categories. One is the user who has already given an opinion on the product (defined as raters, denoted as R) and the other is the user who has not given an opinion (nonraters, defined as N). First, FluidRating builds the relationship net shown in Figure 1. It can be seen that raters a1 and a2 give 3 and 5 stars for the product. Raters a3, a4, and a5 are nonraters. The trust relationships are represented by the arrows. User a3 trusts user a1, so user a1 points to user a3. A system is built based on a well-constructed net. Then, users are mapped into different containers. The trust relationship is mapped to a one-way channel. The temperature of the liquid in the container represents the user’s opinion.

(a)

(b)
Finally, the liquid in containers a1 and a2 will flow to a3, a4, and a5, and the temperature of the fused liquid from different-source containers represents the predicted rating.
3.2. ELM
The basic principle in ELM is shown in Figure 2. The proposed method simulates the user’s thinking process and expresses that different persuasion routes play different roles. One is the central route and the other is the peripheral route. When users process information deeply, the proportion of the central route is more effective. On the other hand, when users think about the known information very lightly, the peripheral route is more effective [16]. Because each user’s dependence on the website is different, this model provides great help for us to explore the psychological processes of different users.

Our new method configures two routes by using the product ratings and trust relationships on the website rather than the traditional cumbersome psychological measures methods in the form of questionnaires.
4. Solution
4.1. Development of the Classical Model and Hypotheses
According to the classical theoretical frameworks, we propose an improved model. We fuse FluidRating and personality theory in ELM as shown in Figure 3. For a product, each user’s attitude is divided into two parts. One part is the central route generated by the user after the consideration of the information. In this article, we use the personality theory to solve this part. The other part is the attitude generated by users who are influenced by the reviews of users they trust, calculated using FluidRating, which is the peripheral route. Then, we take the number of user reviews as the proportion of the two parts, which could denote the degree of dependence of the website. Under the circumstances without text information, we classify users into four types by the reviews. Different types of users are affected by two routes differently. Finally, the method proposes a representation of fluid mechanics and thermodynamics that facilitates the computation of the mix and interaction with mental energy.

The psychological state of user x or the user’s comprehensive mental state is described as follows:where F (x) is the central route of user x and is the peripheral route of user x. G (x, y) is the effect of user x receiving external factors y, like other users’ ratings. Axyis the weight of the factor.
Through the above model, we make the following hypotheses: H1: the more reviews the user has, the more the user can be affected by the peripheral route. When a user evaluates more products, it proves that the user is also more likely to rely on the evaluation system and consider others’ reviews. On the contrary, if a user has fewer evaluations, the user has deeper processing and his judgment is not easily affected. H2: evaluation habit is stable. If a user’s personality trait is that he often gives a rating higher or lower than the average rating, he will maintain this characteristic for a certain period of time. The formal representation is shown as follows: where is the average rating of the nth product, ratingn(x) is the rating of user x of the nth product, and user x rates N products in all. βx represents the deviation degree between the evaluation of user x and the average opinion. We believe that the bias βx of each user in the evaluation is stable. H3: a user can be impacted differently by the users with different influence. where FluidRating (x, y) is the peripheral route based on FluidRating. γxy is the parameter about the influence of different users.
Finally, based on the above hypotheses, we construct the ELMFluid, as shown in Figure 3.
4.2. Problem Formulation and Overview of the Model
Because the system is an idealized model, we make the following provisions:
Assumption 1. The system is a closed system; the entropy of the system is constant.
Assumption 2. The temperature in the container will not be changed until there is liquid exchanging in the container.
As in FluidRating, the new model is divided into 3 parts: one-way connection pipes, fluid, and containers. In our model, the container is used to represent each node in the relation graph and assume that the container is large enough. The bottom of the containers is at the same level so that liquid in the container has the same potential energy. The temperature of the fluid indicates users’ attitudes. There is no liquid in containers of nonraters before the system initializes because the nonraters have not given the rating yet. The one-way pipe represents the trust relationship and this part is the peripheral route.
We added one extra pipe to each container. The temperature of this liquid represents the self-generated internal attitude of the user, and that is the central route.
4.3. Algorithm Details
4.3.1. Parameter Initializing
When the model starts, the fluid height in the container h and the cross-sectional area b will be given. Until the empty container is filled with liquid, we divided the process of liquid exchange into k time slots from the start.
4.3.2. Fluid Updating Execution
The updating process is shown in Figure 4 and the update of the liquid volume between the adjacent containers is computed by the Bernoulli formula in physics. The updated volume from container a to container a′ at ith time slot is shown in (5). This part can also be understood as the volume in the pipe from container a to container a′.

Here, is the constant gravitational acceleration. Δt is the time gap and and are the fluid height in containers a and a′.
The volume of the fluid in a certain container is the remaining plus which is the fluid flowing in from each container which has the height difference and minus which is the fluid flowing out at the next time slot. So, the volume in container a at the i + 1th time slot could be figured out by
The temperature of the liquid in the container is calculated by the formula which figures out the temperature of the mixed liquid. is the temperature of the liquid in container a at the i + 1th time slot. is the temperature of the liquid flowing from container a″ to a.
4.3.3. Time Complexity Analysis
The algorithm is shown in Algorithm 1. Lines from 1 to 10 show the discrete process to figure out the liquid flowing in with k time slots. From line 2 to 6, we figure out the updated fluid which should flow to each nonrater’s container in each time slot. From line 7 to 9, we maintain the fluid height and temperature of the raters’ containers. Each round of fluid updating from line 1 to 10 takes a time complexity of O (V + E). So, the total time complexity of the algorithm in k time slots is O (k ∗ (V + E)). Finally, in line 11, we get the predicted rating represented by the temperature.
|
Above all, ELMFluid could be very efficient for real-world rating predictions involving millions of users because of the sparse network.
5. Evaluation
5.1. Experimental Design
5.1.1. Datasets
The datasets we used are from Epinions.com and Ciao.com. These two websites are very popular about the consumer reviewing [35]. Visitors can browse a variety of comments to help them make the right choice for shopping based on trust relationship. The Epinions dataset is a classical set, which is published by Massa and Avesani [4]. There is a total of 49,290 users with 487,181 trust relationships and a total of 139,738 different products with 664,824 reviews. Another dataset is crawled from the Ciao.com by Tang Jiliang. There are more than 57544 relationships on 2378 users and nearly 284086 ratings of 16,861 products. The basic information of two datasets is shown in Figures 5–7. The Epinions and Ciao data used to support the findings of this study are included within the Supplementary Material. Both datasets can also be downloaded form http://www.cse.msu.edu/tangjili/ [36] and for the reason of privacy protection, the original user ID has been processed and hashed.

(a)

(b)

(a)

(b)

(a)

(b)
The rating distribution of two datasets is shown in Figure 5.
It can be seen that most of them are concentrated on four stars and five stars. Users prefer to give praise. This appeals to common sense.
The distribution of the reviews is shown in Figure 6. The number of users with most reviews is lower than any other user group.
The distribution of the followers of the user is shown in Figure 7. Half of the users on Epinions have more than 15 followers and half of the users on Ciao have more than 30 followers.
It can be seen that the users of Ciao pay more attention to the trust relationship and users of Epinions pay more attention to the reviews. The users in Epinions usually give more reviews than Ciao and the users in Ciao follow more other users than Epinions.
5.1.2. Evaluation Method
To measure the prediction performance of the ELMFluid, we use the leave-one-out method [4, 37]. For the subgraph of a user, the true rating of that user is masked and then it will be predicted based on the subgraph. Then, the prediction error is the difference between the predicted rating and the masked rating.
In order to verify the propagation ability of the algorithm, for the target user, we tested three different calculation scopes of a target user. As is shown in Figure 8, the 1-hop subgraph only considers the target user and his followees with ratings. Then, we add all the followees of the target user into the 2-hop subgraph and if a followee of these has not given a rating, we also add his followees. The 3-hop subgraph is the next extension layer of the 2-hop subgraph.

The process of simulation is also very simple. Initialize the model based on the parameters in the table. The fluid height in the container is 10 and the cross-sectional area is 1. Then update the exchanging liquid according to the algorithm. We divided the process of liquid exchanging into 20 time slots and the predicted values (temperature values) of each time slot were recorded. Finally, we get the attitude of the target user.
Table 1 shows the parameter settings.
5.1.3. Accuracy Metrics
The accuracy evaluation methods we used are RMSE and F1-score [38].
Root Mean Square Error (RMSE) is a commonly used measure of the difference between the samples and total values.where N is the total number of user/item pairs which need to be predicted and Xobs,i and Xmodel,i denote the real ratings and predicted ratings.
The second performance metric is the classic F-score, which is the harmonic mean of precision and recall:
FN (the false-negative case) is the number of incorrect predictions when the ground truth is that the corresponding user has a rating score of lower than three (called negative rating). TP and FP (the true and the false-positive cases) are the numbers of correct and incorrect predictions, respectively, when the ground truth is that the corresponding user has a rating of no less than three (called positive rating).
A smaller RMSE indicates a better prediction. A larger F-score indicates a better prediction. So, when we get the prediction evaluation with a smaller RMSE value and a larger F-score at the same time, this shows that they are consistent, and the method is effective.
5.2. Experimental Results and Analysis
5.2.1. Impact of the Process Depth
In this part of the experiment, we compare the results of four runnings of our algorithm with different values of the central and peripheral routes. First, the central route and the peripheral route are the same. Second, the percent of the central route is larger than the peripheral route. The percentage of central route is 60 and the percentage of the peripheral route is 40. Third, the percentage of central route is 40 and the percentage of the peripheral route is 60. Last, we divide users into four types by the number of reviews and we set different parameters to different users as in Table 2 to show the personal processing depth.
The comparison of the results in Figures 9 and 10 shows that the personal processing depth is the most accurate for the highest F-score and smallest RMSE. These results meet hypothesis H1. The accuracy is 5% higher than the default model in each subgraph from RMSE, followed by the large proportion of the central route and then the small proportion and we can draw the same result from F-score. Under the similar recommended website based on trust relationship, the user’s central route is more important than the peripheral route because the prediction of a larger central route is more accurate than others except for the personalized process.

(a)

(b)

(a)

(b)
From the subgraph of different hops to analysis of the results, the 1-hop works best on the Epinions dataset, and the third hop results from the Ciao dataset are the best. The results on both datasets are almost identical and the nuances can be due to user’s different concerns because the results of three subgraphs are very close.
5.2.2. Impact of the Personality
The second experiment compares the default central route and the personalized central route which considers the personality. The first result is based on the default model and the second is based on the personalized rating preference which improves the central route of the model.
The results in Figures 11 and 12 show that the model with user’s personalized scoring habits is more effective and the two datasets produced similar results. The RMSE with personal improvement in both datasets is smaller than the default and the F-score is higher. In the Epinions dataset, three conditions with different hops are similar. The 2-hop and 3-hop on the Ciao dataset are better than the first. The accuracy of the personalized central route on both datasets is about 5% higher than the default, so we conclude that considering the preference of the user’s scoring, the predicted result is more accurate. These meet hypothesis H2.

(a)

(b)

(a)

(b)
5.2.3. Impact of the Trust Degree
In this part of the experiment, we use the number of users’ followers as personalized parameters. If the number of the followers of a user is greater than a common user, the temperature of the liquid flowing from the container represented by this user should be increased. When the user has fewer followers, the temperature should be decreased. In the process of parameter adjustment, if the coefficient of change is too small, the effect is not obvious. If the coefficient of change is large, the predictions are not good. So, we update the temperature by 10, where n is the number of followers. When the number of followers is higher, the number of this type of user is more sparse, so the logarithmic operation is added. The number of followers is less than 100, so the final range of the coefficients is [−0.1,0.1].
The results are shown in Figures 13 and 14.

(a)

(b)

(a)

(b)
From the results in Figures 13 and 14, by using the current parameters, we can get a slight improvement. In each subgraph of the two datasets, we get a slightly smaller or equal RMSE compared with the default condition. The F-score value is similar with the RMSE value.
It is worth noting that when we get equal RMSEs, this prediction is also meaningful if the F-score value is larger. Or if F-scores are equal, RMSE is smaller. The first case shows that although the deviation from our improvement is equal to the default method, we get truer positive cases, just like our results from the Epinions. The second case shows that when we get the same number of true positive cases, our prediction deviation is less from the ground truth, just like our results from the Ciao.
5.2.4. Multiple Algorithm Comparison
In this section, we tested the comparison among the ELMFluid model constructed by individual factors in the case of previously set parameters, every single factor, and other methods including the FluidRating model, the basic model of this article. Because a smaller RMSE means a better result, while a larger F-score means a better result, from the results in Figures 15 and 16, the superiority of our algorithm has been clearly seen compared with the classical algorithms, including TidalTrust, MoleTrust, RandomWalk, PageRank, and FluidRating. On the three subgraphs with different hops, we reached a similar conclusion. From RMSE metrics, in the Epinions dataset, the accuracy of the user character role theory increased 10% than FluidRating and processing depth contribute 5%. For the Ciao dataset, the accuracy improvements were 15% and 20%, respectively. That means we have a smaller error. From the F-score metrics, we can come to the consistent conclusion with the RMSE metric that we have higher F-score values. That means we get truer positive cases.

(a)

(b)

(a)

(b)
5.2.5. A Deeper Discussion
In this part of the experiment, we tested the effect of time evolution from the 1st time slot to the 20th time slot on the model first. From the result in Figure 17, each iteration of the model has little impact on the model. The result has little change from the first time slot to the end.

(a)

(b)
Next, we conduct experiments on users with different attributes. In the Epinions dataset, we set up two subsets with the users whose followers are more than 10 and more than 20. In the Ciao dataset, two new datasets are made up of the users whose followers are more than 20 and 40, respectively.
Another experiment is also conducted under different subsets. In the Epinions dataset, users whose reviews are more than 20 and more than 40 will compose the two new datasets and the number of users’ reviews of two new Ciao subsets is more than 10 and 20.
The reason for dividing the new dataset in this way is that each subset accounts for 30% or 60% of the full dataset approximately, which is good for our comparison between the users with different attributes.From the first experiment, it can be seen in Figures 18 and 19 that in the Epinions dataset, the users with the number of followers greater than 20 are predicted the most accurately with the smallest RMSE and the highest F-score. Then, the users with the number of followers greater than 10 are the second and the complete dataset has the worst predicted results. We can draw a similar conclusion on the Ciao dataset and the little difference is that we get the lightly truer positive cases from the subdataset made up of the users with the number of followers greater than 20. So usually, the more followers the user has, the more accurate our model predicts.

(a)

(b)

(a)

(b)
The second experiment shown in Figures 20 and 21 studies the relationship between the model’s accuracy and the number of user reviews. It can be seen in Epinions dataset that, generally, users with more reviews will get smaller RMSE, higher F-score, so they can get more accurate prediction than users with fewer reviews, but when the number of reviews exceeds a certain range, the accuracy of the prediction result will be reduced. The results of the Ciao dataset also confirm this point.

(a)

(b)

(a)

(b)
6. Conclusion and Future Work
In this article, we combine the user personality theory and FluidRating as the central route and peripheral route of ELM. This method starts from the user’s internal psychological structure so that it can better express the spread of mental energy through thermodynamics and fluid mechanics. The method is closer to the user’s true psychological state, so it can predict more accurately and comprehensively. This paper verifies the rationality of the model and the validity of the hypothesis through experiments on two real datasets.
In future work, the model can be extended by the role of the user in the community, as well as the user’s sense of efficacy in community, and the user’s psychological process needs to be expressed with more accurate physical form. Other psychological concepts and analytical methods should also be introduced, such as the conformity biases and more detailed personality by the users’ comments and the LIWC dictionary.
Data Availability
The data used to support the findings of this study are included within the article.
Disclosure
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Conflicts of Interest
The authors declare that they have no conflicts of interest regarding the publication of this paper.
Acknowledgments
This paper was supported by the National Natural Science Foundation of China under Grants nos. 61672179, 61370083, and 61402126, the Research Fund for the Doctoral Program of Higher Education of China under Grant no. 20122304110012, the Youth Science Foundation of Heilongjiang Province of China under Grant no. QC2016083, Heilongjiang Postdoctoral Fund no. LBHZ14071, and the Fundamental Research Funds for the Central Universities no. HEUCF20181312.