Research Article

The Recommending Agricultural Product Sales Promotion Mode in E-Commerce Using Reinforcement Learning with Contextual Multiarmed Bandit Algorithms

Table 3

Simulated user website action score.

Action typeScoreDescription

Click on the product1Write a score when you click to enter the product page
Adding to shopping cartProduct price multiplied by coefficientWhen adding a product into the shopping cart, write a score
Shopping cart checkoutCommodity price multiplied by coefficientWrite the product price into a score when the shopping cart is checked out