Research Article
Enhancing Fairness in Federated Learning: A Contribution-Based Differentiated Model Approach
Algorithm 2
DQN-based contribution scores update for FL.
| | Input: current state | | | Output: contribution scores set | | (1) | Initialize the global model parameters for each cluster, experience replay memory | | (2) | Initialize action-value function with random weights | | (3) | Initialize target action-value function with weights | | (4) | for to do | | (5) | With probability select random actions | | (6) | Otherwise select actions | | (7) | Execute actions , update contribution score for each client. | | (8) | Obtain clustering results by using Algorithm 1 based on contribution score | | (9) | Run the FL algorithms independently on each cluster | | (10) | Observe reward and next state | | (11) | Store transition in | | (12) | Sample random mini-batch of transitions from | | (13) | Update weights by minimizing the loss | | (14) | At every certain step, update the target network weights: | | (15) | end for |
|