model
Initializes the weights of a neural network layer using the Xavier normal distribution and sets biases to zero. |
|
Policy network for generating actions and corresponding log-probabilities. |
|
Q-network for estimating the state-action value in reinforcement learning. |
|
Implementation of the Soft Actor-Critic (SAC) algorithm for reinforcement learning. |