PPO-Q: Proximal Policy Optimization with Parametrized Quantum Policies or Values