Abstract: Conventional deep reinforcement learning (DRL) frameworks for power electronics face several critical limitations, including nonreal-time training environments, poor generalization due to ...