Author: Su, C.G.
Paper Title Page
TUPB036
Optimisation of RFQ Transmission Efficiency Based on Reinforcement Learning Control Policy  
 
  • C.G. Su, Z.J. Wang
    IMP/CAS, Lanzhou, People’s Republic of China
 
  The reinforcement learning (RL) algorithm is utilized to control the low-energy beam transport (LEBT) and radiofrequency quadrupole (RFQ) in linear accelerators, with the aim of improving RFQ transmission efficiency, achieving high beam intensity, reducing debugging time, and improving operational efficiency. A neural network model is established as part of the Interaction environment to partially replace the Tracewin software for RL training proceess. The SAC algorithm is a reinforcement learning algorithm used to optimize control policies for continuous action spaces. By using the SAC algorithm and interacting with the neural network model, a policy was trained to control the LEBT solenoids, optimizing the RFQ transmission efficiency to above 95% on the simulation software Tracewin. To test the generalization ability of the strategy, we applied it to a real accelerator and successfully validated its ability to optimize the RFQ transmission efficiency. The results demonstrate that RL policy trained in simulation-based environments can be applied on real accelerator control.  
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)