基于强化学习的核自旋状态控制方法

doi:10.11938/cjmr2026-3214

波谱学杂志

• •

基于强化学习的核自旋状态控制方法

付高成, 张世纪, 黄凯, 魏达秀, 姚叶锋

上海市磁共振重点实验室，医学磁共振与分子影像技术研究院，华东师范大学物理学院，上海 200241

收稿日期:2026-03-20 修回日期:2026-04-21 接受日期:2026-05-11
通讯作者: 魏达秀；姚叶锋 E-mail:dxwei@phy.ecnu.edu.cn;yfyao@phy.ecnu.edu.cn
基金资助:
国家重点研发计划(2023YFF1204801)

A Method for Nuclear Spin State Control Based on Reinforcement Learning

Fu Gaocheng, Zhang Shiji, Huang Kai, Wei Daxiu, Yao Yefeng

Institute of Magnetic Resonance and Molecular Imaging in Medicine, Shanghai Key Laboratory of Magnetic Resonance, School of Physics, East China Normal University, Shanghai, 200241, China

Received:2026-03-20 Revised:2026-04-21 Accepted:2026-05-11
Contact: Wei, Daxiu；YAO Yefeng E-mail:dxwei@phy.ecnu.edu.cn;yfyao@phy.ecnu.edu.cn

摘要/Abstract

摘要： 目标自旋态的高保真度制备是核磁共振波谱实现高灵敏度检测的基础．传统的梯度上升（Gradient Ascent Pulse Engineering, GRAPE）算法对初始猜测敏感且易陷入局部极值，而单纯的强化学习（Reinforcement Learning, RL）在处理复杂量子系统时，常因奖励稀疏导致策略发散．针对此局限，本文提出一种由RL与GRAPE串联的混合优化框架．该框架利用RL的无模型特性进行全局演化轨迹搜索，随后结合GRAPE作局部连续梯度微调．以柠檬酸双¹H自旋体系为例的液体核磁共振实验表明，针对pH变化引起的化学位移漂移，该方法稳定生成保真度大于0.99的鲁棒脉冲．此外，脉冲时长的显著压缩有效减轻了弛豫导致的相干性损耗．该兼顾演化时间与参数鲁棒性的控制策略，为临床波谱学中复杂代谢物检测提供了可靠支撑．

关键词: 核磁共振波谱, 优化控制脉冲, 柠檬酸, 强化学习

Abstract: High-fidelity preparation of target spin states is fundamental to achieving high-sensitivity detection in nuclear magnetic resonance (NMR) spectroscopy. The traditional gradient ascent pulse engineering (GRAPE) algorithm is sensitive to initial guesses and prone to trapping in local extrema. Conversely, pure reinforcement learning (RL) often suffers from policy divergence due to sparse rewards when applied to complex quantum systems. To address these limitations, this paper proposes a hybrid optimization framework that cascades RL with GRAPE. This framework leverages the model-free nature of RL for global evolutionary trajectory search, followed by local continuous gradient fine-tuning using GRAPE. Liquid-state NMR experiments on a two-spin ¹H system of citric acid demonstrate that this method stably generates robust pulses with a theoretical fidelity exceeding 0.99, despite chemical shift drifts induced by pH variations. Furthermore, the significant compression of pulse duration effectively mitigates coherence loss caused by relaxation. This control strategy, which balances evolutionary time and parameter robustness, provides a reliable methodological foundation for the detection of complex metabolites in clinical spectroscopy.

Key words: Nuclear Magnetic Resonance Spectroscopy, Optimal Control Pulse, Citric Acid, Reinforcement Learning

付高成, 张世纪, 黄凯, 魏达秀, 姚叶锋. 基于强化学习的核自旋状态控制方法[J]. 波谱学杂志, doi: 10.11938/cjmr2026-3214.

Fu Gaocheng, Zhang Shiji, Huang Kai, Wei Daxiu, Yao Yefeng. A Method for Nuclear Spin State Control Based on Reinforcement Learning[J]. Chinese Journal of Magnetic Resonance, doi: 10.11938/cjmr2026-3214.

[1]	郑佳琪, 王意浓, 元思文, 尹田鹏. 4-异丙氧基-1-(三氟乙酰基)萘的结构解析和NMR数据完整归属[J]. 波谱学杂志, 2025, 42(4): 437-444.
[2]	邵正泽, 王行乐, 杨雪, 辛家祥, 魏达秀, 姚叶锋. 基于优化控制核自旋谱编辑技术的乳酸信号选择[J]. 波谱学杂志, 2025, 42(1): 1-12.
[3]	朱向炜, 杨雪, 魏达秀, 姚叶锋. 基于核自旋单重态的活体谷胱甘肽分子MRS信号选择[J]. 波谱学杂志, 2024, 41(4): 373-381.
[4]	李正喆, 郭亮, 任旭虎. 基于数值优化方法的Halbach磁体无源匀场方法研究[J]. 波谱学杂志, 2024, 41(2): 128-138.
[5]	王欢, 陶志清, 姜国胜, 张许, 王冠, 禾立春, 刘买利. HdeA在细菌外膜囊泡环境下的原位NMR研究[J]. 波谱学杂志, 2024, 41(1): 1-8.
[6]	慈杰,杨雪,辛家祥,魏达秀,姚叶锋. 用于指导仲氢诱导核极化状态保存的己烯分子中五自旋的单重态制备和寿命研究[J]. 波谱学杂志, 2023, 40(1): 30-38.
[7]	胡凯瑞,杨雪,黄志明,辛家祥,魏达秀,姚叶锋. 三自旋体系核自旋单重态的制备与单重态二维谱的实现[J]. 波谱学杂志, 2022, 39(1): 96-107.
[8]	杨文杰,黄骏. 基于固体核磁共振技术的固体酸结构、酸性及活性分析[J]. 波谱学杂志, 2021, 38(4): 460-473.
[9]	夏锡锋,张文静,林芝晔,柯晓康,温玉洁,王芳,陈俊超,彭路明. 氧化物纳米材料表面结构与性质的固体核磁共振波谱研究[J]. 波谱学杂志, 2021, 38(4): 533-542.
[10]	刘涛涛, 王杰, 郭向阳. 脑科学研究中的质子磁共振波谱方法[J]. 波谱学杂志, 2020, 37(2): 232-240.
[11]	杨以宁, 王雪璐, 姚叶锋. 原位核磁共振技术研究反应环境对光催化甲醇重整过程的影响[J]. 波谱学杂志, 2020, 37(1): 104-113.
[12]	冯宗静, 杜亚平, 罗锋, 徐骏. 通过超宽¹³⁹La固体核磁共振波谱研究层状La(OH)₂NO₃[J]. 波谱学杂志, 2020, 37(1): 76-85.
[13]	汪红志, 王申林, 胡炳文, 余亦华, 宋一桥, 姚叶锋. 基于数值计算模拟的仿真核磁共振波谱仪开发[J]. 波谱学杂志, 2019, 36(3): 288-297.
[14]	杨保联*. 超高场磁共振人体成像应用研究和医学前景 [J]. 波谱学杂志, 2015, 32(4): 707-714.
[15]	KUMAR Sriramoju M, 呂平江, 徐尚德, . 以液体核磁共振波谱分析与帕金森氏病相关的I93M 突变对人类泛素碳端水解酶结构的影响[J]. 波谱学杂志, 2015, 32(2): 329-341.

基于强化学习的核自旋状态控制方法

A Method for Nuclear Spin State Control Based on Reinforcement Learning

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价