Transfer Reinforcement Learning for Mixed Observability Markov Decision Processes with Time-Varying Interval-Valued Parameters and Its Application in Pandemic Control
点击次数:
发表时间:2024-11-02
发表刊物:INFORMS JOURNAL ON COMPUTING
页面范围:1-23
ISSN号:1526-5528
发表时间:2024-11-02