Search Repository
search
Search
Search Repository
search
Search
Deposit
LSE creators
Number of items:
1
.
Item Type
Year
Departments
File available
Report
Ma, Tao
,
Yang, Xuzhi
,
Szabo, Zoltan
(2024).
To switch or not to switch? Balanced policy switching in offline reinforcement learning.
arXiv. https://doi.org/10.48550/arXiv.2407.01837
picture_as_pdf
arrow_upward
Up a level
EndNote
BibTeX
Reference Manager
Refer
Dublin Core
JSON
Multiline CSV
ios_share
Export
rss_feed
Atom
rss_feed
RSS
Report