LSE creators

Number of items: 1.

Report

Ma, Tao, Yang, Xuzhi, Szabo, Zoltan (2024). To switch or not to switch? Balanced policy switching in offline reinforcement learning. arXiv. https://doi.org/10.48550/arXiv.2407.01837

Up a level

EndNote

BibTeX

Reference Manager

Refer

Dublin Core

JSON

Multiline CSV

Atom RSS

Report