Funder
Number of items: 1.
H
Shi, Chengchun, Uehara, Masatoshi, Uehara, Masatoshi, Huang, Jiawei, Jiang, Nan
(2022).
A minimax learning approach to off-policy evaluation in confounded Partially Observable Markov Decision Processes.
Proceedings of Machine Learning Research,
picture_as_pdf