Artificial replay: a meta-algorithm for harnessing historical data in bandits

Publication
Preprint