online decision-making

Sequential Fair Allocation With Replenishments: A Little Envy Goes An Exponentially Long Way

We study the trade-off between envy and inefficiency in repeated resource allocation settings with stochastic replenishments, motivated by real-world systems such as food banks and medical supply chains. Specifically, we consider a model in which a …

Pricing and Optimization in Shared Vehicle Systems: An Approximation Framework

Optimizing shared vehicle systems (bike/scooter/car/ride-sharing) is more challenging compared to traditional resource allocation settings due to the presence of *complex network externalities* -- changes in the demand/supply at any location affect …

Adaptive Discretization for Model-Based Reinforcement Learning

We introduce the technique of adaptive discretization to design efficient model-based episodic reinforcement learning algorithms in large (potentially continuous) state-action spaces. Our algorithm is based on optimistic one-step value iteration …

Online Allocation and Pricing: Constant Regret via Bellman Inequalities

We develop a framework for designing simple and efficient policies for a family of online allocation and pricing problems, that includes online packing, budget-constrained probing, dynamic pricing, and online contextual bandits with knapsacks. In …

Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces

We present an efficient algorithm for model-free episodic reinforcement learning on large (potentially continuous) state-action spaces. Our algorithm is based on a novel Q-learning policy with adaptive data-driven discretization. The central idea is …

Sequential Fair Allocation With Replenishments: A Little Envy Goes An Exponentially Long Way

Pricing and Optimization in Shared Vehicle Systems: An Approximation Framework

Adaptive Discretization for Model-Based Reinforcement Learning

Online Allocation and Pricing: Constant Regret via Bellman Inequalities

Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces

Predict and Match: Prophet Inequalities with Uncertain Supply

Uniform Loss Algorithms for Online Stochastic Decision-Making With Applications to Bin Packing

Predict and Match: Prophet Inequalities with Uncertain Supply

The Bayesian Prophet: A Low-Regret Framework for Online Decision Making

Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces