Keynote Talk: Sample Complexity of Partition Identification Using Multi-armed Bandits
Sandeep Juneja
School of Technology and Computer Science,
Tata Institute of Fundamental Research
Given a vector of probability distributions, or arms, each of which can be
sampled independently, we consider the problem of identifying the
partition to which this vector belongs from a finitely partitioned
universe of such vector of distributions. We study this as a pure
exploration problem in multi armed bandit settings and develop sample
complexity bounds on the total mean number of samples required for
identifying the correct partition with high probability. This framework
subsumes well studied problems in the literature such as finding the best
arm or the best few arms. The partitions considered include half spaces,
polytopes or their complements, or more generally, convex sets or their
complements. In these settings, exploiting problem geometry, we
characterise the lower bounds on mean number of samples for each arm.
Further, we propose algorithms that can match these bounds asymptotically
with decreasing probability of error. Applications of this framework are
diverse. We briefly discuss one associated with nested Monte Carlo and its
applications to finance.