Infrequent Exploration in Linear Bandits
arxiv.org·19h
Flag this post

Title:Infrequent Exploration in Linear Bandits

View PDF HTML (experimental)

Abstract:We study the problem of infrequent exploration in linear bandits, addressing a significant yet overlooked gap between fully adaptive exploratory methods (e.g., UCB and Thompson Sampling), which explore potentially at every time step, and purely greedy approaches, which require stringent diversity assumptions to succeed. Continuous exploration can be impractical or unethical in safety-critical or costly domains, while purely greedy strategies typically fail without adequate contextual diversity. To bridge these extremes, we introduce a simple and practical framework, INFEX, explicitly designed for infrequent exploratio…

Similar Posts

Loading similar posts...