“Reasoning with Sampling” — Notes on Karan & Du (2025)
kosti.bearblog.dev·15h
Flag this post

“Reasoning with Sampling” — Notes on Karan & Du (2025)

  • 09 Nov, 2025 *

I read this nice paper on power sampling and reasoning for LLMs1.

LLMs can generate reasoning before answering a request and they will be more accurate if they do so, e.g., o1 from OpenAI. While there are experiments that indicate that a lot of the reasoning is surface-level or losing accuracy quickly once we move out of distribution (see work by Apple as well this review work by Mondorf and Plank), there is a whole movement now to use various techniques from RL to g…

Similar Posts

Loading similar posts...