Weighted Quantile Weirdness and Bugs
practicalsignificance.com·10h·
Discuss: Hacker News
Flag this post

At some point in their career, many statisticians learn that calculating a quantile can be surprisingly complicated. In probability theory, quantiles have certain elegance: they’re simply defined based on the inverse of a cumulative distribution function. But when it comes time to actually estimate a quantile or compute it using real-world data, it’s surprisingly complicated, and it turns out that there are multiple reasonable ways to define a quantile for a dataset. Different software packages like SAS or R each offer at least a half-dozen ways to compute quantiles, with subtle differences. That’s why I often see data analysts surprised by the fact that their preferred software package spits out a different estimate for the median compared to someone else’s software.

Things get stil…

Similar Posts

Loading similar posts...