Expertise and statistics

27 July 2023 psychology, statistics, education, communication, personal development, werk

It’s been a while since I wrote anything here, so I’m going to briefly write about some things that I learned recently.

An article with the captivating title “Ability to See Expertise is a Milestone Worth Aiming For” discusses how being able to see expertise requires expertise, and therefore worth striving for if you care about knowing what you’re doing. (There’s also a tangent (or main thread?) of how you should use social networks (no, not websites) to get a better salary.) I’ve recognized this in myself and others: that you were missing or recently gained the ability to see the expertise of someone or fully appreciate a particular insight.

One such moment was reading this article, on how long it takes to become gaussian (for convoluting non-gaussian distribution, i.e. having various sets of sampling from an arbitrary non-gaussian distribution). Turns out: sometimes really long! I’ve not had much of a statistical education, but it has always had my interested ever since I discovered a quantity that had a non-gaussian distribution, which illuminated for me the fact that we assume this perhaps way too often. A quote written after my own heart:

Blindly slapping a normal distribution on things is a convenience from the time when we didn’t have fast computers, because the normal distribution has nice theoretical properties that make pen-and-paper analysis convenient.

I failed to convince people in my time working in a hospital research group of this, probably because I lacked the ability to express myself as a layman as described in the Expertise article. The Gaussian is everywhere in medicine, and the mean is usually the only thing that’s calculated and reported. I argued that we had the data to investigate this, but it fell on deaf ears, and since I was not tasked with data analysis I didn’t have the time to do it myself. What I learned now is a bit of vocabulary to express this, and also the sifnificance of mathematical moments. I’m sure someone has tried to educate me on them, but I never remembered the concept or their significance. I would use the difference between the measures of centralcy to explain myself, and now I can use the four basic moments instead, because distributions can be categorized according to these values. A table of P(exceeding) for the normal distribution and the Cantelli and Chebychev estimates is a good way to illustrate the relevance for medicine: outliers may be much more normal than doctors think!

Sigma	Normal	Cantelli	Chebyshev
1	16 %	50 %	50%
2	2.3 %	20 %	25%
3	0.14 %	10 %	11.11%
4	0.003 %	6 %	6.25%
5	0.00003 %	4 %	4%