pglewis said:
It still seems to me that there is an appreciable amount of quantization distortion of high-frequency content that is still well in the audible range at 44.1kHz. Even 8k frequency components get sampled with less than 6 points per cycle at 44.1.
I don't think there
is any quantization error. An 8k frequency component means a
sine wave of 8k. It only takes two sampling points to represent a sine wave (ignoring the problem of aligning the sampling freqency with the sine wave properly... which is why the Nyquist limit is usually stated as
below half the sampling frequency, not equal to or below).
So, 6 points per cycle on an 8k signal is all you need to
perfectly define the sine wave. Now, perhaps you're thinking is that it's not a perfect sine wave, but has some irregularities which limited sampling points will not accurately capture.

And what causes those irregularities? Higher frequency components!

So, in a sense, you are correct in thinking that a
real instrument playing at 8k might not be accurately represented by 6 sampling points, but that is only because there are harmonics created by the instrument that are
well above 8k, which add color etc. up to a point (when they become inaudible... or imperceptible, take your pick on that argument). A true 8k signal is perfectly captured by 6 points (or three, or perhaps two).
So, while it is true that using 44.1 kHz sampling frequency means you can only capture
sine waves up to 22 kHz, it probably doesn't matter much. If it weren't a
true sine wave, then it would be because of higher (than 22 kHz) frequency components, which are probably inaudible or imperceptible anyway.
As pointed out above, your ear just doesn't know the difference between a 20 kHz sine wave, square wave, dirty sine, or anything in between.
And I agree with the statements up above that the antialiasing filter is probably the biggest single benefit of using a higher sampling frequency. The high frequency components captured by the higher sampling rate are probably not audible (or perceptible) for most people on the vast majority of reproduction equipment, but the antialasing can be. I may be wrong, but I think some A/D converters actually sample at a higher rate, apply the filter, and then downsample. Even though there are errors introduced by the downsampling, in some cases they are less perceptible those caused by high frequency filtering.
At least, that's what I seem to recall from my very limited knowledge of digital audio theory.