I cannot, for the sake of my life, figure out what is the need of 192Khz. I did some recording tests. I was not able to get any ultrasonics in that range. Looked at some music. Any content in the 50-96khz response was non-existant. So why, WHY, WHY SHOULD I BE INDULGING IN IT? To me it just seems like its wasting space. Unless people like the sound of ultrasonic noise.
Also a 80 minute 5.1 24/192 file is pretty big.
I was wondering about that 10 years or so ago. So a test. Let's try put any difference between 96k and 192k out of the range of perception bias and into the obvious. So 100 lossy conversions back and forth between 96k and 192k! If something is going to snowball, this should do it. (Seen those Youtube videos re-uploaded 100 times and it's just noise artifacts at the end? Like that.) Using SOX sample rate conversion at highest settings.
Obviously the first conversion from 192k to 96k will alter the data set and not completely null. We're looking for cumulative damage though. If the premise is that 96k somehow cannot hold a complete audio signal (even if just in the decimal dust), then it should snowball over many iterations. Like wav to mp3 would.
The first 192k -> 96k -> 192k.
A/B those two 192k files and they sound the same. (Subjective and all.)
Null them and the difference is not audible at normal monitor settings. The meter wiggles around -100db.
Cut to the chase: Iteration 100 nulled perfectly with iteration 3. Once that initial ultrasonic margin noise was sliced off, the audio stayed digitally perfectly intact over 100 lossy conversions between 192k and 96k.
My conclusion is that if someone determines there's some restoration processing math that can achieve something with sample rates above 96k, that might be all well and good. Not sure how you might prove that. But at the end of the day a 96k "container" can hold a 100% complete audio signal.
Keeping your audio always 24 bit and using balanced connections for analog signals makes an actual difference. If you have some consumer DAC that sounds more accurate at HD vs SD sample rates, you can always upsample any SD program as a workaround.
PS. No one knows what the sound of ultrasonic noise is because no audio speakers reproduce that high up. Or human hearing. If someone things they're "feeling" something... Well it's not coming out of your speakers. Perceiving transistor aura perhaps?
PSS. In listening tests I've done in the past I could hear a difference in the DAC running at SD vs HD sample rates in my older MOTU interfaces. I can't for the life of me tell the difference in the Apogee units. So exactly what I paid for there! Transparent operation even at SD sample rates. 24 bit at 96k seems like a sweet spot to "set it and forget it" and have perfect audio reproduction no matter what edge case. But any sample rate with 24 bit is a really near perfect container.