Have you ever seen the term “16-bit” used in relation to audio? Do you ever wonder what the heck it means? 16-bit audio is probably the most common type of digital audio. Also, bit-digital we’ve been listening to it for decades since CDs use 16-bit audio. It all has to do with ones and zeros. The number of bits is sometimes referred to as “bit depth.” Whatever. What you need to know is that we’re talking about the length of words here…digital words. I referred to “ones and zeros” above because computer language is all math and the highest single digit is “1.” So what does this have to do with audio? Hold on a sec. Let’s talk math for a bit (ha!).
How Bit Depth is Like Champagne
Think about human math. The highest single digit is 9. Now imagine our math had only 1 digit available. If I try to text a friend how many bottles of champagne I have for the party (we have ten), ichimame I would not be able to convey that information very well. I’d have to say something like “I have 9 bottles and I also have 1 bottle.” Silly huh? If I could use 2 digits, now I can say “I have 10 bottles.” I can convey more information. But that’s only good up to 99 bottles. To say I had any number between 100 and 999 bottles (woo hoo, now that’s a party!), I’d need another digit. And so on. The more digits I have available, the more and better information I can convey.
Here is how this relates to digital audio. During analog-to-digital conversion sounds is captured or “sampled” thousands of times per second. The converter has to be able to describe how loud or quiet (amplitude) the audio from each sample is. And since computers use binary language, thewordcounter ones and zeros, they don’t get the luxury of 9 “levels” per digit like us (well, it’s 10 really, but what good is zero bottles of champagne really?). They only have 2 levels per digit, a 1 or a 0. So converters need several digits to accurately represent that amplitude. The more digits available, the longer the “digital word” and the wider the dynamic range that can be accurately represented and played back on our systems. Each digit is actually called a “bit” (hey remember the Tron reference and the bit character that can only say “yes” or “no”?) and we need a long enough digital word to reproduce really quiet sounds as well as the loud ones. If the word is sort, say 8 bits long, our audio would have quite a lot of noise.
So how much dynamic range do we need? Dynamic range is the difference between the quietest sound and the loudest sound. If your digital converters can’t translate enough dynamic range for a vocal recording, that narrow dynamic range will bottom out (at the digital noise floor) before it can deal with the quietest parts of, say, an audio recording. So instead of a nice intimate vocal you get an ugly noise.
How do you know how much dynamic range you need?
In theory, the more the better. But in real life you want the “real” (or “analog”) noise, stuff like computer drive noise, electronics, and ambient room sounds to be louder than the digital noise floor. In most systems these days, you can go all the way down to about -80 to -90 dB or so before you just can’t get the “real” noise any quieter (remember that in digital audio, Zero dB is the loudest point or “digital ceiling,” so loudness is depicted as negative numbers). So in theory, as long as the digital noise floor is lower than -90dB, you should be OK.
So how do you know how what number bits in a digital word will give you enough digital dynamic range?
Here’s the basic rule-of-thumb. For every bit (digit) added to a digital word, you get 6 dB or added dynamic range. So if your converters use 8-bit conversion, you get 48 dB of dynamic range (8 x 6). That means you can record audio whose loudest peak is just under the 0 dB mark, only-and-one and whose quietest part is right around -48dB. Anything quieter than -48 dB won’t be converted properly and will sound a mess! Just garbage-y awfulness. A 16-bit converter gets you 96 dB of dynamic range, which is well beneath our analog noise floor of -80 to -90 dB.
The Rest is History
With the above knowledge, the audio industry sort of adopted 16-bit audio as its standard, which among other things is why music CDs are 16-bit audio. For most people recording in home recording studios, 16-bit audio will probably be good enough. Your computer can also handle 16-bit audio files faster than higher bit words (24, 48 and 96 are other common bit lengths in audio).
There is more to tell about the properties and relative advantages of using 16, 24 or even higher bit rates. Then there’s the whole issue of sampling rate (you’ve probably heard or seen the term 44.1 KHz bandied about). I will address these in other articles, giving each the Home Brew Audio treatment so “regular people” can understand the concepts.