MP3 Players
MPEG Audio Layer 3, more commonly referred to as MP3, is a popular digital audio encoding and lossy compression format invented and standardized in 1991 by a team of engineers working in the framework of the ISO/IEC MPEG audio committee under the chairmanship of Professor Hans Musmann (University of Hannover - Germany). It was designed to greatly reduce the amount of data required to represent audio, yet still sound like a faithful reproduction of the original uncompressed audio to most listeners. In popular usage, MP3 also refers to files of sound or music recordings stored in the MP3 format on computers.
Overview
MP3 is a compression format. It provides a representation of pulse-code modulation-encoded (PCM) audio data in a much smaller size by discarding portions that are considered less important to human hearing (similar to JPEG, a lossy compression for images).
A number of techniques are employed in MP3 to determine which portions of the audio can be discarded, including psychoacoustics. MP3 audio can be compressed with different bit rates, providing a range of tradeoffs between data size and sound quality.
The MP3 format uses, at its heart, a hybrid transformation to transform a time domain signal into a frequency domain signal:
32-band polyphase quadrature filter
36 or 12 tap MDCT; size can be selected independent for sub-band 0...1 and 2...31
Aliasing reduction postprocessing
MP3 Surround, a version of the format supporting 5.1 channels for surround sound, was introduced in December 2004. MP3 Surround is backward compatible with standard stereo MP3, and file sizes are similar.
In terms of the MPEG specifications, AAC (Advanced audio coding) from MPEG-4 is to be the successor of the MP3 format, although there has been a significant movement to create and popularize other audio formats. Nevertheless, any succession is not likely to happen for a significant amount of time due to MP3's overwhelming popularity (MP3 enjoys extremely wide popularity and support, not just by end-users and software but by hardware such as DVD and CD players).
MP3 goes public
A reference simulation software written in C language known as ISO 11172-5 was developed by the members of the ISO MPEG Audio committee in order to produce bit compliant MPEG Audio files (Layer 1, Layer 2, Layer 3). Working in non real time on a number of operating systems it was able to demonstrate the first real time hardware decoding (DSP based) of compressed audio. Some other real time implementation of MPEG Audio encoders were available for the purpose of digital broadcasting (radio DAB, television DVB) towards consumer receivers and set top boxes.
Later on, on July 7, 1994 the Fraunhofer Society released the first software MP3 encoder called l3enc. The filename extension .mp3 was chosen by the Fraunhofer team on July 14, 1995 (previously, the files had been named .bit). With the first real-time software MP3 player Winplay3 (released September 9th, 1995) many people were able to encode and playback MP3 files on their PCs. Because of the relatively small hard drives back in that time (~500 MB) the technology was essential to store music for listening pleasure on a computer.
MP2, MP3 and the Internet
In October 1993, MP2 (MPEG-1 Audio Layer 2) files appeared on the Internet and were often played back using the Xing MPEG Audio Player, and later in a program for Unix by Tobias Bading called MAPlay, which was initially released on February 22nd, 1994 (MAPlay was also ported to the Microsoft Windows).
Initially the only encoder available for MP2 production was the Xing Encoder, accompanied by the program CDDA2WAV, a CD ripper that transformed CD audio tracks to computer data files.
The Internet Underground Music Archive (IUMA) is generally recognized as the start of the on-line music revolution. IUMA was the Internet's first high-fidelity music web site, hosting thousands of authorized MP2 recordings before MP3 or the web was popularized. IUMA was started by Rob Lord (who later headed pioneering Nullsoft) and Jeff Patterson, both from the University of California, Santa Cruz, in 1993. Other founding members include Jon Luini, Brandee Selck, and Ahin Savara.
In the first half of 1995 through the late 1990s, MP3 files began flourishing on the Internet. MP3 popularity was mostly due to, and interchangeable with, the successes of companies and software packages like Nullsoft's Winamp (released in 1997), mpg123, and Napster (released in 1999). Those programs made it very easy for the average user to playback, create, share, and collect MP3s.
Controversies regarding peer-to-peer file sharing of MP3 files have flourished in recent years — largely because high compression enables sharing of files that would otherwise be too large and cumbersome to share. Due to the vastly increased spread of MP3s through the Internet some major record labels reacted by filing a lawsuit against Napster to protect their Copyrights (see also intellectual property).
Commercial online music distribution services (like the iTunes Music Store) usually prefer other/proprietary music file formats that support Digital Rights Management (DRM) to control and restrict the use of digital music. The use of formats that supports DRM is in an attempt to prevent piracy of copyright protected materials, but any computer savvy person can easily rip the DRM from a song file turning it into a file that is not locked to any computer.
Quality of MP3 audio
Because MP3 is a lossy format, it is able to provide a number of different options for its "bit rate"—that is, the number of bits of encoded data that are used to represent each second of audio. Typically rates chosen are between 128 and 256 kilobit per second. By contrast, uncompressed audio as stored on a compact disc has a bit rate of about 1400 kbit/s.
MP3 files encoded with a lower bit rate will generally play back at a lower quality. With too low a bit rate, "compression artifacts" (i.e., sounds that were not present in the original recording) may appear in the reproduction. A good demonstration of compression artifacts is provided by the sound of applause: it is hard to compress because it is random, therefore the failings of the encoder are more obvious, and are audible as ringing.
As well as the bit rate of the encoded file, the quality of MP3 files depend on the quality of the encoder and the difficulty of the signal being encoded. For average signals with good encoders, many listeners accept the MP3 bit rate of 128 kibit/s as near enough to compact disc quality for them, providing a compression ratio of approximately 11:1. However, listening tests show that with a bit of practice many listeners can reliably distinguish 128 kbit/s MP3s from CD originals; in many cases reaching the point where they consider the MP3 audio to be of unacceptably low quality. Yet other listeners, and the same listeners in other environments (such as in a noisy moving vehicle or at a party) will consider the quality acceptable. Obviously, imperfections in an MP3 encode will be much less apparent on low-end computer speakers than on a good stereo system connected to a computer or -- especially -- using high-quality headphones.
Encoding of MP3 audio
The MPEG-1 standard does not include a precise specification for an MP3 encoder. The decoding algorithm and file format, as a contrast, are well defined. Implementers of the standard were supposed to devise their own algorithms suitable for removing parts of the information in the raw audio (or rather its MDCT representation in the frequency domain).During encoding 576 time domain samples are taken and is transformed to 576 frequency domain samples. If there is a transient 192 samples are taken instead of 576. This is done to limit the temporal spread of quantization noise accompanying the transient.
This is the domain of psychoacoustics, which aims at understanding how human acoustical perception works (both in our ears and in our brain).
As a result, there are many different MP3 encoders available, each producing files of differing quality. Comparisons are widely available, so it is easy for a prospective user of an encoder to research the best choice. It must be kept in mind that an encoder that is proficient at encoding at higher bitrates (such as LAME, which is in widespread use for encoding at higher bitrates) is not necessarily as good at other, lower bitrates.
Decoding of MP3 Audio
Decoding, on the other hand, is carefully defined in the standard. Most decoders are "bitstream compliant", meaning that the uncompressed output they produce from a given MP3 file will be the same (within a specified degree of rounding tolerance) as the output specified mathematically in the standard document. The MP3 file has a standard format which is a frame consisting of 384, 576, or 1152 samples (depends on MPEG version and layer) and all the frames have associated header information(32 bits) and side information(9, 17, or 32 bytes, depending on MPEG version and stereo/mono).The header and side information help the decoder to decode the associated huffman encoded data correctly.
Therefore, for the most part, comparison of decoders is almost exclusively based on how computationally efficient they are (i.e., how much memory or CPU time they use in the decoding process).