- Source: MPEG-4 Part 3
3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 3/info/part" target="_blank">Part 3 or 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio (formally ISO/IEC 14496-3) is the third 3/info/part" target="_blank">part of the ISO/IEC 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 international standard developed by Moving Picture Experts Group. It specifies audio coding methods. The first version of ISO/IEC 14496-3 was published in 1999.
The 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 3/info/part" target="_blank">Part 3 consists of a variety of audio coding technologies – from lossy speech coding (HVXC, CELP), general audio coding (AAC, TwinVQ, BSAC), lossless audio compression (3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 SLS, Audio Lossless Coding, 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 DST), a Text-To-Speech Interface (TTSI), Structured Audio (using SAOL, SASL, MIDI) and many additional audio synthesis and coding techniques.
3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio does not target a single application such as real-time telephony or high-quality audio compression. It applies to every application which requires the use of advanced sound compression, synthesis, manipulation, or playback.
3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio is a new type of audio standard that integrates numerous different types of audio coding: natural sound and synthetic sound, low bitrate delivery and high-quality delivery, speech and music, complex soundtracks and simple ones, traditional content and interactive content.
Versions
Subparts
3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 3/info/part" target="_blank">Part 3 contains following subparts:
Subpart 1: Main (list of Audio Object Types, Profiles, Levels, interface to ISO/IEC 14496-1, 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio transport stream, etc.)
Subpart 2: Speech coding – HVXC (Harmonic Vector eXcitation Coding)
Subpart 3: Speech coding – CELP (Code Excited Linear Prediction)
Subpart 3/info/4" target="_blank">4: General Audio Coding (GA) (Time/Frequency Coding) – AAC, TwinVQ, BSAC
Subpart 5: Structured Audio (SA)
Subpart 6: Text to Speech Interface (TTSI)
Subpart 7: Parametric Audio Coding – HILN (Harmonic and Individual Line plus Noise)
Subpart 8: Technical description of parametric coding for high quality audio (SSC, Parametric Stereo)
Subpart 9: 3/info/mpeg" target="_blank">MPEG-1/3/info/mpeg" target="_blank">MPEG-2 Audio in 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4
Subpart 10: Technical description of lossless coding of oversampled audio (3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 DST – Direct Stream Transfer)
Subpart 11: Audio Lossless Coding (ALS)
Subpart 12: Scalable Lossless Coding (SLS)
3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio Object Types
3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio includes a system for handling a diverse group of audio formats in a uniform manner. Each format is assigned a unique Audio Object Type to represent it. Object Type is used to distinguish between different coding methods. It directly determines the 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 tool subset required to decode a specific object. The 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 profiles are based on the object types and each profile supports a different list of object types.
Audio Profiles
The 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio standard defines several profiles. These profiles are based on the object types and each profile supports different list of object types. Each profile may also have several levels, which limit some parameters of the tools present in a profile. These parameters usually are the sampling rate and the number of audio channels decoded at the same time.
Audio storage and transport
There is no standard for transport of elementary streams over a channel, because the broad range of 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 applications have delivery requirements that are too wide to easily characterize with a single solution.
The capabilities of a transport layer and the communication between transport, multiplex, and demultiplex functions are described in the Delivery Multimedia Integration Framework (DMIF) in ISO/IEC 14496-6. A wide variety of delivery mechanisms exist below this interface, e.g., 3/info/mpeg" target="_blank">MPEG transport stream, Real-time Transport Protocol (RTP), etc.
Transport in Real-time Transport Protocol is defined in RFC 3016 (RTP Payload Format for 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio/Visual Streams), RFC 3640 (RTP Payload Format for Transport of 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Elementary Streams), RFC 4281 (The Codecs Parameter for "Bucket" Media Types) and RFC 4337 (MIME Type Registration for 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4).
LATM and LOAS were defined for natural audio applications, which do not require sophisticated object-based coding or other functions provided by 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Systems.
Bifurcation in the AAC technical standard
The Advanced Audio Coding in 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 3/info/part" target="_blank">Part 3 (3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio) Subpart 3/info/4" target="_blank">4 was enhanced relative to the previous standard 3/info/mpeg" target="_blank">MPEG-2 3/info/part" target="_blank">Part 7 (Advanced Audio Coding), in order to provide better sound quality for a given encoding bitrate.
It is assumed that any 3/info/part" target="_blank">Part 3 and 3/info/part" target="_blank">Part 7 differences will be ironed out by the ISO standards body in the near future to avoid the possibility of future bitstream incompatibilities. At present there are no known player or codec incompatibilities due to the newness of the standard.
The 3/info/mpeg" target="_blank">MPEG-2 3/info/part" target="_blank">Part 7 standard (Advanced Audio Coding) was first published in 1997 and offers three default profiles: Low Complexity profile (LC), Main profile and Scalable Sampling Rate profile (SSR).
The 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 3/info/part" target="_blank">Part 3 Subpart 3/info/4" target="_blank">4 (General Audio Coding) combined the profiles from 3/info/mpeg" target="_blank">MPEG-2 3/info/part" target="_blank">Part 7 with Perceptual Noise Substitution (PNS) and defined them as Audio Object Types (AAC LC, AAC Main, AAC SSR).
HE-AAC
High-Efficiency Advanced Audio Coding is an extension of AAC LC using spectral band replication (SBR), and Parametric Stereo (PS). It is designed to increase coding efficiency at low bitrates by using partial parametric representation of audio.
AAC-SSR
AAC Scalable Sample Rate was introduced by Sony to the 3/info/mpeg" target="_blank">MPEG-2 3/info/part" target="_blank">Part 7 and 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 3/info/part" target="_blank">Part 3 standards. It was first published in ISO/IEC 13818-7, 3/info/part" target="_blank">Part 7: Advanced Audio Coding (AAC) in 1997. The audio signal is first split into 3/info/4" target="_blank">4 bands using a 3/info/4" target="_blank">4 band polyphase quadrature filter bank. Then these 3/info/4" target="_blank">4 bands are further split using MDCTs with a size k of 32 or 256 samples. This is similar to normal AAC LC which uses MDCTs with a size k of 128 or 1024 directly on the audio signal.
The advantage of this technique is that short block switching can be done separately for every PQF band. So high frequencies can be encoded using a short block to enhance temporal resolution, low frequencies can be still encoded with high spectral resolution. However, due to aliasing between the 3/info/4" target="_blank">4 PQF bands, coding efficiency around (1,2,3) * fs/8 is worse than with normal 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 AAC LC.
3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 AAC-SSR is very similar to ATRAC and ATRAC-3.
= Why AAC-SSR was introduced
=The idea behind AAC-SSR was not only the advantage listed above, but also the possibility of reducing the data rate by removing 1, 2 or 3 of the upper PQF bands. A very simple bitstream splitter can remove these bands and thus reduce the bitrate and sample rate.
Example:
3/info/4" target="_blank">4 subbands: bitrate = 128 kbit/s, sample rate = 48 kHz, f_lowpass = 20 kHz
3 subbands: bitrate ~ 120 kbit/s, sample rate = 48 kHz, f_lowpass = 18 kHz
2 subbands: bitrate ~ 100 kbit/s, sample rate = 24 kHz, f_lowpass = 12 kHz
1 subband: bitrate ~ 65 kbit/s, sample rate = 12 kHz, f_lowpass = 6 kHz
Note: although possible, the resulting quality is much worse than typical
for this bitrate. So for normal 64 kbit/s AAC LC a bandwidth of 14–16 kHz is
achieved by using intensity stereo and reduced NMRs. This degrades audible quality
less than transmitting 6 kHz bandwidth with perfect quality.
BSAC
Bit Sliced Arithmetic Coding is an 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 standard (ISO/IEC 14496-3 subpart 3/info/4" target="_blank">4) for scalable audio coding. BSAC uses an alternative noiseless coding to AAC, with the rest of the processing being identical to AAC. This support for scalability allows for nearly transparent sound quality at 64 kbit/s and graceful degradation at lower bit rates. BSAC coding is best performed in the range of 40 kbit/s to 64 kbit/s, though it operates in the range of 16 kbit/s to 64 kbit/s. The AAC-BSAC codec is used in Digital Multimedia Broadcasting (DMB) applications.
Licensing
In 2002, the 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio Licensing Committee selected the Via Licensing Corporation as the Licensing Administrator for the 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio patent pool.
See also
TwinVQ – one of the object types defined in 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio version 1
3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 3/info/part" target="_blank">Part 2
3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 3/info/part" target="_blank">Part 14 container format (MP4)
Digital rights management
Advanced Audio Coding (AAC)
ISO/IEC JTC 1/SC 29
References
External links
Apple: 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4: AAC
"AAC" (VideoLAN WIKI)
EBU subjective listening tests on low-bitrate audio codecs Archived 2007-02-16 at the Wayback Machine
AAC radio stations – Online radio stations in AAC format
Tuner2 – Directory of radio stations in AAC+ format at various bitrates
RadioFeeds UK & Ireland – Page containing plenty of terrestrial stations webcasting in AAC+ format.
Results of 64 kbit/s Listening Test A page comparing codecs including HE-AAC @64 kbit/s by listening tests. (Page is offline)
Official 3/info/mpeg" target="_blank">MPEG web site
RFC 3016 – RTP Payload Format for 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Audio/Visual Streams
RFC 3640 – RTP Payload Format for Transport of 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4 Elementary Streams
RFC 4281 – The Codecs Parameter for "Bucket" Media Types
RFC 4337 – MIME Type Registration for 3/info/mpeg" target="_blank">MPEG-3/info/4" target="_blank">4
Kata Kunci Pencarian:
- MPEG-21
- YouTube
- Riwayat versi Android
- .m2ts
- Pengodean Audio Lanjutan
- Televisi digital
- HDMI
- PlayStation 2
- Avidemux
- Televisi digital terestrial
- MPEG-4 Part 3
- MPEG-4
- MPEG-4 Part 11
- MPEG-4 Part 2
- MP4 file format
- MPEG-1
- MPEG-4 SLS
- Advanced Audio Coding
- MPEG-2
- Advanced Video Coding