📄 musicmatch.txt
字号:
$Id: musicmatch.txt,v 1.4 2000/09/09 23:02:37 eldamitri Exp $ MusicMatch (TM) tag format descriptionStatus of this document This document is a description of a deprecated tagging format. The information contained herein is not a specification; its intent is to interpret the format based on hundreds of examples. It also relies heavily on information obtained by others who have done similar investigations. It is not based on any official documentation of the format, as such documentation is not publicly available. Therefore the contents of this document may change to adjust for newly-discovered information, but the format itself is unlikely to change due to its deprecation. Distribution of this document is unlimited.Abstract This document describes the MusicMatch tagging format present in some digital audio files. This format, like other tagging specifications, provides a method for storing information about an audio file within itself to document its contents. This format was developed by MusicMatch and used exclusively by older versions of Jukebox, their popular, "all-in-one" MP3 application.1. Table of contents Status of this document Abstract 1. Table of contents 2. Introduction 3. Conventions in this document 4. Tagging format 4.1. Header 4.2. Image extension 4.3. Image binary 4.4. Unused 4.5. Version information 4.6. Audio meta-data 4.6.1. Single-line text fields 4.6.2. Non-text fields 4.6.3. Multi-line text fields 4.6.4. Internet addresses 4.6.5. Padding 4.7. Data offsets 4.8. Footer 5. Identifying and parsing a MusicMatch tag 6. Converting to ID3v2 7. Copyright 8. References 9. Author's Address2. Introduction The following document describes the structure of the tagging format used by MusicMatch (TM) Jukebox, prior to version 4.0 of that application. This program is a so-called "All-In-One" MP3 program and provides a CD-Ripper, WAV-to-MP3 converter, database, and MP3 player. The MusicMatch tagging format has gone through several incremental iterations in its format, although the basic structure has remained fairly constant throughout its history. The various formats of the MusicMatch tag have been tightly coupled with the version of Jukebox that created it. As such, this document will refer to the Jukebox version and tagging format version interchangeably. As of version 4.0, MusicMatch has deprecated the use of this format in their own Jukebox application, transitioning instead to ID3v2, an open standard for tagging digital audio. Unfortunately, despite repeated requests, MusicMatch has not provided to the public any documents describing this format, and the MusicMatch Jukebox is the only widely-distributed software application that can read and write these tags. As such, this text may not be completely accurate and is surely incomplete, but it covers enough to the format to enable one to write robust software to find and parse tags in this format. For example, the id3lib tagging library's MusicMatch parsing routines were written solely based on the information found in this document. However, the authors cannot be held responsible for any inaccuracies or any harm caused by using this information. One can assume that the specifition is unlikely to change, given MusicMatch's own abandonment of the format. It should also be noted that incoporating functionality into applications to write tags in this format is discouraged, as the format has been officially deprecated by MusicMatch themselves.3. Conventions in this document This document borrows heavily from specifications written by Martin Nillson, author of the ID3v2 tagging standard. Much of the structure, formatting, and other such conventions used in the ID3v2 specifications are carried over into this document. Text within "" is a text string exactly as it appears in a tag. Numbers preceded with $ are hexadecimal and numbers preceded with % are binary. $xx is used to indicate a byte with unknown content.4. Tag overview The MusicMatch Tagging Format was designed to store specific types of audio meta-data inside the audio file itself. As the format was used exclusively by the MusicMatch Jukebox application, it is used only with MPEG-1/2 layer III files encoded with that program. However, its tagging format is not inherently exclusive of other audio formats, and could conceivably be used with other types of encodings. MusicMatch tags were originally designed to come at the very end of MP3 files, after all of the MP3 audio frames. Starting with Jukebox version 3.1, the application became more ID3-friendly and started placing ID3v1 tags after the MusicMatch tag as well. In practice, since very few applications outside of the MusicMatch Jukebox are capable of reading and understanding this format, it is not unusual to find MusicMatch tags "buried" within mp3 files, coming before other types of tagging formats in a file, such as Lyrics3 or ID3v2.4.0. Such "relocations" are not uncommon, and therefore any software application that intends to find, read, and parse MusicMatch tags should be flexible in this endeavor, despite the apparent intentions of the original specification. Although various sections of a MusicMatch tag are fixed in length, other sections are not, and so tag lengths can vary from one file to another. A valid MusicMatch tag will be at least 8 kilobytes (8192 bytes) in length. Those tags with image data will often be much larger. The byte-order in 4-byte pointers and multibyte numbers for MusicMatch tags is least-significant byte (LSB) first, also known as "little endian". For example, $12345678 is encoded as $78 56 34 12. Overall tag structure: +-----------------------------+ | Header | | (256 bytes, OPTIONAL) | +-----------------------------+ | Image extension (4 bytes) | +-----------------------------+ | Image binary | | (var. length >= 4 bytes) | +-----------------------------+ | Unused (4 bytes) | +-----------------------------+ | Version info (256 bytes) | +-----------------------------+ | Audio meta-data | | (var. length >= 7868 bytes) | +-----------------------------+ | Data offsets (20 bytes) | +-----------------------------+ | Footer (48 bytes) | +-----------------------------+ This document will describe the various sections of the tag in the order listed above (that is, in the sequential order that they appear when reading the tag from beginning to end). However, due to the nature of the tag's format, in practice the tag's sections will often be parsed in the reverse order. A robust parsing algorithm will be suggested and described later in the document.4.1. Header An optional tag header often precedes the tag data in a MusicMatch tag. Although the rules that determine this header's required presence are unknown, the header is usually found in tag versions up to and including 2.50, and is usually lacking otherwise. Luckily, its format is rigid and therefore its presence is easy to determine. The data in the header are not vital to the correct parsing of the rest of the tag and can thus be discarded. The header is the only optional section in a MusicMatch tag. All other sections are required to consider the tag valid. The header section is always 256 bytes in length. It begins with three 10-byte subsections, and ends with 226 bytes of space ($20) padding. Each of the first three subsections contains an 8-byte ASCII text string followed by two bytes of null ($00) padding. The first subsection serves as a sync string: its 8-byte string is always "18273645". The second subsection's 8-byte string is the version of the Xing encoder used to encode the mp3 file. The last four bytes of this string are usually '0' ($30). An example of this string is "1.010000". The third and final 10-byte subsection is the version of the MusicMatch Jukebox used to encode the mp3 file. The last four bytes of this string are usually '0' ($30). An example of this string is "2.120000". Sync string "18273645" Null padding $00 00 Xing encoder version <8-byte numerical ASCII string> Null padding $00 00 MusicMatch version <8-byte numerical ASCII string> Null padding $00 00 Space padding 226 * $204.2. Image extension MusicMatch tags can contain at most one image. This first required section is the extension of the image when saved as a file (for example, "jpg" or "bmp"). This section is 4 bytes in length, and the data is padded with spaces ($20) if the extension doesn't use all 4 bytes (in practice, 3-byte extensions are the most prevalent). Likewise, tags without images have all spaces for this section (4 * $20). Picture extension $xx xx xx xx4.3. Image binary When an image is present in the tag, the image binary section consists of two fields. The first field is the size of the image data, in bytes. The second is the actual image data. Image size $xx xx xx xx Image data <binary data> If no image is present, the image binary section consists of exactly four null bytes ($00 00 00 00).4.4. Unused This section is never used, to the best of the author's knowledge. It is always 4 null ($00) bytes. Null padding $00 00 00 004.5. Version information This section of the tag has the exact same format as the header. Unlike the header, this section is required for the tag to be considered valid. Sync string "18273645" Null padding $00 00 Xing encoder version <8-byte numerical ASCII string> Null padding $00 00 MusicMatch version <8-byte numerical ASCII string> Null padding $00 00 Space padding 226 * $204.6. Audio meta-data The audio meta-data is the heart of the MusicMatch tag. It contains most of the pertinent information found in other tagging formats (song title, album title, artist, etc.) and some that are unique to this format (mood, prefernce, situation). In all versions of the MusicMatch format up to and including 3.00, this section is always 7868 bytes in length. All subsequent versions allowed three possible lengths for this section: 7936, 8004, and 8132 bytes. The conditions under which a particular length from these three possibilities was used is unknown. In all cases, this section is padded with dashes ($2D) to achieve this constant size. Due to the great number of fields in this portion of the tag, they are divided amongst the next four sections of the document: single-line text fields, non-text fields, multi-line text fields, and internet addresses. This clarification is somewhat arbitrary and somewhat inaccurate (some of the fields described as "non-text" are indeed ASCII strings). However, the clarification does allow for easier description of the meta-data as a whole. At any rate, the actual fields in this section of the tag appear sequentially in the order presented. 4.6.1. Single-line text fields The first group entries in this section of the tag are variable-length
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -