Exchange character set for use in UKMARC and MARC 21 records
The character set, used in UKMARC records created by the British Library, is in many respects the same as that used in MARC 21.
It is set out as follows:
1. The HEX value assigned to a character; where UKMARC and MARC 21 differ, MARC 21 values are shown in bold red 2. The graphic representation of the particular character 3. The description of the character in UKMARC 4. The description of the character in MARC 21.
Because of its wider repertoire of characters, MARC 21 often has more specific descriptions than UKMARC, and therefore both are given. An asterisk * after the description refers to a note at the foot of the table, e.g. Greek small letter alpha*. Characters that are diacritics are identified by means of the symbol †.
Graphics to represent the end of record character (HEX value 1D), the end of field character (1E) and the subfield delimiter (1F) are determined by the user's system. Therefore, no graphic representation of these characters has been included.
For further information, please refer to The UKMARC Exchange Record Format and to the MARC 21 Specifications for Record Structure, Character Sets and Exchange Media.
GRAPHIC DESCRIPTION HEX VALUE UKMARC MARC 21 1D [see above] End-of-record character Record terminator 1E [see above] End-of-field character Field terminator 1F [see above] Subfield delimiter Subfield delimiter 20 Blank Space 21 ! Exclamation mark Exclamation mark 22 " Double prime Quotation mark 24 $ Dollar sign (as currency) Dollar sign (as currency) 25 % Percent sign Percent sign 26 & Ampersand Ampersand 27 ' Single prime Apostrophe 28 ( Left parenthesis Opening parenthesis 29 ) Right parenthesis Closing parenthesis 2A * Asterisk Asterisk 2B + Plus sign Plus sign 2C , Comma Comma 2D - Hyphen; minus sign Hyphen; minus sign 2E . Full stop; decimal point Period; decimal point 2F / Slash (solidus) Slash (solidus) 30 0 Numeric Digit zero 31 1 Numeric Digit one 32 2 Numeric Digit two 33 3 Numeric Digit three 34 4 Numeric Digit four 35 5 Numeric Digit five 36 6 Numeric Digit six 37 7 Numeric Digit seven 38 8 Numeric Digit eight 39 9 Numeric Digit nine 3A : Colon Colon 3B ; Semicolon Semicolon 3C < Less-than; left angle Less-than sign bracket 3D = Equals sign Equals sign 3E > Greater-than sign; right Greater-than sign angle bracket 3F ? Question mark Question mark 40 @ Commercial at sign Commercial at sign 41 A Upper case alphabetic Latin capital letter A 42 B Upper case alphabetic Latin capital letter B 43 C Upper case alphabetic Latin capital letter C 44 D Upper case alphabetic Latin capital letter D 45 E Upper case alphabetic Latin capital letter E 46 F Upper case alphabetic Latin capital letter F 47 G Upper case alphabetic Latin capital letter G 48 H Upper case alphabetic Latin capital letter H 49 I Upper case alphabetic Latin capital letter I 4A J Upper case alphabetic Latin capital letter J 4B K Upper case alphabetic Latin capital letter K 4C L Upper case alphabetic Latin capital letter L 4D M Upper case alphabetic Latin capital letter M 4E N Upper case alphabetic Latin capital letter N 4F O Upper case alphabetic Latin capital letter O 50 P Upper case alphabetic Latin capital letter P 51 Q Upper case alphabetic Latin capital letter Q 52 R Upper case alphabetic Latin capital letter R 53 S Upper case alphabetic Latin capital letter S 54 T Upper case alphabetic Latin capital letter T 55 U Upper case alphabetic Latin capital letter U 56 V Upper case alphabetic Latin capital letter V 57 W Upper case alphabetic Latin capital letter W 58 X Upper case alphabetic Latin capital letter X 59 Y Upper case alphabetic Latin capital letter Y 5A Z Upper case alphabetic Latin capital letter Z 5B [ Left square bracket Opening square bracket 5C \ Back slash Reverse slash or reverse solidus 5D ] Right square bracket Closing square bracket 5F ß Eszett converts to 'ss' 73 73 60 # Music sharp sign Music sharp sign C4 61 a Lower case alphabetic Latin small letter a 62 b Lower case alphabetic Latin small letter b 63 c Lower case alphabetic Latin small letter c 64 d Lower case alphabetic Latin small letter d 65 e Lower case alphabetic Latin small letter e 66 f Lower case alphabetic Latin small letter f 67 g Lower case alphabetic Latin small letter g 68 h Lower case alphabetic Latin small letter h 69 i Lower case alphabetic Latin small letter i 6A j Lower case alphabetic Latin small letter j 6B k Lower case alphabetic Latin small letter k 6C l Lower case alphabetic Latin small letter l 6D m Lower case alphabetic Latin small letter m 6E n Lower case alphabetic Latin small letter n 6F o Lower case alphabetic Latin small letter o 70 p Lower case alphabetic Latin small letter p 71 q Lower case alphabetic Latin small letter q 72 r Lower case alphabetic Latin small letter r 73 s Lower case alphabetic Latin small letter s 74 t Lower case alphabetic Latin small letter t 75 u Lower case alphabetic Latin small letter u 76 v Lower case alphabetic Latin small letter v 77 w Lower case alphabetic Latin small letter w 78 x Lower case alphabetic Latin small letter x 79 y Lower case alphabetic Latin small letter y 7A z Lower case alphabetic Latin small letter z 7B ¡ Inverted exclamation mark Inverted exclamation mark C6 7C ¿ Inverted question mark Inverted question mark C5 7D Greek small letter alpha* Greek small letter alpha* 1B 67 61 1B 73 7E Greek small letter beta* Greek small letter beta* 1B 67 62 1B 73 7F Greek small letter Greek small letter 1B 67 63 gamma* gamma* 1B 73 A1 Upper case Polish letter L also known as Latin capital letter L with stroke A2 Ø Upper case Scandinavian also known as Latin capital letter O letter L with stroke A3 Ð Upper case Serbo-Croat D Upper case D with crossbar or Latin capital letter D with stroke A4 Þ Upper case Icelandic thorn also known as Latin capital letter thorn A5 Æ Upper case digraph AE also known as Latin capital letter AE A6 Œ Upper case digraph OE also known as Latin capital letter digraph OE A7 Miagkii Znak Soft sign, prime or
modifier letter prime A8 · Middle dot Middle dot A9 Music flat sign Music flat sign
AE Hamza (Alif) Alif, modifier letter right
half ring B0 Ain Ayn, modifier letter turned
comma B1 Lower case Polish letter l also known as Latin small
letter l with stroke B2 ø Lower case Scandanavian also known as Latin small letter o letter o with stroke B3 Lower case Serbo-Croat d Lower case d with
crossbar, Latin small letter d with stroke B4 þ Lower case Icelandic thorn also known as Latin small letter thorn B5 æ Lower case digraph ae also known as Latin small ligature ae B6 œ Lower case digraph oe also known as Latin small ligature oe B7 Tverdyi Znak Hard sign, double prime or
modifier letter double prime B8 Lower case Turkish i also known as Latin small
letter dotless i B9 £ British pound sign British pound sign BA ð Lower case Icelandic letter also known as Latin small eth letter eth E0 High tone diacritic Pseudo question mark,
combining hook above E1 ` Grave accent † Grave accent † E2 Acute accent † Acute accent †
E3 ^ Circumflex † Circumflex † E4 ~ Tilde † Tilde † E5 Macron † Macron †
E6 Breve † Breve †
E7 Dot above † also known as Superior
dot E8 Umlaut † (Diaeresis) Umlaut † (Diaeresis)
E9 Hacek † also known as Combining
caron EA ° Degree Degree sign C0 EB Ligature 1 or first half † also known as Combining
ligature left half EC Ligature 2 or right half † also known as Combining
ligature right half ED High comma off right † also known as High
comma off centre or above right EE Double acute accent † Double acute accent †
EF Candrabindu † Candrabindu †
F0 ¸ Cedilla † Cedilla † F1 Hook right † also known as ogonek
F2 . Dot below † Dot below † F3 .. Double dot below † Double dot below, combining diaeresis below † F4 Circle below † also known as Combining o ring below F5 Double underscore † also known as Combining = double low line F6 Underscore † also known as Combining _ low line F7 Hook left † also known as Comma below
F8 Rude † Right cedilla, combining half ring below †
FE High comma centre † also known as Combining
comma above
Notes
1. In MARC 21 Greek characters are placed in a separate character set. They are accessed by means of an Escape character and an ASCII graphic character. Because the Escape character is locking, all characters following it are designated as being part of the Greek character set. Therefore, it is necessary to unlock it by means of a follow-on Escape sequence in order to return to the Basic and Extended Latin set. For example, the Greek small letter alpha a is accessed as follows:
Escape character 1B 67 ASCII graphic 61 Follow-on Escape sequence 1B 73
For further details see "Accessing alternate character sets" in the MARC 21 exchange character set specification.
2. In UKMARC, HEX value 5E represents a single dagger †, but this has no equivalent in MARC 21. 3. In MARC 21, HEX value 7C represents a vertical bar |, where a blank would be used in the UKMARC 008 field.