<<

Exchange set for use in UKMARC and MARC 21 records

The character set, used in UKMARC records created by the British Library, is in many respects the same as that used in MARC 21.

It is set out as follows:

1. The HEX value assigned to a character; where UKMARC and MARC 21 differ, MARC 21 values are shown in bold red 2. The graphic representation of the particular character 3. The description of the character in UKMARC 4. The description of the character in MARC 21.

Because of its wider repertoire of characters, MARC 21 often has more specific descriptions than UKMARC, and therefore both are given. An * after the description refers to a at the foot of the table, .. Greek small letter alpha*. Characters that are are identified by means of the †.

Graphics to represent the end of record character (HEX value 1D), the end of field character (1E) and the subfield (1F) are determined by the user' system. Therefore, no graphic representation of these characters has been included.

For further information, please refer to The UKMARC Exchange Record Format and to the MARC 21 Specifications for Record Structure, Character Sets and Exchange Media.

GRAPHIC DESCRIPTION HEX VALUE UKMARC MARC 21 1D [see above] End-of-record character Record terminator 1E [see above] End-of-field character Field terminator 1F [see above] Subfield delimiter Subfield delimiter 20 Blank 21 ! Exclamation mark 22 " Double 24 $ Dollar sign (as currency) Dollar sign (as currency) 25 % Percent sign 26 & Ampersand 27 ' Single prime 28 ( Left parenthesis Opening parenthesis 29 ) Right parenthesis Closing parenthesis 2A * Asterisk Asterisk 2B + Plus sign Plus sign 2C , Comma 2D - ; minus sign Hyphen; minus sign 2E . ; decimal point Period; decimal point 2F / (solidus) Slash (solidus) 30 0 Numeric Digit zero 31 1 Numeric Digit one 32 2 Numeric Digit two 33 3 Numeric Digit three 34 4 Numeric Digit four 35 5 Numeric Digit five 36 6 Numeric Digit six 37 7 Numeric Digit seven 38 8 Numeric Digit eight 39 9 Numeric Digit nine 3A : Colon 3B ; Semicolon 3C < Less-than; left angle Less-than sign bracket 3D = Equals sign Equals sign 3E > Greater-than sign; right Greater-than sign angle 3F ? Question mark 40 @ Commercial Commercial at sign 41 A Upper case alphabetic Latin capital letter A 42 Upper case alphabetic Latin capital letter B 43 Upper case alphabetic Latin capital letter C 44 Upper case alphabetic Latin capital letter D 45 E Upper case alphabetic Latin capital letter E 46 Upper case alphabetic Latin capital letter F 47 G Upper case alphabetic Latin capital letter G 48 Upper case alphabetic Latin capital letter H 49 I Upper case alphabetic Latin capital letter I 4A Upper case alphabetic Latin capital letter J 4B Upper case alphabetic Latin capital letter K 4C Upper case alphabetic Latin capital letter L 4D Upper case alphabetic Latin capital letter M 4E Upper case alphabetic Latin capital letter N 4F Upper case alphabetic Latin capital letter O 50 P Upper case alphabetic Latin capital letter P 51 Upper case alphabetic Latin capital letter Q 52 Upper case alphabetic Latin capital letter R 53 S Upper case alphabetic Latin capital letter S 54 Upper case alphabetic Latin capital letter T 55 Upper case alphabetic Latin capital letter U 56 Upper case alphabetic Latin capital letter V 57 Upper case alphabetic Latin capital letter W 58 Upper case alphabetic Latin capital letter X 59 Upper case alphabetic Latin capital letter Y 5A Upper case alphabetic Latin capital letter Z 5B [ Left square bracket Opening square bracket 5C \ Back slash Reverse slash or reverse solidus 5D ] Right square bracket Closing square bracket 5F ß Eszett converts to '' 73 73 60 # Music sharp sign Music sharp sign C4 61 a Lower case alphabetic Latin small letter a 62 b Lower case alphabetic Latin small letter b 63 c Lower case alphabetic Latin small letter c 64 d Lower case alphabetic Latin small letter d 65 e Lower case alphabetic Latin small letter e 66 f Lower case alphabetic Latin small letter f 67 g Lower case alphabetic Latin small letter g 68 h Lower case alphabetic Latin small letter h 69 i Lower case alphabetic Latin small letter i 6A j Lower case alphabetic Latin small letter j 6B k Lower case alphabetic Latin small letter k 6C l Lower case alphabetic Latin small letter l 6D m Lower case alphabetic Latin small letter m 6E n Lower case alphabetic Latin small letter n 6F o Lower case alphabetic Latin small letter o 70 p Lower case alphabetic Latin small letter p 71 q Lower case alphabetic Latin small letter q 72 r Lower case alphabetic Latin small letter r 73 s Lower case alphabetic Latin small letter s 74 t Lower case alphabetic Latin small letter t 75 u Lower case alphabetic Latin small letter u 76 v Lower case alphabetic Latin small letter v 77 w Lower case alphabetic Latin small letter w 78 x Lower case alphabetic Latin small letter x 79 y Lower case alphabetic Latin small letter y 7A z Lower case alphabetic Latin small letter z 7B ¡ Inverted exclamation mark Inverted exclamation mark C6 7C ¿ Inverted question mark Inverted question mark C5 7D Greek small letter alpha* Greek small letter alpha* 1B 67 61 1B 73 7E Greek small letter beta* Greek small letter beta* 1B 67 62 1B 73 7F Greek small letter Greek small letter 1B 67 63 gamma* gamma* 1B 73 A1 Upper case Polish letter L also known as Latin capital letter L with stroke A2 Ø Upper case Scandinavian also known as Latin capital letter O letter L with stroke A3 Ð Upper case Serbo-Croat D Upper case D with crossbar or Latin capital letter D with stroke A4 Þ Upper case Icelandic thorn also known as Latin capital letter thorn A5 Æ Upper case digraph AE also known as Latin capital letter AE A6 Œ Upper case digraph OE also known as Latin capital letter digraph OE A7 Miagkii Znak Soft sign, prime or

modifier letter prime A8 · Middle Middle dot A9 Music flat sign Music flat sign

AE Hamza (Alif) Alif, modifier letter right

half B0 Ain Ayn, modifier letter turned

comma B1 Lower case Polish letter l also known as Latin small

letter l with stroke B2 ø Lower case Scandanavian also known as Latin small letter o letter o with stroke B3 Lower case Serbo-Croat d Lower case d with

crossbar, Latin small letter d with stroke B4 þ Lower case Icelandic thorn also known as Latin small letter thorn B5 æ Lower case digraph ae also known as Latin small ae B6 œ Lower case digraph oe also known as Latin small ligature oe B7 Tverdyi Znak Hard sign, double prime or

modifier letter double prime B8 Lower case Turkish i also known as Latin small

letter dotless i B9 £ British pound sign British pound sign BA ð Lower case Icelandic letter also known as Latin small eth letter eth E0 High tone diacritic Pseudo question mark,

combining above E1 ` † Grave accent † E2 † Acute accent †

E3 ^ † Circumflex † E4 ~ † Tilde † E5 † Macron †

E6 † Breve †

E7 Dot above † also known as Superior

dot E8 Umlaut † () Umlaut † (Diaeresis)

E9 Hacek † also known as Combining

EA ° Degree Degree sign C0 EB Ligature 1 or first half † also known as Combining

ligature left half EC Ligature 2 or right half † also known as Combining

ligature right half ED High comma off right † also known as High

comma off centre or above right EE † Double acute accent †

EF Candrabindu † Candrabindu †

F0 ¸ † Cedilla † F1 Hook right † also known as

F2 . Dot below † Dot below † F3 .. Double dot below † Double dot below, combining diaeresis below † F4 Circle below † also known as Combining o ring below F5 Double † also known as Combining = double low line F6 Underscore † also known as Combining _ low line F7 Hook left † also known as Comma below

F8 Rude † Right cedilla, combining half ring below †

FE High comma centre † also known as Combining

comma above

Notes

1. In MARC 21 Greek characters are placed in a separate character set. They are accessed by means of an Escape character and an ASCII graphic character. Because the Escape character is locking, all characters following it are designated as being part of the Greek character set. Therefore, it is necessary to unlock it by means of a follow-on Escape sequence in order to return to the Basic and Extended Latin set. For example, the Greek small letter alpha a is accessed as follows:

Escape character 1B 67 ASCII graphic 61 Follow-on Escape sequence 1B 73

For further details see "Accessing alternate character sets" in the MARC 21 exchange character set specification.

2. In UKMARC, HEX value 5E represents a single †, but this has no equivalent in MARC 21. 3. In MARC 21, HEX value 7C represents a vertical |, where a blank would be used in the UKMARC 008 field.