IAEA-INIS-8(Rev.1) INIS: PAPER TAPE SPECIFICATIONS AND RECORD FORMAT

INTERNATIONAL ATOMIC ENERGY AGENCY. VIENNA, FEBRUARY 1971 Other reports in the INIS "Reference Series available in February 1971*

IAEA-INIS-(Rev.l) INIS: Descriptive Cataloguing Rules IAEA-INIS-2(Rev.l) INIS: Descriptive Cataloguing Samples IAEA-INIS-3(Rev.l) INIS: Subject Categories and Scope Descriptions IAEA-INIS-4(Rev.) IKTS: Instructions for Submitting Abstracts IAEA-lNlS-5(Rev.l) INIS: Terminology and Codes for Countries and International Organizations IAEA-INIS-6(Rev.2) INIS: Authority List for Corporate Entries IAEA-INIS-7(Rev.l) INIS: Magnetic and Punched Paper Tape Codes and Character Sets IAEA-INIS-9(Rev.l) INIS: Magnetic Tape Specifications and Record Format IAEA-INIS-1O(Rev.O) INIS: Transliteration Rules ior Selected Non-Roman Characters IAEA-INIS-11(Rev. О) INTS* Authority List for Journal Titles IAEA-INIS-12(Re . 0) INIS: Manual for Indexing IAEA-INIS-13(Rev.0) INIS: Thesaurus IAEA-lNlS-13A(Rev. 0) INIS: Terminology Charts

Stocks of the following forms may be ordered by centres participating in INIS:

INIS Form l(Rev,2) (INIS Worksheet) INIS Form 2(Rev/l) (INIS Abstract)

Organizations participating in the INIS network may obtain forms and reasonable quantities of reports in the INIS Reference Series on application to:

INIS Section International Atomic Energy Agency P.O. Box 590 A-1011 Vienna Austria

Other interested parties may order copies of these reports in accordance with the instructions and prices given on the inside back cover. INIS: PAPER TAPE SPECIFICATIONS AND RECORD FORMAT

TABLE OF CONTENTS

Page PREFACE 5

1. NON—PRINTABLE CHARACTERS , , 1.1 Notational Conventions for Non-printable Characters .... 7

2. FORMT OF BIBLIOGRAPHIC RECORDS 9

2.1 Order of Data Elements 9 2.2 General Format of Data Elements 9 2.3 Continuation Lines 9 2.4 Word Breaking at End of Line 9 2.5 Punctuation 11 2.6 Subject Descriptors (Section 3 of INIS Worksheet) 11 2.7 Omitted Data Elements 12

3. WORKSHEET HEADER 13

3.1 First Worksheet л 13 3.2 Subsequent Worksheets 14

4. SPECIAL CONTROL SEQUENCES 15 4.1 Beginning of Record 15 4.2 Beginning of Data Element 15 4.3 Continuation Line 15 4.4 End of Record 15 4.5 Null Sequences 16

5. ENCODING OF SPECIAL CHARACTERS 17 5.1 Encoding of Group 1 Characters 18 5.2 Encoding of Group 2 Characters (superscripts and subscripts) , .... l8 5.3 Summary of Encoding Rules » 20

6. QUALITY AND DIMENSIONS OF PAPER TAPE 21

7. INPUT BATCHES 22

TABLE 1. COMPARISON OF INIS VERSUS PAPER-TAPE CHARACTER SETS ... 23

TABLE 2. TRANSCRIPTION OF SPECIAL CHARACTERS 24

APPENDIX 1. INIS FLEXOWRITER KEYBOARD ARRANGEMENT 25

- 3 -

PREFACE

As recommended by the Panel on the Preparation of the Final Proposal for INIS, Vienna, 28-51 October 1968, INIS will accept machine-readable input in either of two media: magnetic tape or punched paper tape.

This report describes the rules for the preparation of machine-readable INIS input on punched paper tape. The TNIS character set, described in INIS: Magnetic and Punched Paper Tape Codes and Character Sets (TAEA-INIS-7), consists of 119 different graphics and a "space" character. The ideal paper tape device would therefore be one that permits the direct encoding of the INIS character set. However, existing paper tape devices, although in theory able to produce I28 different punched codes (8 channels including parity), are not, in general, able to display 128 different graphics. (The TNIS Friden Plexowriter Model 2^03i keyboard arrangement is shown in Appendix 1 ) •

This means that a number of characters in the INIS character set cannot be directly encoded, but need special handling. Paper tape devices, however, have two major advantages as compared to punched-card devices : a) the ability to encode directly a significantly larger number of characters, and ) the ability to produce a hard copy which can be used immediately for proofreading.

Ease of encoding and of proofreading has, in fact, been the guiding principle in the design of the rules contained in this report. An attempt has been made to achieve these goals as follows:

- 5 - 1) A paper-tape character set (a subset of the INIS character set) was chosen so that the most frequent characters are directly encodable, and a coding scheme was devised for the remaining characters which is both easy to memorize and to proofread.

2) The encoding rules were devised so that the hard copy always reflects a "true image" of the information actually recorded on the punched paper tape.

- 6 - 1. NON-PRINTABLE CHARACTERS

It is essential that the hard copy produced by the paper tape device gives a "true linage" of the information actually recorded on the paper tape; the hard copy must convey to the proofreader all the information recorded. This is trivial, of course, for text characters: if the hard copy reads .. "reactor", it is certain that the paper tape will actually contain the letters -e-a---o-r.

In the encoding process, however, the operator uses special control keys to which no printable graphics are assigned. Therefore there will be characters which are (and must) be present on the paper tape, but which are not visible on the hard copy.

The number of such characters has been limited to four, viz.: tabulator (TAB), carriage return (CR), upper-case shift (UC) and lower­ case shift (LC).

All other control characters are "printable" characters (?.g.@. #). The presence of a CR code on the tape can be inferred by the fact that a new line must have been initiated by depressing the CR key. The presence of UC and LC codes can also, of course, be deduced by the text characters actually printed.

The printed output does not, however, give unequivocal indications of the use of the TAB key since the same effect could be produced by multiple depressions of the "space" key. It is, therefore, most important to instruct keypunchers to use the TAB key where demanded by the instructions. Multiple use of the "space" key would in these cases result in erroneous interpretations of the records.

For the purpose of INIS records the tabulator stop is set at least 6 positions to the right of the left margin (the reason for this will be clear from the sequel).

- 7 - 1.1 Notational conventions for non-printable characters

To improve the readability of this report the following conventions have been adopted:

1) In the description of the general forms of bibliographic elements, the non-printable control characters are indicated by R, A, к, 4 in order to distinguish them from actual в text characters.

2) In the examples, a carriage return (CR) is indicated by the i=.

- 8 - 2. FORMAT OF BIBLIOGRAPHIC RECORDS

2.1 Order of data elements

Data elements are punched in the order in which they are given on the worksheet.

2.2 General format of data elements

Each data element must begin on a new line. The general form of a data element is as follows: T TAG A TEXT? в R 20 Continuation lines

If a data element extends beyond more than one line, each continuation line must begin with a tabulation (TAB) immediately followed by the text, i.e. the tag of the data element must not be repeated in each line. Note, however, that for data elements with a tag which is less than 100 no continuation line is allowed.

Example :

left margin tab setting Ï I 200 The application of spectral data from ^ isotopically substituted molecules to the determination of anharmonic potential energy constants 7*= 300 BNL--9684 ft

2.4 Word breaking at end of line

The length of a line of text is at the discretion of the operator. Whenever it Is necessary to break the text at the end of the line a simple convention must be followed, so that the computer can correctly handle continuation lines.

- 9 - As a rule the computer will automatically insert a space immediately before the first text character of a continuation line except in the following two cases: a) when the last text character of the previous line is a-hyphen (-); in this case the computer will assume that a hyphenated word has been broken between two lines; and b) when the last text character of the previous line is an equal sign (=); in this case the computer will delete the equal sign so that, in the final INIS record, the first text character of the continuation line will follow immediately the character preceding the equal sign. (Note that in order to have an actual equal sign at the end of the line, two equal signs are required, since one will be deleted by the computer).

Examples : left margin tab setting i I 200 Automatic machine for milling^ fins on cylindrical par=^

Computer printout: Automatic machine for milling fins on cylindrical parts

200 Triple-axis neutron spectrometer-Ф computer interfaced

Computer printout: Triple-axis neutron spectrometer-computer interface

200 Collective effects in the phosphorus-31= ф (helium-3, proton)sulfur-Ф 31 reaction^

Computer printout: Collective effects in the phosphorus-J51(helium-3, proton)sulfur-31 reaction

200 Cross-sections for A=242 and v== ф 2200 / ф

Computer printout: Cross-sections for A=242 and v=2200 m/s

- 10 - 2.5 Punctuation

Since it Is Impossible for the computer to distinguish between a grammatical and the full stop used at the end of an abbreviated word, the following rule must be strictly followed to ensure consistency in the computer produced indexes: Do not enter a full stop at the end of a data element unless the last word is an abbreviated word or an author's initial.

Examples :

Phillips, L.P.; Scalsky, E.. Symposium on Accelerator Radiation Dosimetry Brookhaven National Lab., Upton, .. . Appl. Polymer Sei. Nguyen Van Thieu

2.6 Subject descriptors (Section 3 of INTS worksheet)

Descriptors are recorded in upper case and must be separated by a and a space. The split number, if present, is enclosed in square and precedes the first descriptor of the split. The closing square must be followed by a space. The last descriptor of all but the last split must be followed by a semicolon. The last descriptor of the last, or only, split must not be followed by a semicolon. Each descriptor may be on a separate ]ine. Each split must begin on a new line.

Examples. a) No split indication left margin tab setting 009 9 800 TECHNICAL SPECIFICATIONS; EFFICIENCY; ECONOMICS; REACTOR CORE; COOLANTS; STEAM

- 11 - b) Split document OO9 800 [pi] TECHNICAL SPECIFICATIONS; EFFICIENCY; ECONOMICS; REACTOR CORE; COOLANTS; STEAM; GRAPHITE MODERATE! REACTOR; LP§] TECHNICAL SPECIFICATIONS; EFFICIENCY; ECONOMICS; REACTOR CORE; GAS COOLED REACTORS; HEAVY WATER MODERATED REACTORS

Alternatively example a) above could have been entered« 009 9 800 TECHNICAL SPECIFICATIONS; EFFICIENCY; ECONOMICS; REACTOR CORE; COOLANTS; STEAM

Proposed descriptors (tag 8l0) are also separated by a semicolon, but no semicolon follows the last one. Note that the information in the right hand column of tag 810 in the worksheet must not be punched, and that no split indication applies to tag 8l0. Example: 810 STEAM COOLANT; EFFICIENCY

2.7 Omitted data elements

In general, not all data elements are completed on the Worksheet. The absence of a data element can be indicated in one of two ways:

a) by simply omitting the data element; or

T b) by the sequence TAGA? (i.e. the tag is entered but no text is в given); this technique will be used if the data are punched with the help of a program tape which, e.g. contains all possible tags for any given type of bibliographic record; in this case the tags as well as TAB and CR can be prepunched on the program tape, thus eliminating one possible source of error.

- 12 - 3- WORKSHEET HEADER

3.1 First Worksheet

Particular care should be exercised when punching the following elements from the Worksheet Header:

Tag 002 should consist of two numbers separated by a (/) (the slash is preprinted on the Worksheet)

Tag ООЗ should be punched only if either R or is circled; then only the circled character should be punched

Tag 004 should be punched only if either N, С or D is circled; only the circled character should be punched

Tag 005 should be punched only if either T or / is circled; then only the circled character should be punched

Tag 007 may consist of б or 9 characters; if present, only the indicated characters should be punched, i.e. if it consists of 6 characters these should not be followed by 3 spaces

Tag 008 consists of a 3-character subject category code; a 2-digit entry for total number of index terms; a 1-oharacter type of record code (the one which is circled); 1 to 3 biblio­ graphic level codes (the ones circled), and, optionally, a variable number of literary indicators (the ones circled); each of these fields is separated from the preceding one by a slash (/) which is preprinted on the Worksheet; if no literary indicator is circled, however, the bibliographic level codes should not be followed by a slash.

Although the alphabetic characters in the Header are shown in upper case, they may be keypunched in lower case mode. When using the INIS keyboard this would mean that no shift character will be needed (for the Header) after the initial lower case shift (preceding Tag 001).

- 13 - 3.2 Subsequent Worksheets

A bibliographic record may consist of several Worksheets. In this case only Tag 002 (not Tag 001) from the Header of Worksheets other than the first (i.e. 2/2, 2/3 or 3/3) must be entered. This will enable the computer program to check that all Worksheets for the bibliographic record have been correctly entered.

Examples of Worksheet Headers

left margin tab setting 1 I 1) 001 1Аб9123^5^ (first Worksheet Header) 002 1/1 * 004 N ф 008 A31/15/B/MS ? 2) 001 IA6923133 7* " 002 1/1 + 003 R t 004 N t 008 D23/08/J/AS/ *

3) 001 1Аб953531^ " 002 1/2 ^ 004 О 005 010000 ООО / + 007 009537 008 E27/35/B/AMS/U t

4) 002 2/2 i= (second Worksheet Header)

- 14 - 4. SPECIAL CONTROL SEQUENCES

A control sequence is a series of character codes on the punched paper tape which has a special meaning for the computer program that will process the paper tape. The INIS control sequences are described as follows:

4.1 Beginning of record

Each bibliographic record must begin with the following sequence :

001 A В

4.2 Beginning of data element

Each data element must begin with the following sequence : T TAG A В

4.3 Continuation line T A continuation line must begin with the single-code sequence A в

4.4 Snd of record

Bach bibliographic record must end with the following sequence:

О

Note: The end-of-record sequence must be preceded by a carriage 'return „°

e.g.

T A last line of last tag В

- 15 - 4.5 Null sequences A null sequence is a series of control characters which has no effect. There are two permissible null sequences :

a) A shift-code null sequence is a series of upper and lower case shift-codes (UC and LC) such that the text character affected by the sequence is in the same case it would have been if the sequence was not present.

Example

Text Paper tape ABCD и L и L и L и A В с D с С с С с С С

null sequences

b) A carriage return null sequence is a series of carriage return codes (CR).

Examplej. £CCC

A shift-code null sequence can be used anywhere in the bibliographic record.

A carriage return null sequence can be used anywhere a single carriage return is allowed (e.g., to provide vertical spacing, on the hard copy, between two bibliographic records).

- 16 - 5. ENCODING OP SPECIAL CHARACTERS

Special characters and symbols often are found in the title (Tag 200) of INIS records. Many of these characters are not part of the INIS character set (see IAEA-INTS-7)• A transcription table is given which indicates the preferred forms of the symbol in the English language (e.g. К becomes к ) for the most frequently occurring of these characters (see Table 2, p. 23)« There are, however, 33 special characters which are part of the INIS character set but not of the INIS paper tape subset. These need special coding, since they must subsequently be converted, by computer program, to their final form. These 35 characters are grouped as follows:

Groupl

$ -* / >f > < v Q %

Group_ 2

(superscript) 0 12)^56789+-

(subscript) 0123^56789

Those in the first group are encoded by using a different technique than those In the second group. To handle these 33 characters, three

special characters are used as control characters for encoding, viz.:

#(number sign): to delimit special characters in group 1 above

(overscore) : to indicate superscripting

__( under sc ore) : to Indicate subscripting

Note that these three control characters must only be used for the purpose indicated above, since they are not members of the INIS charac­ ter set.

- 17 - 5-1 Encoding of Group 1 characters

Character Composed as

? #female#

cT #male#

—» #yields#

/ #integral# V #root#

> #gt# < #lt#

#XI#

V #nu#

SI # OMEGA #

#percent#

5.2 Encoding of Group 2 characters (superscripts and subscripts)

A subscript (superscript) is any character or string of characters below (above) the normal line position.

a) If all the characters of a subscript (superscript) are available in the subscript (superscript) position in the full INIS character set, i.e. 0-9 (+, -, 0-9) then each subscript (superscript) character must be preceded by the underscore (overscore) character.

Note that, to facilitate proofreading, it is desirable that the underscore and overscore characters be assigned to the same key on the keyboard and that this key be non-escaping, i.e. that there be no carriage movement when it is depressed, so that subscripts and superscripts will actually appear underscored and overscored respectively on the hard copy. This feature will be assumed in the following examples:

b) When both a subscript and a superscript are present, then the subscript must be entered first.

- 18 - Examples of correct encoding:

Original Typewriter copy Paper tape

L x5 С 5

и L и —• L NÏ N2 + N 2 + с С — С С

и к K- к с -

— L L L 235 у . и о и О и и 235U о - О и с С С - С с С С

fi" #OMEGA#- # о M #

Examples of incorrect encoding:

235. 'u 23 5U (all superscript characters must be preceded by an overscore)

(i is not available in the sub­ A Ai i script position in the INIS character set)

X15 (the subscript .wast be entered first)

c) If one or more characters of a subscript (superscript) are not available in the subscript (superscript) position on the computer print train, then the entire subscript (superscript) is enclosed in parentheses and preceded by the word sub (sup). Spacing, if desirable, should be supplied by the operator (or cataloguer).

Examples of correct encoding:

Ksub(03) 03

b / #integral # sub (a) sup (b) a

- 19 - Example of incorrect encoding:

К sub (A3) Kß3 (3 should not be preceded by the underscore since this would mean that the original text read: Kg )

d) If both a subscript and a superscript are present and rule a) above does not apply to both, then both must be entered according to rule c).

Example of correct encoding:

Ksub(ir3) sup(+)

Example of incorrect encoding:

Ksub(ir3)+ 3 (the plus (+) sign should not be encoded by the overscore since rule a) does not apply to both the plus and 7ГЗ) •

5-3 Summary of encoding rules

Table 1 gives the correspondence between the INIS character set and the paper tape character set showing encoding where required (p. 22).

Table 2 gives the transcription of special characters often found in titles of scientific literature which are not contained in the INIS character set (see Table 1, p. 2^).

- 20 - 6. QUALITY AND DIMENSIONS OF PAPER TAPE

The quality of unpunched paper tape must conform to the Draft ISO Recommendation No. 1729; Properties of unpunched paper tape.

ISO/TC 97 (Secretariat-206) 3C4E} Jan. 1969.

The paper tape used for input to INIS is an 8-channel tape having the dimensions specified in the Draft ISO Recommendation No. I67I; Dimensions for punched paper tape for data interchange. ISO/TC 97 (Secretariat-l46) 221E, May 1968.

The tape must be 1-inch wide and have a feed track located at .392 (+ 0.004) inches from the reference edge and be punched with fully punched round holes in the code specified in INIS: Magnetic and Punched Paper Tape Codes and Character Sets (lAEA-INIS-7)•

- 21 - 7. INPUT BATCHES

Input to IMS Is supplied in batches consisting of a number of bibliographic records.

Each batch may consist of a number of paper tape coils.

Each coil must be submitted as follows:

a) The first meter of tape must be blank, and must be cut as indicated in the figure below:

On this piece of blank tape there must be written the name of the submitting organization, the first and last Temporary Record Number (TRN) contained on this coil, and the total number of coils in the batch.

b) Following the blank tape there should be at least 1 meter of tape feed (code 7-6-5-4-3-2-1).

c) The first character on the tape must be a carriage return (code 8), or a carriage return null sequence, followed im­ mediately by Tag 001 of the first bibliographic record.

d) The last characters on the tape must be the end-of-record

sequence (S) R of the last bibliographic record, i.e. a record must not be split between two coils. This must be followed by at least 0.5 meters of tape-feed (code 7-6-5-4-3-2-1). The tail must be cut as Indicated in the figure below:

OOOO о ООО OOOO о ООО OOOO о ООО OOOO о ООО OOOO о ООО ООООоООО ОООАоООО

- 22 - TABLE 1 COMPARISON OF INIS VERSUS PAPER-TAPE CHARACTER SETS

INIS Paper-tape INIS Paper-tape Char. Set Char. Set Char. Set Char. Set

A - Z A - Z ? # female #

a - z a - z o* #male#

0 - 9 0 - 9 —» #yields#

$ $ / # intégra 1#

= = V" #root# ff 1 > #gt# ( ( <

) ) w #XI#

[ t #nu#

] ] # OMEGA #

t % # percent #

: : 0 Ö

/ / 1 ï

- - 2 2

? 3 3

• » 4 4 ex * 5 "u» 5 и + + 6 6 supe i a a 7 7

В /3 8 8

7 У 9 9 + +

TT 7Г - —

Cd ÜJ 0 0

Д Д 1 1

Л Л 2 2

2 г 3 3 *-* — 4 л, 4

С и О

6 6

7 7

8 8

9 9

- 23 - TABLE 2 TRANSCRIPTION OF SPECIAL CHARACTERS

Character Composed as Character Composed as

& (ampersand) and e THETA

• (centred dot) . (period on line) e thêta

 (angstrom) A iota

X (multiplication) x ( lower case x) к kappa

° (degree) deg (or superscript zero lambda when no ambiguity Л anti Л results) anti (j f (division) / S xi Z L П PI r 1 P rho I vector о sigma к К к tilde К tilde T tau к К anti к anti К T UPSILON и U tilde V upsilon " " (double ) Ф PHI > - cp 0 phi < = chi approximately equal to X г or ö or PSI approximately ~ ОГ Ri psi ± + +- -+ £ Pound ô а delta reversible e epsilon Э is contained in zeta contains n eta с

- 24 - APPENDIX 1 INIS FLEXOWRITER KEYBOARD ARRANGEMENT

7654321 65432 1 2 521 3 531 532 321 4 541 742 76431 642 Л BACK TAPE а ß # @ ( S 8• 1 SPACE FEED 1 2 3 4 5 7 9 TAB 1 1 11 15 19 23 27 31 35 39 43 47 51 55 57

+43 421 754 632 76531 741 621 654 653 76541 732 75321 75421 8

AUX STOP [ W E R T Y U I 0 P CAR RET CODE CODE ]

2 6 13 17 21 25 29 33 37 4i 45 49 53 56 761 652 763 76532 76321 764 751 752 721 7 765 76542 ro STOP LOWER » LOWER H J К L READ CASE A S D F G 1 и CASE 7 14 18 22 26 30 34 38 42 46 50 54 59 76543 542 641 65321 76521 631 762 731 753 65421 76421 651 Д UPPER START UPPER N M READ CASE У CASE

12 16 20 24 28 32 36 40 44 48 52 58

SPACE The INIS Reference Series comprises the rules, standards, formats, codes, authority lists, etc. on which the system is based. It currently consists of 14 documents published by the Agency in 1969 and 1970« Some of the documents have been revised and further revisions will be issued in 1971»

Individual documents in the INIS Reference Series may be purchased at the prices indicated below or a complete set of the Series in printed form may be ordered for US $20. Minimum orders should not be for less than US $3.

The price for microfiche copies of the INIS Reference Series is US $0.65 per item.

Single documents t

IAEA-INIS-1(Rev.1) INIS: Descriptive Cataloguing Rules $ 1.10 IAEA-INIS-2(Rev.l) INIS: Descriptive Cataloguing Samples $1.00 IAEA-INIS-3(Rev.l) INIS: Subject Categories and Scope Descriptions $ 0.80 IAEA-INIS-4(Rev.O) INIS: Instructions for Submitting Abstracts $ 0.30 IAEA-INIS-5(Rev.lJ INIS: Terminology and Codes for Countries and International Organizations $ 0.50 IAEA-INIS-6(Rev,2) INIS: Authority List for Corporate Entries $ 3«5° IAEA-INIS-7(Rev.l) INIS: Magnetic and Punched Paper Tape Codes and Character Sets $ 0.40 IAEA-INIS-8 Uev.l ) INIS : Paper Tape Specifications and Record Format $ O.5O lAEA-INIS-9(Rev.lJ INIS: Magnetic Tape Specifications and Record Format $ O.4O IAEA-INIS-10(Rev.0; INIS: Transliteration Rules for Selected Non-Roman Characters $ 0.20 lAEA-INIS-ll(Rev.O; INIS: Authority List for Journal Titles $ 1.30 IAEA-INIS-12(Rev.0) INIS: Manual for Indexing $ 1.00 IAEA-INIS-13(Rev.o; INIS: Thesaurus $ 8.00 IAEA-INIS-13A(Rev.0) INIS: Terminology Charts $ 2.00