>

2.3 -- DIRECTORS --

in file people.html The file lists all directors, as well as some other movie people, as producers and cinematographers (A total of 3290 `tr/td’ table row entries, 3011 `@’ directors as of July 1999).
The directors table is similar to the actors table in that it contains intervals for when the director worked and when he/she lived. The key of the relation is the field "name", which is the name under which the director directed. Director's key names do not contain any blanks. Typically the last name is used, when needed prefaced by an initial. A secondary unique key is defined for each director, up to three letters, based on the initial letters of the first, middle, and last names. This key will provide HTML HREF linkages among many of the files. As with the actors table, this table also includes the real name of the director among its fields ("lastname" and "firstname"). It also contains importat producers, cinematographers, musicians and composers, etc.

2.4 -- STUDIOS --

in file studios.html are important studios only (203 `tr/td’ entries, sparse information).
The key of the studios relation is the name of the studio. The temporal information that is included is an interval indicating the years the studio was (or is) in operation, represented by the fields "startdate" and "enddate". This is a history relation.

2.5 -- CASTS --

in file casts.html This is a large (too large?) file of who acted as what in which movie. (46 009 tr/td entries, only partial for movies and roletypes, July 1999). Casts is an association relation, linking actors with movies. The key of the relation is the catenation of the two fields "film_id" and "actor"; no temporal information is included in this relation.
This file was too big for Netscape in 1996, so that also five working subsets were made available, however these are not kept up-to-date.

2.6 -- REMAKES --

in file remakes.html (1278 `tr/td’ entries in July 1999).
This table (which is not extensively used in the temporal DB paper) gives information about movies that are remakes of other movies. It is very useful to test recursion in databases.

2.10 -- AWARDS-RECEIVED --

is no longer a distinct file Awards received for special occasions are listed with individual entries in the files for ACTORS (actors.html).or MOVIE PEOPLE (people.html). Regular awards associated with a particular movies are given in CASTS (casts.html).

2.11 -- REFERENCES --

Books that provided material for this database are listed within this documentation file as Appendix A.

2.12 -- GEOGRAPHY --

Codes for countries and origins are listed within this documentation file as section 4.3: doc.html GEO.

2.13 -- CATEGORIES --

Codes for movie categories are listed within this documentation file as Section 4.4: doc.html CATS.

2.14 -- COLOR-CODES --

Codes for color processes used for movies are listed within this documentation file as Section 4.5: doc.html COLS.

2.15 -- ROLE-TYPES --

Codes that specify role-types for actors are listed in the preamble for casts.html ROLES.

2.16 -- FIELD-IDENTIFIERS --

Codes that identify subfields in various files are listed within this documentation file as Section 4.2: doc.html FIELDS.

2.17 - AWARD TYPES --

Lists the award types used in MAIN, ACTORS, and PEOPLE, with the organizations who award them, and the span of years they were awarded.

2.19 -- IMAGES --

there is a small collection of .tiff files for actors and directors. They are kept individually in an images subdirectory.

2.20 -- ICONS --

There are about a dozan icons to be used to identify subfiles. Some of them come from the New Yorker Magazine Jan.1993. There are kept individually in an icons subdirectory.


3. Schema Definition for the Movies Database

Here we give a detailed description of the schema of the movies database, which is used for all examples in this paper and was used to implement the temporal SQL additions. General descriptions are given in Section 2, above.
This file is being updated to desctribe the HTML version. Where updates were made, the old material is in curly {brackets}.

3.1 The MOVIES Table

Col-Name = Description
. There is a distinct table for each director (Hitchcock has multiple tables, one for early silent, one for British, one for American, and one for TV movies).
The tables are broken up p by year of first known film by the directors. There are some break and header records for each year.
Each director table has two types of records:
    > The remainder shows the format for the data records that follow below.. The note field is often used to describe the set of detail records
    For movies where the director in not known there is a dummy entry, either by topic (Unknown) or by year (UnYear), as shown in the people file. >

    3.2. The ACTORS Table

    There is one record for each actor listed, but not all actors listed in CAST are documented. There are also break and header records for each letter of the alphabet.

    Col-Name = Description

    .

    3.3 The PEOPLE Table

    Directors are the major subset of the general people.html table. Other entries are significant producers, writers, art directors and some authors. Being a director is indicated in the Pcode field, and has some effect on other fields. There are also break and header records for each letter of the alphabet.

    Col-Name = Description

    3.4 The STUDIOS Table

    Col-Name = Description ------

    3.5 The CASTS Table

    The large CAST file is broken up into sections by initial letter(s) of the directors identifying code. In each section will be a number of directors, ordered by code. There is a distinct table for each director.
    Col-Name = Description
      Each director table has two types of entries >

      3.6 The REMAKES Table

      Col-Name = Description

      3.8 -- QUOTES table

      Col-Name = Description ------

      film-id title speaker role listenr quote

      3.9 -- AWARDS table

      Col-Name = Description ------

      award agency place

      3.10 -- AWARDS table

      Col-Name = Description ------

      recepnt award year reason notes

      3.11 -- REFERENCES table

      Col-Name = Description ------

      no. author title pub-inf

      3.12 -- GEOGRAPHY table

      Col-Name = Description ------

      code . See 4.1 for the encoding used. country-name c-adjective

      3.13 -- CATEGORIES table

      Col-Name = Description


      ctcode = Four-letter code

      Movie categories
      codecategory|codecategory|codecategory |
      Ctxxuncategorized|
      Actnviolence|Advtadventure|AvGaAvant Garde|
      Campnow - camp|Cartcartoon|CnR Cops and Robbers|
      Comdcomedy|
      DisaDisaster| Docudocumentary| Dramdrama|
      Epicepic|Famlfamily |Histhistory |
      Horrhorror| Muscmusical|Mystmystery|
      Noirblack| Pornpornography|Romtromantic|
      ScFiscience fiction| Surlsureal|Suspthriller |
      Westwestern|

      3.14 -- COLOR-CODES table

      Col-Name = Description ------

      color-code full specific color processes > >

      They are listed in the preamble for movies.macros. . -->

      3.15 ROLE-TYPES table

      Col-Name = Description ------

      role-codes

      3.16 FIELD-IDENTIFIERS table

      Col-Name = Description ------

      codes Section 4.4.. Some general codes, appearing in many files, are: >


      4: CODE TABLES

      A fair amount of the information is encoded for consistency of reference. Directors' names are always treated as codes, and many other movie people as well. Names with out spaces are codes, and can be found in the PEOPLE relation. Actors names are treated as codes as well, although here first names have been retained. Many actors can be found in the ACTORS relation.
      All movies have been assigned a code by catenating a director's identifier, found in the PEOPLE relation with sequence digits.

      Several code tables appear below, other used are

      Remote code tables

        >casts.html ROLES. . -->

        Local code tables

        4.1 -- FIELD-DESIGNATORS --

        These codes are used in certain filed to further identify the contents.

        Check this, much changed when moving to HTML.

        codedefinition|
        T:film title|
        T2: redefinition of title in main.html SYNS.|
        T3: title used for locale file in MAIN |
        T4: title used for License plate list in MAIN|
        T5: title |
        T6: title used in casts.html SAYINGS.|
        T6: title used in quotes.html.|
        TS:not sure of actor spellingused in CASTS obsolete |
        TZ:title from Movies-dir.html|
        TZ:not sure if actor in this filmused in CASTS|
        P: producer in PEOPLE|
        PN: producer full name not yet classified|
        PU: unknown producer not yet classified|
        St: listed studionot yet consistent|
        SN: studio name|
        SU: studio name unknown|
        SL: country or city of studio|
        SD: distributor old codes: Dtr, Ds, Dis{\Dtr|
        R: roleused in CASTS|
        RZ: role uncertainused in CASTS|
        RU: role unknownused in CASTS|
        RN: only name in roleused in CASTS|
        RS: spelling of actor's name unsureused in CASTS|
        D:|
        DN: director with full name, may not be in PEOPLE filewas \DiF|
        DU:|

        4.2 -- FIELD-IDENTIFIERS --

        Ancillary information, and social and professional relationships is frequently given as sub-fields in notes. Below are the prefixes used for such variable information. Some refer to candidate entries in other files.

        Many occur in the MAIN file, in the final NOTES field, others in the PEOPLE and ACTORS files. There should be only one sub- field of any type in the notes field, but they may have contain multiple entries If the contents references entries in a file, it is listed, but such a reference is only assured if coded with a colon ( : ). Some of these designations are repeated with the files themselves.

        CodeContentsOcurrs inReferences what|
        Aliasother professional names PEOPLE, ACTORSnone |
        Alsoother professionsMAIN, ACTORSPEOPLE |
        Altalternative titleMAIN (SYNONYMS)none |
        BBook AuthorMAIN, ACTORSPEOPLE |
        BS was Brother or Sister of, use SiPEOPLE, ACTORSPEOPLE |
        CCinematographerMAIN, PEOPLE, ACTORSPEOPLE |
        CoDCoDirectorMAINPEOPLE |
        CostCost to make film [M/K]MAINnone|
        Chchild ofPEOPLE ACTORSPEOPLE, ACTORS |
        ChorchoreographerMAIN, ACTORSPEOPLE |
        DtrdistributorMAINSTUDIO |
        Erpossible error, verifyallnone |
        FdFounded: STUDIOS PEOPLE , ACTORS|
        FdrFounder of: PEOPLE , ACTORS STUDIOS |
        GnFPEOPLEused for Female producer or director |
        IncIncome from film [M/K]MAINnone|
        Infinfluenced PEOPLE, ACTORSPEOPLE , ACTORS|
        Inbinfluenced by PEOPLE, ACTORSPEOPLE, ACTORS |
        LwLived withPEOPLE, ACTORSPEOPLE, ACTORS |
        McomposerMAIN, PEOPLE, ACTORSPEOPLE |
        MtMarried toPEOPLE, ACTORSPEOPLE, ACTORS |
        Ntnoteallnone |
        PplaywrightMAIN, ACTORSPEOPLE |
        Ptparent ofPEOPLE, ACTORSPEOPLE, ACTORS |
        Phunusual physical characteristics,
        as height, weight, ..
        ACTORS none |
        RARt director, now VMAIN, PEOPLE, ACTORSPEOPLE |
        Seen [dd[MMM]]yyMAINnone|
        StstudioMAIN, ACTORSPEOPLE |
        Si PEOPLE, ACTORSPEOPLE, ACTORS |
        Tytypical stylePEOPLE, ACTORSTable 4.4 |
        VVisual art directorMAIN, ACTORS, PEOPLEPEOPLE |
        VT have video tape (number)MAINnone|
        WwriterMAIN, PEOPLE, ACTORSPEOPLE |
        Wwworked withPEOPLE, ACTORSPEOPLE, ACTORS |

        4.3 -- GEOGRAPHY --

        Codes for countries and origins are listed below. It is still incomplete. .
        Common countries for movie making are listed first, followed by other countries in alphabetical order
        codecountryadjective|codecountryadjective|
        AmUSAAmerican|
        Brnot usedBritish| GBGreat Britainnot used|
        FrFranceFrench|
        GeGermanyGerman|
        ItItalyItalian|
        JaJapanJapanese|
        Alphabetical by code|
        ArArgentiniaArgentine|
        AuAustraliaAustralian|
        BeBelgiumBelgian|
        BzBrazilBrazilian|
        CaCanadaCanadian|
        ChChina, PRCChinese|
        CzCzechoslovakiaCzech|
        DaDenmarkDanish|
        GrGreeceGreek|
        DuHollandDutch|
        HuHungaryHungarian|
        InIndiaIndian|
        IrIrelandIrish|
        MeMexicoMexican|
        OsAustriaAustrian|
        PePeruPeruvian|
        RuUSSR, RussiaRussian|
        SpSpainSpanish|
        SASouth-AfricaSouth-African|
        YuYugoslaviaYugoslav|
        ZwSwitzerlandSwiss|

        4.4 -- CATEGORIES --

        Codes for movie categories are listed below. Mainly used in MAIN, also content of Type fields. Multiple entries are possible.
        codecategory|
        Susp thriller|
        CnR cops and robbers|
        Dram drama|
        West western|
        Myst mystery|
        S.F. science fiction|
        Advt adventure|
        Horr horror|
        Romt romantic|
        Comd comedy |
        Musc musical|
        Docu documentary|
        Porn pornography, including soft|
        Noir black|
        BioP biographical Picture|
        TV TV show|
        TVs TV series|
        TVm TV miniseries|

        4.5 -- COLOR-CODES --

        Codes for color processes used for movies in MAIN are listed below.
        codefull namedescription|
        prcunknown|
        colcolorcolor film, common after 1955|
        bnwblack-and-whiteb-w film common before 1945|
        sbwsilentsilent black-and-white film |
        cldcoloredblack-and-white film recolored|
        CartcartoonCartoons are normally colored|
        TcolTechnicolorhigh quality color |
        EcolEastmancolor color by Kodak N\t(unstable)|
        WcolWarnercolor|
        McolMetrocolor Color by MGM|
        AcolAnscocolorcolor by Kodak? |
        AgcolAgfacolor|
        FcolFujicolor|
        DeLuxeDeLuxelow cost color|
        DuArtDuArtcolor|
        MovielabMovieLabcolor|
        CSCinemascopewidescreen, mostly color |
        Trama Technirama widescreen color |
        PanPanaVision|
        TVfilm made for TVvarious processes|
        VstVistavision|

        Appendix A: References

          Books, etc. > Electronic material
            > 1999 > > >; relates actors to Kevin Bacon with degree of seperation, April 1999. >; relates actors to Each other with degree of seperation, April 1999. . -->

            Books about Actors

              >

              Web pages

                >
                  > BOOKS I have for Movie Stories:
                    >

                    Appendix B: CONVERSION

                    This section refers to the original HTML files. The notes are still being developed.

                    To convert the source files from HTML format to another type of database:

                    (we use [] to denote HTML `french' brackets.)

                      > Multiple values in a field a separated by
                        > If both `;' and `,' appear in a field, then the `;' typically distinguishes a major group relative to the `,' value separator, as [td] island, South Pacific; court, SF, CA[td]. > TypeCode(field), as W(Eliot Stannard)
                        >

                        When names of producers, writers, etc, in fields do not contain blanks, as [td] P:A.Hughes[td], then the name exists in the people.html file, and can be used as an interfile reference.