<<

Outline The as a Concept and Tool The and its Evolution Database Applications

Day 1: Introduction to Database Theory and Design

Database Theory and Design Tyler Peterson

International Summer School on Language Documentation and Description Leiden University Centre for Linguistics, Leiden

November 26, 2011

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications My Details:

Tyler Peterson office: LUCL Van Wijkplaats 4 Room 205a telephone: 071-5272059 email: [email protected] [email protected] (for Google docs) office hours: Most afternoons until 18:00

I Please fill out the short survey, and don’t hesitate to contact me!

Database Theory and Design Day 1: Introduction to Database Theory and Design I Theory: Terminology used in database theory; Understanding the design features of a database (entities and attributes). I Design: Assessing your goals; planning a database on paper (Entity-Relationship diagrams); Best practices.

I Cross tabulation: a front-line analytical tool. I Trigger–Target Database for Phonological Processes I Programma de Fonologia Experimental e Hist´orica

I vs. spreadsheets. Types of databases. I Different commercial and free database programs: advantages and limitations. I To familiarize you with the concepts in database theory and design.

I Looking at the practical implemetation of an analytical database in MS Access.

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications Goals for the Course:

I Databases: what they are, what linguists can use them for.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Theory: Terminology used in database theory; Understanding the design features of a database (entities and attributes). I Design: Assessing your goals; planning a database on paper (Entity-Relationship diagrams); Best practices.

I Cross tabulation: a front-line analytical tool. I Trigger–Target Database for Phonological Processes I Programma de Fonologia Experimental e Hist´orica

I Different commercial and free database programs: advantages and limitations. I To familiarize you with the concepts in database theory and design.

I Looking at the practical implemetation of an analytical database in MS Access.

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications Goals for the Course:

I Databases: what they are, what linguists can use them for.

I Databases vs. spreadsheets. Types of databases.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Theory: Terminology used in database theory; Understanding the design features of a database (entities and attributes). I Design: Assessing your goals; planning a database on paper (Entity-Relationship diagrams); Best practices.

I Cross tabulation: a front-line analytical tool. I Trigger–Target Database for Phonological Processes I Programma de Fonologia Experimental e Hist´orica

I To familiarize you with the concepts in database theory and design.

I Looking at the practical implemetation of an analytical database in MS Access.

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications Goals for the Course:

I Databases: what they are, what linguists can use them for.

I Databases vs. spreadsheets. Types of databases. I Different commercial and free database programs: advantages and limitations.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Cross tabulation: a front-line analytical tool. I Trigger–Target Database for Phonological Processes I Programma de Fonologia Experimental e Hist´orica

I Theory: Terminology used in database theory; Understanding the design features of a database (entities and attributes). I Design: Assessing your goals; planning a database on paper (Entity-Relationship diagrams); Best practices. I Looking at the practical implemetation of an analytical database in MS Access.

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications Goals for the Course:

I Databases: what they are, what linguists can use them for.

I Databases vs. spreadsheets. Types of databases. I Different commercial and free database programs: advantages and limitations. I To familiarize you with the concepts in database theory and design.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Cross tabulation: a front-line analytical tool. I Trigger–Target Database for Phonological Processes I Programma de Fonologia Experimental e Hist´orica

I Design: Assessing your goals; planning a database on paper (Entity-Relationship diagrams); Best practices. I Looking at the practical implemetation of an analytical database in MS Access.

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications Goals for the Course:

I Databases: what they are, what linguists can use them for.

I Databases vs. spreadsheets. Types of databases. I Different commercial and free database programs: advantages and limitations. I To familiarize you with the concepts in database theory and design.

I Theory: Terminology used in database theory; Understanding the design features of a database (entities and attributes).

Database Theory and Design Day 1: Introduction to Database Theory and Design I Cross tabulation: a front-line analytical tool. I Trigger–Target Database for Phonological Processes I Programma de Fonologia Experimental e Hist´orica

I Looking at the practical implemetation of an analytical database in MS Access.

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications Goals for the Course:

I Databases: what they are, what linguists can use them for.

I Databases vs. spreadsheets. Types of databases. I Different commercial and free database programs: advantages and limitations. I To familiarize you with the concepts in database theory and design.

I Theory: Terminology used in database theory; Understanding the design features of a database (entities and attributes). I Design: Assessing your goals; planning a database on paper (Entity-Relationship diagrams); Best practices.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Cross tabulation: a front-line analytical tool. I Trigger–Target Database for Phonological Processes I Programma de Fonologia Experimental e Hist´orica

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications Goals for the Course:

I Databases: what they are, what linguists can use them for.

I Databases vs. spreadsheets. Types of databases. I Different commercial and free database programs: advantages and limitations. I To familiarize you with the concepts in database theory and design.

I Theory: Terminology used in database theory; Understanding the design features of a database (entities and attributes). I Design: Assessing your goals; planning a database on paper (Entity-Relationship diagrams); Best practices. I Looking at the practical implemetation of an analytical database in MS Access.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Trigger–Target Database for Phonological Processes I Programma de Fonologia Experimental e Hist´orica

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications Goals for the Course:

I Databases: what they are, what linguists can use them for.

I Databases vs. spreadsheets. Types of databases. I Different commercial and free database programs: advantages and limitations. I To familiarize you with the concepts in database theory and design.

I Theory: Terminology used in database theory; Understanding the design features of a database (entities and attributes). I Design: Assessing your goals; planning a database on paper (Entity-Relationship diagrams); Best practices. I Looking at the practical implemetation of an analytical database in MS Access.

I Cross tabulation: a front-line analytical tool.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Programma de Fonologia Experimental e Hist´orica

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications Goals for the Course:

I Databases: what they are, what linguists can use them for.

I Databases vs. spreadsheets. Types of databases. I Different commercial and free database programs: advantages and limitations. I To familiarize you with the concepts in database theory and design.

I Theory: Terminology used in database theory; Understanding the design features of a database (entities and attributes). I Design: Assessing your goals; planning a database on paper (Entity-Relationship diagrams); Best practices. I Looking at the practical implemetation of an analytical database in MS Access.

I Cross tabulation: a front-line analytical tool. I Trigger–Target Database for Phonological Processes

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications Goals for the Course:

I Databases: what they are, what linguists can use them for.

I Databases vs. spreadsheets. Types of databases. I Different commercial and free database programs: advantages and limitations. I To familiarize you with the concepts in database theory and design.

I Theory: Terminology used in database theory; Understanding the design features of a database (entities and attributes). I Design: Assessing your goals; planning a database on paper (Entity-Relationship diagrams); Best practices. I Looking at the practical implemetation of an analytical database in MS Access.

I Cross tabulation: a front-line analytical tool. I Trigger–Target Database for Phonological Processes I Programma de Fonologia Experimental e Hist´orica

Database Theory and Design Day 1: Introduction to Database Theory and Design I Day 2: Principles of Database Theory and Design

I Day 3: Relational Databases I

I Day 4: Relational Databases II, examples and implementation

I Day 5: Implementation

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications 5 Day Plan:

I Day 1: Introduction to databases: the types, the purposes, the applications.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Day 3: Relational Databases I

I Day 4: Relational Databases II, examples and implementation

I Day 5: Implementation

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications 5 Day Plan:

I Day 1: Introduction to databases: the types, the purposes, the applications.

I Day 2: Principles of Database Theory and Design

Database Theory and Design Day 1: Introduction to Database Theory and Design I Day 4: Relational Databases II, examples and implementation

I Day 5: Implementation

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications 5 Day Plan:

I Day 1: Introduction to databases: the types, the purposes, the applications.

I Day 2: Principles of Database Theory and Design

I Day 3: Relational Databases I

Database Theory and Design Day 1: Introduction to Database Theory and Design I Day 5: Implementation

Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications 5 Day Plan:

I Day 1: Introduction to databases: the types, the purposes, the applications.

I Day 2: Principles of Database Theory and Design

I Day 3: Relational Databases I

I Day 4: Relational Databases II, examples and implementation

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications 5 Day Plan:

I Day 1: Introduction to databases: the types, the purposes, the applications.

I Day 2: Principles of Database Theory and Design

I Day 3: Relational Databases I

I Day 4: Relational Databases II, examples and implementation

I Day 5: Implementation

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications Goals for Today:

The Database as a Concept and Tool Understanding what a database is Using databases in linguistics The Database model and its Evolution The ‘Flat’ database The Database Management System (DBMS) Types of Databases Models Database Applications Choosing the right Linguistic DBMS and Interfaces Non-Linguistic DBMS and Interfaces References and Suggested Readings

Database Theory and Design Day 1: Introduction to Database Theory and Design I Index cards in a shoebox. I A table in a spreadsheet.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I The database as a concept: A structured collection of data, or structured information:

Database Theory and Design Day 1: Introduction to Database Theory and Design I A table in a spreadsheet.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I The database as a concept: A structured collection of data, or structured information:

I Index cards in a shoebox.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I The database as a concept: A structured collection of data, or structured information:

I Index cards in a shoebox. I A table in a spreadsheet.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

Word Gloss Gram. Morph. hon ‘fish’ N ROOT smax ‘bear, meat’ N ROOT algyax ‘language’ N ROOT sm-algyax’ Gitksan N STEM sm-’ ‘true’ A PREFIX siipxw ‘sick, ill’ A ROOT wii-nakw’ ‘tall’ A STEM wii- ‘long’ A PREFIX nakw’ DISTAL ROOT nakw’ EVIDENTIAL ROOT x- ‘consume’ V PREFIX iixwt ‘fish’ V ROOT witxw ‘arrive’ V ROOT bakw ‘arrive’ V ROOT litsxxw ‘read’ V ROOT =hl common noun Det. ENCLITIC =t proper noun Det. ENCLITIC =tip plural noun Det. ENCLITIC -y’ 1sg Agr. SUFFIX -n 2sg Agr. SUFFIX -t 3 Agr. SUFFIX Table: Structured Information: a Gitksan (Tsimshianic) word list

Database Theory and Design Day 1: Introduction to Database Theory and Design I Word processor: processes words (!) I Spreadsheet: processes financial, numerical and statistical information. I Database program: processes structured information.

I Different types of processors:

I The digital presentation of structured information through an application: MS Access; OpenOffice Calc; FileMaker Pro; MySQL with a PHP server; etc.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I The database as an application, or a kind of ‘processor’:

Database Theory and Design Day 1: Introduction to Database Theory and Design I Word processor: processes words (!) I Spreadsheet: processes financial, numerical and statistical information. I Database program: processes structured information.

I The digital presentation of structured information through an application: MS Access; OpenOffice Calc; FileMaker Pro; MySQL with a PHP server; etc.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I The database as an application, or a kind of ‘processor’: I Different types of processors:

Database Theory and Design Day 1: Introduction to Database Theory and Design I Spreadsheet: processes financial, numerical and statistical information. I Database program: processes structured information.

I The digital presentation of structured information through an application: MS Access; OpenOffice Calc; FileMaker Pro; MySQL with a PHP server; etc.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I The database as an application, or a kind of ‘processor’: I Different types of processors:

I Word processor: processes words (!)

Database Theory and Design Day 1: Introduction to Database Theory and Design I Database program: processes structured information.

I The digital presentation of structured information through an application: MS Access; OpenOffice Calc; FileMaker Pro; MySQL with a PHP server; etc.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I The database as an application, or a kind of ‘processor’: I Different types of processors:

I Word processor: processes words (!) I Spreadsheet: processes financial, numerical and statistical information.

Database Theory and Design Day 1: Introduction to Database Theory and Design I The digital presentation of structured information through an application: MS Access; OpenOffice Calc; FileMaker Pro; MySQL with a PHP server; etc.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I The database as an application, or a kind of ‘processor’: I Different types of processors:

I Word processor: processes words (!) I Spreadsheet: processes financial, numerical and statistical information. I Database program: processes structured information.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I The database as an application, or a kind of ‘processor’: I Different types of processors:

I Word processor: processes words (!) I Spreadsheet: processes financial, numerical and statistical information. I Database program: processes structured information.

I The digital presentation of structured information through an application: MS Access; OpenOffice Calc; FileMaker Pro; MySQL with a PHP server; etc.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Spreadsheets use functions to ask questions of numbers. “What’s the average daily rainfall for the first six months of this year?” I Databases uses functions to ask questions about structured information: “Do we have any books on designing databases in our library? If so, on which shelves are they located?”

I The primary differences between a spreadsheet and database: different types of queries.

I Retrieval, and presentation: Today’s database applications are designed to retrieve and present data through queries through specially designed forms, within a database application, or on the web.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I Spreadsheets are actually a kind of database: both organize information into tables.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Spreadsheets use functions to ask questions of numbers. “What’s the average daily rainfall for the first six months of this year?” I Databases uses functions to ask questions about structured information: “Do we have any books on designing databases in our library? If so, on which shelves are they located?”

I Retrieval, and presentation: Today’s database applications are designed to retrieve and present data through queries through specially designed forms, within a database application, or on the web.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I Spreadsheets are actually a kind of database: both organize information into tables. I The primary differences between a spreadsheet and database: different types of queries.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Databases uses functions to ask questions about structured information: “Do we have any books on designing databases in our library? If so, on which shelves are they located?”

I Retrieval, and presentation: Today’s database applications are designed to retrieve and present data through queries through specially designed forms, within a database application, or on the web.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I Spreadsheets are actually a kind of database: both organize information into tables. I The primary differences between a spreadsheet and database: different types of queries.

I Spreadsheets use functions to ask questions of numbers. “What’s the average daily rainfall for the first six months of this year?”

Database Theory and Design Day 1: Introduction to Database Theory and Design I Retrieval, and presentation: Today’s database applications are designed to retrieve and present data through queries through specially designed forms, within a database application, or on the web.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I Spreadsheets are actually a kind of database: both organize information into tables. I The primary differences between a spreadsheet and database: different types of queries.

I Spreadsheets use functions to ask questions of numbers. “What’s the average daily rainfall for the first six months of this year?” I Databases uses functions to ask questions about structured information: “Do we have any books on designing databases in our library? If so, on which shelves are they located?”

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What is a database?

I Spreadsheets are actually a kind of database: both organize information into tables. I The primary differences between a spreadsheet and database: different types of queries.

I Spreadsheets use functions to ask questions of numbers. “What’s the average daily rainfall for the first six months of this year?” I Databases uses functions to ask questions about structured information: “Do we have any books on designing databases in our library? If so, on which shelves are they located?”

I Retrieval, and presentation: Today’s database applications are designed to retrieve and present data through queries through specially designed forms, within a database application, or on the web.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Cross-linguistic and typological research. I Tools for verifying and evaluating contrasting empirical and theoretical claims. I Specialized queries that can yield new insights into data.

I Archiving massive amounts of language/linguistic data. I Lexicography and dictionary making. I Enables collaboration through client-server applications over a network. I Database applications are particularily well-suited to linguistic research (cf. Nerbonne 1997; Everaert et al 2009):

I Consistency and integrity: imposing a structure on information can help reduce inaccuracies and redundancies.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:

Database Theory and Design Day 1: Introduction to Database Theory and Design I Cross-linguistic and typological research. I Tools for verifying and evaluating contrasting empirical and theoretical claims. I Specialized queries that can yield new insights into data.

I Lexicography and dictionary making. I Enables collaboration through client-server applications over a network. I Database applications are particularily well-suited to linguistic research (cf. Nerbonne 1997; Everaert et al 2009):

I Consistency and integrity: imposing a structure on information can help reduce inaccuracies and redundancies.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:

I Archiving massive amounts of language/linguistic data.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Cross-linguistic and typological research. I Tools for verifying and evaluating contrasting empirical and theoretical claims. I Specialized queries that can yield new insights into data.

I Enables collaboration through client-server applications over a network. I Database applications are particularily well-suited to linguistic research (cf. Nerbonne 1997; Everaert et al 2009):

I Consistency and integrity: imposing a structure on information can help reduce inaccuracies and redundancies.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:

I Archiving massive amounts of language/linguistic data. I Lexicography and dictionary making.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Cross-linguistic and typological research. I Tools for verifying and evaluating contrasting empirical and theoretical claims. I Specialized queries that can yield new insights into data.

I Database applications are particularily well-suited to linguistic research (cf. Nerbonne 1997; Everaert et al 2009):

I Consistency and integrity: imposing a structure on information can help reduce inaccuracies and redundancies.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:

I Archiving massive amounts of language/linguistic data. I Lexicography and dictionary making. I Enables collaboration through client-server applications over a network.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Cross-linguistic and typological research. I Tools for verifying and evaluating contrasting empirical and theoretical claims. I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure on information can help reduce inaccuracies and redundancies.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:

I Archiving massive amounts of language/linguistic data. I Lexicography and dictionary making. I Enables collaboration through client-server applications over a network. I Database applications are particularily well-suited to linguistic research (cf. Nerbonne 1997; Everaert et al 2009):

Database Theory and Design Day 1: Introduction to Database Theory and Design I Tools for verifying and evaluating contrasting empirical and theoretical claims. I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure on information can help reduce inaccuracies and redundancies.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:

I Archiving massive amounts of language/linguistic data. I Lexicography and dictionary making. I Enables collaboration through client-server applications over a network. I Database applications are particularily well-suited to linguistic research (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure on information can help reduce inaccuracies and redundancies.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:

I Archiving massive amounts of language/linguistic data. I Lexicography and dictionary making. I Enables collaboration through client-server applications over a network. I Database applications are particularily well-suited to linguistic research (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research. I Tools for verifying and evaluating contrasting empirical and theoretical claims.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Consistency and integrity: imposing a structure on information can help reduce inaccuracies and redundancies.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:

I Archiving massive amounts of language/linguistic data. I Lexicography and dictionary making. I Enables collaboration through client-server applications over a network. I Database applications are particularily well-suited to linguistic research (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research. I Tools for verifying and evaluating contrasting empirical and theoretical claims. I Specialized queries that can yield new insights into data.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:

I Archiving massive amounts of language/linguistic data. I Lexicography and dictionary making. I Enables collaboration through client-server applications over a network. I Database applications are particularily well-suited to linguistic research (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research. I Tools for verifying and evaluating contrasting empirical and theoretical claims. I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure on information can help reduce inaccuracies and redundancies.

Database Theory and Design Day 1: Introduction to Database Theory and Design I A Linguistic database: contains data from language research (i.e. words, phonemes, grammatical categories, fundamental frequencies, etc.) I A Metalinguistic database: contains data about language research (i.e. names of speakers, locations, recording details, etc.)

I Both are concieved, designed and implemented using the same principles.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What can databases be used for in linguistics?

I Two broad types of databases in linguistics:

Database Theory and Design Day 1: Introduction to Database Theory and Design I A Metalinguistic database: contains data about language research (i.e. names of speakers, locations, recording details, etc.)

I Both are concieved, designed and implemented using the same principles.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What can databases be used for in linguistics?

I Two broad types of databases in linguistics:

I A Linguistic database: contains data from language research (i.e. words, phonemes, grammatical categories, fundamental frequencies, etc.)

Database Theory and Design Day 1: Introduction to Database Theory and Design I Both are concieved, designed and implemented using the same principles.

Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What can databases be used for in linguistics?

I Two broad types of databases in linguistics:

I A Linguistic database: contains data from language research (i.e. words, phonemes, grammatical categories, fundamental frequencies, etc.) I A Metalinguistic database: contains data about language research (i.e. names of speakers, locations, recording details, etc.)

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The Database as a Concept and Tool Understanding what a database is The Database model and its Evolution Using databases in linguistics Database Applications What can databases be used for in linguistics?

I Two broad types of databases in linguistics:

I A Linguistic database: contains data from language research (i.e. words, phonemes, grammatical categories, fundamental frequencies, etc.) I A Metalinguistic database: contains data about language research (i.e. names of speakers, locations, recording details, etc.)

I Both are concieved, designed and implemented using the same principles.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications A common starting point: the ‘Flat’ database

Word Gloss Gram. Morph. hon ‘fish’ N ROOT smax ‘bear, meat’ N ROOT algyax ‘language’ N ROOT sm-algyax’ Gitksan N STEM sm-’ ‘true’ A PREFIX siipxw ‘sick, ill’ A ROOT wii-nakw’ ‘tall’ A STEM wii- ‘long’ A PREFIX nakw’ DISTAL ROOT nakw’ EVIDENTIAL ROOT x- ‘consume’ V PREFIX iixwt ‘fish’ V ROOT witxw ‘arrive’ V ROOT bakw ‘arrive’ V ROOT litsxxw ‘read’ V ROOT =hl common noun Det. ENCLITIC =t proper noun Det. ENCLITIC =tip plural noun Det. ENCLITIC -y’ 1sg Agr. SUFFIX -n 2sg Agr. SUFFIX -t 3 Agr. SUFFIX Table: A ‘Flat’ Database of a Gitksan (Tsimshianic) word list

Database Theory and Design Day 1: Introduction to Database Theory and Design I Generating word lists. I Basic searches.

I Enter language data into a word processor (MS Word) or spreadsheet (MS Excel).

I One record in a paper form = One row (“record”) in computerized table of data. I Adequate for a simple applications with not a lot of data or features (i.e. categories).

I A ‘flat’ database.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications A common starting point: the ‘Flat’ database

I Language data in field notes, a numbered arrangement; Possibly transferred onto cards.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Generating word lists. I Basic searches.

I One record in a paper form = One row (“record”) in computerized table of data. I Adequate for a simple applications with not a lot of data or features (i.e. categories).

I A ‘flat’ database.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications A common starting point: the ‘Flat’ database

I Language data in field notes, a numbered arrangement; Possibly transferred onto cards.

I Enter language data into a word processor (MS Word) or spreadsheet (MS Excel).

Database Theory and Design Day 1: Introduction to Database Theory and Design I Generating word lists. I Basic searches.

I Adequate for a simple applications with not a lot of data or features (i.e. categories).

I A ‘flat’ database.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications A common starting point: the ‘Flat’ database

I Language data in field notes, a numbered arrangement; Possibly transferred onto cards.

I Enter language data into a word processor (MS Word) or spreadsheet (MS Excel).

I One record in a paper form = One row (“record”) in computerized table of data.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Generating word lists. I Basic searches.

I A ‘flat’ database.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications A common starting point: the ‘Flat’ database

I Language data in field notes, a numbered arrangement; Possibly transferred onto cards.

I Enter language data into a word processor (MS Word) or spreadsheet (MS Excel).

I One record in a paper form = One row (“record”) in computerized table of data. I Adequate for a simple applications with not a lot of data or features (i.e. categories).

Database Theory and Design Day 1: Introduction to Database Theory and Design I A ‘flat’ database.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications A common starting point: the ‘Flat’ database

I Language data in field notes, a numbered arrangement; Possibly transferred onto cards.

I Enter language data into a word processor (MS Word) or spreadsheet (MS Excel).

I One record in a paper form = One row (“record”) in computerized table of data. I Adequate for a simple applications with not a lot of data or features (i.e. categories).

I Generating word lists. I Basic searches.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications A common starting point: the ‘Flat’ database

I Language data in field notes, a numbered arrangement; Possibly transferred onto cards.

I Enter language data into a word processor (MS Word) or spreadsheet (MS Excel).

I One record in a paper form = One row (“record”) in computerized table of data. I Adequate for a simple applications with not a lot of data or features (i.e. categories).

I Generating word lists. I Basic searches.

I A ‘flat’ database.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Identifying and eliminating incorrect entries. I Inconsistency. I Unmanageable file size (difficult to transfer), and potential memory problems.

I Inflexible. I Difficult to expand. I Many redundant data entries

I Can become overwhelming complex, and unstable along with the burden of maintaining the database.

I For language data: can obscure potentially meaningful implications, relationships and generalizations.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications Limitations of a Flat database

I You find you need more out of your data:

Database Theory and Design Day 1: Introduction to Database Theory and Design I Identifying and eliminating incorrect entries. I Inconsistency. I Unmanageable file size (difficult to transfer), and potential memory problems.

I Many redundant data entries

I Can become overwhelming complex, and unstable along with the burden of maintaining the database.

I For language data: can obscure potentially meaningful implications, relationships and generalizations.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications Limitations of a Flat database

I You find you need more out of your data:

I Inflexible. I Difficult to expand.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Identifying and eliminating incorrect entries. I Inconsistency. I Unmanageable file size (difficult to transfer), and potential memory problems.

I Can become overwhelming complex, and unstable along with the burden of maintaining the database.

I For language data: can obscure potentially meaningful implications, relationships and generalizations.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications Limitations of a Flat database

I You find you need more out of your data:

I Inflexible. I Difficult to expand. I Many redundant data entries

Database Theory and Design Day 1: Introduction to Database Theory and Design I Can become overwhelming complex, and unstable along with the burden of maintaining the database.

I For language data: can obscure potentially meaningful implications, relationships and generalizations.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications Limitations of a Flat database

I You find you need more out of your data:

I Inflexible. I Difficult to expand. I Many redundant data entries

I Identifying and eliminating incorrect entries. I Inconsistency. I Unmanageable file size (difficult to transfer), and potential memory problems.

Database Theory and Design Day 1: Introduction to Database Theory and Design I For language data: can obscure potentially meaningful implications, relationships and generalizations.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications Limitations of a Flat database

I You find you need more out of your data:

I Inflexible. I Difficult to expand. I Many redundant data entries

I Identifying and eliminating incorrect entries. I Inconsistency. I Unmanageable file size (difficult to transfer), and potential memory problems.

I Can become overwhelming complex, and unstable along with the burden of maintaining the database.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications Limitations of a Flat database

I You find you need more out of your data:

I Inflexible. I Difficult to expand. I Many redundant data entries

I Identifying and eliminating incorrect entries. I Inconsistency. I Unmanageable file size (difficult to transfer), and potential memory problems.

I Can become overwhelming complex, and unstable along with the burden of maintaining the database.

I For language data: can obscure potentially meaningful implications, relationships and generalizations.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications Limitations of a Flat database cont.

Word Gloss Gram. Morph. hon ‘fish’ N ROOT smax ‘bear, meat’ N ROOT algyax ‘language’ N ROOT sm-algyax’ Gitksan N STEM sm-’ ‘true’ A PREFIX siipxw ‘sick, ill’ A ROOT wii-nakw’ ‘tall’ A STEM wii- ‘long’ A PREFIX nakw’ DISTAL ROOT nakw’ EVIDENTIAL ROOT x- ‘consume’ V PREFIX iixwt ‘fish’ V ROOT witxw ‘arrive’ V ROOT bakw ‘arrive’ V ROOT litsxxw ‘read’ V ROOT =hl common noun Det. ENCLITIC =t proper noun Det. ENCLITIC =tip plural noun Det. ENCLITIC -y’ 1sg Agr. SUFFIX -n 2sg Agr. SUFFIX -t 3 Agr. SUFFIX Table: A ‘Flat’ Database of a Gitksan (Tsimshianic) word list

Database Theory and Design Day 1: Introduction to Database Theory and Design I. Database Management System (DBMS) II. An application to interact with the DBMS.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications The Solution:

I Separate the flat database into two interacting systems:

Database Theory and Design Day 1: Introduction to Database Theory and Design II. An application to interact with the DBMS.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications The Solution:

I Separate the flat database into two interacting systems: I. Database Management System (DBMS)

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications The Solution:

I Separate the flat database into two interacting systems: I. Database Management System (DBMS) II. An application to interact with the DBMS.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Handles physical details of storing data efficiently I Delivers & manipulates data for applications I Security and stability

I Oracle I Microsoft SQL Server

I Efficient storage I Maintains ‘just enough’ redundancy I Principle focus: handling data

I Several “industrial-strength” DBMS:

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications I. Database Management System (DBMS)

I Keeps data in small, unique chunks

Database Theory and Design Day 1: Introduction to Database Theory and Design I Handles physical details of storing data efficiently I Delivers & manipulates data for applications I Security and stability

I Oracle I Microsoft SQL Server

I Principle focus: handling data

I Several “industrial-strength” DBMS:

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications I. Database Management System (DBMS)

I Keeps data in small, unique chunks

I Efficient storage I Maintains ‘just enough’ redundancy

Database Theory and Design Day 1: Introduction to Database Theory and Design I Oracle I Microsoft SQL Server

I Handles physical details of storing data efficiently I Delivers & manipulates data for applications I Security and stability I Several “industrial-strength” DBMS:

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications I. Database Management System (DBMS)

I Keeps data in small, unique chunks

I Efficient storage I Maintains ‘just enough’ redundancy I Principle focus: handling data

Database Theory and Design Day 1: Introduction to Database Theory and Design I Oracle I Microsoft SQL Server

I Several “industrial-strength” DBMS:

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications I. Database Management System (DBMS)

I Keeps data in small, unique chunks

I Efficient storage I Maintains ‘just enough’ redundancy I Principle focus: handling data

I Handles physical details of storing data efficiently I Delivers & manipulates data for applications I Security and stability

Database Theory and Design Day 1: Introduction to Database Theory and Design I Oracle I Microsoft SQL Server

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications I. Database Management System (DBMS)

I Keeps data in small, unique chunks

I Efficient storage I Maintains ‘just enough’ redundancy I Principle focus: handling data

I Handles physical details of storing data efficiently I Delivers & manipulates data for applications I Security and stability I Several “industrial-strength” DBMS:

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications I. Database Management System (DBMS)

I Keeps data in small, unique chunks

I Efficient storage I Maintains ‘just enough’ redundancy I Principle focus: handling data

I Handles physical details of storing data efficiently I Delivers & manipulates data for applications I Security and stability I Several “industrial-strength” DBMS:

I Oracle I Microsoft SQL Server

Database Theory and Design Day 1: Introduction to Database Theory and Design I Structured (SQL): a standardized language that uses user-defined functions to query the data. I Generates reports in form of a table or pivot table.

I The DBMS stores data and responds to queries – we don’t interact with it directly. I DBMSs are used with a “client” application: MS Access, FileMaker Pro etc. These create a graphical user interface to interact with the data through forms and reports. I A language to query data from a DBMS:

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications II. Applications that interact with the DBMS

I A program to retrieve data from a DBMS:

Database Theory and Design Day 1: Introduction to Database Theory and Design I Structured Query Language (SQL): a standardized language that uses user-defined functions to query the data. I Generates reports in form of a table or pivot table.

I DBMSs are used with a “client” application: MS Access, FileMaker Pro etc. These create a graphical user interface to interact with the data through forms and reports. I A language to query data from a DBMS:

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications II. Applications that interact with the DBMS

I A program to retrieve data from a DBMS:

I The DBMS stores data and responds to queries – we don’t interact with it directly.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Structured Query Language (SQL): a standardized language that uses user-defined functions to query the data. I Generates reports in form of a table or pivot table.

I A language to query data from a DBMS:

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications II. Applications that interact with the DBMS

I A program to retrieve data from a DBMS:

I The DBMS stores data and responds to queries – we don’t interact with it directly. I DBMSs are used with a “client” application: MS Access, FileMaker Pro etc. These create a graphical user interface to interact with the data through forms and reports.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Structured Query Language (SQL): a standardized language that uses user-defined functions to query the data. I Generates reports in form of a table or pivot table.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications II. Applications that interact with the DBMS

I A program to retrieve data from a DBMS:

I The DBMS stores data and responds to queries – we don’t interact with it directly. I DBMSs are used with a “client” application: MS Access, FileMaker Pro etc. These create a graphical user interface to interact with the data through forms and reports. I A language to query data from a DBMS:

Database Theory and Design Day 1: Introduction to Database Theory and Design I Generates reports in form of a table or pivot table.

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications II. Applications that interact with the DBMS

I A program to retrieve data from a DBMS:

I The DBMS stores data and responds to queries – we don’t interact with it directly. I DBMSs are used with a “client” application: MS Access, FileMaker Pro etc. These create a graphical user interface to interact with the data through forms and reports. I A language to query data from a DBMS:

I Structured Query Language (SQL): a standardized language that uses user-defined functions to query the data.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications II. Applications that interact with the DBMS

I A program to retrieve data from a DBMS:

I The DBMS stores data and responds to queries – we don’t interact with it directly. I DBMSs are used with a “client” application: MS Access, FileMaker Pro etc. These create a graphical user interface to interact with the data through forms and reports. I A language to query data from a DBMS:

I Structured Query Language (SQL): a standardized language that uses user-defined functions to query the data. I Generates reports in form of a table or pivot table.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Consists of several tables (relations) linked to each other.

I Not as a table but as a tree structure, similar to folders and subfolders in an operating system: each unit “belongs” to some larger unit, and contains smaller units.

I Data are modeled as objects of various types that share or inherit properties according to their type I For example, a database about word classes could let objects of the type transitive verb inherit properties of the type verb.

I A Relational database

I A Hierarchical database

I An Object-Oriented database database

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications Four types of database models:

I A Flat database I Made of a single table, or “file”. I Each row corresponds to some object (e.g., a language) being described, and each column represents a property (attribute), such as name, location, or word order etc..

Database Theory and Design Day 1: Introduction to Database Theory and Design I Not as a table but as a tree structure, similar to folders and subfolders in an operating system: each unit “belongs” to some larger unit, and contains smaller units.

I Data are modeled as objects of various types that share or inherit properties according to their type I For example, a database about word classes could let objects of the type transitive verb inherit properties of the type verb.

I A Hierarchical database

I An Object-Oriented database database

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications Four types of database models:

I A Flat database I Made of a single table, or “file”. I Each row corresponds to some object (e.g., a language) being described, and each column represents a property (attribute), such as name, location, or word order etc.. I A Relational database I Consists of several tables (relations) linked to each other.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Data are modeled as objects of various types that share or inherit properties according to their type I For example, a database about word classes could let objects of the type transitive verb inherit properties of the type verb.

I An Object-Oriented database database

Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications Four types of database models:

I A Flat database I Made of a single table, or “file”. I Each row corresponds to some object (e.g., a language) being described, and each column represents a property (attribute), such as name, location, or word order etc.. I A Relational database I Consists of several tables (relations) linked to each other. I A Hierarchical database I Not as a table but as a tree structure, similar to folders and subfolders in an operating system: each unit “belongs” to some larger unit, and contains smaller units.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline The ‘Flat’ database The Database as a Concept and Tool The Database Management System (DBMS) The Database model and its Evolution Types of Databases Models Database Applications Four types of database models:

I A Flat database I Made of a single table, or “file”. I Each row corresponds to some object (e.g., a language) being described, and each column represents a property (attribute), such as name, location, or word order etc.. I A Relational database I Consists of several tables (relations) linked to each other. I A Hierarchical database I Not as a table but as a tree structure, similar to folders and subfolders in an operating system: each unit “belongs” to some larger unit, and contains smaller units. I An Object-Oriented database database I Data are modeled as objects of various types that share or inherit properties according to their type I For example, a database about word classes could let objects of the type transitive verb inherit properties of the type verb.

Database Theory and Design Day 1: Introduction to Database Theory and Design I The network database: WordNet

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Two types of database applications:

I Stand-alone desktop databases: MS Access

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Two types of database applications:

I Stand-alone desktop databases: MS Access

I The network database: WordNet

Database Theory and Design Day 1: Introduction to Database Theory and Design I A stand-alone software with a graphical user interface for both the database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up, sent by email, etc.

I The ony requirement is a desktop computer with the database application; software is easy to install or already present, and it is not necessary to set up a server.

I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Stand-alone desktop databases

I Suitable for the one-person research project.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up, sent by email, etc.

I The ony requirement is a desktop computer with the database application; software is easy to install or already present, and it is not necessary to set up a server.

I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for both the database configuration, and to create forms and queries.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Everything fits in one file or folder, and can be backed up, sent by email, etc.

I The ony requirement is a desktop computer with the database application; software is easy to install or already present, and it is not necessary to set up a server.

I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for both the database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

Database Theory and Design Day 1: Introduction to Database Theory and Design I The ony requirement is a desktop computer with the database application; software is easy to install or already present, and it is not necessary to set up a server.

I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for both the database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up, sent by email, etc.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for both the database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up, sent by email, etc.

I The ony requirement is a desktop computer with the database application; software is easy to install or already present, and it is not necessary to set up a server.

Database Theory and Design Day 1: Introduction to Database Theory and Design I MS Access, FileMaker Pro, OpenOffice Calc.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for both the database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up, sent by email, etc.

I The ony requirement is a desktop computer with the database application; software is easy to install or already present, and it is not necessary to set up a server.

I Internet collaboration possible but not required.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for both the database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up, sent by email, etc.

I The ony requirement is a desktop computer with the database application; software is easy to install or already present, and it is not necessary to set up a server.

I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Database Theory and Design Day 1: Introduction to Database Theory and Design I A web-based interface (i.e. a web browser) I A server. (running PHP to manage the queries and generate the web pages) I The database. (MySQL)

I A modular system of three parts:

I Most of the same functions with stand-alone databases can be used in network databases.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Network databases

I Ideal when multiple people must collaborate on data entry.

Database Theory and Design Day 1: Introduction to Database Theory and Design I A web-based interface (i.e. a web browser) I A server. (running PHP to manage the queries and generate the web pages) I The database. (MySQL)

I Most of the same functions with stand-alone databases can be used in network databases.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Network databases

I Ideal when multiple people must collaborate on data entry. I A modular system of three parts:

Database Theory and Design Day 1: Introduction to Database Theory and Design I Most of the same functions with stand-alone databases can be used in network databases.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Network databases

I Ideal when multiple people must collaborate on data entry. I A modular system of three parts:

I A web-based interface (i.e. a web browser) I A server. (running PHP to manage the queries and generate the web pages) I The database. (MySQL)

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Network databases

I Ideal when multiple people must collaborate on data entry. I A modular system of three parts:

I A web-based interface (i.e. a web browser) I A server. (running PHP to manage the queries and generate the web pages) I The database. (MySQL)

I Most of the same functions with stand-alone databases can be used in network databases.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Pros: Free, with more or less the same functionality as a stand-alone, proprietary database. I Cons: Extensive computer knowledge required (i.e. setting up a server, making the connections, knowledge of HTML)

I Network databases

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Comparing Pros and Cons

I Stand-alone databases

I Pros: Can be implemented quickly and easily. I Cons: Can be expensive and proprietary.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Comparing Pros and Cons

I Stand-alone databases

I Pros: Can be implemented quickly and easily. I Cons: Can be expensive and proprietary. I Network databases

I Pros: Free, with more or less the same functionality as a stand-alone, proprietary database. I Cons: Extensive computer knowledge required (i.e. setting up a server, making the connections, knowledge of HTML)

Database Theory and Design Day 1: Introduction to Database Theory and Design I Ability to import and export data (i.e. text, XML files). I Are the pre-defined and/or user-defined options helpful? Can they be easily modified? I Is the application scalable? I Is it relational?

I Unicode compatibility, special character input methods, and the ease of character input. I Ability to handle texts and texts, interlinearized material. I Allows you to follow the best practices for archiving linguistic data (i.e. XML, E-MELD emeld.org).

I Technical:

I Linguistic:

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Criteria

I General: I Who produced the software, which platforms the software runs on? Is other software needed? I Is it easy to use? Is it well-supported/documented? Cost?

Database Theory and Design Day 1: Introduction to Database Theory and Design I Unicode compatibility, special character input methods, and the ease of character input. I Ability to handle texts and texts, interlinearized material. I Allows you to follow the best practices for archiving linguistic data (i.e. XML, E-MELD emeld.org).

I Linguistic:

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Criteria

I General: I Who produced the software, which platforms the software runs on? Is other software needed? I Is it easy to use? Is it well-supported/documented? Cost? I Technical: I Ability to import and export data (i.e. text, XML files). I Are the pre-defined and/or user-defined options helpful? Can they be easily modified? I Is the application scalable? I Is it relational?

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Criteria

I General: I Who produced the software, which platforms the software runs on? Is other software needed? I Is it easy to use? Is it well-supported/documented? Cost? I Technical: I Ability to import and export data (i.e. text, XML files). I Are the pre-defined and/or user-defined options helpful? Can they be easily modified? I Is the application scalable? I Is it relational? I Linguistic: I Unicode compatibility, special character input methods, and the ease of character input. I Ability to handle texts and texts, interlinearized material. I Allows you to follow the best practices for archiving linguistic data (i.e. XML, E-MELD emeld.org).

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings Databases designed for linguistics

I Stand-alone: SIL Shoebox 5.0 with Toolbox 1.2

I Runs on both Windows and Mac. Proprietary, but not too expensive. I Not very well supported, problems exporting XML files. I A native environment for text interlinearization and analysis. I Uses filter-type searches, not structured queries.

Database Theory and Design Day 1: Introduction to Database Theory and Design I MS Access: powerful and customizable form and query tools. Proprietary and not cheap. I FileMaker Pro: also with customizable form and query tools. Proprietary and not cheap. I OpenOffice Calc: less features than Access or FileMaker, but has the same core functionality. Open source (free), but somewhat unstable.

I Network: MySQL (http://www.mysql.com/); Apache server with PHP; Google Chrome – all free.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings The Players

I Stand-alone, relational databases:

Database Theory and Design Day 1: Introduction to Database Theory and Design I FileMaker Pro: also with customizable form and query tools. Proprietary and not cheap. I OpenOffice Calc: less features than Access or FileMaker, but has the same core functionality. Open source (free), but somewhat unstable.

I Network: MySQL (http://www.mysql.com/); Apache server with PHP; Google Chrome – all free.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings The Players

I Stand-alone, relational databases:

I MS Access: powerful and customizable form and query tools. Proprietary and not cheap.

Database Theory and Design Day 1: Introduction to Database Theory and Design I OpenOffice Calc: less features than Access or FileMaker, but has the same core functionality. Open source (free), but somewhat unstable.

I Network: MySQL (http://www.mysql.com/); Apache server with PHP; Google Chrome – all free.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings The Players

I Stand-alone, relational databases:

I MS Access: powerful and customizable form and query tools. Proprietary and not cheap. I FileMaker Pro: also with customizable form and query tools. Proprietary and not cheap.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Network: MySQL (http://www.mysql.com/); Apache server with PHP; Google Chrome – all free.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings The Players

I Stand-alone, relational databases:

I MS Access: powerful and customizable form and query tools. Proprietary and not cheap. I FileMaker Pro: also with customizable form and query tools. Proprietary and not cheap. I OpenOffice Calc: less features than Access or FileMaker, but has the same core functionality. Open source (free), but somewhat unstable.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings The Players

I Stand-alone, relational databases:

I MS Access: powerful and customizable form and query tools. Proprietary and not cheap. I FileMaker Pro: also with customizable form and query tools. Proprietary and not cheap. I OpenOffice Calc: less features than Access or FileMaker, but has the same core functionality. Open source (free), but somewhat unstable.

I Network: MySQL (http://www.mysql.com/); Apache server with PHP; Google Chrome – all free.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Access, FileMaker Pro, and Calc do not handle texts well. I Access is more constrained than FileMaker or Calc – less possibility for introducing errors or inconsistencies. I FileMaker and Calc are suited to smaller, less-complex projects. I All are XML compatible and network ready.

I Some more pros/cons and comparisons:

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings The Players

I Access, FileMaker Pro, and Calc are all suitable databases for linguistic analysis: they are all relational and can handle SQL queries.

Database Theory and Design Day 1: Introduction to Database Theory and Design I Access, FileMaker Pro, and Calc do not handle texts well. I Access is more constrained than FileMaker or Calc – less possibility for introducing errors or inconsistencies. I FileMaker and Calc are suited to smaller, less-complex projects. I All are XML compatible and network ready.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings The Players

I Access, FileMaker Pro, and Calc are all suitable databases for linguistic analysis: they are all relational and can handle SQL queries. I Some more pros/cons and comparisons:

Database Theory and Design Day 1: Introduction to Database Theory and Design I Access is more constrained than FileMaker or Calc – less possibility for introducing errors or inconsistencies. I FileMaker and Calc are suited to smaller, less-complex projects. I All are XML compatible and network ready.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings The Players

I Access, FileMaker Pro, and Calc are all suitable databases for linguistic analysis: they are all relational and can handle SQL queries. I Some more pros/cons and comparisons:

I Access, FileMaker Pro, and Calc do not handle texts well.

Database Theory and Design Day 1: Introduction to Database Theory and Design I FileMaker and Calc are suited to smaller, less-complex projects. I All are XML compatible and network ready.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings The Players

I Access, FileMaker Pro, and Calc are all suitable databases for linguistic analysis: they are all relational and can handle SQL queries. I Some more pros/cons and comparisons:

I Access, FileMaker Pro, and Calc do not handle texts well. I Access is more constrained than FileMaker or Calc – less possibility for introducing errors or inconsistencies.

Database Theory and Design Day 1: Introduction to Database Theory and Design I All are XML compatible and network ready.

Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings The Players

I Access, FileMaker Pro, and Calc are all suitable databases for linguistic analysis: they are all relational and can handle SQL queries. I Some more pros/cons and comparisons:

I Access, FileMaker Pro, and Calc do not handle texts well. I Access is more constrained than FileMaker or Calc – less possibility for introducing errors or inconsistencies. I FileMaker and Calc are suited to smaller, less-complex projects.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings The Players

I Access, FileMaker Pro, and Calc are all suitable databases for linguistic analysis: they are all relational and can handle SQL queries. I Some more pros/cons and comparisons:

I Access, FileMaker Pro, and Calc do not handle texts well. I Access is more constrained than FileMaker or Calc – less possibility for introducing errors or inconsistencies. I FileMaker and Calc are suited to smaller, less-complex projects. I All are XML compatible and network ready.

Database Theory and Design Day 1: Introduction to Database Theory and Design Outline Choosing the right Database Application The Database as a Concept and Tool Linguistic DBMS and Interfaces The Database model and its Evolution Non-Linguistic DBMS and Interfaces Database Applications References and Suggested Readings

I There are countless resources on the web on , theory, and implementation. I Specific references on linguistic databases:

I Ferrara, M. & Moran, S. 2004. Review of DBMS for Linguistic Purposes. Proceedings of E-MELD 2004. Online publication, at http://www.linguistlist.org/emeld/workshop/2004/proceedings.html. I Nerbonne, John. 1998. Linguistic Databases, CSLI, Stanford. I Everaert, Musgrave, Dimitriadis (eds) 2009. The Use of Databases in Cross-Linguistic Studies. Empirical Approaches to Language Typology (EALT) 41. Mouton de Gruyter.

Database Theory and Design Day 1: Introduction to Database Theory and Design