NATURAL RESOURCES ANALYSIS PROGRAM (NRAP)

FLORA DATA AND MODELLING FOR CAPE YORK PENINSULA

Maria Cofinas, M.P. Bolton, A.J. Bryett, D.C. Crossley and A.L. Bull Environmental Resources information Network Department of the Environment, Sport and Temtories 1995

CYPLUS is a joint initiative of the Queensland and Commonwealth Governments CAPE YORK PENINSULA LAND USE STRATEGY (CYPLUS)

Natural Resources Analysis Program

FLORA DATA AND MODELLING FOR CAPE YORK PENINSULA

Maria Cofmas, M.P. Bolton, A.J. Bryett, D.C. Crossley and A.L. Bull Environmental Resources Information Network Department of the Environment, Sport and Territories 1995

CYPLUS is a joint initiative of the Queensland and Commonwealth Governments

Final report on project:

NRlS - FLORA DATA AND MODELLING Recommended citation:

Cofinas, Maria, Bolton, M.P., Bryett, A.J., Crossley, D.C., and Bull, A.L. (1995). 'Flora Data and Modelling for Cape York Peninsula'. (Cape York Peninsula Land Use Strategy, Office of the Co-ordinator General of Queensland, Brisbane, Department of the Environment, Sport and Territories, Canberra.)

Note:

Due to the timing of publication, reports on other CYPLUS projects may not be fully cited in the REFERENCES section. However, they should be able to be located by author, agency or subject.

ISBN 0 7242 6214 8

@ The State of Queensland and Commonwealth of 1995.

Copyright protects this publication. Except for purposes permitted by the Copyright Act 1968, no part may be reproduced by any means without the prior written permission of the Office of the Co-ordinator General of Queensland and the Australian Government Publishing Service. Requests and inquiries concerning reproduction and rights should be addressed to:

Office of the Co-ordinator General, Government of Queensland PO Box 185 BRISBANE ALBERT STREET Q 4002

The Manager, Commonwealth Information Services GPO Box 84 CANBERRA ACT 2601 CAPE YORK PENINSULA LAND USE STRATEGY STAGE I

PREFACE TO PROJECT REPORTS f.: '! Cape York Peninsula Land Use Strategy (CYPLUS) is an initiative to provide a basis for public participation In phmhg for the ecologWy sustaimb1e development of Cape York Peninsula. It is jointly funded by the Queensland and Commonwealth Governments and is being cdout in thee stages:

Stage I - information gathering; Stage II - development of principles, policies and processes; and Stage 111 - implementation and review.

The project dealt with in this report is a part of Stage I of CYPLUS. The main components of Stage I of CYPLUS consist of two data collection programs, the development of a Geographic Information System (GIs) and the establishment of processes for public participation.

The data collection and collation work was conducted within two broad programs, the Natural Resources Analysis Program (NRAP) and the Land Use Program (LUP). The project reported on here forms part of one of these programs.

The objectives of NRAP were to collect and interpret base data on the natural resources of Cape York Peninsula to provide input to:

evaluation of the potential of those resources for a range of activities related to the use and management of land in line with economic, environmental and social values; and formulation of the land use policies, principles and processes of CYPLUS.

Projects examining both physical and biological resources were included in NRAP together with Geographic Information System (GIs) projects. NRAP projects are listed in the following Table.

1 Physical ResourceIGIS Projects I Biological Resource Projects 1 Bedrock geological data - digitising and Vegetation mapping (NR01) integration (NR05) Airborne geophysical survey (NR15) Marine (seagrasslmangrove) distribution (NR06) Coastal environment geoscience survey Insect fauna survey (NR17) (NR14) Mineral resource inventory (NR04) Fish fauna survey (NR10) Water resource investigation (groundwater) Terrestrial vertebrate fauna survey (NR03) (NR16) Regolith terrain mapping (NR12) Wetland fauna survey (NR09) I Physical ResourceIGIS Projects Biological Resource Projects Land resource inventory (NR02) Flora data and modelling (NR18) Environmental region analysis (NR11) Fauna distribution modelling (NR19) CYPLUS data into NRIC database FINDAR Golden-shouldered parrot conservation (NR20) management (NR2 1) Queensland GIs development and maintenance (NR08) * GIS creationlmaintenance (NR07)

* These projects are accumulating and storing all Stage I data that is submitted in GIs compatible formats.

Research priorities for the LUP were set through the public participation process with the objectives of:

collecting information on a wide range of social, cultural, economic and environmental issues relevant to Cape York Peninsula; and highlighting interactions between people, land (resource use) and nature sectors.

Projects were undertaken within these sector areas and are listed in the following Table.

People Projects Land Projects Nature Projects

Population Current land use Surface water resources Transport services and Land tenure Fire infrastructure Values, needs and aspirations Indigenous management of land Feral and pest animals and sea Services and infrastructure Pastoral industry Weeds Economic assessment Primary industries (non-pastoral, Land degradation and soil non-forestry) erosion Secondary and tertiary industries Forest resources Conservation and natural heritage assessment Traditional activities Commercial and non commercial Conservation and National Park fisheries management Current administrative structures Mineral resource potential and mining industry Tourism industry TABLE OF CONTENTS

LIST OF FIGURES

LIST OF TABLES

LIST OF MAPS

ABBREVIATIONS

EXECUTIVE SUMMARY vii

1.0 INTRODUCTION

1.1 Identification of user requirements 1.2 Status of flora data from Cape York Peninsula 1.3 Aims of the project

2.0 METHODS

2.1 Point data versus polygon data 2.2 Overview of methods 2.3 Collation 2.3.1 Identification 2.3.2 EvaluationfNegotiation 2.3.3 Specification 2.3.4 ConversiodDatabasing 2.3.5 Supply (to ERIN) 2.3.6 Feedback to custodian 2.4 Taxon Listing 2.5 Integration 2.5.1 Loading data into ENS 2.5.2 Duplication control 2.6 Packaging 2.6.1 Database design 2.6.2 Database building 2.6.3 Data documentation 2.6.4 Geocode data validation 2.6.5 Additional (value-added) data 2.6.6 Data extraction and GIs tools 2.7 Analysis 2.7.1 Species distribution mapping 2.7.2 Development of adequacy of sampling tools 2.7.3 VISTR analyses 2.7.4 SPAM analyses 2.7.5 Species distribution modelling tools 2.7.6 Species distribution modelling analyses 3.0 RESULTS - DATABASE BUILDING

3.1 Data acquisition 3.1.1 Identified datasets 3.1.2 Acquired datasets 3.1.3 Excluded datasets 3.2 Documentation (metadata) on source datasets 3.3 Value-adding and packaging 3.3.1 Geocode data validation 3.3.2 Applied categories 3.3.3 Data extraction and GIs tools 3.4 Summary of consolidated CYP flora dataset

4.0 RESULTS - ANALYSIS

4.1 Species distribution mapping 4.1.1 Endangered and Vulnerable 4.1.2 Life forms 4.2 VISTR analyses 4.2.1 Total flora versus 5 group regionalisation 4.2.2 Life forms versus 5 group regionalisation 4.3 SPAM analyses 4.4 Distribution modelling 4.4.1 BIOCLIM 4.4.2 GARP

5.0 DISCUSSION

5.1 Data provided to CYPLUS Flora & Fauna GIs 5.1.1 Data quality 5.1.2 CYPLUS Flora & Fauna GIs 5.1.3 Value-added data 5.2 Analyses 5.2.1 Species distributions 5.2.2 Species modelling 5.2.3 Identifying gaps in the data 5.3 Potential Uses in CYPLUS Stages I LUP and II 5.3.1 CYPLUS StageILUP 5.3.2 CYPLUS Stage I. 5.4 Significance of results 5.4.1 Evaluation against objectives 5.4.2 Scientific or methodological advances 5.4.3 General community benefits

6.0 ACKNOWLEDGEMENTS

7.0 REFERENCES 8.0 APPENDICES

Appendix I Cape York Peninsula Flora Data: Expressions of Interest Ia Advertisement Ib Guidelines and Selection Criteria Appendix II Project Information Brochure Appendix III ERIN Contract for CYPLUS Data Appendix IV Data File Format Appendix V Data LoadingNalidation Process at ERIN Appendix VI Data Acquisition Contracts Appendix VII CYP Floramauna Metadata Instructions Appendix VIIIDocumentation (Metadata) on Source Flora Datasets Vma CYPLUS - ANBG CBG Flora 1.0 VIIIb CYPLUS - Gitay Flora : Archer 1.0 VIIIc LANDCOVER - Atherton herbarium 1.0 VIIId LANDCOVER - Aust Nat Bot Gard 1.0 VIIIe CYPLUS - Bunt Mangroves 1.0 VIIIf CYP - Comalco Herbarium Plants 1.0 Vmg LANDCOVER - CSIRO PI Herb Pt 2 1.0 Vmh CYPLUS - Gitay Flora : Jardine 1.0 VIIIi LANDCOVER-CSIRO PI Herb CANE3 1.0 VIIIj LANDCOVER - Melb. herb. - Eucs 1.0 VIIIk MANGROVE ATLAS 1.0 VIIIl CYP - QDEH Detailed Veg Survey 1.0 VIIIm CYP - QDEH Obs Veg Survey 1.0 VIIIn CYP-NSWHERB PLANTS 1.0 VIIIo NT Herbarium 1.0 VIIIp LANDCOVER - Queensland Herb. 1.0 VIIIq ROTAP - Queensland Herbarium 1.0 VIIIr LANDCOVER - CSIRO Tree Seed C. 1.0 VlIIs ROTAP - CSIRO PI Herb CANB 1.0 VIIIt CYPLUS - Gitay Flora : Weipa 1.0 Appendix IX Documentation of Whole Flora Dataset Appendix X Flora data summaries Appendix XI Species list of Palms in each Region LIST OF FIGURES

Figure 1: Legend for schema in Figures 2 to 5. 4 Figure 2: Overview scheme for production of flora component of CYPLUS GIs. 5 Figure 3: Scheme for Collation of each point-based dataset. 7 Figure 4: Scheme for Taxon Listing for the flora of Cape York Peninsula. 10 Figure 5: Scheme for Integration of point-based flora datasets. 11 Figure 6: Species accumulation curves for Palm records. 3 1 Figure 7: Histogram and table of percentage area and sites across the annual mean temperature classes in CYP. 32 Figure 8: Visualisation of predicted distribution model ofMacarmgapolyadenia using BIOCLlM rules in GARP. after 34 Figure 9: Visualisation of first distribution model ofMacarangapolyadenia using GARP rules. after 34 Figure 10: Visualisation of second distribution model ofMacarangapolyadenia using GARP rules. after 36

LIST OF TABLES

Table 1: Flora dataset identification and acquisition summary - point data. 19 Table 2: Summary of flora dataset by life forms. 23 Table 3: Summary of Rare or Threatened plants in CYP. 24 Table 4: Endangered plant taxa in CYP. 25 Table 5: Vulnerable plant taxa in CYP. 25 Table 6: High level summary of components of flora dataset as in ERIS. 27 Table 7: Acquired data summarised by source dataset (as named in ERIS DDC). 27 Table 8: Climate profile for Macaranga polyadenia. 33

LIST OF MAPS

Map 1: Gridded analysis of Trees - based on number of specimens per cell. after 26 Map 2: Gridded analysis of Trees - based on number of taxa per cell. after 26 Map 3 : Distribution of endangered flora. after 3 0 Map 4: Distribution of all vulnerable flora. after 30 Map 5: Distribution of Palm records against 5 group regionalisation. after 30 Map 6: Inadequately sampled climate classes - Annual mean temperature by vegetation sites. after 32 Map 7: BIOCLIM modelling - Macaranga polyadenia. after 32 ABBREVIATIONS

ANCA Australian Nature Conservation Agency

CSIRO Commonwealth Scientific and Industrial Research Organisation

CYP Cape York Peninsula

CYPLUS Cape York Peninsula Land Use Strategy

DWE CSlRO Division of Wildlife and Ecology

DDC Data Dictionary and Catalogue

ERIN Environmental Resources Information Network

ERIS Environmental Resources Information System

GIs Geographic Information Systems

JCrJNQ James Cook University of North Queensland

LUP Land Use Program (of CYPLUS)

NRAP Natural Resources Analysis Program (of CYPLUS)

RDBMS Relational database Management System

ROTAP Rare or Threatened Australian Plants

SIS Spatial Information Systems

vii

EXECUTIVE SUMMARY

Introduction

The Flora Data and Modelling Project (NR18) was one of three projects undertaken by ERIN for the Natural Resources Analysis Program of CYPLUS Stage I.

At the commencement of this project, existing datdinformation about the flora on Cape York Peninsula was widely scattered across numerous agencies and individuals in many different formats. This information was seen as a valuable resource not utilised to its full potential and it would provide a base upon which to undertake further surveys across CYP. The purpose of this project was therefore to make information on the flora of Cape York Peninsula (CYP) available to environmental decision-makers in the State and Commonwealth in digital form through Relational Database Management Systems @DBMS) and Geographic Information Systems (GIs).

The objectives of the project were:

1. to collate existing (pre-Cape York Peninsula Land Use Strategy (CYPLUS) data on the locations of CYP plant species from a variety of sources;

2. to integrate existing plant data with newly collected data from NRAP project NROl Vegetation Survey of North ~ueensland;

3. to perform some simple analyses to identify gaps in the data and predict the likely distribution of selected species; and

4. to provide suitable computer tools for accessing flora data in the CYPLUS Geographic Information System (GIs).

Methods

The major sources of flora data were identified and targeted to assess their suitability for inclusion into the flora database. Using a working list of CYP flora datasets and likely custodians of same, ERIN project staff contacted each data custodian and established the general particulars of each dataset. If the dataset was point-based, ERIN commenced negotiation for access to the data for CYPLUS and asked more detailed questions as to the precise nature and coverage of the data.

Each dataset was evaluated against specifications and priorities already established, and if suitable, access was negotiated with each Custodian.

Flora data was thus collated from existing paper records, digital versions in other databases, such as the Queensland Herbarium, and integrated with new data collected on the Cape from project NROl: Vegetation Survey of Cape York Peninsula. The existing plant species locations and other associated information were databased (in most cases by the data custodian) in a standardised Relational Database Management System (RDBMS) format and loaded into ERIN'S Environmental Resources Information System (ERIS). The data were spatially referenced and hence suitable for use in a GIs.

Loading data into ERIS involved a number of standard tests for taxonomic and locational accuracy. To check the taxonomic accuracy of the flora data, the current list of plants from the Queensland Herbarium was used and to check locational accuracy, a mainland/offshore test for each record was run at ERIN.

Data documentation

The characteristics, strengths and weaknesses of each source dataset were documented progressively in the ERIS Data Dictionary & Catalogue @DC) program. This documentation enabled management of the data at ERIN by project staff and would be provided to users of the data.

Additional value-added data

The second geocode validation test (BIOCLIM outlier detection test) was not compulsory; it was performed only on selected taxonomic groups whose records were classed as 'mainland' by the test mentioned above. The BIOCLIM outlier detection test was run for all records across a particular species. Those records that were found to be an acceptable climatic domain of the set received a validation status of 'V' (for valid) and outliers were assigned 'St (for suspect).

Where available, additional information on each species was loaded into the database from existing digital sources or keyed in by ERIN project staff using paper sources. Data such as common names, growth form and conservation status were entered in separate fields or tables, as appropriate. This additional information was designed to enhance user-friendly retrieval of data from the CYPLUS Flora databaselGIs and to provide various non-taxonomic aggregations of the integrated dataset, for summary purposes.

CYPLUS Flora & Fauna GIs

A stand alone GIs, running with ARCDIWO software was required to store the flora and fauna data collated across CYP. Tools designed to access and perform simple analyses of the data were planned and constructed.

Analysis

Species Distribution Mapping. Some flora species considered to be important or of priority for management were selected for plotting known distributions. Simple distribution maps of these species could be used to identifjl and target areas of biological importance on the Cape.

Development of adequacy of sampling tools. Tools to assist with the analysis of the flora data were developed at ERIN. The two developed, help assess biases in the data collected to date and integrate GIs, RDBMS, statistics packages and other tools on the ERIN UNIX platform. The Visualisation of Samples, Taxa and Regions (VISTR) is an analysis tool which produces species accumulation curves for point-based samples of flora or fauna in a given region (or set of polygons of like class in a GIS). VISTR assists with assessing whether a key taxonomic group (such as the genus ) or ecological category (such as mangroves) is adequately sampled in the region(s) of interest. The Site Placement & Analysis Module (SPAM) was developed in response to a request by J. Neldner of Queensland Herbarium, Mareeba, to assist with optimising the placement of new vegetation survey sites in relation to existing sites against each abiotic attribute in turn.

Species Distribution Modelling Tools and Analyses. It has been found that specimen based collection data do not always give an accurate representation of the range of distribution of a species. Information is required on the extents of a species range, which is a function of its ecology. Biogeographic models have been developed to provide a more realistic estimate of the potential distribution of species populations and hence compensate for biases in sampling observed in existing datasets.

One plant species was selected for distribution modelling using BIOCLIM (Bioclimate Prediction System) and GARP (Genetic Algorithm for Rule-set Production) in order to demonstrate the methodology. The same methodology is applicable to other plant and animal species. The vulnerable plant species, Macaranga polyadenia (Family Euphorbiaceae) was chosen for analysis.

Results

Datasets in CYPLUS Flora & Fauna GIs

A subset of the identified but acquired datasets was included in the flora database. The number of datasets totalled 19, including data from the Queensland Herbarium, the Australian National Herbarium (Canberra & Atherton), the NSW Herbarium, the CSlRO Australian Tree Seed Centre, the Australian Institute of Marine Science, the Comalco Herbarium, Queensland Department of Environment & Heritage, Dr Habiba Gitay and the Australian National Botanic Gardens.

A summary of the flora data collated (based on records in ERIS) is presented in the table below.

No. of Families The metadata corresponding to each collated dataset was made available to users in hardcopy and digital format. Excluded datasets were iternised and reasons for their exclusion given.

The following groups of taxa were validated using the BIOCLIM outlier test: Acacia spp., Allocasuarina & Casuarim spp., Callitris spp., Eucalyptus spp., Hakea spp., Melaleuca spp., Family Poaceae, Mangrove taxa , Tree taxa, Rare and Threatened species and many other smaller genera.

Records in the database were attributed with information on life forms and conservation status.

CYPLUS Flora & Fauna GIs

Records from the 19 datasets were included in the GIs with a number of analysis tools. These included the ability to create ARCWO coverages from selected data, concentrations of taxa and samples across CYP, text files for selected taxa and analyses based on applied categories. An analysis of the concentration of trees with reference to the number of species and number of samples was completed in the report.

Analyses

Individual maps of the distribution of selected endangered species on CYP were produced as summaries of each species. A map of the distribution of all vulnerable taxa summarised the status of vulnerable species on CYP in terms of their spatial distribution.

Example VISTR analyses were run on the whole flora dataset and palms on CYP and included maps of the distribution of the records against the 5 group regionalisation of CYP and species accumulation curves. Regions 1 to 4 appear undersarnpled, whereas Region 5 appears to be adequately sampled where the curve has flattened. A SPAM analysis using NROl's vegetation survey sites and annual mean temperature was completed and included histogram and table summaries of the analysis and a map showing undersampled areas.

Distribution models were produced for Macaranga polyadenia using BIOCLIM and GARP.

Discussion

This project has brought together most of the existing data about the presence of plants on CYP in a format suitable for general use. These formats include color plots of one or more species maps, gridded maps summarising tree distributions, results of analyses to identi@ gaps in the data and modelled distributions of plant taxa. The resulting database has provided the most comprehensive overview of the current knowledge of flora data across CYP. With the results of the NROl project, it covers one of the key natural resource datasets of CYP. The data provided in the CYPLUS Flora & Fauna GIs is the best available dataset which could be collated. The taxonomic accuracy of the data was checked against the most current list provided for CYP by the Queensland Herbarium and geocode accuracy determined after passing through a series of validation tests. Suspect records were flagged in the database and passed to custodians for corrections. Records passing all tests can be extracted for analyses such as species distribution modelling, which require the highest quality records.

AU records were provided as received fiom the data custodians except for sensitive taxa, such as rare and threatened plant taxa. These were gridded to 3 minutes due to the sensitivity in releasing their locations.

At this stage, the maintenance of the CYPLUS Flora & Fauna GIs is unclear and it needs to be reviewed by users of the data for its ease of use and contents. It is anticipated that this project be used as a model to indicate the practical aspects of standardising data collection in the field and taking it the next step to incorporating the data into a standard digital format, usable by various agencies.

The flora database can be used for any analysis within the Land Use Programs where there is a requirement for information on the flora of CYP, in particular the Conservation & Natural Heritage Assessment project. It is also applicable to information relevant to Stages I1 and 111 of CYPLUS.

Assessment of objectives

All of the project's aims were met. This included the collation of datasets, integrating existing and newly-collected survey data, completing simple analyses of species to identifjr distributions of taxa, identify gaps in the data and predicting the distribution of plants. The database was incorporated into the CYPLUS Flora & Fauna GIs with computer tools developed to access and analyse the data.

Scientific or methodological advances

The collation of primary data covering CYP collected to date to provide a base upon which to add data fiom new surveys. The availability and maintenance of such a dataset will enhance the knowledge of the flora in the long-term; encourage reuse of data, thus reducing the need for costly field surveys; allow priority areas to be selected for survey; provide a sound baseline for monitoring the flora of CYP and provide a near to complete dataset for analysis and modelling.

Standards for the collation of point based flora data were applied when databasing the data. A minimum set of attributes were defined for GIs applications.

Heavy emphasis was placed on extensive documentation of the datasets for the convenience of users.

The approach taken by ERIN and other agencies contrasts with the common project- based approach where datasets are compiled for a single use and then discarded. General community benefits

The major benefits to the community were the collation of historical surveys; availability of historical and current information on one system; an increase in the knowledge base of the flora occurring on CYP; access to data to allow informed decisions to be made about the management of the region; data to support land-use conflicts that may arise; and the potential for local government and individual access to the data about the region in which they live. 1.0 INTRODUCTION

1.1 Identification of user requirements

The report by Connell Wagner (1989) was completed to identifjr and analyse the resources of CYP using existing data sources and to provide a planning framework for future resource allocation in CYP. The extent to which this was used as a guide to determine requirements for datdinformation for the CYPLUS is not known.

The three projects supported by ERIN were proposed as a result of a consultancy commissioned by ERIN. A consortium consultant (Kestal Research, CSlRO Centre for Spatial Information Systems and Computer Power Pty. Ltd 1990), was commissioned to assess the environmental data needs of potential users within the (then) ASETT Portfolio to suggest system solutions and to recommend projects. At the time, several projects and priority areas were suggested. The region which met most of the criteria set by the consultants was CYP. As finds were available for only one pilot project, it was decided to proceed with CYP. The ERIN CYP project was to be run in conjunction with the proposed joint Cornrnonwealth/Queensland land use studies, which later became known as the Cape York Peninsula Land Use Study and then as the Cape York Peninsula Land Use Strategy (CYPLUS).

From this, ERIN proceeded with an assessment of the Environment Portfolio needs for CYP in July 1990, Vegetation (plant communities) and plant distributions were in the category of highest priority; rare and endangered plants and ecosystemsfland systems were considered a high priority. Proposals to incorporate these needs into CYPLUS were accepted by the Environment Portfolio and Queensland Government and became the three projects in NRAP. The project summary for the flora project was distributed to interested parties and is found in Appendix 11.

1.2 Status of flora data from Cape York Peninsula

At the beginning of this project, existing data on the flora of Cape York Peninsula (CYP) was widely scattered, held by numerous agencies throughout Australia and in a variety of formats. Data could be found in the form of field notes, survey sheets and in internal and published reports. In some cases, relevant data already existed in digital format, particularly in major herbarium collections and data held by ERIN from other sources.

The Queensland Government was a major data holder through the Qld Herbarium and the Qld Department of Environment and Heritage (QDEH). Significant data holdings also existed in the CSIRO, universities, other herbaria and with private individuals. These sources were to be investigated and where suitable, collated into the flora dataset of CYP.

Besides the existing flora data collected in CYP, new data were generated concurrent to this project in the CYPLUS NRAP project NRO1: Vegetation Mapping of CYP. The challenge was to make the old and new flora data available in a common format on the CYPLUS Flora & Fauna GIs to support planning and decision-making. 1.3 Aims of the project

The purpose of this project was to make information on the flora of Cape York Peninsula (CYP) available to environmental decision-makers in the State and Commonwealth in digital form through Relational Database Management Systems @DBMS) and Spatial Information Systems (SIS).

The more detailed objectives of the project were:

1. To collate existing (pre-Cape York Peninsula Land Use Strategy (CYPLUS) data on the locations of CYP plant species from a variety of sources.

2. To integrate existing plant data with newly collected data fiom the Natural Resources Analysis Program (NRAP) project NRO 1.

3. To perform some simple analyses to identifjr gaps in the data and predict the like distribution of selected species.

4. To provide suitable computer tools for accessing flora data in the CYPLUS Geographic Information System. 3

2.0 METHODS

2.1 Point data versus polygon data

Thackway & Cresswell (1992) compare four scenarios where vegetation datafinformation were used for environmental investigations. These included the use of pre-classified data and primary data. The advantages identified in the primary data approach to environmental information have been targeted by ERIN as important in the long-term storage and use of data. Basically, the systematic collection and storage of primary data according to agreed standards will enable easy access to the data and allow reuse unlimited times without affecting the quality of the original data (Thackway & Cresswell 1992).

This reasoning underpins the Flora Project. CYP covers a very large spatial area and numerous natural environments. Over time, field collections have been patchy and disjointed without a long-term view of collating the data into one large useable dataset for a variety of applications. The opportunity to draw these datasets together for CYP would provide a valuable resource for hture investigations.

2.2 Overview of methods

This project was mainly concerned with converting existing, paper records of plant species locations into a digital format. In some cases, particularly major herbarium collections and other data held by ERIN, relevant data on CYP flora already existed in digital format.

These existing data were integrated with new data (from NRO1: Vegetation Survey of Cape York Peninsula) to form a spatially-referenced database of plant species locations ("presences") in a standardised RDBMSIGIS format. Documentation of data quality of these source datasets was a high priority in this project. ERIN also performed some simple analyses to identifjr gaps in the data and demonstrate the usehlness of distribution modelling.

A broad outline of the methods for gathering flora data from existing sources is shown in Figure 2, with a legend for the schema in Figure 1. The establishment of priorities, standards and protocols is not shown in Figure 2, but occurred soon after "Start".

Each of the point-based flora datasets supplied to the CYPLUS Flora & Fauna GIs went through the Collation, Integration and Packaging phases. Construction of an authoritative list of CYP plant taxa (Taxon Listing) was conducted somewhat separately to the gathering of point-based data, but was nevertheless vital for the final product. Only portions of the data went through the Analysis phase, but hrther progress is expected here through the Conservation and Natural Heritage Assessment project of the CYPLUS Land Use Program. Some datasets did not reach the CYPLUS Flora & Fauna GIs; work on these ceased at some stage of the Collation phase. (3 StaWFin ish

Dataset

[or other physiml entity)

I Operation or procedure

Dataor information fluw =ww

- s Data or information fluw [with string feedback)

----- Possible [or past) linkagti,

Figure 1: Legend for schema in Figures 2 to 5. I I Taxon Listing. )

I Analysis. 1 modelling, etc. and transfer la ( h.musms) J

Figure 2: Overview scheme for production of flora component of CYPLUS GIs. 2.3 Collation

A detailed scheme for collation of each point-based flora dataset is given in Figure 3. In brief, this section describes the steps undertaken by ERIN from identification of the existence of a dataset, through its evaluation and databasing (usually by the relevant Custodian), to loading of the raw data into an Oracle source table at ERIN.

Priorities for evaluating and acquiring existing flora data were established early in the project (Appendix I). Standards & protocols for databasing existing records were also set up early in the project (Appendix IV). These standards evolved somewhat during the course of the project.

2.3.1 Identification

One of the earliest tasks at ERIN for this project was to identifjr sources of existing point- based plant species data, i.e. herbarium specimens, ecological site survey observations and incidental records. Formal literature searches were also conducted. However, the task was also ongoing, in that discussions with Custodians and others often unearthed datasets previously unknown to ERIN. Bibliographies and a table of identified datasets were also created.

Using a working list of CYP flora datasets and likely Custodians of same, ERIN project staff contacted each Custodian and established the general particulars of each dataset. The first question was whether the Custodian held datasets relating to CYP. If so, the next question was whether the dataset was point-based (resulting from either survey observations or herbarium specimens) or polygon (map) based. If the dataset was point-based, ERIN commenced negotiation for access to the data for CYPLUS and asked more detailed questions as to the precise nature and coverage of the data, viz: What was the original purpose for collecting the data? Was it for ecological survey, taxonomic survey or otherwise? Which taxa were surveyed? Were the data in paper or digital format? What is the area covered by the dataset? How big is the dataset? What is the quality of the data in the dataset? How good are the taxonomic determinations? How accurate and precise are the geocodes (if any)? What was the design of the sampling strategy? What is the availability of the dataset to ERIN and CYPLUS? What resolution can the data be released to the CYPLUS Flora & Fauna GIs? What resources are required to convert the data into a suitable format? How long would it take? etc. etc. Evduatiand CNegotiation

Feedback b astodim; camctions

1 From Integration.

Figure 3: Scheme for Collation of each point-based flora dataset. Thus began the documentation of the strengths and weaknesses of each source dataset. Each dataset was evaluated against specifications and priorities, already established (see above and Appendix I). Given a willingness of the Custodian to make the data available to CYPLUS, the next step was to formalise the process.

2.3.3 Specification

In this step, negotiations with the data Custodian were formalised. In some cases, such as small datasets already in digital format, only an exchange of letters was needed between the data Custodian and ERIN.

In the case of large, datasets in paper format, contracts were established between ERIN and the data Custodian to convert the data to RDBMS format and supply the data to CYPLUS via ERIN (Appendix 111). In general, the Custodian usually knew their data best, so ERIN preferred to supply resources to the Custodian to upgrade their data in a format suitable for ERIN and the CYPLUS Flora & Fauna GIs.

In some cases, a third party (with consent of Custodian) was in a better position to upgrade and supply the data, so contracts were set up between ERIN and the third party. In yet other cases, ERIN project officers were in a better position to upgrade public domain CYP flora datasets directly. Here, the Custodian was usually informed by letter.

Once negotiations were formalised, the Custodian or agent set about converting the data into a standardised format suitable for loading into the ERIN System ("ERIS" - Environmental Resources Information System). In most cases, data suitable for loading into the ENS "Specimen" Module was obtained fiom the Custodian. In general terms, each specimen or observational record had locational details as well as date of collection, determination, etc. Many herbaria store their label data in this format. Data meeting this minimum requirement was destined for supply to the CYPLUS Flora & Fauna GIs.

In some cases, data suitable for loading into the ERIS "Site SurveyyyModule was obtained. Several to many species were recorded by the Custodian at each site, which has a single set of habitat data associated with it.

Custodians supplied CYP flora data to ERIN in RDBMS or structured text format with a locational reference on each record. Data were supplied in ASCII, .DBF, Microsoft Access or other format. In other words, source data were not in GIs format.

A variety of physical formats were employed, data were usually supplied on MS-DOS 3.5" floppy disks, but Exabyte tapes or electronic mail were occasionally used.

All relevant incoming datasets were loaded into "source tables" of raw data in the Oracle RDBMS at the ERIN Unit. At this stage, the incoming source dataset was documented on the ERIN Data Dictionary and Catalogue @DC) system. Basic contact details and dataset specifications were entered along with some indication of data quality. 2.3.6 Feedback to custodian

The results of ERIN'S data loading and standard validation tests were fed back to the Custodian for correction andlor comment. In some cases, the likely turnaround time for corrections by the Custodian meant that ERIN project staff made corrections (where possible fiom other context) and informed the Custodian of changes.

Corrections from error lists sent from processing large datasets such as the Queensland Herbarium, could not be incorporated into the dataset because the turnaround from these was beyond the timing for the Flora Project.

2.4 Taxon Listing

The philosophy of this phase was for ERIN to always use the best list of CYP flora taxa as it became available throughout the project (Figure 4). Throughout the first two thirds of the project, Census of Australian Vascular Plants (Hnatiuk 1990) was the best available to ERIN.

In July 1993, J.R. Clarkson, QDEH, Mareeba, supplied a draft list of CYP flora taxa.

In May 1994, J.R. Clarkson, QDEH, Mareeba supplied a hrther draft list of CYP Flora taxa. All data supplied to the CYPLUS Flora & Fauna GIs on 29/6/94 conforms to this list.

At the time of this project, the Census of Queensland Plants (Queensland Herbarium Staff, 1993) was not available in electronic fo'mat for enhancing the Integration process.

2.5 Integration

In this phase (Figure 5), all relevant source datasets were integrated into a common RDBMS format - the ERIS "SpecimenyyModule which is implemented in Oracle with a user-friendly interface via Oracle Forms. Although the Module is called "specimenyy,it can also hold data from ecological site surveys and incidental observations, as well as label data for botanical herbarium specimens.

2.5.1 Loading data into ENS

Loading data into ENS Specimen involves a series of standard tests for taxonomic and locational accuracy (Appendix V). In this project, the last step of that process was performed only on a selected range of taxa.

Flora data from other ERIN continental projects, especially ERIN'S Landcover Project and the Rare and Threatened Australian Plants database, were already available in the same ERIS "SpecimenyyModule. CYP flora data fiom CYPLUS and non-CYPLUS sources were thus accessible at the ERIN Unit through SQL or SQR queries and through Oracle Forms.

One of the fields in the Specimen table indicated the source dataset for each record. In this way, selected datasets meeting some minimum quality criterion, could be recovered from the Specimen table for particular purposes. Use best available [digtd) authoriMve l~stfor CYP.

A##Y 1952$--J7e 199.3 Census of Australian . Txon Vascular Plants - - \ me-53 ( HrTatirk, lgm). -+-A List for CYP. w /

/ Ju& 1993t- 19- # WV / John Uarkson's [QDEH J Mareeba) CYP flora # 1 checklist[IkaRl arrd Australian Plant 1 d arne i Index (Chapman, 1991 ). 1

John UarWon7s(QDEH Mareeba) CYPRora chmldist CImft2)

John Qarkson's (QDEH Census of Queensland Mmeba) wpnota Plants (not cumntly checklist (Wblishd wvsion] Havailable, dgitAly).

Figure 4: Scheme for Taxon Listing for the flora of Cape York Peninsula. (Draft numbers are not those of Custodian) To mllation & . Taxon Listing. 1,

90urce Tables

h-p

Figure 5: Scheme for Integration of point-based flora datasets. 2.5.2 Duplication control

During the conduct of this project, there was poor control of duplicate records in ERIS Specimen module. An automatic procedure to detect duplicates had not been established. Essentially the same record may have been databased independently and then loaded from several source datasets. Sometimes, different views of the Custodian's dataset may have sent multiple copies of the same record.

2.6 Packaging

2.6.1 Database design

The CYP Flora database was designed using the relational model @ate 1990). For an Entity:Relationship diagram of the CYP Flora data system, see Bryett et al. (1994).

Lists of fields in each table are in Bryett et al. (1994). Note especially the two sets of geocode fields - one for a 3 minute grid and another for point data, where Custodian allowed release to CYPLUS.

The ERIS Site Survey Module was not used, as it was not yet in production mode during this project.

2.6.2 Database building

The procedure for cross-loading of CYP flora (and fauna) data from ERIS Specimen into the CYPLUS Flora & Fauna GIs is given in Bryett et al. (1994).

Specimen-format data were cross-loaded at the best available resolution allowed for release to the CYPLUS Flora & Fauna GIs, as specified by the Custodian of each dataset. The standard maximum resolution aimed at was a 3 minute grid.

2.6.3 Data documentation

The strengths and weaknesses of each source dataset were documented progressively in the ERIS DDC program. This documentation enabled management of the data at ERIN by project staff and gave detailed information to users not involved in the data acquisition process and not familiar with the datasets.

Towards the end of the project, text listings from ENS DDC were produced by modification of an existing SQR reporting program on EMS. Metadata entries for each source dataset were modified to be directly useful to users of the CYPLUS Flora & Fauna GIs. This involved rewriting text from the ENS DDC entries and the creation of additional codes. See Glossary of CYP metadata terms (Appendix VII) for details.

2.6.4 Geocode data validation

A series of standard geocode "data validation" checks were performed on selected data in ERIS Specimen. See Appendix V for details. 2.6.5 Additional (value-added) data

Where available, additional information on each species was loaded into the database fiom existing digital sources or from paper sources keyed in by ERIN project staff. Data such as common names, growth form and conservation status were entered in separate fields or tables, as appropriate.

This additional information was designed to enhance user-friendly retrieval of data fiom the CYPLUS Flora databaselGIs and to provide various non-taxonomic aggregations of the integrated dataset, for summary purposes.

ERIN also performed some preliminary attribution of CYP flora taxa as to whether each was endemic to CYP or had a disjunct distribution. This will be reported in the Land Use Program project Conservation and Natural Heritage Assessment.

Plant Life Forms. Species can be described by their growth form or life form on the basis of their similarities in structure and hnction. The life form of a species is usually constant but may assume a different form when growing under different environmental conditions. Therefore, a species with a particular life form in CYP may have a different life form in another region.

The strong relationship between a species life form and the environment in which they grow means that the distribution of life forms across an area can be used as an indication of environment types found across a large area of the continent. For example, an area with a high proportion of trees adjacent to an area with a high proportion of shrubs gives some indication of the environmental heterogeneity of the landscape.

Rare or Threatened Plants. These include plant taxa considered to be extinct (X), endangered (E) or vulnerable (V) grouped as threatened and rare (R) or poorly known (K).

There are a number of sources of information on the conservation status of plants in Cape York Peninsula. The two most current sources which were used are:

1. HERBRECS (Staff of the Queensland Herbarium 1994) and the

2. ANZECC List of Threatened Australian Flora, (ANZECC Endangered Flora Network 1993).

The HERBRECS database holds information on the status of rare and threatened plants for Queensland whereas the ANZECC list is a national list of threatened (endangered and vulnerable) plants which relies on input from State and Commonwealth agencies. The definition of the conservation status of most plant species is a result of an ongoing process and is continually changing as more data is collected. The ANZECC list is current to June 1993, whereas the HERBRECS database is continually being updated.

Definitions of the conservation status codes from the two sources are in the ANZECC listing (ANZECC Endangered Flora Network 1993) and Thomas & McDonald (1989).

2.6.6 Data extraction and GIs tools

A series of data extraction tools were set up for operation on the "Monster" specimen coverage supplied to the CYPLUS Flora & Fauna GIs (l3ryett et al., 1994). 2.7 Analysis

2.7.1 Species distribution mapping

Flora species considered to be important or of priority for management were selected for plotting known distributions. An important group of such species are the threatened plants of CYP, comprising endangered and vulnerable species. The list of threatened plants used here was obtained fiom John Clarkson's plant list for CYP. Simple distribution maps of these species can be used to identifjr and target areas of biological importance on the Cape. 2.7.2 Development of adequacy of sampling tools

Biases in data collected in the field occurs almost by default unless the survey is small andlor fbnded generously. There are number of causes of bias including: taxonomic bias, seasonal bias, sampling bias, accessibility bias, and popularity bias.

A number of computer-based tools were developed by ERIN, as part of ERIN'S CYPLUS projects, to assist with the analysis ofthe flora and fauna data. The tools integrate ARCmJFO and GRID, ERIN'S Oracle database, the S-PLUS statistics package and UNlX utilities (Bryett et al., 1994). These programs are potentially applicable to any area of the continent and to appropriate datasets.

They have been developed to assess sampling bias but can also be used to determine accessibility (spatial distribution of records according to access) and seasonal (records sorted by date) bias. Taxonomic and popularity bias should be determined in conjunction with other datasets.

Visualisation of Samples, Taxa and Regions (VISTR) and the Site Placement Analysis Module (SPAM) are two such tools. The primary aim of these tools is to provide an overview of existing environmental data and assign priorities to target areas for objects of interest. They assist with determining whether the environment has been adequately sampled and aid in the effective and efficient design of surveys of large areas such as Cape York Peninsula.

Visualisation of Samples. Taxa and Regions (VISTR). VISTR is an analysis tool which produces species accumulation curves for point-based samples of flora or fauna in a given region (or set of polygons of like class in a GIs). VISTR assists with assessing whether a key taxonomic group (such as the genus Eucalyptus) or ecological category (such as mangroves or snakes (Richards et al., 1990)) is adequately sampled in the region(s) of interest.

The program and associated documentation has been developed to a high standard. (See Appendix 111 of Bryett et al. (1994).

Site Placement & Analysis Module @PAM). SPAM was developed in response to a request (in February 1993) by J. Neldner of QDEH, Mareeba, to assist with optirnising the placement of new survey sites in relation to existing sites. VISTR was seen as insufficient for this purpose, because of its emphasis on polygon-based regions and the typical use of aggregations the abiotic environment (e.g. Cofinas & Bolton in prep). SPAM was designed to assess site placement against each abiotic attribute in turn, where the continuum of each abiotic attribute is divided into many equal-interval classes (Neldner et al., in press).

SPAM has not been developed to general purposes, or to an error free standard, and is more for use "in house" at ERIN for specific purposes.

2.7.3 VISTR analyses

Total flora versus 5 ~OUVregions. Point records of the whole flora dataset corresponding to the NRO1 plant list were assessed for adequacy of sampling against the 5 group regionalisation (Cofinas & Bolton in prep).

Life forms versus 5 Group Reaionalisation. Point records belonging to the "lifeform" applied categories were each assessed for adequacy of sampling against the 5 Group Regionalisation (Cofinas & Bolton in prep) using VISTR. The life forms run were: Forb, Grarninoid, Palmid, Shrub, Subshrub, Tree, Vine. The applied category "Mangrove" was also run through VISTR.

2.7.4 SPAM analyses

SPAM was applied to the Queensland Herbarium (NRO1) detailed observational sites, in April 1993. This was to test the representativeness of existing sites and to suggest undersampled environmental types requiring extra sites.

SPAM analyses were run for each of the following abiotic data layers:

(1) a 1140th degree BIOCLM prediction grid of 16 climate variables (8 temperature and 8 rainfall) - see Cofinas & Bolton (1994) for details;

(2) six soil finctional attributes (McKenzie and Hook, 1992) attached to the digital version of the CSIRO Atlas of Australian Soils (NNC & CSIRO 1991); and

(3) landform attributes interpreted using the Explanatory Notes accompanying the Atlas of Australian Soils (Northcote et al. 1960-68).

Terrain data, used in the generation of the BIOCLM grid, was not directly used in this analysis.

Further details of the approach are given in Neldner et al. (in press).

2.7.5 Species distribution modelling tools

Site and specimen based collections do not give an accurate representation of the range of distribution of a species. To determine the significance or importance of a species, it is necessary to establish or understand its population dynamics and ecology. Therefore, information is required on the extents of its range which is a finction of its ecology. Where well-sampled surveys of species are not available, modelling potential distribution £tom known sites and information about the environment at these sites is required. Biogeographic models have been developed to provide a more realistic spatial pattern of the potential distribution of species populations and hence compensate for biases in sampling observed in existing datasets (Cresswell et al. 1993). The two methods available at ERIN and applied to species in CYP were BIOCLIM and GARP.

2.7.6 Species distribution modelling analyses

BIOCLIM. BIOCLIM (Bioclimate Prediction System) (Busby 1991) is a program developed to model the potential distributions of species at continental and regional scales. BIOCLIM predicts potential distribution of taxa based on the correlation of locations of a set of species data (presence) and selected climatic attributes. The BIOCLlM model establishes a climatic envelope for the set of locations which the model uses to search all other sites in the data space to locate those sites which share the same climate attributes as the species dataset. This prediction is an indicator of the climatic limits for the distribution of a particular species.

An explanation of the limitations of BIOCLIM is found in the Atlas of Elapid Snakes of Australia (Longmore 1986) and Busby (1991).

GARP. GARP (Genetic Algorithm for Rule-set Production) uses a genetic algorithm (general purpose optimisation techniques based on biological principles) to automate the use of environmental data collected through field surveys to produce distribution maps and models. The genetic algorithm allows a number of forms of model to be developed and tested simultaneously, rather than requiring a separate algorithm for each form of model. GARP currently uses BIOCLIM type rules, GARP rules, logistic regression rules and atomic rules (David Stockwell pers.com 1994).

Some features of GARP are:

* use of multiple models, allowing prediction with higher accuracy than any single model;

* the use of rules to develop models using arbitrary species distributions including bimodal (two-peaked) responses to underlying environmental scalars;

* the use of categorical, ordinal and continuous data, whereas BIOCLIM is based on quantitative climatic data;

* rigorous testing of models for significance and accuracy using sampling of the dataset (Stockwell 1994).

The method has been developed on the WWW based on 1120th degree continental data, and can also be used with a command line driven method on 1140th degree CYP data used in NRl1 (Cofinas & Bolton in prep).

The physical environmental data collated for the Environmental Region Analysis (NR11) project, was used as the basis for the modelling. These were collated fiom datasets part of ERIN'S Continental Data Base Project comprising climate, terrain, geology, and soil information, a summary of which can be found in Cofinas & Bolton (1994). Demonstration Analysis. One plant species was selected for distribution modelling using BIOCLIM and GARP in order to demonstrate the methodology. It was impractical to model all species populations on CYP because of the large number of species in the region. The methodology can be applied to hrther species as required.

A vulnerable plant species, Macaranga polyadenia (Family Euphorbiaceae) was chosen on the basis of its conservation status, an adequate number of records in the database and its distribution down the eastern coast of CYP. Many other endangered and vulnerable species had very restricted distributions and would not demonstrate the modelling techniques adequately. Records which had passed through ERIN'S geocode validation program (valid records (Bryett et al. 1994)) were extracted (this was 23 records out of 43 in total), as the inclusion of suspect records would decrease the accuracy of the predicted distributions. 3.0 RESULTS - DATABASE BUILDING

3.1 Data acquisition

This section documents the datasets identified for acquisition by ERIN'S CYPLUS/NRAP project Flora Data and Modelling (NR18).

3.1.1 Identified datasets

Table 1 contains a list of flora datasets relating to Cape York Peninsula. Some idea of data type, formatting and availability was usually obtained at the data 'Identification' stage, for each dataset. The column headed "Type7' has several allowable values, viz: Observational - Incidental observations Site - multiple species and their habitat recorded at a locality Specimen - data relating to botanical herbarium specimens Taxon List - standard lists of taxa for a geographic area

3.1.2 Acquired datasets

The number of records referred to for each dataset (Table 1) indicates ERIN'S knowledge of the dataset at the time of enquiry. The number thus combines both datasets restricted to CYP and continent wide datasets containing records in CYP. Accurate record numbers from datasets supplied to the CYPLUS Flora & Fauna GIs can be obtained from the dataset summaries in Appendix VIII. The column headed "Collation Stage Reached" in Table 1 indicates the stage reached in acquiring the dataset - see Figures 2 and 3 for phases and stages. Note, in particular, whether the data has reached the CYPLUS Flora & Fauna GIs (or in more detailed terms Version 1.0 of the CYPLUS Flora & Fauna GIs (l3ryett et al., 1994)).

Appendix VI gives a summary of the contracts entered into by ERIN for this project. Also listed in Appendix VI are separate ERIN contracts for continent-wide datasets, which have records from Cape York Peninsula.

Note should be taken of the "??" in Table 1 which indicates that the status of the attribute for the particular dataset was unknown or approximate. 19

Table 1: Flora dataset identification and acquisition summary - point data.

Format at start Collation Data/Custodian7s Stage Reached

aper (forms) Most in Mareeba Far North CYPLUS (Neldnertclarkson) Queensland. FLORA & a) Detailed Sites FAUNA GIs

FLORA & FAUNA GIs - * CYPLUS FLORA & FAUNA GIs --- EvaluatiodNe from 1913 - gotiation recent publication at end of NR18 Specimen Various (new Identified data needed!)

FLORA & FAUNA GIs;

FLORA & Canberra: ERIN FAUNA GIs; 'Landcover" groups

FLORA & FAUNA GIs

orms/Digital PORA & --- 'FAUNA GIS NSW Herbarium, Data from RGSQ 500 CYPLUS Sydney excursion to FLORA & "'Heathlands" FAUNA GIs Station BundAIMS Mangrove data (East 5,430 Site Paper CYPLUS coast) (transect) FLORA & FAUNA GIs MesselVWells Mangrove data 77- . Text? Paper? EvaluatiodNe (Wenlock & Escape gotiation ----. ----. Various herbaria Mangrove Atlas 644 in Specimen Digital CYPLUS (1986), Australia- CYP FLORA & wide FAUNA GIs ----< ----< CSIRO Div. Water Mitchell Normanby 2000? Site Paper (see text EvaluatiodNe Resources land system survey (218 sites) for details) gotiation ---- Comalco herbarium Weipa district and 1800 Specimen Paper (labels) CYPLUS , Weipa beyond FLORA & FAUNA GIs Dept Weipa district 1300136 Site Digital Conversion {transect)

Gitay Archer Bend and 100 Site Paper CYPLUS ~ardineRiver: (notebooks) FLORA & orchids and FAUNA GIs vegetation sites Paper (forms CYPLUS system survey sites; FLORA & Godwin' s herbarium FAUNA GIs

Paper; being Identified redone by StantodFell

cIlwraith Range, sites (notebooks) FLORA & FAUNA GIs

FLORA & FAUNA GIs (in above datasets) Australian National berbarium data ? Specimen PaperDigital Part in Botanic Gardens CYPLUS FLORA & FAUNA GIs Stantonhiell NRCPI CYPLUS 200 sites? Site Paper (forms) - Part in (CYPLUS, NRCP) NRO 1 Rainforest > Digital CYPLUS Vegetation Project (CORVEG) FLORA & FAUNA GIs Extra-Australian L! ? Paper; should CYPLUS weeds (emphasis on be in 1994 FLORA & quarantine) HERBRECS FAUNA GIs update Batavia Downs; veg. ? Site Digital CYPLUS & soils FLORA & FAUNA GIs? (through NRO 1) aplidAbraharns Western CYP -1,000 Site Paper - needed Conversion dryland (129 sites) ID'S (Requires some I geocoding & cleaning) QDEH pestern CYP 332 sites Site Paper? EvaluationNe ------(B1ackman)ITaplin wetlands gotiation Peter Sutton, Aurukun ? ? Transect Not known Identified

Pdelaide -

3.1.3 Excluded datasets

In Table 1 above, several datasets did not reach the CYPLUS FLORA & FAUNA GIs. Some discussion on each of these is given, below.

Census of Old Vascular Plants. This dataset was in preparation at the Queensland Herbarium during project NR18. It was published in October 1993 (Queensland Herbarium Staff), but a digital version is not currently available, except as a document.

Redetermination of specimens - Old Herbarium. Brisbane. No CYPLUS or ERIN resources were identified for this activity. Some redeterminations are often part of herbarium activities, anyway.

MesselVWells manmove data. Dr Bolton contacted Prof Messell by phone; he referred ERIN to Dr G. Wells. Dr Bolton talked on phone and wrote to Dr Wells, but received no reply.

CSIRO Mitchell Nonnanbv land svstem survev sites - vegetation. There is an extensive history of consultation with Bob Story, but little success in establishing a sub-project. Progress was made in a number of areas which led to the identification of yet other problems: * survey cards recording vegetation site data were obtained from QDPI; these were difficult to decipher due to the recorder's handwriting; number identifier on cards did not correspond to site numbers; geographic coordinates for site locations were not on the cards. * aerial photographs used during survey were located at CSIRO, Department of Water Resources, Canberra and used with 1:100,000 topographic maps to derive geocodes for sites. * these site locations were compared with the locations of specimens held in CSIRO's Australian National Herbarium (CANB); the purpose was to link the specimens to other observational data on the survey cards; the geocodes from the two were found to be different and would need to be redetermined for greater accuracy (Roppola unpubl. data); this would need to proceed in conjunction with Bob Story and CANB as records in the Herbarium's database would need to be altered. * a taxon list which would link the survey card number to site numbers was located with Bob Story; a vital link to databasing the records on the survey cards. * little money and time was available for the development of this project and would depend on a substantial proportion of Bob Story's time and ERIN stafftime.

Alex Thomas: Weipa district. Initial consultations were held with Alex Thomas in December 1992.. Mr Thomas indicated a strong willingness to supply the data to ERIN, but due to other commitments by both parties, the data were not supplied.

Hynes, Stanton and Gitay. The site data collected for Lakefield National Park is with Ross Hynes, Brisbane. Data for the area was recollected as part of NROl by Peter Stanton and Dave Fell and therefore superseded the existing dataset.

Operation Ralei~h.A proposal by ERIN and Dr Gitay to database records from this expedition met with widespread opposition from North Queensland botanists. Apparently, the quality of flora determinations available from the expedition records was such that their databasing was not justified.

TaplinIAbrahams: western CYP dryland. ERIN finded a determination of a number of specimens collected in CYP dryland sites by Mr H. Abrahams and Mr A. Taplin in association with wetland surveys by Dr Blackrnan (see below). Corrections to errors were not made by the data holders nor ERIN during project NR18.

QDEH 03lackman)lTaplin: western CYP wetlands. This large project commenced prior to CYPLUS, using ANPWS and QDEH finding. Major components of the project included the regionalisation of the Gulf of Carpentaria plains using rectified LANDSAT MSS imagery, the collection of detailed (ground-based) transect data on wetland characteristics (including vegetation and flora) and systematic, aerial transects for assessing migratory birds.

In the early days of CYPLUS, this project was to have been extended across the remainder of CYP, through additional CYPLUS finding as NR09. However, after cuts to NRAP finds, QDEH redeployed NR09 to a desktop exercise in Brisbane.

Peter Sutton. Adelaide. This involved a collection of culturally important plant species which was not followed up due to insufficient resources to resolve any potential sensitivities involved with collation of Aboriginal data.

3.2 Documentation (metadata) on source datasets

See Appendix VIIIa to WIIt for documentation on each individual source dataset. These are stand alone summaries intended for use by users of the CYPLUS Flora & Fauna GIs Pryett et al. 1994). The documentation follows conventions introduced by the U.S. Spatial Data Transfer Standard (SDTS). 3.3 Value-adding and packaging

3.3.1 Geocode data validation

Geocode data validation began during project due to the need for valid data during the lifetime of the project. Ideally, all datasets should have been loaded before the species validation. The data validation process was initiated in October 1993 with data loaded to the date, processed. Species validated and occurring in datasets loaded after this validation phase have not been validated.

The following groups of taxa are part way though ERIN'S geocode validation process:

Acacia spp. Allocasuarina & Casuarina spp. Callitris spp. Eucalyptus spp. Hakea spp. Melaleuca spp. Family Poaceae Mangrove taxa Tree taxa Rare and Threatened species and many other smaller genera.

The species processed through the geocode validation process are flagged as such in the CYPLUS Flora & Fauna GIs. See Bryett et al. (1994) for details of relevant database codes. Lists and maps of suspect records were sent to the custodians of each dataset for correction or confirmation of validity.

3.3.2 Applied categories

Life forms. Table 2 summarises the flora data of CYP in terms of their life forms. The life forms used were as defined by the Queensland Herbarium (Mareeba), except the 'Mangrove Taxa' which was defined at ERIN. The table is an indication of the component of each life form in CYP and the figures should be used as a guide.

Note needs to be taken that a particular species may have more than one life form and that many species in EMS are at this stage not associated with any life form attribute.

Table 2: Summary of CYP flora dataset by life-forms.

Life-form No.of Records No.of Distinct Taxa No.of Unique Locations ---- Tree 42,739 743 8,791 Shrub 17,132 391 5,263

Scandent Shrub 876 22 55 1

Subshrub 4,855 186 2,074

Mallee eucalypts 80 5 66

Palrnid 1721 5 1 542

Forb 11,317 813 3,040

Vine 5,441 303 1,884

Mangrove Taxa 2,82 1 23 1,105

I Graminoid 9,884 552 2,749 I

Rare or Threatened Plants. A summary of the conservation status of plant species in CYP as provided by the Queensland Herbarium is in Table 3. Note must be taken that this list is only for species occurring North of 16degS in CYP.

Table 3: Summary of Rare or Threatened plants in CYP.

As an example, the endangered and vulnerable taxa were looked at in more detail (Tables 4 and 5). The taxa in the tables were derived from the list of R&T's obtained from the Queensland Herbarium and compared to the taxa in the ANZECC list (ANZECC Endangered Flora Network 1993), a category within each table. The taxa in each were found differ somewhat, due to the currency of the two sources. The list from the Queensland Herbarium is a more accurate reflection of the current status of threatened plants in Queensland as it is being updated as information becomes available.

Table 4: Endangered plant taxa in Cape York Peninsula.

Table 5: Vulnerable plant taxa in Cape York Peninsula. 3.3.3 Data extraction and GIs tools

Various examples of data extraction, analysis and map production were created from the CYP flora dataset and tools provided to the CYPLUS Flora & Fauna GIs (Maps 1 and 2). Details of the data structures in the GIs and its capabilities are available in Bryett et al. (1994). Mapl: Gridded Anelpis of Trees 3 minute gdeize CYPLU8 is a ~IIinilialive d the Ouwnalend and Camwealth Dwurnments II "Based dh number of specirnerls per cell"

-rn # Prepared and produced at the CYPLUS GIS using ERIN tadtitles II December 07, 1994

Copvight (cj Cornmanwealth Government Custodians Source datasets listed below,

LEGEND 39715 data points I I % of total samples

0 - 1 '% 0f fota! S~IT!P!BS . 1 - 3 % ol total smples 3 - 7 YO af total samples I I - I 7 - 15 Yo oaf tdal samples I I I @ > 15 Ya of total samples

I I I Study Area (NRIC & QDoL)

141 E. 142 E l47E IUE Projection : Geographical Representation APPROXIMATE SCALE IN KILOMETRES Spheroid : ANS II Nominal Scale - 1 : 4000000 EDITION NO 1

Custodians of data : 'ACT Australian National Botanic Gardens'. 'ACT CSlRO Division of Forestry, Tree Seed Centre'. 'ACT CSlRO Mvision of Plant Indwq, AuGralian National He&ariur?'.,'ACT ERIN Unit'. 'Dr. Habiba Gitag. 'Dr. Mi& Godwin'. 'Dr. William Laverack'. 'NT , ybarlum,, Canservabon CMnmlsson of the NP. 'QU) AWallan lnstltute of Marine Science'. 'QLD Cornalcp Aluminium, Welpa'. 'Qld Queensland He,rbariurnl. 'Qld Queensland Herbarium, Mareeba'. Study Area : coastline - Nabanal Resources Informahon Centre, prowsional southern boundary - Queensland Department Caveats. ... Users are requested to acknowledpe CWLUS, primary sources, and the Environmental Resources Information Network (EF N when using Information conmned In this document Custodian supplied point data gridded to a common resalution;please check precision of individual records

-. -

Map 2: Oridded AnaSysls of Trees WLUS

CWLW,,,,, 1s a Innkwe of the mr 3 minute gM size Dueenalsnd and tomlrrmwea!th davernmente of taxa mr cell"

Prepared and produced at the CYPLUS GiS using ERIN fadllties II December 12, 1994

Copyright (c) Commonwealth Government 2nd Custodians Source datasets I~stdbalaw, 1994. LEGEND 39715 data po~nts

Uo oh of total taxa

7 - 15 % of total tma

> 15 % of total taxa I r------' I I Study Area (NRlC & QDaL)

E ' Roads (AUSLIG, 1988) 8 I ---r * ------r---b I I

141 E lJlE 143E 1UE Projection : Geographical Representation APPROXIMATE SCALE IN KiLOMETRES Spheroid : ANS 0 100 209 Nominal Scale - 1 : 4000000 EDITION NO 1 Sources.... - ' Custodians of data : 'ACT Australian National Botanic Gardens'. 'ACT CSlRO Division of Foteqry, Tree Seed Centre'. 'ACT CSlRO Division of Plant Indust Australian Nationa! He&ariuml.,'ACT ERIN Unit'. 'Dr. Hab~baG~tay'. 'Dr. M~ck G+winn. 'pr. Wlliam Lavarack'. 'NT gbbanurn,, Consswat~onCanrn~won of the FIT. 'QLD Austral~aninstitute of Mar~neScience'. 'QLD Comalco Alumlnlum, Welpa'. 'Qld Queensland tje.&ariumi. Qld Queensland Herbarium, Mareeba'. Study Area : caastline - National Resaurces lnformatlon Centre, pfovlslonal southern boundav - Queensland Departmen Caveats.,,, Users are request* to acknowledge CYPLUS, primary sources, and the Environmental Resources Information Network (E when uslng lnforrnatlon contained In thls document Custodian supplied point data gridded to a common resolution;please check precision of individual records

Various visuaVspatia1 summaries of the data can be found in the following sections. These include species distribution maps, results from the VISTR analyses, including the distribution of records for the whole dataset and distribution of records for applied categories such as trees, shrubs etc. 4.0 RESULTS - ANALYSIS

4.1 Species distribution mapping

4.1.1 Endangered and Vulnerable plants

Maps 3% 3b and 3c were derived from Table 4 for those species with records in ERIS.

Map 4 shows the distribution of all vulnerable flora plants for which there are records in ERIS.

4.1.2 Life forms

Results fiom the VISTR analyses for life forms are described below. These also include GIs coverages of the distribution of the taxon or taxonomic group in question (Map 5).

4.2 VISTR analyses

4.2.1 Total flora versus 5 group regionalisation

The WSTR example was reported under the CYPLUS project NR11 report (Cofinas & Bolton in prep) to demonstrate an application of the environmental regionalisation work.

4.2.2 Life forms versus 5 group regionalisation

There are a number of products resulting from the VISTR analyses.

1. Map 5 shows the results of mapping the records for the palms of CYP against the background of the environmental regions of CYP. A point coverage of the records resulting from running VISTR and a polygon coverage of the regions was used.

2. Species accumulation curves per region (cumulative counts of number of taxa collected versus the number of samples) (Figure 6).

3. Plant taxon lists

a) list of taxa by date collected

b) list of taxa by geocode (latitudeflongitude)

c) list of unique taxa

d) list of taxa and number of records in each region

As an example, the taxon list by region has been included as Appendix XI.

The species accumulation curves in Figure 6 show that in Regions 1 to 4, new taxa are still being collected with additional surveys. The curves are very steep, indicating a high rate of new taxa collected from a small number of samples. In Region 5, there have been a higher number of samples collected (65.9% of total) where initially this resulted in a high number of new taxa collected. Over time the rate of new taxa collected decreased quite rapidly with the curve levelling oE Map 5 shows the distribution of palm specimens collected within each Region and also gives an indication of the number of collections within a Region. Most collections tend to be concentrated in small clumps along the coastal strip around CYP.

4.3 SPAM analyses

One of the outputs fiom Site Placement & Analysis Module for 'Annual Mean Temperature' ('annual mean temperature' from BIOCLIM) is provided (Map 6 and Figure 7) as an example of the technique. A4-size display books with SPAM maps and histograms of all variables were sent to Mr John Neldner (Queensland Herbarium, Mareeba) and other display books are available for perusal at the ERIN Unit, in Canberra.

The graph and table (Figure 7) are used to determine which classes of 'annual mean temperature' are oversampled (>loo%) or undersampled (

4.4 Distribution modelling Early on in the project, ERIN performed many modelling exercises on CYP data, in particular using BIOCLIM, to develop automated procedures for generating species predictions and hardcopy map outputs. These analyses were based on selected flora and fauna species as requested from other NRAP projects. Since these were based on data received early on in the project, the results are not included in the report.

BIOCLIM

The results from the BIOCLIM analysis for Macaranga polyadenia are presented in Map 7. A number of text files were produced from the analysis including: 1. list of 'suitable' sites for the species 2. list of 'marginal' sites for the species 3. monthly mean minimum temperatures at each site (specimen records) 4. monthly mean maximum temperatures at each site (specimen records) 5. monthly mean temperatures at each site (specimen records) 6. monthly mean precipitation at each site (specimen records) 7. means, standard deviations and coefficients of variation of monthly temperatures and precipitation at each site (specimen records) 8. sixteen derived parameters for each site (specimen records) 9. climate profile for the taxon. CYPLUS is a joint initiative of the (Cucurbitaceae) () Queenshnd and Commonwealth Qanrrnment 2. Dendrobium antennatum 4. Dendrobium nindii (Orchidaceae)

* Location records Projection: Geographical representation

Sources.... Atherton Herbarium, CSlRO Division of Plant lndust , Australian National Herbarium Q~eensIandHerbar~um. Queensland Department of pnvironment and Heritage Austrajian National Botanic Gardens Dr. W~ll~amLaverack, QDEH , The species are classed as endangered by the Queendand Herbarium (1994) I Caveats.... The data has been assumed by ERlN to be correct as received from the data supplier. Location records have been compiled from both specimen and observational data.

Map prepared thmugh the facilities of the Envinmmental Remurces lnforrnation Network ERlN (December 12, 19941 5. pictum 7. Malaxis lawleri (Orc hidaceae) 6. Habenaria macraithii 8. Phalaenopsis rosenstromtt (Orchidaceae)

?U E 145 E

f Location records

Sources.... Atherton Herbarium, CSlRO Division of Plant lndustr Australian National Herbarium Queenrland Herbarum, Queensland Department of l!hironrnent and Heritage AustralIan Nat~onalBotanic Gardens Dr. William Laverack, QDEH The species are classed as endangered by the Queensland Herbarium (1994) Caveats.... The data has been assumed by ERIN to be correct as received from the data supplier. Location records have been compiled from both specimen and observational data. I

Map prepared thmugh the facilities of the Emimnrnental Remurces Information Network ERIN IDecember 12. 1994) 9. Eremochloa muricata 1 1 .Lindsaea repens CYPLUS is a joint initiative of the (Lindsaeaceae) Queenshnd and Commonweallh Govern 1 0 Cyathea exilis 12Huperzia carinata (Lycopodiaceae)

* Location records Projection: Geographical representation . .

The data has been assumed by ERIN to be correct as received from the data Supplier. Locarion records have been compiled from both specimen and observational data.

Map prepared thmugh the facilitks of the Emimnrnental Remurces Informtian Network ERIN !December 12, 19941 1 CYPLUS is a joint initiative of the Queenfihnd and Commonwealth GDvernmem

Location records Projection: Geographical representation

Sources .... ACT Australian National Botanic Gardens - ACT CSlRO Division of Plant Industry, Aust. Nat. Herbarium Dr. William Laverack - Mr. Mick Godwin - NT Herbarium, Conservation Commiss~onof the NT QLD Comalco Aluminium Weipa - Queensland Herbarium, Brisbane & Mareeba - NSW Royal Botanic Gardens & Nat~onalHerbarlum of NSW. The taxa are classed as vulnerable by the Queensland Herbarium (1994). Caveats.... The data has been assumed to be correct as received from the data supplier. Location records have been compiled from both specimen and observational data. Map shows the spatial distribution of vulnerable flora, there is no indication of the number of species at each point.

Map prepared through the facilities of the Envimnrnenial Resources Infunnation Network ERIN (December 19, 1994) I LEGEND

Reglan 2

Region 3

Rqkn 4 Region 5

I@ CYPLUS Study Area

Rara Rwards Figure 6: Species accumulation curves for Palm records

REUON 4 REGION 2

C

0' 0 100 00 600 RWON5

Number of Samples Figure 7: Histogram and table of percentage area and sites across the "annual mean temperature" classes in Cape York Peninsula. Annual Mean Temperature ... '-. .* . ..,../. .%.. ., .. .* ,. .> ..II.~,~., ...,, ,.,. .> ., ,,.< ..,....,, .,, ...... ,,.,..<. . ,".,. <..I..I. .,..* ...... , . . . .> .".*..,".,, *-... .l,.,.l.,..,.. . - ...*... .., .*.. > - _ ...... 1. i

., . '. i 5 - . L!

.'..' . <. . . . . r . . :"

: ,, - .

. --

a :. U) $ :: !+a a '.-. ,

8 ,. ... ,; . ,.

... .

.. .

, -

.. :, .:::. ,.. .,..: ,! , ,.... :. :,.,>,,:. ,.: .:.. , .. ,: :, .. , .: ., , , :. ....,: j:: . :" . .' a. , - , ' . . I . - - . tan" class umber

I Bars of histogram indicate percentage of whole study area in each class (% Area). Points on line graph indicate percentage of all sample sites located in each class (% Sites).

Tabular data for above figure plus sample percentage in each class (Sample % = % Sites * 100 / % Area). Percentages Percentages Percentages Class Area Sites Sample Class Area Sites Sample Class Area Sites Sample Class 1 0.01 0.41 4100 Class 19 0.03 0.00 0 Class 37 2.21 2.18 98 Class 2 0.00 0.00 0 Class 20 0.09 0.27 300 Class 38 1.50 0.41 27 Class 3 0.00 0.00 0 Class 21 0.14 0.41 292 Class 39 4.36 5.72 131 Class 4 0.00 0.00 0 Class 22 0.12 0.54 450 Class 40 8.08 14.17 175 Class 5 0.01 0.14 1400 Class 23 0.16 0.14 87 Class41 6.21 9.81 157 Class 6 0.02 0.14 700 Class 24 0.27 0.41 151 Class 42 15.58 20.71 132 Class 7 0.01 0.00 0 Class 25 0.23 0.54 234 Class 43 17.27 18.53 107 Class 8 0.00 0.00 0 Class 26 0.21 0.68 323 Class 44 7.13 4.36 61 Class 9 0.02 0.00 0 Class 27 0.39 0.68 174 Class 45 7.66 1.36 17 Class 10 0.00 0.00 0 Class 28 0.39 0.54 138 Class 46 6.48 0.82 12 Class 11 0.04 0.00 0 Class 29 0.22 0.95 431 Class 47 2.87 0.95 33 Class 12 0.02 0.00 0 Class 30 0.59 1.23 208 Class 48 4.5 1 0.27 5 Class 13 0.01 0.00 0 Class 31 0.66 1.09 165 Class 49 3.72 1.50 40 Class 14 0.05 0.00 0 Class 32 0.37 0.14 37 Class 50 3.71 5.72 154 Class 15 0.04 0.00 0 Class 33 0.79 0.27 34 Class 16 0.00 0.00 0 Class 34 1.17 1.09 93 Class 17 0.07 0.54 771 Class 35 0.64 0.82 128 Class 18 0.10 0.00 0 Class 36 1.84 2.45 133 m - MAP 6: Inadequately Sampled Climate Classes I hupl Mesn Temperature by Vegetation Sites fzhss Tevrm(kg CI u lowe qpr am% 25 &.*3 ~tw3.126 ... 26 a,& Qassclf' 26 ... 26 I we 4 I ~clW3.M35 ...x t : - 4 I .* I I t t ..- -. Class47 26 ... 26 "' 1'--*------i6------1I ~ass4326 ,..27 =f.----.---*-*;-- I I . .- I 1%..& .I. I WOm49 27 ... 27 b 'I I I ;* I I t ' I I t ' -y I t I .-$ ' I 1 I I 1. 11 S ;------+a tI tI I 1 1 I I 1 I I I t I 1

I I t I

1 t 1 1 t I t I I 1

t --,-- p. ; I 1 "8 .---'1 I t I r--.------I t 11. +---,. t I L I I Vegetation &my Sites ~~.ovlsional~~~~~~~rrsdyh 1'1 RE& (AWG, 1988)

Fml.aiw rn~May la 1993 -- Reduction Mo.... 1:4500060 t I 4 I I don Alhqlal t r 141 E t ~~els:18ard%~~ I& E 14 E Gntral Mcliaa. 132 degrees East 1d I% Sphicl: Chke 1866

Source.... ~~~~~fittedtodata~~ Site ~CK&QJIS of ~~W~CXIsuw Nelm* ~visidoxst,i~ -National li&ms Mmm -QidBptofLands. Caveats.... Usersare~to~wl*m~and~~~wbes ~mqontainedh~-t htogmn b* Umt (Ap& 1993). ~yO~~~P~b~~k%~~mt ~XWX&fm ah& sacnple mndas nxded pertlb dm.

The output from these files has not been included in the report due to the number and size of them. The climate profile for Macaranga polyadenia which defines the climatic limits of the presence of the species is in Table 8.

Table 8: Climate profile for Macaranga polyadenia.(see Cofinas & Bolton (in prep) for a definition of acronyms.

a) Temperature values

b) Precipitation values

11max 3 167 711 63 210 253 1414

The following figures demonstrating the results from GARP, were screen captured and thus appear as they would after the analysis was run.

BIOCLIM tpe rules. GARP was run while restricted to BIOCLIM type rules. The predicted potential distribution of Macaranga polyadenia is shown in Figure 8. The overall accuracy of the predicted distribution is 83.6%. The predicted presence of the species tends to be clustered around the existing specimen records (blue areas), surrounded by areas of high conflict where both presence and absence of the species is predicted. The majority of CYP shows a definite absence of the species (red).

Because a rule only predicts a single outcome (presence or absence), two rules are needed to simulate a BIOCLIM model. The rules used to make predictions are listed in the following in decreasing frequency of use by GARP...... No-rule number, Sig-significance, Pr-accuracy, Cov-coverage, Use-usage

No Type Prior Post Sig Cov Use

IF NOT TANN= [25,26]degC AND TMNCM= [18,20]degC AND TMXWM= [32,34]degC AND TSPAN = [12,16]degC AND TCLQ = [22,25]degC AND TWMQ= [27,28]degC AND lWETQ=[27,27]degC AND TDRYQ=[24,26]degC AND RANN=[1285,2858]mm AND RWETM=[305,713]mm AND RDRYM=[ 3,661- AND RCVAR=[86,118]mm2 AND RWETQ= [888,1854]mm AND RDRYQ= [17,219]mm AND RCLQ=[30,265]m AND RWMQ=[508,1413]mm AND TMEL= [20,185]m AND TMXEL=[37,246]m AND TMNEL= [-I, 115]m AND TREL= [11,233]m AND TCOEF=[18,194] % THEN TAXON=-1

1 d 0.46 0.96 34.73 0.48 0.05 IF TANN= [25,26]degC AND TMNCM = [18,20]degC AND TMXWM= [32,34]degC AND TSPAN= [12,16]degC AND TCLQ= [22,25]degC AND TWMQ= [27,28]degC AND TWETQ=[27,27]degC AND TDRYQ=[24,26]degC AND RANN=[1285,2858]mm AND RWETM= [305,713]m AND RDRYM= [ 3,66]mm AND RCVAR= [86,118]mm2 AND RWETQ= [888,1854]mm AND RDRYQ= [17,219]mm AND RCLQ=[30,265]mm AND RWMQ= [508,1413]mm AND TMEL= [20,185]m AND TMXEL= [37,246]m AND TMNEL= [-I, 115]m AND TREL= [I 1,233lm AND TCOEF=[18,194] % THEN TAXON=254

The first rule is used 95% of the time of determine the absence of the species (TAXON=-1). The second rule, used 5% of the time determines the presence of the species (TAXON=254). The individual rules list which variables have been used and the range of values in each variable to define presence and absence.

GARP rules. GARP was run a number of times for Macaranga polyadenia, resulting in a number of different predictions due to the nature of the program. Results from two of the predictions are presented in this report to illustrate GARP. The green areas are cells where rules only predict presence, the red only prediction of absence. Yellow areas indicate cells where there are rules that predict presence and rules predicting absence - called conflict areas.

First Prediction The predicted potential distribution resulting fiom GARP rules is shown in Figure 9. The predicted presence of the species is restricted to the north-eastern side of CYP and to the south-east around Cape Melville. There is a high proportion of high conflict areas (in the model) with a core of predicted absence of the species in the southern-central and western area. The accuracy of the predicted distribution is much greater than BIOCLIM rules only, at 97.4%.

The rules used to make predictions are listed in the following. Visualisation

The image below is a visualisation of model ~utput.Blue areas are the recorded locations. Predicti~nssf presence are green and predictions of absence are red. Yellow and brown are areas where rules predict both presence and absence sf a species.

w

1

I

Legend

low conflict absence I high con£lict - presence

Figure 8: Visualisation of predicted distribution model of Macaranga golydettia using BPOCLIM rules in GARP. - -

Visualisation

The image below is a visualisation of model output. Blue areas are the recorded locations. Predictions of presence are green and predictions of absence are red. Yellow and brown are areas where rules predict both presence and absence of a species,

L

I

Legend

absence low conflict I

high conflict presence

. . . .- - . . .- -.

Figure 9: Visualisation of first distribution model of Macaranga polyadeniu using GARP rules, No-rule number, Sig-significance, Pr-accuracy, Cov-coverage, Use-usage

No Type Prior Post Sig Cov Use

0 ! 0.55 1.00 22.90 0.51 0.95 IF NOT TANN=[25,26]degC AND TMNCM=[18,20]degC AND TMXWM=[32,34]degC AND TSPAN= [12,16]degC AND TCLQ= [22,25]degC AND TWMQ=[27,28]degC AND TWETQ = [27,27]degC AND TDRYQ= [24,26]degC AND RANN= [1285,2858]mm AND RWETM=[305,713]mm AND RDRYM=[ 3,661- AND RCVAR=[86,118]mm2 AND RWETQ=[888,1854]mm AND RDRYQ=[17,219]mm AND RCLQ=[30,265]mm AND RWMQ= [508,1413]mm AND TMEL= [20,185]m AND TMXEL=[37,246]m AND TMNEL=[-1,115]m AND TREL=[11,233]m AND TCOEF=[18,194] % THEN TAXON=-1

15 d 0.45 0.93 23.63 0.49 0.04 IF TANN= [25,26]degC AND TMNCM= [18,20]degC AND TMXWM= [32,34]degC AND TSPAN= [12,16]degC AND TCLQ = [22,25]degC AND TWMQ =[27,28]degC AND TWETQ=[27,27]degC AND TDRYQ=[24,26]degC AND RANN=[1285,2858]mm AND RWETM=[305,713]mm AND RDRYM=[ 3,66]mm AND RCVAR=[86,118]mrn2 AND RWETQ=[888,1854]mm AND RDRYQ=[17,219]mm AND RCLQ=[30,265]mm AND RWMQ= [508,1413]mm AND TMEL=[20,185]m AND TMXEL=[37,246]m AND TMNEL=[-1,115lm AND TREL=[11,233]m AND TCOEF=[18,194] % THEN TAXON=254

In this case it is apparent that the BIOCLIM type rules within GARP have high accuracy and significance and are most important in determining the model. The BIOCLlM variables and ranges are identical to the above prediction.

Second Prediction The predicted potential distribution resulting from the second run of GARP to the same dataset is shown in Figure 10. There are few areas where the probability of presence is high (green). These are surrounded by areas where the probability of presence is smaller (light green to yellow). Most of CYP has been classified to be high conflict where predicted presence and absence overlap. The accuracy of the predicted distribution is slightly higher than the previous prediction, at 97.4%.

The rules used to make the predictions are listed below.

No-rule number, Sig-significance, Pr-accuracy, Cov-coverage, Use-usage

No Type Prior Post Sig Cov Use

0 ! 0.53 1.00 23.58 0.49 0.95 IF NOT TANN=[25,26]degC AND TMNCM=[18,2O]degC AND TMXWM=[32,34]degC AND TSPAN=[12,16]degC AND TCLQ=[22,25]degC AND TWMQ=[27,28]degC AND TWETQ=[27,27]degC AND TDRYQ=[24,26]degC AND RANN=[1285,2858]mm AND RwETM=[305,713]mmAND RDRYM=[3,661mm AND RCVAR=[86,118]mm2 AND RWETQ=[888,1854]mm AND RDRYQ=[17,219]mm AND RCLQ=[30,265]mm AND RWMQ=[508,1413]mmAND TMEL,=[20,185]mAND TMXEL=[37,246]m AND TMNEL=[-1,1151mAND TREL=[11,233]mAND TCOEF=[18,194]% THEN TAXON-1

13 d 0.47 0.92 22.670.52 0.02 IE" TANN=[25,26]degCAND TMNCM=[18,20]degCAND TMXWM=[32,34]degC AND TSPAN=[12,16]degC AND TCLQ=[22,25]degC AND TWMQ=[27,28]degC AND TWETQ=[27,27]degC AND TDRYQ=[24,26]degCAND RANN=[1285,2858]mm AND RWETM=[305,713]mmAND RDRYM=[ 3,66]mm AND RCVAR=[86,118]mm2 AND RWETQ=[888,1854]mm AND RDRYQ=[17,219]mmAND RCLQ=[30,265]mm AND RWMQ=[508,1413]mm AND TMEL=[20,185]m AND TMXEL=[37,246]m AND TMNEL=[-1,115]mAND TREL=[11,233]m AND TCOEF=[18,194]% THEN TAXON=254

9 r 0.50 0.95 17.92 0.31 0.02 IF + TWMQ*0.20degC + RWETM*0.04mm - RCVAR*0.35mm2 + SDEP*0.02c THEN TAXON=254

This model resulted in 3 prediction rules. The first rule is an absence rule used 95% of the time and is the same as the two first rules above. The second rule is also the same as the two above and is a presence rule used 2% of the time. Both are again BIOCLIM type rules. The third rule is a logistic regression and was used in 2% of cases to predict presence.

In the second prediction, GARP has discovered one extra presence rule for the predicted presence of the species compared to the first prediction. Visualisation

The image below is a visualisati~nof model output. Blue areas are the recorded locations. Predictions of presence are green and predictions of absence are red. Yell~wand brown are mas where lules predict both presence and absence of a species. r . .

i r. . L Legend absence - low sonf lict 1

high conflict I presence

Figure 16: Visualisation of second distribution model of Mucaranga polyadenia using GARP rules. 5.0 DISCUSSION

A project to collate information on the flora of CYP was seen to be one part of a collection of projects under NRAP to build a natural resource information base. Data and information about,the flora and fauna across CYP would take us one step fbrther to understanding the biological diversity of the area which is important when considering land use plans and decisions on the Cape that will affect the land in the long-term. The many years of time and money put into these collections makes them a very valuable resource so that maintenance of one complete flora database available to users and decision-makers seems a logical step.

In addition to having a flora database in a standard format and available for use in a GIs to a wide variety of users, the other NRAP projects collating and making new flora data available in standard formats have the capability to be incorporated into a similar environment for analysis with the flora data. The availability of the data to CYPLUS users will allow relationships between various aspects of the environment to be explored across the whole Cape not possible previous to CYPLUS except at a very coarse scale.

Before commencement of the project, it was known that data about the flora of CYP was scattered across a number of agencies in the Commonwealth and States and some in private collections. The challenge was to identify the datasets, make some assessment of their suitability and worth/contribution to the database and attempt to obtain the data and integrate it into one system useable by many and for many different purposes.

5.1 Data provided to CYPLUS Flora & Fauna GIs

Access to flora data involved a great deal of liaison with the Custodians of the data as well as determining the time and effort involved into getting the data into ENS. The summary of the datasets incorporated into the database (Table 7, Appendix VIII) indicates that this aspect of the project was achieved with success. Data were obtained fiom large existing databases, such as the Queensland Herbarium, to small specific collections such as Dr William Laverack's orchid data, which was databased from field notes.

Every effort was made to identify as many datasets as possible containing data from CYP. However, time and cost constraints did not allow all important datasets to be identified and/or databased. The datasets excluded fiom the database as identified in Section 3.1.3 should be targeted in the fbture for incorporation into the CYPLUS Flora & Fauna GIs.

It is anticipated that this project be used as a model to indicate the practical aspects of standardising data collection in the field and taking it the next step to incorporating the data into a standard digital format, useable by various agencies (Bolton 1992). Thus, maintenance of the CYPLUS Flora & Fauna GIs over time is critical to supporting and maintaining the currency of the database in the long-term.

It quickly became apparent that much of the data existing about the flora of CYP was concentrated in particular areas or followed particular themes. For example, many collections were concentrated along major roads due to access and others were fiom popular areas such as the McIlwraith Ranges. Therefore, the project served the purpose of collating existing data as well as providing the opportunity to identify gaps in collections in a spatial and taxonomic sense. The bias is particularly obvious in Map 12 of Cofinas and Bolton (in prep) where a VISTR analysis depicting locations of flora specimens in the database clearly shows the locations of major roads. A similar bias in botanical and zoological collections has been noticed in many other datasets, eg. elapid snakes (Longmore 1986) and the Rare or Threatened Australian Plants (ROTAP) database.

This simple overview of the dataset provided by the distribution map could immediately be used to identi@ gross spatial gaps in the flora collections across CYP.

Once a large proportion of the data had been loaded into the database, and longer term contracts were established, a different approach to the data was taken. It was recognised that while the specimen data would provide a very good base to the dataset, more was needed.

Enhancements to the data identified included an evaluation of the quality of the data as well as classifications of the data into themes such vegetation communities (NRO1 project), lists of rare and threatened (R&T) plants, endemic plants, determining the variety of life forms across the Cape, distributions of records across environments of CYP, regions of high floral diversity, the importance of site data as opposed to only specimen data and the link to habitat information.

Most of these essentially required a complete or near complete flora dataset of CYP to make any conclusions about the region's floralvegetation. Some results were obtained within this project and others will be completed within the CYPLUS Land Use Program project: Conservation and Natural Heritage Assessment.

5.1.1 Data quality

Taxonomv. Obtaining an accurate, up-to-date list of plant taxa within the study area proved to be very time consuming. A number of problems were identified which had to be overcome to produce the coherent flora dataset.

1. The taxon list for CYP at the Queensland Herbarium was not up to date because of the work undertaken by the NRO1 Vegetation Mapping Project. The list was continuously being updated during the project by officers in Mareeba, and a final standard list was not available fiom this source until the latter half of 1994.

2. Separate lists are maintained by the Queensland Herbarium, Brisbane and the Australian National Botanic Gardens (CBG), Canberra. ENS is based on the taxonomy maintained at CBG, based on the Australian Plant Name Index (APNI) (Chapman 1991). This means that data received fiom Queensland was rejected by ERIS due to taxon name clashes. It was known that the APNI list was out of date compared to the list maintained in Queensland, therefore a separate loading list had to be included in ERIS for the CYP data.

All records loaded into ERIS that corresponded to the taxon list provided by Queensland were included in the NR18 flora database supplied to the CYPLUS Flora & Fauna GIs (Bryett et al. 1994). The examples provided, Maps 1 and 2, show all the tree records in the database, both offshore and mainland. The validity of the offshore records needs to be checked by the Custodians of the data (see below). Geocodina. All records were run through the mainlandoffshore test and are flagged in the database with respective codes. Only selected taxa were validated using BIOCLIM as this had to be run for each species and was time consuming. An additional (not CYPLUS) hnded staff member was allocated to this task for six months. In order for the BIOCLIM component of the validation to be accurate, a complete or near complete database was required. It was not practical to wait until the end of the project to validate, therefore, once the majority of the data was received the BIOCLIM validation was run only on the largest taxonomic groups. This is why there are still many records in the database without a valid or suspect code.

'Offshore' records identified by the maidandoffshore test were not included in analyses of the flora data. The use of 'mainland' records is outlined below:

* All mainland records were used for VISTR analyses as it is an exploratory data analysis tool used to identi@ gaps in the data. Checks with valid records is recommended to highlight differences between the two. * In this report only 'valid' records were used to display the results from species distribution mapping but 'suspect' records could be included if necessary. * 'Valid' records only were included in species distribution modelling. This is the recommended standard approach, as the inclusion of suspect records is most likely to decrease the accuracy of predicted distributions.

The geocode validation program developed at ERIN is continually being refined and updated and includes work to increase the accuracy of the mainlandoffshore test. This test is important with respect to records occurring along the coastline, where the definition of what the coastline is often hzzy.

To resolve the problem of validating data as it flows in during a project, ERIN needs an automated system which allows data to be validated as it is received from the Custodian. This is currently in the initial stages of development. Corrections to suspect records made by the custodian and updates of databases need to be incorporated into the general maintenance of the flora database to keep it current.

5.1.2 CYPLUS Flora & Fauna GIs

The data provided to CYPLUS is detailed in Bryett et al. (1994). The majority of the data were provided as received fiom the data Custodians (that is, point locations) except for sensitive taxa, such as the rare and threatened plant taxa. These were gridded to 3 minutes due to the sensitivity in releasing their locations. A number of tools to extract the data were incorporated into the CYPLUS Flora & Fauna GIs based on commonly asked questions of the data. The examples (Maps 1 and 2) were very quick to produce and give a quick view of the entity analysed.

At this stage, the maintenance of the CYPLUS Flora & Fauna GIs is unclear. ERIN does not plan to maintain the consolidated database as a stand-alone since superior tools and data providing a wider context are already available in ERIS. The GIs also needs to be reviewed by users of the data for its ease of use and contents. Advantages and disadvantages identified by users will provide valuable feedback to ERIN, other government organisations constructing such databases, scientists, local government and community groups. 5.1.3 Value-added data

The applied categories determined for the flora have proved to be very usehl and should be developed further. The idea of applied categories is to allocate species to one or more higher- level non-taxonomic groups (eg. hnctional groups or life forms). Meaningful summaries of species location data can then be made against each applied category relevant to the application. When the underlying primary species location data are changed, there is no need to alter the applied categories (unless the species name changes) to enable repeated standard analyses. As a summary, they have been invaluable in providing a broad picture of the flora of CYP and provide aggregations of the data usehl for many purposes. They have also been used for VISTR analyses to determine where data gaps lie.

The lists of taxa determining the applied categories also need to be maintained as new data becomes available and hrther research sees taxa added, removed or changing in status.

5.2 Analyses

5.2.1 Species distributions

The range of a taxon's distribution can be determined for any individual taxon or collections of taxa. An example of this can be seen in Maps 3 and 4. Such analyses are a very important input into an informed decision about the management of CYP and provide a very simple picture of various aspects of the flora. Emphasis on distribution, association with vegetation communities and the distribution of the vegetation communities across CYP will help build a picture of biologically important regions of CYP.

One analysis presented in this report concentrated on R&T's because of their role in the assessment of impacts of disturbance resulting from activity within a region. R&T plants generally have restricted distributions, some covering only a few square kilometres. Just under 10% of the plant taxa of CYP have a conservation status and the limited distributions of endangered plants is highlighted by the maps in Map 3.

Application of distribution maps. An investigation of R&T's within a region may follow these steps:

1. Determine distribution of all R&T's within the region;

2. Determine distribution of individual groups, such as endangered and vulnerable flora (Map 4);

3. Plot distributions of targeted individual taxa (Map 3) and relate these to their presence outside the region (that is, place them into a wider context); and

4. Collate information, perform overlay analyses, determine priority for conservation based on R&T1sand make decisions on the importance of the region with reference to R&T1s. This scenario covers only one aspect of the flora. R&T1s,and indeed any one theme should not be looked at in isolation. Other important themes as mentioned above will include centres of endemism, areas representative of vegetation communities, relict flora, aesthetically important areas etc. The list of attributes of the flora considered to play an important role on the Cape is potentially very long and needs to be prioritised within the context of CYP and within the context of Australia.

The importance of placing these analyses within the Australian context cannot be underestimated. The national significance of a particular taxon or group of taxa will play an important role in determining decisions made within a region. For example, an endangered species found only in CYP and nowhere else in Australia will have more weight in a decision than a species found in adjacent regions also, especially where they are already conserved in other regions. The national context helps to sort out priorities in an otherwise complex environment.

5.2.2 Species modelling

The next step in identifLing a taxon's distribution from existing data is to model its potential distribution within the region. Where little is known about the taxon, this type of analysis can aid in gaining a region-wide or continent-wide perspective on the potential environments it can occupy, based upon its known distribution.

An intensive sampling survey across the Cape would remove the need for species modelling, but across such a large area this would never be practical or cost effective. Therefore, modelling provides the best solution at this stage to providing information on important species.

Its application is important with both common and uncommon species and can work in two ways.

1. The model can be used to target potential sites where the species can be found and thereby restrict surveys to smaller areas where the chance of collection is higher,

2. The taxon's environmental profile can be used to locate other regions with similar environment's. Where these occur for example, in another State, specimen records held within that State could be targeted and included in the modelling procedure. This is particularly usefbl if data is only obtained for the region in question and it does not cover a biogeographic region but is defined by, for example, a local government area (LGA). Taxa are usually not restricted to any LGA but is more likely to be restricted by biogeographic elements. This is where the environmental regions generated for CYP may be used. The new and more complete dataset can then be modelled to support targeted field surveys (see 1 above).

The results of modelling Macaranga polyadenia present many questions to the analyst. The three models applied showed similar yet different patterns. The BIOCLIM model is the most simple to apply. Based on a climatic envelope it defines suitable and marginal sites where the species may occur. Because of the algorithm behind the model, it produces a conservative model, where all potential sites are identified without significant restriction. It is highly likely that where the presence of the species is not predicted, it will not occur. The two models developed using GARP, produce similar distribution maps to the BIOCLIM model. To an extent this is to be expected since the variables used in BIOCLIM are also used in GARP, and BIOCLIM type rules incorporated into GARP simulate BIOCLIM analyses. A comparison of the climate profiles and ranges for each variable for the three models substantiates this. GARP goes one step fbrther in modelling with additional forms of model and variables, with a greatly increased overall accuracy.

The addition of other variables, such as vegetation communities, which affect a species distribution, would increase the accuracy of the prediction models produced by GARP. Macaranga polyadenia is restricted to lowland rainforest and swampy sites on Cape York Peninsula. The incorporation of vegetation into the model could potentially increase the accuracy of the prediction and decrease the degree of conflict in the model.

The validity of the algorithms and models produced by GARP have yet to be tested scientifically or in the field. The technique is still in its early phases of development and should be used with care. The results presented here are encouraging, but the reasons for the differences between the first and second prediction need to be identified and explained. The interpretations require a knowledge of GARP as well as the ecology of the species in question.

In terms of designing a field survey, areas of absence can be masked out. High conflict areas where the species may be present should be noted but the probability of finding it in these areas is low. Areas where the species is predicted as being present should be a priority for field survey. The results from the different models, where they essentially result in similar potential distributions, should be overlayed and the union of these identified for fbrther survey. If this amounts to large area, then the model with the most restricted prediction of presence could be targeted first, moving on to other areas where required.

5.2.3 Identieing gaps in the data

VISTR. The results from VISTR are not only useful in determining spatial gaps in the flora data, they produce usefil information about the flora as sumrnarised by the groups/regions provided. In this case, summaries of the flora for each Region in the 5 group regionalisation were produced (Cofinas & Bolton in prep, Table 7, Map 12).

A number of problems were encountered in the interpretation of the species accumulation curves which will be addressed in the next version of VISTR. These include: * the length of time to run the program across large datasets; * an anomaly in curve fitting in S-PLUS; * the steepness of the curve varies with axis scales for each analysis, axes need to be standardised across analyses; and

* explicit criteria for determination of curve shape needs to be implemented such as fitting curves to explicit models to assess whether an asymptote has been reached (Soberon and Llorente 1993). Patterns in the records for the Palms of CYP are evident from Map 5 (distribution of Palm records) and the curves in Figure 6. The shape of the curves may be interpreted in a number of ways where:

* Regions 1 to 4 are truly undersarnpled for Palms and require more collection; * the unique Palm taxa within Region 5 have been sampled by surveys to date; * Region 5 is a region where most Palm taxa occur naturally due to some environmental attribute or vegetation type;

* the distribution of Palm taxa in Regions 1 to 4 is restricted due to some natural constraint whether physical or biological (the distribution of Palms tends to be within wetter, low areas where rainforest occurs; they also occur hrther inland where there are wet areas); and

* the explanation could also be a combination of the above, for example, palms are restricted by some factors in Regions 1 to 4 but are still undersampled within these Regions.

At this stage, the curves should only be regarded as indicators because the dataset is not complete and the methodology is in a developmental stage. To make conclusions about what the curves represent, the curves need to be fitted against a model and a knowledge of the ecology of Palm taxa is necessary.

The application of the entire flora dataset to VISTR can be found in the NR11 report (Cofinas & Bolton in prep).

It should be emphasised that this analysis is not restricted to specimen records nor to an environmental regionalisation. The method was also applied by the Queensland Herbarium, Mareeba (NR01 project) using the vegetation survey sites and vegetation mapping units to determine whether the mapping units were adequately sampled (Neldner et al. in press).

SPAM. The application of SPAM across many variables which influence the distribution of taxa helps isolate particular attributes of the environment which have not been sampled by existing surveys. The example in this report should not be used in isolation to other variables but for the sake of illustration the results were interpreted based on the 'annual mean temperature' variable.

The analysis was completed by interactively specifling the variable of interest and the point locations (sites) to intersect with the variable. On the screen, a map and histograms were displayed; the user interpreted the undersampled classes from the histogram and specified them interactively on SPAM. The results were then saved to postscript files resulting in Map 6 and Figure 7.

The area considered to be a priority for sampling can be examined against other information such as existing sites, road network, accessibility etc. to determine the most suitable locations for sampling within the undersampled area. In this example, undersampled classes fell within the major road network, an ideal area to sample, because of accessibility. The method can be applied to any number of attributes as discussed in Section 2.7.4, and the overlap of undersampled areas can be used to target areas by priority as determined by the number of attributes undersampled. For example, an area with 5 environmental attributes undersampled should be given priority for survey than an area with only two undersampled. This will increase the efficiency of the surveys (saving time and money on repeated surveys within one area) and reduce the number required. See Neldner et al. (in press) for further discussion of SPWrelative to other approaches.

5.3 Potential uses in CYPLUS Stages I LUP and 1L

5.3.1 CYPLUS Stage I LUP

The flora database can be used for any analysis within the Land Use Programs where there is a requirement for information on the flora of CYP. In particular, the data will be used for the Conservation and Natural Heritage Assessment in conjunction with the flora and fauna data collated from other NRAP projects.

Conservation and Heritage Assessment. The flora database will be used to identify sites of significance on CYP. The ease with which the flora data can now be used for a complete analysis of CYP is a definite advantage at this stage of CYPLUS. The major datasets have been collated into one database with the minimum set of attributes associated with them. The present database will be used as a base to build upon, add more data and attribute existing records.

The following attributes will be derived from the flora dataset for the Conservation and Heritage Assessment (CNHA) using methodology and criteria recently applied to the Central Highlands of Victoria (Anon 1994):

* primitive, endemic, disjunct and relictual species; * flora community diversity; * plant diversity; and * rare plants (lists and concentrations).

These attributes will form the basis of the delineation of areas of high value. The combination of floral values with faunal values will be used to show how areas and features with conservation and heritage values overlap. These areas could also be intersected with the areas presently conserved to determine species and communities at risk.

The CNHA is currently developing an overall assessment of conservation values with reference to obligations under international treaties or conventions, national programs, state responsibilities and regional initiatives.

5.3.2 CYPLUS Stage I1

Stage I1 of CYPLUS will see the development of principles, policies and mechanisms to formulate a land use strategy for CYP using the results from Stage I of CYPLUS (NRAP, PPP and LUP). The strategy is aiming to provide a direction and a framework within which specific issues can be systematically addressed.

The potential use of the flora database in Stage 11 may include the following:

* defining a baseline of data for monitoring through time; * the basis for biotic regionalisations (plant communities); * greenhouse scenarios; * assessment and preservation of plants as cultural resources; * assessments of disturbances, such as weeds and fire; * resource assessments (e.g. ecotourism, horticultural potential); * results from the Conservation and Heritage Assessment program will identi@ key conservation and heritage sites, a direct input into Stage II;

* basis for pasture assessment, an input into decisions about the pastoral industry on CYP; and

* more sound base for putting CYP into a continental context as a result of management decisions having implications in other regions.

5.4 Significance of results

5.4.1 Evaluation against objectives

There were four objectives identified: * To collate existing pre-CYPLUS data on the locations of CYP plant speciesffom a variety of sources.

The first objective was met. Approximately 17 sources of flora data across CYP were collated with many other sources identified. A large amount of literature on the floralvegetation of CYP was also collected during the project. * To integrate existing plant data with newly collected data porn the hR4Pproject NROl .

The second objective was met. ERIN encountered a number of problems whilst incorporating the NROl data into ERIS because CORVEG is still under development and fields are coded but this was overcome. All the data received during the project was loaded into ERIS and presented as a single database. * To pe@orm some simple analyses to idenh3 gaps in the data and predict the likely distribution of selected species.

This objective was partially met. Techniques were developed within ERIN to identifjl gaps in the data and predict species distributions. These were run on test data during the project to develop the techniques, but the complete loading of the database was not completed until the end of the project.

This left no time to complete thorough investigations to determine gaps in the flora database. A few analyses were completed and presented in the report to highlight the use of these techniques.

The modelling component of the objective was met and two techniques were used to model the distributions of selected species. A number of species were modelled during the project upon request. Little time was able to be devoted to a detailed assessment of the taxon modelled.

* To provide suitable computer toolsfor accessing flora data in the CYPLUS GIS.

This objective was met. A number of tools were provided with the flora database for accessing the data within the GIs. Details of these are in Bryett et al. (1994).

Most of the time on the project was spent on Objectives 1 and 2, followed by Objective 4 then Objective 3.

5.4.2 Scientific or methodological advances

Primary data collected for a variety of floralvegetation surveys are often overlooked once it has been interpolated to polygon mapping units. It is often the case that the primary data are lost, so that reinterpretation of the polygon information is not possible at a later date. This project located the data from a number of such surveys and databased the site information from these. Therefore, these datasets need no longer be used in isolation of other datasets, both historical and new.

The collation of the backlog of flora data for CYP was essentially a one off event and once completed for data collected to date should be maintained and added to as new surveys are completed. This will enable a very comprehensive dataset for CYP to be developed over time from the wealth of information being collected. The maintenance of such a dataset will enhance any flora work on CYP for a number of reasons:

* repeated data collections initiated by different surveys over one area will not be necessary unless extra attributes for collection are identified;

* large areas where no data collection has taken place can be investigated as possible new sources of information;

* comprehensive analyses can be run at any time on the most current dataset available for CYP;

* flora and fauna surveys could run in conjunction after analysing the two sets of records in conjunction; and * other surveys (eg. soils) covering other disciplines can be added on to new flora surveys or vice versa to increase overall efficiency.

A 'minimum set' of data was defined for databasing the flora records as required for GIs applications. The minimum dataset can always be worked upon to add as many attributes necessary for particular applications eg. addition of habitat information to individual records or linking site information to records. The 'minimum set' of data collected should form the basis of all new surveys to allow full integration of the data.

The dataset provides the most comprehensive overview to date on the collection of flora data across CYP. With the results of the NROl project it covers one of the most important natural resources of CYP.

The comprehensiveness of the dataset will support the testing of tools developed to analyse the data. For example, well-sampled species could be used to test new modelling techniques developed such as GARP. The results of analyses have already been used by surveyors in the field to target particular areas on CYP. The approach to the problem clearly has wider applicability to other regions in Australia. This is also true of the analysis tools which have been developed to run using any datasets. Point-based species records (especially survey records NRO1) are compatible with emerging theories about the balance of nature (eg. Walter & Paterson 1994). New data collected from surveys can be used in addition to existing data to refine vegetation classifications of CYP and to illustrate the variability within polygons on vegetation maps.

5.4.3 General community benefits

One of the major achievements of this project has been the collation of flora data fiom historical surveys on CYP which would otherwise have been lost, buried and stored on paper due to lack of money and time to database them. Two examples include QDEHs site data fiom the land unit survey of the Weipa area (Godwin 1985) and orchid data collected by Dr William Laverack.

The current flora database is by no means a final product but is well developed and easily maintained. It is accessible to the community of CYP but more work is necessary before it is totally available because of constraints placed upon the data by the Custodians of some datasets. Access to the CYP flora information by electronic means is already technical feasible (Carrington 1991; Boston and Stockwell 1994; Chapman and Busby 1994; Stockwell 1994).

The data can be seen as an increase in the knowledge base of the flora occurring on CYP and will allow more informed decisions to be made about the management of the region as a whole and particular areas where conflicts may arise. Decisions about the Cape, made on the basis of improved and current information will directly benefit the community and potentially decrease the impacts on the community which would arise fiom lack of access to existing data.

The knowledge gained from such a database is consistent with Ecologically Sustainable Development (ESD) which is a national strategy to enable decisions to be made about an area aimed at a sustainable land-use strategy for the region. The database is available to sub-regional interest groups to use for their own investigations, such as species lists for a small area of interest, as well as looking at the basis of management and land-use decisions made about the area.

Local government authorities, responsible for planning in the region will have access to the data and can use it in conjunction with other information to develop and assess land-use strategies proposed for the Cape.

Landholders interested in obtaining detailed information about their property or even an overview can extract existing information about their property. Information about adjacent properties will also aid in assessing decisions made about their property. Therefore, information can be extracted at a number of levels from property, to local area, to region wide. 6.0 ACKNOWLEDGEMENTS

Funding for the project was provided by CYPLUS, NRAP under NR18, Flora Distribution and Modelling; administration and systems support fiom ERIN, ANCA and later DEST.

We thank and acknowledge all Custodians of the datasets incorporated into the CYPLUS Flora & Fauna GIs for giving permission to use their data. In particular the following providers of datasets; John Neldner and John Clarkson (Qld Herbarium, Mareeba), all Herbaria and in particular the Queensland Herbarium and CSIRO Herbarium (Atherton and Canberra), Habiba Gitay, Mick Godwin, Bill Laverack, John Bunt, and the Comalco Herbarium, Weipa.

A number of staff at ERIN worked on the project. Andrew Vanspall provided great support in loading the data, validation and BIOCLIM modelling and solving lots of problems with the data. Barbara Barnsley worked on correcting and updating the taxon list for CYP and Helen Lawley was invaluable in running the geocode validation test for the data. Thanks to David Stockwell with advice on modelling with GARP.

Much support was received fiom John Neldner and John Clarkson in providing the data fiom surveys across CYP (NROI)and access to previous surveys for databasing. The high quality data fiom NRO1 was invaluable to the flora database. Many discussions were held with them regarding the analysis of the data and methods to identifl gaps in the data with valuable feedback provided to ERIN.

We also thank Mr Ian McNaught, NRIC and Mr Graham McColm, QLands for their help in developing the data standards used in the CYPLUS Flora & Fauna GIs. 7.0 REFERENCES

Anon (1994) Methodpapers: Central Highlancls Joint Forests Project. Vol 1- Natural Values. Australian Heritage Commissions and Dept. of Conservation & Natural Resources, Victoria. August 1994, 106pp.

ANZECC Endangered Flora Network (1993) ANZECC List of 7hreatenedAustralian Flora. June 1993. Prepared for the Australian and New Zealand Environment and Conservation Council (ANZECC). Australian Nature Conservation Agency, Canberra.

Bolton M.P. (1992) Core attributes for terrestrial biological survey data in Australia. In: Bolton M.P. (Ed.) Vegetation:From Mapping to Decision Support. ERIN, Canberra.

Boston, A.N. and Stockwell, R.D.B. (1994) Interactive species distribution reporting, mapping and modelling using the World Wide Web. ERIN WWW Server, URL: http://www. erin.gov.auldatabase/www-FALL94/speciesqaper.html.

Bryett A.J., M.P. Bolton and D.C. Crossley (1994) Guidelinesfor the use offlora and fauna data on the CYPLUS FLORA & FAWA GIS, Version 1.0, August, 1994. ERIN Unit, Canberra.

Busby, J.R. (199 1) BIOCLIM - A Bioclimate analysis and prediction system. In ed.Margules, C.R & Austin, M.P.) Nature Conservation: Cost Efective Surveys and Data Analysis. p.64-68. CSIRO, Australia.

Carrington, P. (199 1) Biology standing at the river edge. Natural Science 1:4-9.

Chapman, A.D. (1991) Australian Plant Name Index. Aust. Flora and Fauna Series Nos 12- 15, Aust. Biol. Resources StudyIAGPS,, Canberra, 3055pp. (in 4 vols).

Chapman, A.D. and Busby, J.R. (1994) Linking plant species information to continental biodiversity, inventory, climate modelling and environmental modelling. In ed. Miller, R.I.) Mqping the Diversity of Nature. pp179-195. Chapman & Hall, London.

Cofinas, M. and Bolton, M.P. (1994) NRII Environmental Region Analysis. unpubl. NRAP/CYPLUS report.

Connell Wagner (1989) Cape York Peninsual Resource Analysis. Prepared for Premier's Department. Cairns.

Cresswell, I.D., Thackway, R., Malafant, K., and Bull, A.L. (1993) Assessing the conservation of the major Murray-Darling Basin ecoystems. Phase 2, Part I: Environmental regions and mode IZing of endangered and vulnerable species. Final report to the Murray-Darling Basin Commission. Environmental Resources Information Network, Australian Nature Conservation Agency, National Resources Information Centre, Canberra. Report No.2. Date, C. J. (1 990) An Introduction to Database Systems. Volume I. Fifth edition. The Addison-Wesley Systems Programming Series. Addison-Wesley Publishing Company.

Godwin, M.D. (1985) Land Units of the Weipa Region of Ausfralia's Cape York Peninsula. Report compiled by Research and Planning Branch for Comalco Ltd. Queensland ,NationalParks and Wildlife Service.

Hnatiuk, R. (1990) Census of Australian Vascular Plants. Aust. Flora and Fauna Series No. II, Aust. Biol. Resources StudyIAGPS, Canberra, 650pp.

Kestal Research, CSIRO Centre for Spatial Information Systems and Computer Power Pty Ltd. (1990) User Requirements of the Arts, Sport, the Environment, Tourism and Territories Portfolio for the Environmental Resources Information Network. Final Report. 28 February 1990.

Longmore, R. (1986) Atlas of Elapid Snakes of Australia. Australian Flora and Fauna Series Number 7. Bureau of Flora and Fauna, Canberra. AGPS, Canberra.

McKenzie, N. and Hook, J. (1992) Interpretations of the Atlas of Australian Soils. Draft Consulting Report to the Environmental Resources Information Network (ERIN). Technical Report 9411992. CSIRO Division of Soils, Canberra.

Neldner, V.J.,Crossley, D.C. and Cofinas & Bolton, M. (in press) Using Geographic Information Systems (GIs) to determine the adequacy of sampling in vegetation surveys. Biological Conservation.

Northcote, K.H. et al. (1 960-68) Atlas of Australian Soils. Explanatoly data sheets for Sheets 1-10. CSlRO Division of Soils, Adelaide.

NRIC & CSIRO (1991) Digital Soil Atlas: after Northcote, K.H. et al. (1960-68).

Richards, B.N., Bridges, R.G., Curtin, R.A., Nix, H.A., Shepherd, K.R., and Turner, J. (1990) Biologi'cal Conservation of the South-East Forests. Report of the Joint Scientific Committee to the Hon. Alan Griffiths, M.P. and the Hon. Ian Causley, M.L.4 Canberra.

Soberon, J.M. and Lorente, J.B. (1993) The Use of Species Accumulation Functions for the Prediction of Species Richness. Conservation Biology. 7(3):480-488.

Staff of the Queensland Herbarium (1994) Queensland Vascular Plants. Names and Distribution. Qld Dept of Environment and Heritage, Queensland Herbarium, Brisbane, October 1993.

Stockwell, D.R.B. (1994) Genetic Algorithm for Rule-set Production (GARP). ERIN WWW Server, URL: http:lkaos.erin.gov.au/general/biodiv~mode1/ERINlGARP/h0me~html.

Thackway, R. and Cresswell, I.D. (1992) Environmental Regionalisations of Australia. A User-Oriented Approach. Environmental Resources Information Network, Canberra. Thomas, M.B., and McDonald, W.J.F. (1989) Rare and Threatened Plants of Queensland. 2nd edition. Department of Primary Industries, Queensland Government, Brisbane.

Walter, G.H. and Paterson, E.H. (1994) The implications of paleontological evidence for theories of ecological communities and species richness. Awt. J. Ecol. 19:24-250. 8.0 APPENDICES

APPENDM I

Cape York Peninsula Flora Data: Expressions of Interest

APPENDIX Ia Advertisement

The following advertisement was placed in i%e Weekend Australian newspaper on 252614192:

Australian National Parks and Wildlife Service

EXPRESSIONS OF INTEREST

Cape York Peninsula Flora Data

The Environmental Resources Information Network has funds for three projects: Flora Data and Modelling, Fauna Distribution Modelling, and Environmental Region Analysis, as part of the Natural Resources Analysis Program of the Cape York Peninsula Land Use Strategy (CYPLUS). CYPLUS is a joint initiative of the Commonwealth and Queensland Governments.

The aim of the Flora Data and Modelling project is to make information on the flora of Cape York Peninsula (CYP) available to environmental decision-makers through spatial information systems. Expressions of interest are sought from agencies and individuals who wish to collaborate with ERIN to collate existing data on the flora of CYP.

Data sought include site survey data with comprehensive species lists and associated environmental data such as landform, soils etc., locations of rare and endangered species and the common "land cover" taxa on CYP.

Interested parties can request "Guidelines and Selection Criteria" from Dr AM Bull (06) 250 0279 @h), (06) 250 0360 (fax). The closing date for expressions of interest is 30 April 1992. APPENDIX Ib Guidelines and Selection Criteria

Expressions of Interest

CAPE YORK PENINSULA FLORA DATA

Guidelines and Selection Criteria

INTRODUCTION

The Cape York Peninsula Land Use Strategy

The Cape York Peninsula Land Use Strategy (CYPLUS), a joint initiative of the Commonwealth and Queensland Governments, is intended to provide a sound basis for land use decision-making in the future in the Peninsula. In Stage 1 of CYPLUS the emphasis is on gathering a large amount of primary data; much of this through 19 projects in the Natural Resource Analysis Program (NRAP) of CYPLUS. The following State and Commonwealth agencies are implementing NRAP projects:

Qld Dept of Environment and Heritage (including Qld National Parks and Wildlife Service) Qld Dept of Primary Industries Qld Dept of Resource Industries Qld Dept of Lands Environmental Resources Information Network (ERIN) Bureau of Mineral Resources, Geology and Geophysics National Resource Information Centre (NRIC).

ERIN Cape York Peninsula Projects

ERIN'S mission is to provide geographically-related environmental information of an extent, quality and availability required for planning and decision making. For further details regarding ERIN, please see enclosed sheet.

ERIN has three projects in NRAP, entitled:

Flora Data and Modelling Fauna Distribution Modelling Environmental Region Analysis

The objective of all three ERIN projects is to coordinate existing data from many sources and make the aggregated data available to environmental decision-makers. For further details, see enclosed sheets. ERIN is also contributing to a directory of sources of data on Cape York Peninsula (CYP). Data from specimen collections (museums, herbaria, etc.) and site survey data will be integrated into spatial information systems, through the databasing of spatially-referenced attributes. The specific aim of the Flora Data and Modelling project is to make information on the flora of CYP available to environmental decision-makers through spatial information systems.

This document sets out criteria for selecting flora and flora-related datasets to contribute to the ERIN "Flora Data and Modelling" project, and provides guidelines for collaborative arrangements to make these data available to ERIN and CYPLUS.

SELECTION PROCEDURE

Individuals or agencies interested in making their Cape York Peninsula flora data available to ERIN (and ultimately CYPLUS) should reply, addressing the selection criteria below, in no more than three pages. Supplementary material may be attached for further information. Expressions of interest should include:

1. Regional coverage of data. 2. Scope of data i.e. Taxa covered. 3. The number of siteslrecords covered by the data. 4. Input data format i.e. Field Notebooks, Free text, Structured text, GIs Coverage, Map etc. 5. Details of proposal. 6. Project outputs 7. Proposed timetable. 8. Required resources and budget (if applicable).

Funds will progressively be available over the next two years to collate flora site data and Expressions of Interest may be held over for further development over this period. Individual datasets will be prioritized according to the selection criteria, below. Selected applicants may be invited to develop more detailed proposals, which could involve funding by contract with ERIN for specific outputs. Applicants should therefore indicate the preferred timing of their proposed activitiesJsub-projects.

SELECTION CRITERIA

Ideal Data Set Characteristics

1. Data from within the CYPLUS Study Area (see Operational Definitions 1).

2. Accurate and precise site locations, preferably geocoded (e.g. latitude and longitude). In particular, individuals or agencies wishing to database CYP herbarium specimens should indicate the percentage of records meeting this criterion, and how the estimate was reached.

3. Data collected with explicit, systematic sampling methodologies.

4. Data quality control on taxon identifications, e.g. vouchers and/or determinavit by Qld Government Botanist or other professional botanist referring to a major herbarium. In particular, individuals or agencies wishing to database CYP survey data should indicate the identification source(s) for their data.

5. Data in, or capable of being entered into, a relational database system with good importJexport facilities. (Oracle preferred, but not essential; flat-file ASCII text negotiable).

6. Attribute data collected to documented standards (e.g. " Yellow book, " McDonald et al., 1990) are preferred.

7. Data are interpreted with analyses, maps and publications available to ERIN and CYPLUS.

Priorities (for integration into ERIN CYP flora spatial information system)

1. Flora site data with a comprehensive overview of CYP for the whole plant taxon (family, genus or species), particular biomeJvegetation type, or functional group (e.g. vines).

2. Rare and endangered species (Thomas & McDonald, 1989: Rare and Threatened Plants of Queensland, 2nd Edition), and newly-discovered taxa.

3. Site survey data with comprehensive species lists and environmental attributes, such as elevation, landform, hydrology, soils, vegetation structure and species' abundances.

4. Flora site data for land cover taxa, especially species from the following plant families or genera, in approximately descending order of priority:

Myrtaceae - particularly Eucalyptus, Melaleuca, Leptospermum, Sinoga, Thryptomene, Tristania s.l., Fenzlia, Syzygium Mimosaceae - particularly Acacia Poaceae Euphorbiaceae - particularly Petalostigma Caesalpiniaceae -particularly Erythrophleum, Lysiphyllum ( & Piliostigma ) Combretaceae - particularly Terntinalia Proteaceae Lecythidiaceae Rubiaceae CyperaceaelRestionaceae Casuarinaceae Chenopdiaceae Fabaceae Sterculiaceae Xanthorrhoeaceae Pandanaceae Ebenaceae Moraceae Callitris Polycarpaea Verbenacae - Avicennia, Clerodendrum, Lantana & Premna Rutaceae - Boronia Anacardiaceae - esp. Blepharocarya Araucariaceae Sapindaceae - esp. Cupaniopsis, Dodonaea Cycadaceae

5. Locally-prominent species in other taxa or categories not in 4 (above), e.g.: Bombax, Alphitonia, Parinari, Bursaria, Atalaya, Grewia, Calophyllum, Byblis Epacridaceae Convolvulaceae non-chenopod salt-marsh taxa wetland species exotic species - esp. Cryptostegia, Calotropis, Hyptis woody mangrove species

6. Indicators of environmental health and/or land degradation (please justify selection).

7.Other woody perennial angiosperm taxa not listed above.

8. Aquatic taxa, herbaceous taxa, ferns and fern allies, and non-vascular plants

9. Taxa not in groups listed above.

Collaboration

Willingness to collaborate with ERIN, where appropriate on a day to day basis, is an essential requirement for access to CYP funds.

All primary data collected, databased (or analysed) using CYPLUS funds must be made available for use by all agencies involved in CYPLUS. The timing of this availability is negotiable, with February 1994 the latest possible date allowable. Special arrangements will be made for sensitive datasets e.g. rare and threatened plants, or data which is being prepared for publication.

In the initial stages, emphasis will be on entering handwritten and textual data into RDBMS. Where individuals or agencies have existing digital data (RDBMS or GIs) , projects could involve further analysis and interpretation of those data in collaboration with ERIN. In addition to primary site data, ERIN is interested in making interpretations of CYP data available to CYPLUS, through appropriate technology. This could include text-retrieval systems, image databases for photographs and GIs for polygon maps.

All arrangements are subject to negotiation to attempt to meet the needs of all parties (collaborators, ERIN and CYPLUS-family of agencies) involved.

Field projects to collect new flora data will not be considered for funding until early 1993. ERIN would expect such surveys to be heavily targeted ( according to gaps in the database as these become apparent ) to make maximum use of limited funds.

While applicants may also wish to make physical environmental and/or fauna data available to ERIN and CYPLUS, almost no funds are presently available for this purpose. However, individuals and agencies interested in collaborating with ERIN in any of the three topic areas are invited to submit Expressions of Interest.

All interested agencies and individuals are invited to contribute to the development and refinement of national data standards for biological site surveys. These are presently being coordinated by ERIN. OPERATIONAL DEFINITIONS/GLOSSARY

1. The CYPLUS Study Area comprises all terrestrial areas North of the southern boundary of Cook Shire; i.e North of about 16.5 degrees S. Additional areas include Carpentaria Shire North of the Nassau River, Wujal Wujal and three mainland islands: Thursday, Horn and Prince of Wales. The marine boundary extends to the three-mile limit.

2. 'Flora' is defined as all taxa in the plant kingdom - vascular and non-vascular.

3. 'Land cover' taxa are those which could beexpected to be readily visible on satellite imagery in some parts of their distribution. Such taxa would normally be widespread and abundant perennial or dominant annual species.

4. 'Sites' are plots or small polygons of up to approximately 1 hectare.

5. 'Sub-Projects' are individuals or agencies (or groups of same) working on specific topics relevant to the ERIN Flora Data and Modelling Project on CYP.

6. RDBMS: Relational Database Management System.

7. GIs: Geographic Information System.

8. SIS: Spatial Information System.

REFERENCES

McDonald, R.C., Isbell, R.F., Speight, J.G., Walker, J. and Hopkins, M.S. (1990) Australian Soil and Land Survey. Field Handbook. 2nd edn. Melbourne : Inkata Press.

ERIN Contact Officers: Dr Matt Bolton - Scientific Coordinator -Ecological Processes Environmental Resources Information Network GPO Box 636 Canberra ACT 2601 (06) 250 0375 (ph) (06) 250 0360 (fax)

Dr Ann Bull - Project Officer - Cape York Peninsula Environmental Resources Information Network GPO Box 636 Canberra ACT 2601 (06) 250 0279 (ph) (06) 250 0360 (fax)

Closing dates:

Expressions of Interest for 1991-92 funding: 30 April 1992. Expressions of Interest for collaboration: up to 1 March 1994.

General CYPLUS Enquiries

For further information regarding CYPLUS, please contact:

Tracey Fitzpatrick Natural Resources Management Branch . Department of Arts, Sport, the Environment and Territories GPO Box 787 Canberra Am 2601 phone: (06) 274 1978 59

APPENDIX II Project Information Brochure

PROJECT TITLE:

NR18: FLORA DATA AND MODELLING

Project aims

The purpose of this project is to make information on the flora of Cape York Peninsula (CYP) available to environmental decision-makers through Spatial Information Systems.

Significance

The project will assemble a large proportion of existing site data on the flora of CYP and associated habitat data where available. These data will be turned into ARCIINFO coverages at the best possible scale (mostly 1:l million). These will be analysed to target particular areas for further field sampling, and provide an overview of our current knowledge of the CYP flora and its conservation. The predicted distributions of target taxa, particularly rare, threatened and newly-discovered plant species will be modelled. As part of a national effort to establish national standards for vegetation site data, ERIN will collaborate with botanists to refine and test survey attributes for vegetation, whilst at the same time producing useful data for CYPLUS.

Research Plan

Presence/absence flora species data (and habitat data where available) will be collated into GIS/RDBMS format from existing paper-based records. Where computer-readable data already exists, access rights are being negotiated. The Queensland Government is the major data holder through the Qld Herbarium and the National Parks and Wildlife Service. Significant data holdings also exist in GIRO, universities, other herbaria and with private individuals. Consultancies will be let to collate existing data sets. Data will be amalgamated with CYP flora data from other ERIN continental projects, especially ERIN'S Land Cover Project.

The data will be analysed to assess the adequacy of sampling of various taxonomic and functional groups in relation to the environmental regions (domains) of CYP created in the Environmental Region Analysis Project. One approach will be to determine the existence (or otherwise) of an asymptote in the species-area curve for each taxonomic group in each region. These analyses should give a useful overview of the flora of CYP, and should indicate regions requiring further field surveys. Surveys in the 1993 dry-season will be targetted at inadequately-sampled Environmental Regions. Site data will also be assessed in relation to other regionalisations of CYP, as these become available from CYPLUS/NRAP.

The context and significance of rare, threatened and newly-discovered species will be assessed. Knowledge of species indicating particular habitats or environmental processes will also be useful to decision-makers. The potential distribution of selected species will be modelled from the existing site data and the physical environmental data from the Environmental Region Analysis project, using the latest computer systems available. Results will be supplied to field workers to target particular areas on CYP.

Budget

ERIN will obtain funds for one full-time position to coordinate the collation of flora site data and to perform analyses. Consultancy funds will go to collaborators for collation of specific data sets.

Project Dates

1 March 1992 to 1 March 1994

Planned Outputs By March 1993: Textual digital databases of existing flora presencelabsence data and associated habitat information; data quality checks; ARCIINFO point coverages for locations of sampling sites; ARCflNFO coverages, distribution modelling and adequacy of sampling analyses for rare and threatened, and newly- discovered plant taxa; other material for use in field surveys.

By September 1993: Adequacy of sampling analyses for each life-form group in each Environmental Region; ARCflNFO coverages and distribution modelling for dominant and 'indicator' species;

By March 1994: Some field validation of previous analyses; recent field data integrated into digital databases; sites of high botanical diversity; final predicted distributions and adequacy of sampling analyses; revised conservation codings and key sites for rare and threatened plant species.

Project Supervisor: Dr M.P. Bolton Telephone: (06) 250 0375 Fax No.: (06) 250 0360 6 1

APPENDIX III ERIN Contract for CYPLUS Data

File #

Dear

Re : Contract for Supply of Cape York Peninsula Flora Survey Data by

The Director of National Parks and Wildlife ("the Director"), a corporation established under the National Parks and Wildlife Conservation Act 1975, is prepared to accept the proposal of of to supply flora data for Cape York Peninsula.

CONSULTANCY SERVICES

The consultancy services to be provided are described in Annexure A, B and C to this letter.

TIMING

Data are to be supplied to the ERIN Unit of ANCA (the Australian Nature Conservation Agency, also known as the Australian National Parks and Wildlife Service) progressively as available. The data outlined in Annexure A: Items 1, 2 and 3 are to be supplied in full by 30 September 1993,31 January 1994 and 30 April 1994, respectively. Data in Item 5 is to be supplied two weeks after the Contract is signed.

FEES

An initial fee of $x shall be payable by the Director within 30 days of the commencement of this Consultancy Agreement. A progress payment of $y shall be payable by the Director 30 days after receipt of items scheduled for delivery by 31 January 1994. A final fee of $z shall be payable within 30 days subject to acceptance of the Consultancy Services by the Director, and upon receipt by the ERIN Unit of all stipulated data and correctly invoiced to the Director.

LIAISON OFFICER

The person holding, occupying or performing the duties of the Associate Director, Spatial Information Section in the ERIN Unit of ANCA shall be the liaison officer with the responsibility for supervision of the Contract on behalf of the Director and authority to issue and receive any written notification under the contract.

TERMS AND CONDITIONS OF CONSULTANCY

The attached terms and conditions marked "Annexure B" and entitled GENERAL CONDITIONS FOR CONSULTANCY SERVICES" shall form part of this agreement, as shall the documents titled "Annexure A" and "Annexure C".

ACCEPTANCE

A duplicate of this letter is enclosed with an endorsement that provides for notification of acceptance. If you agree to provide the Consultancy Services as set out in this letter and Annexures A, B and C, your acceptance must be notified by signing, dating and returning the enclosed duplicate letter to the Director. Receipt by the Director of acceptance in writing of these terms and conditions will constitute the entire agreement for the provision of the Consultancy Services.

Yours faithfully ERIN and Corporate Services Australian Nature Conservation Agency

Signed:

For and on behalf of the .

Dated this day of 199x.

Name:

Title: ANNEXURE A

Contract for Supply of Cape York Peninsula Flora Survey Data by

AIM

To makk further data from the available to the Cape York Peninsula Land Use Strategy (CYPLUS) through the ERIN CYPLUS project NR18 "Flora Data and Modelling".

SCOPE OF WORK

1. By , the Custodian will supply ERIN with a digital detailed site records and approximately bobservational sites.

2. By, the Custodian will supply ERIN with c digital detailed site records and approximately d observational sites.

3. By , the Custodian will supply ERIN with e detailed site records and approximately f observational sites.

4. Data for Items 1, 2 and 3 to be sent on a DOS 3.5" disk in a format agreed by the two parties to this Contract.

5. Written progress reports on the project will be received at ERIN by , , and .

ANNEXURE B GENERAL CONDITIONS FOR CONSULTANCY SERVICES

1INTERPRETATION

(1) In these Conditions: (a) "the Director" means the Director of National Parks and Wildlife; (b) "the Contract" means the Contract under which the Consultancy Services are to be provided to the Director including these general conditions; (c) "the Consultancy Services" means the services to be performed under the Contract; (d) "the Consultant" means the party who by the Contract undertakes to provide the Consultancy Services; (e) "Custodian" means the person or body nominated by the Consultant to be responsible for the development and/or management of data or information and who has the right to determine the conditions on which those data or information may be used or released. (9 "Australian National Parks and Wildlife Service" means the Australian National Parks and Wildlife Service established by Section 33 of the National Parks and Wildlife Conservation Act 1975; (g) "ERIN" means the Environmental Resources Information Network established within the Commonwealth Government to provide geographically related environmental information of an extent, quality and availability for planning and decision making by on environmental issues. (2) Words importing a gender include any other gender. Words in the singular number include the plural and words in the plural number include the singular.

2. VARIATION OF AGREEMENT

(1) This Contract constitutes the entire agreement between the parties and supersedes all communications, negotiations, arrangements and agreements, either oral or written, between the parties with respect to the subject matter of this Contract. (2) No agreement or understanding varying or extending this Contract, including in particular the scope of the Services to be provided or fees to be paid, shall be legally binding upon either party unless in writing and signed by both parties.

3. CONTRACT MATERIAL

(1) The title and intellectual property (including copyright) in the record of site data to be supplied under this Consultancy Agreement shall vest with . The intellectual property in all the other reports to be supplied under this Consultancy Agreement shall be considered by both parties to be in the public domain. (2) On the expiration or earlier termination of the Contract, the Consultant shall deliver to the Director a copy of all specified data or data gathered up until the time of such termination.

4. ENHANCEMENTS AND DERIVATIONS

(1) ERIN may create enhanced or derived products from the data for the purposes of ERINICYPLUS. (2) The publication of agglomerated data shall acknowledge the custodians of the original data. (3) The Director undertakes not to publish or transfer to a third party original data without permission of the Custodian. In giving such permission the Custodian shall advise the Director, or his nominee, in writing of any conditions to be observed in the publication or transfer of that data.

5. ACKNOWLEDGEMENT OF FUNDING

Any publication by the Consultant of the material relating to the Contract shall contain an acknowledgement of the funding of the project by ERINICYPLUS.

6. CONFLICT OF INTEREST

The Consultant warrants that, at the date of entering into the Contract, no conflict of interest exists in the performance of its obligations under the Contract. If, during the term of the Contract, a conflict or risk of conflict of interest arises, the Consultant undertakes to notify the Director immediately in writing of that conflict or risk.

7. SECURITY

The Consultant shall, when using the Director's premises or facilities, comply with all security and office regulations in effect at those premises or in regard to those facilities, as notified by the Director.

8. NEGATION OF EMPLOYMENT AND AGENCY

(1) The Consultant shall not represent itself, and shall ensure that its employees do not represent themselves, as being employees or agents of the Director. (2) The Consultant shall not by virtue of this Contract be or for any purpose be deemed to be an employee or agent of the Director.

9. TERMINATION OF CONTRACT

(1) The Director may, by written notice, terminate or constrict the Contract or any part of the Contract and upon such notice being given the Consultant shall cease or reduce work according to the tenor of the notice and shall forthwith immediately do everything possible to mitigate consequential losses. (2) In that event the Consultant may submit a claim for compensation and the Director shall pay to the Consultant such sums as are fair and reasonable in respect of any loss sustained by the Consultant in unavoidable consequence provided that: (a) the Consultant shall not be entitled to compensation for loss of prospective profits; and (b)The Director shall not be liable to pay any sum which, in addition to any amounts paid or due or becoming due to the Consultant under the Contract, would together exceed the full price of the Consultancy Service ordinarily payable under the Contract.

10. DEFAULT If the Consultant fails within 14 days after receipt of written notice, to remedy any default in performance of the following obligations, namely: (a) to commence or to proceed at the rate of progress strictly in accordance with the Contract; or (b) to perform or observe the terms and conditions of the Contract, The Director may, by written notice, terminate the Contract and recover from the Consultant any loss or damage suffered by the Director.

11. UNAVOIDABLE DELAY

A party to this Contract shall not be entitled to exercise its right and remedies upon the default of the other party (whether at common law) or under the Default clause) if that default:

(1) is caused by an act or event beyond the reasonable control of that other party; (2) continues for less than one (1) month; and (3) was not reasonably foreseeable at the time this Contract was entered into.

12. INDEMNITY

(1) The Consultant shall indemnify and keep indemnified the Director from and against any claim, demand, action, suit or proceeding that may be made or brought against the Director or the servants or agents of the Director or any of them in respect of personal injury to or death of any person whomsoever or loss of or damage to any property or any other loss or damage whatsoever arising out of or as a consequence of an unlawful act or a negligent act or omission in the performance of the Consultancy Services by the Consultant or its employees or agents and also from any costs and expenses that may be incurred with any such claim, demand, suit or proceeding. (2) Any such injury, death or damage that is attributable to negligence, omission or default of the Director, officers of the Australian National Parks and Wildlife Service or other servants or agents of the Director shall be specifically excluded from the provisions of the indemnification.

13. REQUIREMENTS FOR PERMITS, LICENCES OR OTHER AUTHORISATIONS

The Consultant shall ensure that any permits, licences or other authorisations are obtained that are required by itself, its employees or agents in undertaking the Consultancy Services.

14. APPLICABLE LAW

(1) The Contract shall be governed by and construed in accordance with the law for the time being in force in the Australian Capital Territory and the parties submit to the jurisdiction of the courts of that Territory. (2) The Consultant shall ensure that the work done under the Contract complies with the laws from time to time in force in the State or Territory in which the Consultancy Services, or any part thereof, are to be carried out. (3) Where the Consultancy Services provided under this Contract may result in income being derived, provision shall be made between the parties for an equitable share of the income to be paid to either party, having due regard to the respective contributions and interests of the bodies and persons contributing or associated with the consultancy services.

ANNEXURE C

1. Data supplied to the ERIN Unit under the terms of this agreement will be made available to the "CYPLUS- family" of agencies through the National Resource Information Centre (NRIC). The "CYPLUS-family" of agencies presently comprises Commonwealth and Queensland Government Departments and Statutory Authorities, including:

C'wealth Dept of Arts, Sport, the Environment and Territories (DASET) Qld Dept of Premier, Economic and Trade Development (QDPETD) C'wealth Dept of Primary Industries and Energy (DPIE) Australian Heritage Commission (AI-IC) Aboriginal and Torres Strait Islanders Commission (ATSIC) Qld Dept of Environment and Heritage (QDEH) Qld National Parks and Wildlife Service (QNPWS) Qld Dept of Primary Industries (QDPI) Qld Herbarium Qld Dept of Resource Industries Qld Dept of Lands Environmental Resources Information Network (ERIN) Australian Geological Survey Organisation (formerly BMR) National Resource Information Centre (NRIC) CSIRO Divsion of Entomology (Australian National Insect Collection) CYPLUS Taskforce, Cairns

These are largely the lead Departments of CYPLUS and agencies involved in the 19 projects of the CYPLUS Natural Resources Analysis Program (NRAP).

2. The custodian undertakes to inform ERIN in writing of any data considered to be sensitive, and of release instructions for same to the "CYPLUS-family" of agencies. APPENDIX IV Data file format

* the order of the fields does not matter as long as we know what the fields represent (provide a listing of the fields and what they represent) * record delimiter can be any character not used in the text * one record per line terminated by carriage return * text fields have a limit of 255 characters * save the document as an ASCII text file * Data to be sent on 3.5" disk in DOS or Mac format or in UNIX tar format on an exabyte tape.

Specimen Data ------Suggested format for each specimen ~ecord......

custodian$custodian~specimen~id$genus$species$infraspecific~rank$ infraspecies$authority$lat~deg$lat~min$latsec$longdeg$longmin$long -sec$ altitude$geocode~accuracy$geocode~determination$altitude~ac~uracy$ altitude~determination$date~collected~day$date~collected~month$ date~collected~year$collector~name$locality$ + any additional fields separated by delimiter

Site Data (if provided) ------...... ------

Site data should comprise two files: one holding the site description data, and the other holding the specimen observation data. These files are linked by a common field 'custodian-site-id' which is a unique number of the site.

For more complete information please refer to the document: 'Core Attributes for Terrestrial Biological Survey Data in Australia' edited by M.P. Bolton, ERIN, version 3.0 October, 1992.

Suggested format for each site description record ------custodian$custodian~site~id$site~recorder$lat~deg$lat~min$lat~sec$long~deg$ long~min$long~sec$altitude$geocode~accuracy$geocode~determination$ altitude~accuracy$altitude~determination$altitude~accuracy$locality$ date~collected~day$date~collected~month$date~collected~year$ + any additional fields separated by delimiter

Suggested format for each observation record ...... custodian~site~id$custodian~obse~ation~id$genus$species$infraspecific~rank$infraspecies$authority$ + any additional fields separated by delimiter

custodian char(7) the custodian holding the specimen record custodian-specimen-id char(l0) custodian's unique number of the specimen custodian-site-id char(l0) custodian's unique number of the site genus char(30) species char(40) species-author char(100) infraspecific-rank char(7) infraspecifies char(40) infraspecies-author char(100) lat-deg number(2) lat-min number(2) lat-sec number(2) long-deg number(3) long-min number(2) long-sec number(2) altitude number(5,l) geocode-accuracy number(7) estimate of accuracy of geocode in metres e.g. '1000' ...... radius of 1000 metres geocode-determination char(30) how the geocode was determined e.g. gps, top0 map altitude-accuracy number(4) estimate of accuracy of altitude in metres e.g. '10' ...... +/- 10 metres altitude-determination char(30) how the altitude was determined e.g. altimeter, gps, dem, top0 map date-collected-day number(2) day as a 2 digit number date-collected-month char(3) month as a 3 letter code e.g. 'MAR' date-collectedyear number(4) year as a 4 digit number e.g. '1991' locality char(255) free text description. 255 character limit. If need more space then use 'locality-1 ' and 'locality-2' fields. collector-name char(65) name of the collector of the specimen site-recorder char(65) name of the recorder of the site information. 69

APPENDIX V Data Loadinglvalidation Process at ERIN

Purpose

This document explains the data loading and validation procedures at the ERIN Unit.

Contents

Where validation reports fit into the scheme of things

What is the Purpose of the reports? What changes does ERIN make to the specimen records? Who do I contact in ERIN regarding these reports and changes? How does ERIN want to be notified of errors?

Exulanation of validation tests and codes

Order of validation events and allocation of validation codes Detailed description of validation tests Data loading tests Offshore test Bioclim outlier detection test

Tvves of reports and how to use them

Data loading report - one report for the whole data set Validation map, Specimen Detail report and Suspect Specimen report - a set for each species in the data set. When each report is produced How to use these products

Introduction

The quality of geocode and elevation information attached to specimen records varies within a custodian's database and between custodians databases. To use information from all these sources, ERIN conducts quality checks. Records that pass these tests can then be used in analyses with a baseline confidence in the data. Records that do not pass these tests may not necessarily be in error. As environmental information is a valuable resource we are interested in assisting custodians improve the quality of their data. Although ERIN finds errors with specimen records we are reticent to change the original information provided by the custodian. This policy avoids the otherwise possible situation of you seeing your records presented in an analysis, identifying the source as your database, but in a location that is not the same as in your database.

ERIN does alter the original custodian records in the following circumstances. ERIN will add an elevation from a Digital Elevation Model (DEM) if the custodian has not supplied one. Many custodian's databases put a zero value for altitude when in fact they do not know the altitude. This problem renders a lot of records unusable for modeling species distributions. To get around this problem the best we can, we update all records where the custodian's elevation is zero with an elevation from the DEM, if the value from the DEM is greater than zero. Whenever this is done, a flag is placed in the database to indicate that the altitude value was added by ERIN from the DEM.

ERIN produces a report (with maps) for sending to each custodian, with the intent that they may prove instrumental in the identification and correction of records in error in their databases.

Explanation of validation tests and codes

Order of validation events and allocation of validation codes Point data records go through a series of validation tests at ERIN.

When a set of data is received from a custodian, it is loaded into a database table just created for that data set and some cleaning procedures conducted, to ensure all of the data is in a consistent format. After this, the data is loaded from the data set table into the central ERIN Specimen table, which contains records from all data sets. During this process, the taxonomic names are checked against the names in the Australian Plant Name Index (APNI) (i.e. originally obtained from the Census ofAustralian Vascular Plants (CAVP) and updated by the ANBG) and the Census ofAustralian Vertebrate Species (CAVS). At the same time, records are tested for presence of a latitude and longitude and a collection date inside valid ranges. Records which fail any one of these tests are listed in an error report and are not loaded into the Specimen table. Records that pass enter the Specimen table and at this stage have no value in the validation status field.

More tests are conducted once records are in the Specimen table. The first test checks if the position of the record is on mainland Australia or Tasmania. This is referred to as the "Offshore test". At this point elevation is derived from the 1140th of a degree Digital Elevation Model (DEM) if one has not been supplied by you, or if you originally supplied a zero value for elevation and the DEM had a non zero value. The validation status field is updated for each record checked, with a "M' for those that passed and "0"for those that failed.

The second test conducted after records reach the Specimen table is the BIOCLIM outlier detection test. This test is not conducted per data set, but across data sets, targeting all the records in the Specimen table for each species. Species that have less than three records are not tested as the outlier detection procedure is invalid for this number of participating records. All participating records are updated at the end of the test. Those that were found to be an acceptable climatic domain of the set receive a validation status of "V" (for valid) and outliers are assigned "S' (for suspect).

This test is conducted for each species, using all the records of that species in the ERIN Specimen table. Latitude, longitude and elevation of records are input to the test. BIOCLIM (Bioclimatic Prediction System) determines the values of 15 climatic variables for each record and then the statistics on each variable for the species. An additional subroutine, developed in ERIN then compares the climate variable values for each record with the mean value and declares records which are outliers as potentially in error or "suspect". A record only needs to be an outlier according to one climate variable to be considered suspect.

The outlier detection subroutine produces information for each suspect record, describing which climate variables the record failed the test on and if the climate variable value of that record was far above or far below that of the rest of the records in the set. This information is presented in the suspect specimen report.

Tvues of revorts and how to use them

This section provides explanatory remarks to assist your interpretation of the information on maps and in reports.

Data Loadina Report - one report for the whole data set

Obvious spelling errors have been corrected and the error report annotated:

aus buss has been altered to aus bus

This is followed by a list of records that have been altered in the form:

ERIN-id species name (as mispelt, e.g. aus buss) your-id ERIN-id species name (as mispelt, e.g. aus buss) your-id ERIN-id species name (as mispelt, e.g. aus buss) your-id ERIN-id species name (as mispelt, e.g. aus buss) your-id etc.

Errors that are not easily corrected remain in the report as: No match for aus bus

This is again followed by a list of records for which there is no match:

ERIN-id species name your-id ERIN-id species name your-id ERIN-id species name your-id ERIN-id species name your-id etc.

A third category of changes is where a name is automatically altered because of built in synonymy in APNI or CAVS. This will produce a message such as:

WARNING - Taxon name changed, because specified name not current

This is again followed by a list of records which have been altered:

ERIN-id species name your-id ERIN-id species name your-id ERIN-id species name your-id ERIN-id species name your-id etc.

The report also contains records for which there is no latitude or longitude, or for which the date is outside valid ranges, etc.

Validation man, Specimen Detail report and Suspect Specimen renort - a set for each species in the data set

These products are designed to give you a complete picture of the species records which are in the ERIN Specimen table from your data set. Records that did not pass the data loading tests were not loaded into the specimen table, so did not participate in further validation tests and therefore do not appear in the following map or reports. Apart from the specimen report, products are produced for a species in the following circumstances.

When each nroduct is produced

Produced for species with specimens that are offshore or suspect and are located in the geographic window defined by latitude -45 to -10 degrees and longitude 112.5 to 155 degrees. Therefore, a map is not produced for species with only valid records and if a species only has suspect or offshore records outside the geographic window.

Specimen Reporl

Produced for each species

Suspect Specimen Report

Produced for species with suspect records (records with a validation code of "S").

How to use these products

Each map includes species records from your data set that were found to be valid, suspect or offshore and this species records from other data sets that are valid. All points on the map are inside the geographic window of latitude -45 to -10 degrees and longitude 112.5 to 155 degrees.The key on the map contains the species name and the data set the maps were produced for. The reason why records from other data sets are also on the map is to provide more information for you to understand why records were determined suspect. On the map suspect records are ones marked with a cross and are not in the ocean or on islands. These records failed the outlier detection test. As this test is conducted on all records for the species (including those from other data sets) and the determination of suspect and valid records is dependent on all the records in the set, it is useful to know the number and location of records from other data sets that were not considered outliers. This is especially useful on the rare occasion when a number of records in your data set are suspect and none or very few are valid.

The map is scaled to include some contextural features (roads, state boundary, coastline) but be a large enough scale to distinguish individual sites. Primary and Secondary roads data used on these maps was supplied to us by AUSLIG at a scale of 1:2.5 million. AUSLIG are currently working on a finer resolution (1:100 000) data set of roads Australia wide as part of their Australian Geographic Data Base project. This data will be used instead when it is available.

On the map, suspect and offshore records are labelled with an ERIN database ID that you can use to look up the information on these sites in the specimen detail and suspect specimen reports. Both these reports have the records ordered on this ID. The ERIN database ID is unique for each record in the Specimen table and is a number allocated for the purposes of managing these records in the ERIN relational database schema. We would prefer to use your institution ID, but as yet they are unreliably managed by some custodians. In some cases an institution ID is not allocated to all records and in others is not unique for each record. To be able to definitely link points on the map with records in the reports we have used the ERIN ID.

The specimen report contains all records for a species in your data set that were loaded into the ERIN Specimen table. Your institution record ID is given on this report, along with the latitude, longitude, elevation and locality. The ERIN validation code is provided so you know which checks each record passed or failed (see Explanation of validation events and codes section). Some species specimen records have no "V" or "S" validation codes as the species had less than 3 records on the mainland or in Tasmania (validation code "M") and so could not be checked with the BIOCLIM outlier detection test.

The latitude and longitude are in decimal degrees that were derived from the values you supplied to us in degrees minutes and seconds. We realise that it would be better for you to see your original values on this report and will endeavour to supply this in the next version of the report.

Records that are suspect on the specimen report are listed on the suspect specimen report. Use the ERIN LD to associate records on this report with records on the specimen detail report and the map. The 15 codes given for each suspect record came from the BIOCLIM outlier detection test. On the suspect specimen report, a code is given for each climatic variable for each record. The code indicates the placement of that records value for a climatic variable with respect to the rest of the records in the set. An "L" code indicates the value was an outlier, far below the rest of the set. A "U" code indicates the record was an outlier, far above the rest of the set. The "-" code means the climate variable value was not unlike that of mist records in the set. We thought this information would help you to detect if a record in error by giving you background on why the record failed the test. Status of the validation system at ERIN and planned improvements

The process of conducting validation, producing reports and handling updates is new and will have some problems. This system was devised to assist in improving the quality of specimen data for the benefit of both custodians and other data users. Therefore we are keen to see the system work well for your organisation. Suggestions on improvements on any part of the system from methods to layout of reports will be well received. APPENDIX VI Data Acquisition Contracts

In the CYPLUS NR18 'Flora Data and Modelling' project, several contracts (Table 1) were let to database existing (pre-CYPLUS) data from plant specimens in herbaria and/or from site surveys. Several relevant datasets were acquired under other ERIN projects (Table 2).

Table 1: CYPLUS funded flora contracts let by ERIN under project NR 18.

Table 2: ERIN funded contracts directly relevant to CYPLUS project NR 18.

Data on the northern and southeastern areas QLD Herbarium 26 000 1990-91 of the State, etc. 10 000 1991-92 Database 26 000 eucalypts and 16 000 grass National Herbarium of 85 000 1990-91 records NSW 35 000 1991-92

grasses, native pines and sheoaks 28 000 1991-92 l!Btabase records collected from 6 000 seed Australian Tree Seed 8 000 1990-91 collection sites Centre, CSIRO Forestry 1 500 1991-92 Databasing of Eucalyptus collections National Herbarium of 31 000 1992-93

Callitris and Melaleuca Herbarium 12 806 1993-94 APPENDIX VII CYP FLORA/FAUNA METADATA INSTRUCTIONS

High-level Instructions

The overall procedure is to:

1. Edit DD&C entry for source datasets (optional).

2. Perform a report into a text file, using Matt's SQR script a. Create a new subdirectory; move to it b. At the command prompt, type: sqr cyp-meta -MERIS-sqrsize.dat c. log onto the database as prompted d. Enter exact dataset name; version no. is usually 1.0; press return for a dozen or so irrelevant prompts.

3. Rename and edit the output file called cyp-meta.lis; say regions to r-xxxxxx.lis (813 characters to go over to DOS). [This procedure was a precaution because of ERIN'S move from Belconnen to Civic, at the time of implementation.]

Edit according to instructions and priorities (in files). The idea here is to do it according to needs a user of the CYPLUS GIs, not ERIN.

4. Copy file to PC, using FTP or soft-PC. Further edit during move!

5. Supply text file to CYPLUS GIs for use with data.

6. Import text file into word-processor to tart up and print out as part of NRAP project reports.

7. Updatelupgrade DDC, as required, perhaps parsing text file for reloading into DDC.

Structure of Metadata File

The CYP floralfauna metadata file for each source dataset is a text file structured with two data types, viz: Coded fields Text fields (free text or pick list)

For coded fields, use the format "Field Name = Code Value" or "Value". The codes often derive from short pick lists on ERIS DDC, but many were modified to suit a stand-alone CYPLUS GIs.

For text fields, start "Field Name:" on one line and begin the text on the next line. Some of these fields derive from a long pick list in ERIS DDC but the possibility of editing these for the CYPLUS GIs is left open, here, because of possible misspellings, etc. Other text fields are free text, either derived from ERIS DDC or created de novo by ERIN project staff. 'x: Put one blank line between each field, whether coded or free-text, except before headings, where two blank lines should be added.

Headings for major sections of metadata are in capital letters, with or without underlining in the various text and word-processor versions.

Legend for following section

{Matt's discussion about field contents.) [../../../..I valid values - select one code, only. Field Names and Allowable Values

Date: e.g. [I8June 19941 {Automatically-generated by SQR program)

Fixed header Information [Environmental Resources Information Network

DATA SET REPORT {Automatically-generated by SQR program)

Data Set Name and Version Number: {Centered under DATA SET REPORT) {Automatically-generated by SQR program)

Data Set Name = Coded Value {Short title - as currently used within ER1N;need better standardisation, but difficult to change this one because other tables linked to it.)

Version = Number Value Allowable values = [1.0/2.0/etc.]

Theme = Coded Value Allowable values = [FLORA/FAUNA/REGIONS/OTHER/UNKNOWN] {*Not* as DD&C!) Definitions - FLORA - data or information on the plants or botany of a region, often sampled at various point locations or small areas FAUNA - data or information on the animals or zoology of a region, often sampled at various point locations or small areas REGIONS - division of an area into smaller regions, based on some environmental, administrative or other criterion OTHER - not as above UNKNOWN - not known at this stage.

Main Project Number = Coded Value Allowable values = [NR01/NR03/NR09/NR11/NR17/NR18/NR19] {This is a field to describe the source of the data in the CYPLUS GIs. Credit only *one* approriate CYPLUS NRAP project, since ERIN is forwarding data from other NRAP projects. Only data entered by ERIN or compiled from a non-NRAP source could be stamped with NR11, NR18 or NR19.) Definitions - NROl - Cape York Peninsula Vegetation Mapping NR03 - Terrestrial Fauna Survey NR09 - Wetland Fauna survey NRll - Environmental region Analysis NR17 - Insect Fauna survey NR18 - Flora Data and Modelling NR19 - Fauna Distribution Modelling

Secondary Project Number = Coded Value Allowable values = [NR01/NR03/NR09/NRll/NR17/NR18/NR19/N/A] {more occurrences of ERIN project #'s, here! Definitions as above.)

Data Set Title (expand "Data Set Name"): Free Text {As on DD&C; a stand-alone title; expand acronyms.) Data Set Description (Abstract): Free Text {As in DD&C under "Description"but with more-explanatory field name, here.)

Custodian : Text - Long Pick List {As in DD&C)

Address : Text - Long Pick List (multi-line.) {Text: As in DD&C)

Phone = Value {As in DD&C)

Fax = Value {As in DD&C)

Custodianship Comments : Free Text {Additional space to describe unusual custodianship arrangements, especially cases of multiple custodial interests in a dataset.)

Section Heading: DATASET SIZE AND EXTENT

Data Type = Coded Value Allowable values = [POINTPOINT & CELLlCELL/LINEPOLYGON/RASTER/TEXT/OTHER] {Similar to DD&C) Definitions - POINT - data is from a point location or small area; represented by the point data type in a GIs. POINT & CELL - both Point and Cell data in dataset CELL - complex data relates to a regular grid over the area of interest; data type could actually be Point, Raster or Polygon in a GIs LINE - linear data type POLYGON - data attached to any irregular (closed) shape RASTER - remote sensing or simple cell-based data, eg.g as in Arcnnfo GRID module TEXT - structured text as in a RDBMS or free text OTHER - not as above

Aggregation Level = Coded Value Allowable values = [PRIMARYINEAR PRIMARY/COMPOSITE/lNTERPRETED/MODELLED] Definitions - PRIMARY - as collected by person or instruments in field or by remote sensing NEAR PRIMARY - slight aggregation (e.g. geocode fuzzied to a particular resolution) COMPOSITE - several source datasets amalgamated by some logical grouping into a larger dataset (e.g. all Amphibia of CYP from all source datasets) INTERPRETED - data resulting from some manual interpretation, usually by an expert in the data and theme MODELLED - data resulting from some standard analytical procedure.

Data Set Size (as delivered to the CYPLUS GIs) : Free Text {As in DD&C, but check, because info may need updating. Include #taxa, #unique locations (sites) and #obsemations/specimens. A candidate for turning into a coded field!) Spatial Extent: Allowable values = [Cape York Peninsula Land Use Strategy Study Area/EXTRA- CONTINENTAUAUSTRALIAlQUEENSLAND/NORTHQUEENSLANDIOTHER) {As in DD&C, but with a few extra options; note field name change from Regions(s) to Spatial Extent (as suggested by CSIRO DWE (Belbin et al., undated)). Definitions - Cape York Peninsula Land Use Strategy Study Area - see McColm et al. (1993) EXTRA-CONTINENTAL - both within and outside the continental landmass of Australia AUSTRALIA - within the continental landmass of Australia QUEENSLAND - within the State of Queensland NORTH QUEENSLAND - Queensland North of the Tropic of Capricorn OTHER - not as above.

Temporal Extent : Free Text {As in DDC, under "Period-of-Observation", if present. Use some of ideas of CSIRO DWE under 'Collection Period'.)

Taxon Extent : Free Text {Enter main taxonomic groups involved in the source dataset.)

Fields Supplied : Free Text {See Appendix VIb??)

Other fields : Free Text {List of fields in source dataset provided by Custodian, but not passed on to CYPLUS GIs)

Section Heading: DATASET USAGE

Data Set Use Code = Coded Value Allowable values = [PUBLIC DOMAINICYPLUS FAMILY/RESTRICTED/VARIOUS] Definitions - PUBLIC DOMAIN - available to the public without restriction, other than due acknowledgement; use this value instead of "ALL" in ERIN'S DDC. CYPLUS FAMILY - agencies which have signed the CYPLUS GIs Memorandum of Understanding or a Land Use Program contract RESTRICTED - release to parties, even in the CYPLUS family, has been restricted by the data Custodian VARIOUS - parts of the dataset are restricted.

Release Instructions: Free Text {Similar to DD&C; rewrite from point of view of CYPLUS GIs - e.g. "Point data not to be released to third parties without permission of Custodian."}

More-Detail at Custodian = Coded Value Allowable values = [YESINO] {Does the Custodian have a more-detailed version of this data set or are there further explanatory details available?}

Basis of ERIN involvement = Coded Value Allowable values = [ERIN-CYPLUS CONTRACTEXCHANGE LETTERSINRAP PROJECTERIN CONTRACTJERIN PROJECT/OTHER/UNKNOWN] {Basis of ERIN involvement in the dataset in relation to the Custodian.) Definitions - ERIN-CYPLUS CONTRACT - all or most of the data subject to a standard ERIN contract with CYPLUS Attachment C. (see Appendix lb) EXCHANGE LETTERS - Custodian and ERIN reached a formal agreement through an exchange of letters for supply of CYP data to the CYPLUS GIs NRAP PROJECT - data collected by a non-ERIN CYPLUS NRAP project and forwarded to CYPLUS GIs through ERIN ERIN CONTRACT - all or most of the data subject to a standard ERIN contract for a non-CYPLUS ERIN project ERIN PROJECT - data from an ERIN project not subject to a formal contract OTHER - not as above UNKNOWN - not known, at this stage.

Acknowledgement : Free Text {As in DD&C. This field is very important! Suggest text for acknowledgement of Custodian, individuals and ERIN.}

Copyright = Coded Value Allowable values = [WITH CUSTODIANJOTHER,SEE CUSTODIAN] Definitions - WITH CUSTODIAN - Custodian specified under "Custodian", above, holds copyright. OTHER, SEE CUSTODIAN - Some other copyright arrangement applies to this dataset.

Caveats : Free Text {As in DD&C - Maybe here, put text to be displayed on output maps.)

Section Heading: DATA QUALITY

Reliability = Coded Value Allowable values = [VERY GOOD/GOOD/POOR/SUSPECT/UNKNOWN/VARIABLE] Definitions (c.f. Arthur's in ERIN'S experimental WWWeb) - COMPREHENSIVE - meets its specification over a whole topic area VERY GOOD - recommended for a wide range of uses, e.g. assemblage analysis GOOD - recommended for several uses, e.g. species-based modelling POOR - recommended as a source of hypotheses to further test with better-quality data SUSPECT - quality is suspect - similar to VARIABLE UNKNOWN - quality not known VARIABLE - dataset includes patches of high and low quality data, subsetting would normally be required peior to particular analyses.

Reliability Comments : Free Text {As in DD&C. Make sure to include something on general identification methods used, here: e.g. "vouchered survey by experienced botanists"}

Sampling Unit = Coded Value Allowable values = [LARGE AREA BASED, > 1 SQ KMJSITE BASEDISMALL AREA BASED, < 1 SQ KMJSPECIMEN BASEDJVARIOUS] {As in DD&C) Definitions - LARGE AREA BASED, > 1 SQ KM - data attributed to a large spatial unit SITE BASED - survey data from plots or small transects within a lanform element; see McDonald et a1 (1991) for definition of a "site" SMALL AREA BASED, < 1 SQ KM - similar to site, but may cover more than one landform element SPECIMEN BASED - single specimen from a point locality INCIDENTAL OBSERVATION - single observation from a point locality.

Sampling Code = Coded Value Allowable values = [NOT APPLICABLE/OPPORTUNISTIC/TAXONOMIC PURPOSES/ECOLOGICAL SURVEYISTRATIFIED SURVEY/EXPERIMENTAL/RANDOM/STRATIFIEDRANDOM/REGULAR/VARIOUS/OTHER] {New field suggested, under another name, by CSIRO DWE.) Definitions - NOT APPLICABLE -self explanatory OPPORTUNISTIC - occasional sightings, often accidental, such as road kills of fauna. TAXONOMIC PURPOSES - main purpose of collecting dataset is for taxonomic purposes; may be some vouchering of ecological survey and experimental material. ECOLOGICAL SURVEY - Ecological survey of a region; sampling approach not specified. STRATIFIED SURVEY - Ecological survey of a region; sampling of representative vegetation types subsampling not specified or selective. STRATIFIED RANDOM - Ecological survey of a region; sampling of representative vegetation or habitat types. RANDOM - Ecological survey of a region; true random sampling. REGULAR - sampling by a regular pattern, such as points on a grid. EXPERIMENTAL - Data from ecological experiments; could be from a Long Term Ecological Research site. VARIOUS - One or more of the above sampling strategies in the dataset.)

Sampling Methodology : Free Text {As in DD&C)

Section Heading: LINEAGE

Supplier : Text - Pick List {Shorter version of Custodian, as in DD&C)

Received @ ERIN = Value {Date field as in DD&C)

Lineage Comments : Free Text { Statement of what we have done to the source table at ERIN. e.g. "-112193: Loaded only those specimens matching CAVs version 8.1. 15/6/94: Fuzzied point locations to nearest 3 minutes (approx. 5.5 km)" See also CYPLUS GISIQLD Dept of Lands guidelines.)

Section Heading: POSITIONAL ACCURACY

Scale = Value {Suggested output scale; usually not supplied for point-based floralfauna data.)

Positional Accuracy Comments : Free Text {As in DD&C under Geocode Comments, Geocode Determination and Geocode Precision. (Edit into readable text!) See also CYPLUS GISIQLD Dept of Lands guidelines.)

Section Heading: ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Free Text {This is a tricky one. See CYPLUS GISIQLD Dept of Lands guidelines. Maybe refer elsewhere in metadata, e.g. "See under Reliability Comments". "No particular tests performed." would be a common entry!)

Section Heading: LOGICAL CONSISTENCY

Consistency Comments : {Text: See CYPLUS GISIQLD Dept of Lands guidelines. "No particular tests performed." would be a common entry!)

Section Heading: COMPLETENESS

Completeness Comments : Free Text { See CYPLUS GISIQLD Dept of Lands guidelines.)

Section Heading: OTHER

Data Set Comments : Free Text {Keep this for extra comments. Check that contents of DD&C is current and useful, here, for CYPLUS GIs.) APPENDIX VIII Documentation (Metadata) on Source Flora Datasets

APPENDIX VIIIa CYPLUS - ANBG CBG Flora 1.0

12 July 1994 Environmental Resources Information Network

DATA SET REPORT CYPLUS - ANBG CBG Flora 1.0

Data Set Name = CYPLUS - ANBG CBG Flora

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = N/A

Data Set Title (expand "Data Set Name") : Cape York Peninsula Land Use Strategy - Australian National Botanic Gardens Canberra Botanic Gardens Flora

Data Set Description (Abstract): Plant specimen records for the Cape York Penriinsula Area from IBIS, the database of the Canberra Botanic Gardens at ANBG.

Custodian : ACT Australian National Botanic Gardens

Address : Australian National Botanic Gardens GPO Box 1777 CANBERRA ACT 2601

Phone = (06) 250 0450

Fax = (06) 250 9599

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 1730 specimen records in total; 791 loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 791 specimen records.

Spatial Extent (in source data set) = AUSTRALIA Temporal Extent : 1962 - 1993

Taxon Extent : All records in CYPLUS study area.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = PUBLIC DOMAIN

Release Instructions : The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = OTHER

Acknowledgement : Cite "Australian National Botanic Gardens Herbarium Collection, ANCA."

Copyright = WITH CUSTODIAN

Caveats : The data has been assumed by ERIN to be correct as received from the data supplier.

DATA QUALITY

Reliability = GOOD

Reliability Comments : Records seem to have been databased to a high standard, but the reliability of the original specimen records is, as awlays, questionable.

Sampling Unit = SPECIMEN BASED

Sampling Code = N/A

Sampling Methodology : N/A

LINEAGE

Supplier : ACT Australian National Botanic Gardens

Date Recieved @ ERIN = 27-MAR-1994

Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; all records north of 16.5 deg. S supplied to CYPLUS.

POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode determination unknown; geocode sources from collectors and unknown sources; geocode precision is mostly 1,000m. No records have been processed through ERIN's geocode validation procedure.

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN's Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : All records for CYPLUS study area provided a,geocode was available. APPENDIX Vmb CYPLUS - Gitay Flora :Archer 1.0

22 June 1994 Environmental Resources Information Network

DATA SET REPORT CYPLUS - Gitay Flora : Archer 1.0

Data Set Name = CYF'LUS - Gitay Flora : Archer

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondaq Project Number = N/A

Data Set Title (expand "Data Set Name") : Cape York Peninsula Land Use Strategy - Dr Habiba Gitay Flora for Archerbend and Jardine National Park, Cape York Peninsula

Data Set Description (Abstract): Vegetation information gathered for the Archerbend and Jardine National Park area and the surrounds on a site by site basis collected by Dr Habiba Gitay and by Mr Mick Godwin.

Custodian : Dr. Habiba Gitay

Address : Ecosystem Dynamics Group Research School of Biological Sciences Institude of Advanced Studies Australian National University CANBERRA ACT 260 1

Phone = (06) 249 5593

Fax = (06) 249 5095

Custodianship Comments : Mr Mick Godwin, Queensland Department of Environment & Heritage is joint custodian of the Jardine National Park part of the data set.

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRJMARY

1049 specimen records in total; 167 sites; 780 records loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 780 specimen records; number of sites unknown. Spatial Extent (in source data set) = Cape York Peninsula Land Use Strategy Study Arm Queensland

Temporal Extent : 1986 - 1987

Taxon Extent : Vascular (higher) plant taxa (emphasis on trees and shrubs). For Archerbend sites, grasses and herbs were rarely collected.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : For Archerbend, other fields include density of trees and size. For Jardine, only the number of trees per hectare was estimated. Please contact ERIN or the Custodian for details and other fields.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Data not to be released to third party without permission from the Custodian. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = Em-CYPLUS CONTRACT

Acknowledgement : Cite "Dr Habiba Gitay, Australian National University, Canberra and Mr Mick Godwin, Queensland Department of Environment and Heritage, Cairns. Databasing funded by the Environmental Resources Information Network."

Copyright = OTHER, SEE CUSTODIAN

Caveats: The data has been assumed by ERIN to be correct as received from the data supplier. Nomenclature has been updated where possible in 1993-1994.

DATA QUALITY

Reliability = GOOD

Reliability Comments : Field data collected and databased by experienced professionals.

Sampling Unit = SlTE BASED

Sampling Code = ECOLOGICAL SURVEY

Sampling Methodology : Aerial photographs and geology maps were used to preselect the sites, although advantage was taken to note any unusual species or assemblages. The sites were 20x20m and most were surveyed on foot.

LINEAGE Supplier : Dr. Habiba Gitay

Date Recieved @ ERIN = 01-MAY-1994

Lineage Comments : This data set was supplied as two separate data sets (1.Archerbend National Park, 2.Jardine National Park) which were combined into one ORACLE table and loaded into ERIS. Taxon names checked against John Clarkson's plant list for CYP.

POSITIONAL ACCURACY

Positional Accuracy Comments : Location of sites estimated by counting paces from a known point. The site location was determined by the position on a 1:100,000 map and where possible in conjunction with aerial photographs. Geocode precision varies from 100m (756 records), 200m (9 records) and 500m (15 records). Most geocodes given in degrees, minutes, seconds. No records have been processed through ERIN'S geocode validation procedure.

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN'S Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Material identified in field and by collecting herbarium samples which were identified later by Mick Godwin and verified by the Atherton Herbarium. Some held in Mick's herbarium with copies in Atherton and Queensland Herbarium.

LOGICAL CONSISTENCY

Consistency Comments : The data was collected primarily to produce a map based on land units or broad vegetation types. Infrequently encountered species or assemblages were recorded.

COMPLETENESS

Completeness Comments : Other data exists and is based on vegetation types. Ross Hynes (QLands) also has some data for the area. Other data is held by Godwin and Gitay for 1993 collection period.

OTHER

Data Set Comments : Recieved as part of the CYPLUS / Strie Pty Ltd. contract. APPENDIX VIIIc LANDCOVER - Atherton herbarium 1.0

20 June 1994 Environmental Resources Information Network

DATA SET REPORT LANDCOVER - Atherton herbarium 1.0

Data Set Name = LANDCOVER - Atherton herbarium

Version = 1.0

Main Project Number = NR18

Secondary Project Number = N/A

Data Set Title (expand "Data Set Name") : Plant specimen records from the CSIRO Australian National Herbarium - Atherton.

Data Set Description (Abstract): Specimen records form the Atherton Herbarium.

Custodian : ACT CSIRO Division of Plant Industry, Australian National Herbarium

Address : Australian National Herbarium Division of Plant Industry CSIRO GPO Box 1600 CANBERRA ACT 2601

Phone = (06) 246 5113

Fax : (06) 246 5249

Custodian Comments : Bernie Hyland at the CSIRO Tropical Forest Research Centre, Atherton, is the effective Custodian of this dataset.

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = NEAR PRIMARY

Data Set Size (as delivered to ERIN) : 94,366 specimen records in total; 65,629 records loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 22,062 specimen records

Spatial Extent (in source data set) = Australia Temporal Extent : 1802 - 1992

Taxon Extent : Rainforest plant taxa.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contect ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Data not to be released to any third party without permission of Custodian. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN CONTRACT

Acknowledgement : Cite "Australian National Herbarium, Atherton, CSIRO Division of Plant Industry."

Copyright = WITH CUSTODIAN

DATA QUALITY

Reliability = UNKNOWN

Reliability Comments : Probably typical of most herbarium data - more reliable location data and identifications from the 1950's onwards.

Sampling Unit = SPECIMEN BASED

Sampling Code = TAXONOMIC PURPOSES

Sampling Methodology : Specimens of interest to taxonomists were collected, dried and pressed and lodged in Herbarium. May be some vouchers from ecological surveys and experiments.

LINEAGE

Supplier : QLD CSIRO Division of Plant Industry, Atherton Herbarium.

Date Recieved @ ERIN = 01-MAY-1992

Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN'S geocode validation performed on tree taxa; all records north of 16.5 deg. S supplied to CYPLUS. POSITIONAL ACCURACY

Positional Accuracy Comments : The age of many of the records leads to more unreliable geocode information. These were determined using maps; geocode sources is unknown; geocode precision is 1,000m for all records.

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Taxon name from identification of specimen in herbarium by an experienced taxonomist. Only those records matching ERIN'S Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson7sList. The taxon names may be out of date, due to recent taxonomic revisions, descriptions and redeterminations. Data quality summary: mainland and islands 20,391 records; offshore 1,090 records; of the mainland and island records, 12,905 were valid and 1,298 were suspect.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : Current contract with ERIN complete. No particular tests performed at ERIN. APPENDIX VIIId LANDCOVER - Aust Nat Bot Gard 1.0

12 July 1994 Environmental Resources Information Network

DATA SET REPORT LANDCOVER - Aust Nat Bot Gard 1.0

Data Set Name = LANDCOVER - Aust Nat Bot Gard

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = NIA

Data Set Title (expand "Data Set Name") : Plant specimen records of Landcover taxa from the herbarium of the Australian National Botanic Gardens

Data Set Description (Abstract): Landcover data from the Australian National Botanic Gardens. incl. Eucalyptus/Angophora; Poaceae; Melaleuca; Callitris; Casuarinaceae; Tasmanian taxa; Chenopodiaceae pp.

Custodian : ACT Australian National Botanic Gardens

Address : Australian National Botanic Gardens GPO Box 1777 CANBERRA ACT 2601

Phone = (06) 250 0450

Fax = (06) 250 9599

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 15,062 specimen records in total; 12,784 loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 99 specimen records.

Spatial Extent (in source data set) = Australia Temporal Extent : 1950 - 1991

Taxon Extent : Casuarinaceae; Myrtaceae; Poaceae

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = PUBLIC DOMAIN

Release Instructions : The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN CONTRACT

Acknowledgement : Cite "Australian National Botanic Gardens, Centre for Plant Biodiversity Research, ANCA."

Copyright = WITH CUSTODIAN

Caveats : The data has been assumed by ERIN to be correct as received from the data supplier.

DATA QUALITY

Reliability = GOOD

Reliability Comments : Well curated.

Sampling Unit = SPECIMEN BASED

Sampling Code = N/A

Sampling Methodology : N/A

LINEAGE

Supplier : ACT Australian National Botanic Gardens

Date Recieved @ ERIN = 20-JAN-1992

Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN'S geocode validation performed on tree taxa; all records north of 16.5 deg. S supplied to CYPLUS. POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode determination unknown; geocode sources from collectors and compilers of data set; geocode precision mainly to 1,000m and 10,000m.

ATIlUBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching EFUN's Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Identifications generally carried out by taxaonomic specialist. Data quality summary: 91 mainland records; 8 offshore records; of mainland records validated, 78 were valid, 7 were suspect.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : Current contract with ERIN complete. No particular tests performed at ERIN. APPENDIX VIIIe CYPLUS - Bunt Mangroves 1.0

21 June 1994 Environmental Resources Information Network

DATA SET REPORT CYPLUS - Bunt Mangroves 1.0

Data Set Name = CYPLUS - Bunt Mangroves

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = NR06

Data Set Title (expand "Data Set Name") : CYPLUS - Australian Institute of Marine Science mangrove survey data, collected and databased by Dr John Bunt.

Data Set Description (Abstract): CYPLUS - Australian Institute of Marine Science mangrove survey data, collected and databased by Dr John Bunt. Data was collected from surveys of eastern CYP. custodian : QLD Australian Institute of Marine Science

Address : Australian Institute of Marine Science PMB No 3 TOWNSVILLE MAIL CENTRE QLD 4810

Phone = (077) 78 92 11

Fax = (077) 72 5852

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 5430 specimen records in total; 30 species; 5082 records loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 5,082 specimen records

Spatial Extent (in source data set) = Cape York Peninsula Land Use Strategy Study Area Temporal Extent : 1978 - 1984

Taxon Extent : Mangrove vascular (higher) plant species

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : The data cannot be given away or resold without permission of the Australian Institute of Marine Science. If the data are used to produce maps and other products, a statement acknowledging that the data were supplied by AIMS will be displayed with the product.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN-CYPLUS CONTRACT

Acknowledgement : Cite "The Australian Institiute of Marine Science are Custodians of the data.Data collection and databasing by Dr J. Bunt. Databasing was funded by CYPLUS, through the Environmental Resources Information Network."

Copyright = WITH CUSTODIAN

Caveats : The data has been assumed by ERIN to be correct as received from the data supplier.

DATA QUALITY

Reliability = GOOD

Reliability Comments : Source of data is John Bunt's fieldwork whilst employed at AIMS over the period between 1978 and 1984.

Sampling Unit = SITE BASED

Sampling Code = ECOLOGICAL SURVEY

Sampling Methodology : The surveys were undertaken by recording all species present within sites at measured distances along compass transects from open water to tidal limits at locations within each riverine estuary. More detail on survey design is probably available from publications by Dr Bunt, or from contacting him directly (Bexhill Consultancies). The size of the sites is variable due to the method of collection adopted.

LINEAGE Supplier : Bexhill Consultancies Pty. Ltd.

Date Recieved @ ERIN = 20-DEC-1992

Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN's geocode validation performed on tree taxa.

POSITIONAL ACCURACY

Positional Accuracy Comments : The accuracy of the sample sites up from the start of the transects is 10m and the accuracy of the transects is 100m as determined from 1:100,000 topographic maps. Altitude precision is O.lm for sites surveyed by the Australian Survey Office and lm for other sites.

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN's Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Identified by Prof. P.B.Tomlinson, N.C.Duke in collaboration with Dr Betsy Jackes, JCUNQ. Data quality summary: mainland and island 5,081 records; offshore 1 record; of mainland and island records, 4,567 were valid, 40 were suspect.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : No particular tests performed at ERIN. APPENDIX VIUf CYP - Comalco Herbarium Plants 1.0

21 June 1994 Environmental Resources Information Network

DATA SET REPORT CYP - Comalco Herbarium Plants 1.0

Data Set Name = CYP - Comalco Herbarium Plants

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = NIA

Data Set Title (expand "Data Set Name") : CYP - Comalco Herbarium Plants

Data Set Description (Abstract): CYP Plant specimen records from Cornalco Herbarium, Weipa in site & specimen format.

Custodian : QLD Comalco Aluminium, Weipa

Address : Comalco Aluminium Weipa North Queensland 4874

Phone = (070) 698 924

Fax = (070) 698 409

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 2,030 specimen records in total, 1709 loaded into ENS; - 1,053 sites

Data Set Size (as delivered to CYPLUS GIs) 1,709 specimen records

Spatial Extent (in source data set) = Cape York Peninsula Land Use Strategy Study Area

Temporal Extent 1977 - 1992 Taxon Extent : Many taxa from Weipa district and fiuther afield on CYP

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Data not to be released to any third party without permission from the Custodian. The publication of this and agglomerated data shall acknowlege the custodians of the original data.

More Detail At Custodian = NO

Basis of ERM Involvement = ERIN CYPLUS CONTRACT

Acknowledgement : Cite "Data and databasing by Comalco Herbarium, Weipa. Databasing funded by CYPLUS through the Environmental Resources Information Network."

Copyright = WITH CUSTODIAN

Cavetas : The data has been assumed by ERIN to be correct as received from the data supplier.

DATA QUALITY

Reliability = GOOD

Reliability Comments : Taxonomic identifications should be of a good standard as done using herbarium specimen; however, not all were referenced directly with the Qld Herbarium in Brisbane. Spatial reliability may be poor, as with most datasets derived from specimen card.

Sampling Unit = SITE BASED

Sampling Code = VARIOUS

Sampling Methodology : A large number of specimens were collected by Ann Gunness (nee Morton) for revegetation purposes and by Mick Godwin, QDEH Cairns for his land system survey work.

LINEAGE

Supplier : QLD Comalco Aluminium, Weipa

Date Recieved @ ERIN = 18-MAY-1993 Lineage Comments

Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN's geocode validation performed on tree taxa.

POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode determination was from locality names, method of determination unknown. Geocode sources were mainly from the collectors and data set compiler. Geocode precision varies from 180m to 540,000m (majority between 180x11 and 9,000m).

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN's Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Data quality summary: mainland 1,582 records; offshore 115 records; of mainland records validated, 182 were valid, 31 were suspect.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : Specimen records within the Herbarium not already lodged with the Queensland Herbarium were databased.

OTHER

Data Set Comments : Databasing work at Weipa performed by Alex Thomas, Dept of Zoology, James Cook University of North Queensland, Townsville. Data has been loaded to ERIN's specimen table so that it may be validated. This is a temporary situation until implementation of ERIN's site module and loader. APPENDIX VIIIg LANDCOVER - CSIRO PI Herb Pt 2 1.0

12 July 1994 Environmental Resources Information Network

DATA SET REPORT LANDCOVER - CSIRO PI Herb Pt 2 1.0

Data Set Name = LANDCOVER - CSIRO PI Herb Pt 2

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = N/A

Data Set Title (expand "Data Set Name") : Plant specimen records of Eucalyptus from the Australian National Herbarium, Canberra.

Data Set Description (Abstract): Eucalyptus (p.p.) from CANB.

Custodian : ACT CSIRO Division of Plant Industry, Australian National Herbarium

Address : Australian National Herbarium Division of Plant Industry CSIRO GPO Box 1600 CANBERRA ACT 2601

Phone = (06) 246 5113

Fax = (06) 246 5249

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 14,000 specimen records in total; 11,501 loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 53 specimen records

Spatial Extent (in source data set) = Australia Temporal Extent : 1960 - 1992

Taxon Extent : Eucalyptus specimen records.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Data not to be released to third party without permission of Australian National Herbarium. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN CONTRACT

Acknowledgement : Cite "Australian National Herbarium, Centre for Plant Biodiversity Research, CSIRO Division of Plant Industry. Databasing funded by the Environmental Resources Information Network."

Copyright = WITH CUSTODIAN

Caveats : The data has been assumed by ERIN to be correct as received from the data supplier.

DATA QUALITY

Reliability = GOOD

Reliability Comments : Specimens identified by expert on Eucalypts (MIH Brooker) and the collection is regularly curated.

Sampling Unit = SPECIMEN BASED

Sampling Code = N/A

Sampling Methodology : NIA

LINEAGE

Supplier : ACT CSIRO Division of Plant Industry, Australian National Herbarium.

Date Recieved @ ERIN = 06-OCT-1993 Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN'S geocode validation performed on tree taxa; all records north of 16.5 deg. S supplied to CYPLUS.

POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode determination is unknown; geocode sources were from collectors, compilers of data set and unknown sources; geocode precision varies from 1,000m (33 records) to 10,000m (19 records).

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN'S Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Specimens identified by taxonomic expert (MIH Brooker). Data quality summary: mainland 51 records; offshore 1 record; of mainland records validated, 48 were valid, 3 were suspect.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : Complete data set for Eucalyptus in CYPLUS study area. No particular tests performed at ERIN. APPENDIX VIlIh CYPLUS - Gitay Flora : Jardine 1.0

22 June 1994 Environmental Resources Information Network

DATA SET REPORT CYPLUS - Gitay Flora : Jardine 1.0

Data Set Name = CYPLUS - Gitay Flora : Jardine

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = NIA

Data Set Title (expand "Data Set Name") : Cape York Peninsula Land Use Strategy - Dr. Habiba Gitay Flora : Jardine and McIlwraith Range area, Cape York Peninsula.

Data Set Description (Abstract): Site based vegetation information with a great deal of detailed information about orchids in the McIlwraith and Jardine area.

Custodian : Dr. P.S. Laverack, Queensland Department of Environment & Heritage

Address : QNPWS Norhern Regional Centre Pallarenda Townsville 0000

Phone = (077) 74 12 11

Fax = (077) 21 1742

Custodianship Comments : Dr Habiba Gitay is joint Custodian. The specimens were collected by Dr Laverack and databased by Dr Gitay.

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = NEAR PRIMARY

Data Set Size (as delivered to EF3N) : 664 specimen records in total; 59 sites; 606 records loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 606 specimen records; number of sites unknown.

Spatial Extent (in source data set) = Cape York Peninsula Land Use Strategy Study Area, Queensland Temporal Extent : 1973 - 1986

Taxon Extent : Vascular (higher) plants, especially orchids (475 records).

Fields Supplied : See data dictionary file: monster-dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : VERY SENSITIVE DATA SET. Point data may only be released with the permission of the Custodian. Aggregated data not to be released to third party without permission from the Custodian. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN-CYPLUS CONTRACT

Acknowledgement : Cite "Dr P.S. Laverack, Queensland Department of Environment and Heritage, Townsville and Dr H. Gitay, Australian National University, Canberra. Databasing funded by the Environmental Resources Information Network."

Copyright = WITH CUSTODIAN

Caveats : The data has been assumed by ERIN to be correct as received from the data supplier.

DATA QUKITY

Reliability = GOOD

Reliability Comments : Field data collected and databased by experienced professionals.

Sampling Unit = SITE BASED

Sampling Code = PURPOSIVE

Sampling Methodology : Aerial photographs were used to identi@likely orchid rich areas. Base camps were set up and teams walked in to identified areas, to collect site data.

LINEAGE Supplier : Dr. Habiba Gitay

Date Recieved @ ERIN = 01-APR-1994

Lineage Comments : Data set loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP.

POSITIONAL ACCURACY

Positional Accuracy Comments : Geocodes precision between 100m and 1000m. Often the site geocode was approximate as the survey was conducted in a general area around that point.

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN'S Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Material identified in field by Bill Laverack. He is an experienced, well known collector. Many specimens vouchered and others checked against specimens in Herbaria.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : No particular tests performed at ERIN.

OTHER

Data Set Comments : Recieved as part of the CYPLUS / Strie Pty Ltd. contract.

Publications : Laverack, P.S. (1989) Orchids of Torres Strait. A report to the Australian Orchid Foundation. August 1989. Department of Environment and Conservation, Queensland.

Laverack, P.S. (1977) Orchids of the Iron Range Area. Results of the Australian Orchid Foundation-National Parks and Wildlife Service of Queensland Expedition to the Iron Range Area of Cape York Peninsula. Queensland National Parks and Wildlife Service.

Laverack, P.S. (1980) Orchids of McIlwraith Range. Results of the Australian Orchid Foundation-National Parks and Wildlife Service of Queensland joint project to study the orchids of the McIlwraith Range area of Cape York Peninsula. Queensland National Parks and Wildlife Service. January 1980. APPENDIX VILIi LANDCOVER - CSIRO PI Herb CANB 1.0

12 July 1994 Environmental Resources Information Network

DATA SET REPORT LANDCOVER - CSIRO PI Herb CANB 1.0

Data Set Name = LANDCOVER - CSIRO PI Herb CANB

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = NIA

Data Set Title (expand "Data Set Name") : Plant specimen records of landcover taxa from the Australian National Herbarium, Canberra.

Data Set Description (Abstract): Eucalyptus @.p.) Poaceae @.p.) & Melaleuca records from CANB Herbarium.

Custodian : ACT CSIRO Division of Plant Industry, Australian National Herbarium

Address : Australian National Herbarium Division of Plant Industry CSIRO GPO Box 1600 CANBERRA ACT 2601

Phone = (06) 246 5113

Fax = (06) 246 5249

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 45,959 specimen records in total; 38,959 loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 905 specimen records.

Spatial Extent (in source data set) = Australia Temporal Extent 1770 - 1991

Taxon Extent : 838 Myrtaceae (Eucalyptus & Melaleuca) specimen records; 67 Poaceae specimen records.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Data not to be released to third party without permission of Australian National Herbarium. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN CONTRACT

Acknowledgement : Cite "Australian National Herbarium, Centre for Plant Biodiversity Research, CSIRO Division of Plant Industry. Databasing funded by the Environmental Resources Information Network."

Copyright = WITH CUSTODIAN

Caveats : The data has been assumed by ERIN to be correct as received from the data supplier.

DATA QUALITY

Reliability = VARIABLE

Reliability Comments : Generally good. Melaleucan specimens are reliable due to recent revisions; Eucalypt validation to proceed; others not known.

Sampling Unit = SPECIMEN BASED

Sampling Code = N/A

Sampling Methodology : N/A

LINEAGE

Supplier : ACT CSIRO Division of Plant Industry, Australian National Herbarium

Date Recieved @ ERIN = 15-MAY-1992 Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN's geocode validation performed on tree taxa; all records north of 16.5 deg. S supplied to CYPLUS.

POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode determination unknown; geocode sources from collectors (374 records), compilers of data set (132 records) and unknown sources (399 records); geocode precision mainly 1,000m with some records to 10,000m.

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN's Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Identifications by taxonomic expert (MIH Brooker). Data quality summary: mainland 810 records; offshore 43 records; of mainland records validated, 748 were valid, 52 were suspect; 52 records have yet to be processed.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : Current contract with ERIN complete. Updating in progress. No particular tests performed at ERIN.

OTHER

Data Set Comments : Eucalypts (19191 records) - mainly northern, some SE; Grasses of the genera Astrebla, Dichanthium, Stipa, Triodia, Zygochloa, Bothriochloa, Enneapogon, Monodia & Sorghum + some others (10297 records), all Melaleuca records (5266 records). APPENDIX VIIIj LANDCOVER - Melb. herb. - Eucs 1.0

12 July 1994 Environmental Resources Information Network

DATA SET REPORT LANDCOVER - Melb. herb. - Eucs 1.0

Data Set Name = LANDCOVER - Melb. herb. - Eucs

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = NIA

Data Set Title (expand "Data Set Name") : Plant specimen records of Eucalyptus from the National Herbarium of Victoria, plus one record of Callitris databased under "LANDCOVER - Melbourne Herb. (Version) 1.0"

Data Set Description (Abstract): Eucalypt data from Melbourne Herbarium

Custodian : Vic Royal Botanic Gardens & National Herbarium of Victoria

Address : National Herbarium of Victoria Royal Botanic Gardens Birdwood Ave SOUTH YARRA VIC 3141

Phone = (03) 655 2300

Fax = (03) 655 2350

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 10,700 specimen records in total; 8,152 loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 62 specimen records.

Spatial Extent (in source data set) = Australia

Temporal Extent : Taxon Extent : Eucalyptus specimen records and one record of Callitris macleayana.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = PUBLIC DOMAIN

Release Instructions : The Callitris reocord is not to be supplied to a third party without the permission of the National Herbarium of Victoria. The publication of this and agglomerated data shall acknowledge the custodians of the original data. All ROTAP specimens to be released to an accuracy of 10 minutes.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN CONTRACT

Acknowledgement : Cite "Royal Botanic Gardens and National Herbarium of Victoria, Melbourne. Databasing funded by the Environmental Resources Information Network."

Copyright = WITH CUSTODIAN

DATA QUALI'I*Y

Reliability = GOOD

Reliability Comments : All records have had there identity checked by M.I.H. Brooker (CANB)

Sampling Unit = SPECIMEN BASED

Sampling Code = N/A

Sampling Methodology : NIA

LINEAGE

Supplier : Vic Royal Botanic Gardens & National Herbarium of Victoria

Date Recieved @ ERIN = 05-MAY-1994

Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN7sgeocode validation performed on tree taxa; all records north of 16.5 deg. S supplied to CYPLUS. POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode determination unknown; geocode sources from collectors, compilers of data set and unknown; geocode precision is between 1,000m and 500,000m and is 10,000m for the Callitris record.

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN'S Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Identifications checked by taxonomic expert (MIH Brooker). Data quality summary: 58 mainland records; 3 offshore records; of the mainland records validated, 55 were valid, 2 were suspect. The Callitris record was a valid specimen.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : Only specimen records of Eucalyptus were obtained from the National Herbarium. Of the Calltris records from the Herbarium, only one was in the CYPLUS study area. APPENDIX VIIIk MANGROVE ATLAS 1.0

12 July 1994 Environmental Resources Information Network

DATA SET REPORT MANGROVE ATLAS 1.0

Data Set Name = MANGROVE ATLAS

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = NIA

Data Set Title (expand "Data Set Name'?) : Preliminary Atlas of Mangrove Species in Australia.

Data Set Description (Abstract): Data prepared for the "Preliminary Atlas of Mangrove Species in Australia", published by Dr John Busby and Dr Peter Bridgewater in Australian Flora & Fauna Series No. 5. (1986)

Custodian : ACT ERIN Unit

Address : ERIN Unit GPO Box 787 Canberra, ACT 2601

Phone = (06) 274 1111

Fax = (06) 274 1333

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 190Kb, c. 40 species. 2,420 specimen records in total and loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 644 specimen records.

Spatial Extent (in source data set) = Australia

Temporal Extent : Unknown. Taxon Extent : At least mangrove vascular plant species.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : To Bona fide researchers and interested parties on application to the Custodian. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = OTHER (ABRS PROJECT)

Acknowledgement : Cite "Data provided by the Western Australian Herbarium, Queensland Herbarium, National Herbarium of NSW, Northern Territory Herbarium, and herbarium of the Australian Institute of Marine Science.''

Copyright = WITH CUSTODIAN

Caveats : Data current as published and may be out of date. Data set includes data collated from selected herbaria.

DATA QUALITY

Reliability = GOOD

Reliability Comments : Data points checked against topographic maps at the time.

Sampling Unit = SPECIMEN BASED

Sampling Code = NIA

Sampling Methodology : NIA

LINEAGE

Supplier : ACT Australian Biological Resources Study

Date Recieved @ ERIN = 30-JAN-1992

Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN'S geocode validation performed on tree taxa; all records north of 16.5 deg. S supplied to CYPLUS. POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode determination by maps; geocode sources unknown; geocode precision to 1,000m.

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN'S Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Specimens originally from State and Commonwealth Herbaria and checked by taxonomic experts. Data quality summary: 562 mainland records; 82 offshore records; of the mainland records validated, 395 were valid, 13 were suspect.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : Complete as published from the Preliminary Atlas of Mangrove Species in Australia. APPENDIX VIlll CYP - QDEH Detailed Veg Survey 1.0

21 June 1994 Environmental Resources Information Network

DATA SET REPORT CYP - QDEH Detailed Veg Survey 1.0

Data Set Name = CYP - QDEH Detailed Veg Survey

Version = 1.0

Theme = FLORA

Main Project Number = NRl8

Secondaq Project Number = NROl

Data Set Title (expand "Data Set Name") : CYP - Qld Herbarium, Mareeba (Qld Dept of Environment and Heritage) Detailed sites from Vegetation Survey of Cape York Peninsula N of 16 degrees S.

Data Set Description (Abstract): Detailed CYP flora survey sites and specimen from John Neldner and John Clarkson; "Queensland Herbarium Far North Queensland Vegetation Mapping Project". Each site has an exhaustive species list and comprehensive structural data.

Custodian : Qld Queensland Herbarium, Mareeba

Address : Queensland Herbarium, Mareeba Off. Qld Dept of Environment & Heritage PO Box 1054 MAREEBA QLD 4880

Phone = (070) 921 555

Fax : (070) 923 593

Custodianship Comments

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = NEAR PRIMARY

Data Set Size (as delivered to ERTN) : 24,898 specimen records in total; 18,052 records loaded into ENS.

Data Set Size (as delivered to CYPLUS GIs) 18,052 specimen records.

Spatial Extent (in source data set) = Cape York Peninsula Land Use Strategy Study Area Temporal Extent : 1986 - 1993

Taxon Extent : All major vascular (lugher) plant tam. Probably better collections of perennial taxa (e.g. woody plants) in dry season observations, especially in drought times.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contect ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Data not to be released to any third party without permission fiom the Custodian. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN-CYPLUS CONTRACT

Acknowledgement : Cite "Queensland Herbarium, Queensland Department of Environment & Heritage, Mareeba. Databasing funded by CYPLUS through the Environmental Resources Information Network."

Copyright = WITH CUSTODIAN

Caveats : The data has been assumed by ERIN to be correct as received from the data supplier.

DATA QUALITY

Reliability = VERY GOOD

Reliability Comments : Contains only taxa from each site that have been positivley identified. Taxon identifications are by two highly-experienced botanists; specimen (vouchers) were collected whenever necessary to perform positive determinations and also routinely for "known" taxa.

Sampling Unit = SITE BASED

Sampling Code = STRATIFIED SURVEY

Sampling Methodology : Aerial photographs marked up with vegetation patterns. Sites representative of each pattern selected, visited and recorded. Most sites near roaWtracks, but some helicopter work. See Ecological Soc. Aust. Symp. draft paper for Sept. 1993. LINEAGE

Supplier : Supplied by the Queensland Herbarium, Mareeba at a number of stages during the project.

Date Recieved @ ERIN = 08-FEB-1994

Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN's geocode validation performed on tree taxa.

POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode precision is 300m (14,802 records) and 1,000m (3,190 records). Geocode determination using GPS (15,052 records) and topographic maps (3,000 records) by the collectors.

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN's Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Data quality summary: 17,558 mainland records; 472 offshore records; of the mainland records validated, 2,485 were valid, 50 were suspect; 22 records have yet to be processed.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : Apparently, about 70% of the sites collected by Neldner/Clarkson were covered by the ERIN-CYPLUS Contract. Data from these sites are supplied, here. The remaining 30% will probably be supplied to CYPLUS GIS, through ERIN, covered by the CYPLUS GIs MOU. Contact Custodian for timing.

OTHER -

Data Set Comments : Data is in temporary site tables waiting for loading into ERIN's site module. APPENDIX VIIIm CYP - QDEH Obs Veg Survey 1.0

21 June 1994 Environmental Resources Information Network

DATA SET REPORT CYP - QDEH Obs Veg Survey 1.0

Data Set Name = CYP - QDEH Obs Veg Survey

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = NROl

Data Set Title (expand "Data Set Name") : Cape York Peninsula - Queensland Department of Environment and Heritage, Observational Vegetation Survey.

Data Set Description (Abstract): Observational CYP flora survey sites and specimens as part of the Queensland Herbarium Far North Queensland Vegetation Mapping Project.

Custodian : Qld Queensland Herbarium, Mareeba.

Address : Queensland Herbarium, Mareeba Off. Qld Dept of Environment & Heritage PO Box 1054 MAREEBA QLD 4880

Phone = (070) 921 555

Fax : (070) 923 593

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 11,533 specimen records in total; 10,571 loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) : 10,571 specimen records.

Spatial Extent (in source data set) = Cape York Peninsula Land Use Strategy Study Area Temporal Extent : 1989 - 1993

Taxon Extent : Predominant (i.e. dominant species, species in flower, etc.) taxa observed, not comprehensive.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contect ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Data not to be released to any third party without permission from the Custodian. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN-CYPLUS CONTRACT

Acknowledgement : Cite "Queensland Herbarium, Queensland Department of Environment & Heritage, Mareeba. Databasing funded by CYPLUS, through the Environmental Resources Information Network."

Copyright = WITH CUSTODIAN

Caveats : The data has been assumed by ERIN to be correct as received from the data supplier.

DATA QUALITY

Reliability = GOOD

Reliability Comments : Taxon identifications are by two highly-experienced botanists.

Sampling Unit = SMALL AREA BASED, c 1 SQ KM

Sampling Code = OPPORTUNISTIC

Sampling Methodology : Observations made by collector whilst driving along roads speaking into tape recorder.

LINEAGE

Supplier : The data was supplied by the Queensland Herbarium, Mareeba.

Date Recieved @ ERIN = 15-JUL-1993 Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN's geocode validation tests performed on tree taxa.

POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode precision is 300m. Most geocodes determined by GPS from vehicle position by collectors.

A'TTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN's Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Taxon identifications are field observations by two highly-experienced botanists, Mr John Clarkson and Mr John Neldner. Data quality summary: 7,850 mainland records; 65 offshore records; of the mainland records validated, 737 were valid, 2 were suspect. Mainlandloffshore test has not been run on 2,656 records.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : Not sure how complete this dataset is. All data supplied to ERIN by 21 June 1994 is included, here. Please contact Custodian for further details. See also related detailed site data and vegetation maps.

OTHER

Data Set Comments : Data is in temporary site tables waiting for loading into ERIN's site module. APPENDIX VIIIn CYP-NSWHERB PLANTS 1.0

21 June 1994 Environmental Resources Information Network

DATA SET REPORT CYP-NSWHERB PLANTS 1.0

Data Set Name = CYP-NSWHERB PLANTS

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = N/A

Data Set Title (expand "Data Set Name") : CYPLUS - Royal Geographic Society of QLD - New South Wales Herbarium Plants from Heathlands and district.

Data Set Description (Abstract): Plant data from expedition to "Heathlands" of the Royal Geographic Society of Queensland in 1992 supplied by National Herbarium of New South Wales

Custodian : NSW Royal Botanic Gardens & National Herbarium of NSW

Address : National Herbarium of NSW Mrs Macquaries Road SYDNEY NSW 2000

Phone = (02) 231 8111

Fax = (02) 251 7231

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 511 specimen records in total; 354 records loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 354 specimen records.

Spatial Extent (in source data set) = Queensland

Temporal Extent : March 1992.

Taxon Extent : Vascular plant taxa.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Data not to be released to any third party without permision from the Custodian. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN-CYPLUS CONTRACT

Acknowledgement : Cite "Royal Botanic Gardens and National Herbarium of New South Wales. Databasing funded by the Environmental Resources Information Network."

Copyright = WITH CUSTODIAN

Caveats : The data has been assumed by ERIN to be correct as received from the data supplier.

DATA QUALITY

Reliability = GOOD

Reliability Comments : Taxonomic identifications should be of a high standard as done using herbarium specimens. Spatial reliability may be poor, as with most data sets where a GPS was not used and derived from a specimen card.

Sampling Unit = SPECIMEN BASED

Sampling Code = TAXONOMIC PURPOSES

Sampling Methodology : Generally between 3-12 specimens in each site.??????

LINEAGE

Supplier : NSW Royal Botanic Gardens & National Herbarium of NSW

Date Recieved @ ERIN = 01-JUN-1993

Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN'S geocode validation performed on tree taxa.

POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode determination mainly GPS and also topographic maps, but determination for each record not given. Geocode sources from collector and compiler of the data set. Most records have a geocode precision of 1000m.

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN'S Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Data quality summary: mainland 350 records; no offshore records; of mainland records validated, 219 were valid, 34 were suspect; 4 records have yet to be processed.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : All records held by the Herbarium of NSW for the Heathlands Expedition were supplied. APPENDIX VIIIo NT Herbarium 1.0

21 June 1994 Environmental Resources Information Network

DATA SET REPORT NT Herbarium 1.0

Data Set Name = NT Herbarium

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = N/A

Data Set Title (expand "Data Set Name") : Plant specimen records from the Northern Territory Herbarium

Data Set Description (Abstract): Plant specimen records from the Northern Territory Herbarium

Custodian : NT Herbarium, Conservation Commission of the NT

Address : Northern Territory Herbarium Conservation Commision of the NT PO Box 496 PALMERSTON NT 083 1

Phone = (089) 89 4400

Fax : (089) 32 3849

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 151,873 total specimen records; 131,336 records loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 2,526 specimen records.

Spatial Extent (in source data set) = AUSTRALIA

Temporal Extent : 1802 - 1993 Taxon Extent : All major vascular (higher) plant taxa in NT. Maybe some lower plants.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contect ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Data not to be released to any third party without written permission from the Conservation Commission of the Northern Territory. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN PROJECT

Acknowledgement : Cite "Northern Territory Herbarium, Conservation Commission of the Northern Territory."

Copyright = WITH CUSTODIAN

Caveats : The data has been assumed by ERIN to be correct as received from the data supplier.

DATA QUALITY

Reliability = UNKNOWN

Reliability Comments : Validation tests may ascertain reliability.

Sampling Unit = SPECIMEN BASED

Sampling Code = TAXONOMIC PURPOSES

Sampling Methodology : Specimens of interest to taxonomists were collected, dried and pressed and lodged in Herbarium. May be some vouchers from ecological surveys and experiments.

LINEAGE

Supplier : NT Herbarium, Conservation Commission of the NT

Date Recieved @ ERIN = 20-SEP-1993

Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN's geocode validation performed on tree taxa; all records north of 16.5 deg. S supplied to CYPLUS.

POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode supplied by collectors & compilers, mainly from maps; geocode precision 1000m (2358 specimens) and 60,000m (96 records).

A?TRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN's Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson7sList. Identification of specimen in herbarium by experienced taxonomist - in this case, often Clyde Dunlop. Data quality summary: mainland and islands 2,263 records; offshore 191 records; of mainland records validated, 1,019 were valid, 105 were suspect.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : Current contract with ERIN complete. No particular tests performed at ERIN.

OTHER

Data Set Comments : Apparently does not meet full HISPID standards as started before HISPID agreed to. APPENDIX VIIIp LANDCOVER - Queensland Herb. 1.0

21 June 1994 Environmental Resources Information Network

DATA SET REPORT LANDCOVER - Queensland Herb. 1.0

Data Set Name = LANDCOVER - Queensland Herb.

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = NROl

Data Set Title (expand "Data Set Name") : Plant specimen records for areas of Queensland and particular "landcover" taxa, from the Queensland Herbarium (BRI).

Data Set Description (Abstract): Queensland Herbarium data for BURKE, COOK, DARLING DOWNS, MORETON, NORTH KENNEDY & WIDE BAY DISTRICTS plus Acacia, Callitris, Casuarinaceaee, Chenopodiaceae, Eucalyptus, Melaleuca and Poaceae. Also known as HERBRECS from the system used to extract records.

Custodian : Qld Queensland Herbarium

Address : Queensland Herbarium Meiers Road INDOOROOPILLY QLD 4068

Phone = (07) 877 9321

Fax : (07) 371 6655

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = NEAR PRIMARY

Data Set Size (as delivered to ERIN) : approx. 220,000 specimen records in total; 200,731 loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 37,342 records

Spatial Extent (in source data set) = Queensland Temporal Extent 1770 - 1994

Taxon Extent : All vascular (higher) plant taxa in QLD.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contect ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Data not to be released to any third party without permission of the Custodian. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN CONTRACT

Acknowledgement : Cite "Queensland Herbarium, Queensland Department of Environment & Heritage, Brisbane. Databasing funded by the Australian Biological Resources Study initially and upgrading in part by the Environmental Resources Information Network."

Copyright = WITH CUSTODIAN

Caveats : The data has been assumed by ERIN to be correct as received from the data supplier.

DATA QUALITY

Reliability = VARIABLE

Reliability Comments : Probably typical of most herbarium data - more reliable location data and identifications from the 1950's onwards.

Sampling Unit = SPECIMEN BASED

Sampling Code = TAXONOMIC PURPOSES

Sampling Methodology : Specimens of interest to taxonomists were collected, dried and pressed and lodged in Herbarium.

LINEAGE

Supplier : Qld Queensland Herbarium

Date Recieved @ ERIN = 30-DEC-1991 Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN'S geocode validation performed on tree taxa; all records North of 16.5 degrees S supplied to CYPLUS.

POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode determination is unknown, geocode sources are from the collectors, and precision varies: 1,000m (25,829 records); 10,000m (9,401 records); 60,000m (2,112 records).

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN'S Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Taxon name from identification of specimens in the herbarium by an experienced taxonomist. Data quality summary: mainland and islands 24,879 records; offshore 4,036 records; of mainland records validated, 11,887 were valid, 1,196 were suspect. 8,427 records have yet to be processed.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : All records for CYPLUS study area received from Custodian.

OTHER

Data Set Comments : Somewhat out of date, due to recent taxonomic revisions, descriptions and redeterminations. Reload of next version of HERBRECS in progress at ERIN. APPENDIX VIIIq ROTAP - Queensland Herbarium 1.0

12 July 1994 Environmental Resources Information Network

DATA SET REPORT ROTAP - Queensland Herbarium 1.0

Data Set Name = ROTAP - Queensland Herbarium

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = N/A

Data Set Title (expand "Data Set Name") : Rare or Threatened Australian Plant specimen data from Queensland

Data Set Description (Abstract): Queensland Herbarium Rare or Threatened Australian Plants

Custodian : Qld Queensland Herbarium

Address : Queensland Herbarium Meiers Road INDOOROOPILLY QLD 4068

Phone = (07) 877 9321

Fax = (07) 371 6655

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = NEAR PRIMARY

Data Set Size (as delivered to ERIN) : 9,521 specimen records in total; 8,260 records loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 1,477 records.

Spatial Extent (in source data set) = Queensland

Temporal Extent : 1770 - 1990

Taxon Extent : Plant species considered to be rare or threatened.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Point data not to be released to anyone. Aggregated data not to be released to third party without permission of the Custodian. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ANCA CONTRACT

Acknowledgement : Queensland Herbarium

Copyright = WITH CUSTODIAN

DATA QUALITY

Reliability = VARIABLE

Reliability Comments : Geocodes are suspect in some circumstances and taxon names may be out of date.

Sampling Unit = SPECIMEN BASED

Sampling Code = N/A

Sampling Methodology : N/A

LINEAGE

Supplier : ACT CSIRO Division of Plant Industry, Australian National Herbarium

Date Recieved @ ERIN = 30-SEP-1991

Lineage Comments : Data set loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN'S geocode validation performed on tree taxa; all records N of 16.5 deg. S supplied to CYPLUS.

POSITIONAL ACCURACY

Positional Accuracy Comments Geocode determination by map; geocode precision 5,000m (1,017 records), 10,000m (460 records); geocode source unknown.

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN'S Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Most specimens identified by taxonomic specialists. Data quality summary: mainland and islands 1,374 records; offshore 87 records; of mainland records validated, 1,018 were valid, 254 were suspect; 16 records have yet to be processed.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : Data supply complete from Herbarium for ROTAP'S.

OTHER

Data Set Comments : Dataset is a result of contract let by ANCA to CSIRO Division of Plant Industry. APPENDIX Vmr LANDCOVER - CSlRO Tree Seed C. 1.0

22 June 1994 nvironrnental Resources Information Network

DATA SET REPORT LANDCOVER - CSIRO Tree Seed C. 1.0

Data Set Name = LANDCOVER - CSIRO Tree Seed C.

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = NIA

Data Set Title (expand "Data Set Name") : Plant collection records from the CSIRO Tree Seed Centre, Canberra.

Data Set Description (Abstract): Plant collection records from CSIRO Tree Seed Centre in the Division of Forestry, Canberra.

Custodian : ACT CSIRO Division of Forestry, Tree Seed Centre.

Address : Tree Seed Centre Division of Forestry CSIRO PO Box 4008 Queen Victoria Terrace, CANBERRA ACT 2600

Phone = (06) 281 8211

Fax = (06) 281 8266

Custodianship Comments :

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 8,785 records in total; 7,923 records loaded into ENS.

Data Set Size (as delivered to CYPLUS GIs) 3 85 records

Spatial Extent (in source data set) = Australia Temporal Extent : 1966 - 1991

Taxon Extent : Vascular (Iigher) plants. Associate (mostly prominent species) and habitat recorded at seed collection sites.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = PUBLIC DOMAIN

Release Instructions : The Tree Seed Centre reserves the right to impose restrictions on particular sitelspecies combinations if they see the need. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN CONTRACT

Acknowledgement : Cite "CSIRO Tree Seed Centre, CSIRO Division of Forestry, Canberra. Databasing funded by the Environmental Resources Information Network."

Copyright = WlTH CUSTODIAN

DATA QUALITY

Reliability = GOOD

Reliability Comments : Taxonomy fairly reliable because of nature of the seed collections, but not always vouchered. Locality information reliable.

Sampling Unit = SITE BASED

Sampling Code = SELECTIVE

Sampling Methodology : Sites selected to maximise usell provenances of target species for seed collection. Habitat and associated species recorded at most collection sites.

LINEAGE

Supplier : ACT CSIRO Division of Forestry, Tree Seed Centre

Date Recieved @ ERIN = 06-AUG-1991

Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN's geocode validation performed on tree taxa; all records north of 16.5 deg. S supplied to CYPLUS.

POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode determination unknown; geocode sources from collectors; geocode precicion vaires from 1,000m (377 records) to 60,000m (8 records).

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN's Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Field identification by collectors. Some records vouchered and held in CANB. Vouchers usually held while seed still in stock, discarded when seed no longer available. Data quality summary: 376 mainland records; 9 offshore records; of the mainland records validated, 350 were valid, 12 were suspect.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : No particular tests performed at ERIN. APPENDIX VIIIs ROTAP -CSIRO PI Herb CANB 1.0

12 July 1994 Environmental Resources Information Network

DATA SET REPORT ROTAP - CSIRO PI Herb CANB 1.0

Data Set Name = ROTAP - CSIRO PI Herb CANB

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = N/A

Data Set Title (expand "Data Set Name") : Rare or Threatened Australian Plants specimen data from the Australian National Herbarium, Canberra.

Data Set Description (Abstract): Specimen records for Rare or Threatened Australian Plants.

Custodian : ACT CSIRO Division of Plant Industry, Australian National Herbarium

Address : Australian National Herbarium Division of Plant Industry CSIRO GPO Box 1600 CANBERRA ACT 2601

Phone = (06) 246 5113

Fax = (06) 246 5249

Custodianship Comments

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = NEAR PRIMARY

Data Set Size (as delivered to ERIN) : 4,244 specimen records in total; 4,134 loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 334 specimen records.

Spatial Extent (in source data set) = Australia Temporal Extent : 1943 - 1988

Taxon Extent : ROTAP specimen records.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Point records not to be released without permission. Generalised data not to be released to third party without permission of Australian National Herbarium. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ANCA CONTRACT

Acknowledgement : Cite "Australian National Herbarium, Centre for Plant Biodiversity Research, CSIRO Division of Plant Industry. Databasing funded by the Australian Nature Conservation Agency."

Copyright = WITH CUSTODIAN

DATA QUALITY

Reliability = VARIABLE

Reliability Comments : The list was compiled by John Briggs for the ANZECC list of ROTAP's. It is probably better than a lot of the other ROTAP data sets.

Sampling Unit = SPECIMEN BASED

Sampling Code = N/A

Sampling Methodology : N/A

LINEAGE

Supplier : ACT CSIRO Division of Plant Industry, Australian National Herbarium

Date Recieved @ ERIN = 14-OCT-1992

Lineage Comments : Data loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP; ERIN'S geocode validation performed on all taxa; all records north of 16.5 deg. S supplied to CYPLUS.

POSITIONAL ACCURACY

Positional Accuracy Comments : Geocode determination is mainly unknown; geocode sources are unknown; geocode precision varies from 50m to 25,000m.

AITRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN'S Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Most specimens identified by taxonomic specialists. Data quality summary: 326 mainland records; 8 offshore records; of mainland records validated, 242 were valid, 74 were suspect.

LOGICAL CONSISTENCY

Consistency Comments : No particular tests performed at ERIN.

COMPLETENESS

Completeness Comments : Complete data set from Custodian. APPENDIX VIIIt CYPLUS - Gitay Flora: Weipa 1.0

22June1994 Environmental Resources Information Network

DATA SET REPORT CYPLUS - Gitay Flora : Weipa 1.0

Data Set Name = CYPLUS - Gitay Flora : Weipa

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = N/A

Data Set Title (expand "Data Set Name") : Cape York Peninsula Land Use Strategy - Dr. Habiba Gitay Flora : Weipa Area

Data Set Description (Abstract): Vegetation information gathered for Weipa area on a site by site basis collected by Mick Godwin.

Custodian : Dr. Mick Godwin, Qld Dept of Environment & Heritage.

Address : Qld Dept of Environment & Heritage National Parks & Wildlife Service PO Box 2066 CAIRNS QLD 4870

Phone = (070) 53 43 10

Fax = (070) 52 3080

Custodianship Comments : Dr Gitay is joint custodian of the data, responsible for databasing. Mr Godwin was the collector.

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = PRIMARY

Data Set Size (as delivered to ERIN) : 3,108 specimen records in total; 276 sites; 2,583 records loaded into ERIS.

Data Set Size (as delivered to CYPLUS GIs) 2,583 specimen records; number of sites unknown.

Spatial Extent (in source data set) = Cape York Peninsula Land Use Strategy Study Area, Queensland

Temporal Extent : Taxon Extent : Vascular (higher) plant taxa (emphasis on trees and shrubs).

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodian for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Point data not to be released to third parties without the permission of the Custodian. The publication of this and agglomerated data shall acknowledge the custodians of the original data.

More Detail At Custodian = YES

Basis of ERIN Involvement = ERIN-CYPLUS CONTRACT

Acknowledgement : Cite "Mr Mick Godwin, Queensland Department of Envionrnent and Heritage, Cairns and including private collections; Dr Habiba Gitay, Australian National University, Canberra. Databasing funded by the Environmental Resources Information Network."

Copyright = SEE CUSTODIAN

Caveats : The data has been assumed by ERIN to be correct as received from the data supplier. Nomenclature has been updated where possible in 1993-1994.

DATA QUALITY

Reliability = GOOD

Reliability Comments : Field data collected and databased by experienced professionals.

Sampling Unit = SITE BASED

Sampling Code = ECOLOGICAL SURVEY

Sampling Methodology : Aerial photographs and geology maps were used to preselect the sites, although advantage was taken to note any unusual species or assemblages. A site representing the typical vegetation for a particular "land unit" was chosen and the species listed. In most cases, data on the soil was also collected.

LINEAGE

Supplier : Dr. Habiba Gitay Date Recieved @ ERIN = 01-APR-1994

Lineage Comments : Data set loaded into ERIS and taxon names checked against John Clarkson's plant list for CYP.

POSITIONAL ACCURACY

Positional Accuracy Comments : The site location was determined by the position on a 1:100,000map and in conjunction with aerial photographs. Most geocodes given in degrees, minutes, seconds. Accurate to about 500m. No records have been processed through ERIN's geocode validation procedure.

ATTRIBUTE ACCURACY

Attribute Accuracy Comments : Only those records matching ERIN's Taxon Table have been sent to the CYPLUS GIs. For plants, this is John Clarkson's List. Material identified in field by Mick Godwin and by collecting herbarium samples. He is an experienced, well known collector. Specimens are held in Mick's herbarium with some copies in the Atherton Herbarium.

LOGICAL CONSISTENCY

Consistency Comments : The data was collected primarily to produce a map based on land units or broad vegetation types. Infrequently encountered species or assemblages were recorded.

COMPLETENESS

Completeness Comments : No particular tests performed at ERIN.

OTHER

Data Set Comments : Recieved as part of the CYPLUS / Strie Pty Ltd. contract.

Publications : Godwin, M.D. (1985) Land Units of the Weipa Region of Australia's Cape York Peninsula. Queensland National Parks & Wildlife Service. Report compiled by Research and Planning Branch for Comalco Ltd. 141

APPENDIX IX Documentation of Whole Flora Dataset

21 September 1994 Environmental Resources Infornmation Network

DATA SET REPORT CYPLUS - All CYP Flora (NR18) 1.0

Data Set Name = CYPLUS - All CYP Flora (NR18)

Version = 1.0

Theme = FLORA

Main Project Number = NR18

Secondary Project Number = NROl

Data Set Title: All Flora Specimen and Observation Data from Cape York Peninsula Land Use Strategy Project NR18.

Data Set Description (Abstract): ERIN was responsible for collating existing point-based location data on the flora of Cape York Peninsula. This included data from herbarium specimens and observations (including systematic and incidental records.) This work often involved converting data from a paper- based format to a more-useable digital form - spatially referenced RDBMS records.

Data from new survey work, mainly by two teams from the Dept of Environment and Heritage, were also received at ERIN and integrated with the existing data. All source datasets were packaged into a common format. A user interface was written in ARCIINFO am1 language to select subsets of the data relevant to a wide variety of uses. Data plus interface tools were supplied to the CYPLUS GIs at NRIC on 30 June 1994.

Custodian : ACT ERIN Unit

Address : Environmental Resources Information Network DEST GPO Box 787 CANBERRA ACT 2601

Phone = (06) 274 1059

Fax = (06) 274 1333

Custodianship Comments : Conact Dr Matt. Bolton at the above address, or e-mail: [email protected] ERIN does not see itself as the long-term custodian of this dataset.

DATASET SIZE AND EXTENT

Data Type = POINT

Aggregation Level = COMPOSITE

Data Set Size (at ERIN) : There are 92,300 plant records in eris.specimen for study area (i.e.above 16 deg), 11437 distinct sites and 3979 distinct species.

Data Set Size (as delivered to CYPLUS GIs):

Spatial Extent (in source data set) = Cape York Peninsula Land Use Strategy Study Area, Queensland

Temporal Extent : from 1800's to present

Taxon Extent : All major vascular flora taxa - species of angiosperms and gymnosperms - occurring on CYP. No non-vascular plants.

Fields Supplied : See data dictionary file: monster.dct

Other Fields : Please contact ERIN Unit or Custodians of source datasets for details.

DATASET USAGE

Data Set Use Code = CYPLUS FAMILY

Release Instructions : Not to be released to third parties without express permissions of the ERIN Unit and relevant primary Custodians. Substantial portions of the database are RESTRICTED.

More Detail At Custodian = YES

Basis of ERIN Involvement = OTHER

Acknowledgement : All users of the CYPLUS flora database should acknowledge CYPLUS funding and the ERIN Unit as well as requirements specified by relevant Custodians of the relevant primary datasets (See Metadata on source datasets). Copyright = OTHER, SEE CUSTODIAN

DATA QUALITY

Reliability = VARIABLE

Reliability Comments : Data have been collated from a multitude of sources. The user should be aware of huge variations in data quality throughout the dataset. See further details, below, and in metadata for the source datasets.

Sampling Unit = VARIOUS

Sampling Code = VARIOUS

Sampling Methodology : Sampling methods range from incidental observations to systematic, stratified site sampling.

LINEAGE

Supplier : Many - see separate metadata reports on each dataset.

Lineage Comments : CYPLUS-project NR18 data first arrived at ERIN on 20-DEC-1992 (Bunt Mangroves) Relevant ERIN-funded data first arrived at ERIN on 06-AUG-1991 (CSIRO Tree Seed Centre) The last source-dataset acquisition was on 05-MAY-1994 (Melb. Herbarium) - When did Qld Herb. #2 arrive? This version of the "All CYP Flora (NR18)" dataset was supplied to the CYPLUS GIs on 30-JUN-1994

POSITIONAL ACCURACY

Positional Accuracy Comments : Except for the most-recent data from Qld Herbarium Mareeba and QDEH Cairns, geocodes (locations) were usually determined from topographic maps or (worse) locality description. Average accuracy is probably about 1 minute or 1km, but some records could be as much as 50km in error. See geocode precision field for precision of individual records. All records have been supplied at a standard resolution of 3 minutes of latitude~longitude. Where custodial permission was obtained, more-accurate point locations were also supplied to the CYPLUS GIs in a separate field in the same record.

ATTRIBUTE ACCURACY Attribute Accuracy Comments : Only those records matching ERIN'S Taxon Table have been sent.to the CYPLUS GIs. For plants, this is Clarkson's List. Taxonomic quality is variable. Observational records range from those collected by very experienced botanists to incidental records by tourists. Specimen records amenable to redetermination, if necessary, in future.

LOGICAL CONSISTENCY

Consistency Comments : Apart from records failing the Taxon Test (see Attribute Accuracy, above), individual records have been attributed according to the assessment of the record by the ERIN Offshore Test.

Only some recrods (rare and threatened plants and landcover groups) were tested by the BIOCLIM Outlier Detection Test.

COMPLETENESS

Completeness Comments : No particular tests performed at ERIN, but the collated dataset is clearly biased towards the vicinity of roads and other easily-accessible areas.

OTHER

Data Set Comments : An upgrade of the Atherton Herbarium and NROl (main vegetation survey) data are planned. Additional data from the NROl (rainforest survey) may also become available in digital format, shortly. 145

APPENDIX X Flora Data Summaries

Table 3: Acquired data summarised by botanical family.

APPENDIX XI Taxon list of Palms in each Region (5 Group Regionalisation)

----_-_____----____------...... Date Prepared ...... November 24, 1994 Date Prepared ...... November 24,1994 Started ...... l:08 PM Finished ...... l:29 PM Adequacy dataset is at ...... /chimaera/flora/vistr/palm2

------_--LL-----I----I------hd------hd--- _I---C-_--_--_-Id--I_I------d------

Applied-Category ....Palmid Number of Samples...... 1704 Regionalisation ...... / chimaera/region/region/cyp/region3/classn/test/reg5~alb Clipping with ...... /chimaera/arc/study

Region 1 Taxon-ID Taxon Name Number of Samples

18673 Agavaceae cannifolia 18887 Agavaceae Pleomele angustifolia 26903 Arecaceae Archontophoenix alexandrae 46872 Arecaceae Archontophoenix alexandrae 23912 Arecaceae Arenga microcarpa 15635 Arecaceae utan 11554 Arecaceae ramsayi 13640 Arecaceae benthamii 14276 Arecaceae Livistona muelleri 17210 Arecaceae Ptychosperma elegans 16882 Arecaceae Ptychosperma macarthurii 9716 Arecaceae Wodyetia bifurcata 18628 Cyatheaceae Cyathea exilis 2090 Cycadaceae Cycas media 7177 Pandanaceae Pandanus lauterbachii

Region 2 Taxon-ID Taxon Name Number of Samples

18673 Agavaceae Cordyline cannifolia 20831 Agavaceae 18887 Agavaceae Pleomele angustifolia 26903 Arecaceae Archontophoenix alexandrae 46872 Arecaceae Archontophoenix alexandrae 4067 Arecaceae Arenga australasica 16688 Arecaceae Caryota rumphiana 10072 Arecaceae Hydriastele wendlandiana 11554 Arecaceae Licuala ramsayi 19439 Arecaceae Linospadix minor 14276 Arecaceae Livistona muelleri 9120 Arecaceae Normanby a normanbyi 17210 Arecaceae Ptychosperma elegans 16882 Arecaceae Ptychosperma macarthurii 9716 Arecaceae Wodyetia bifurcata 12798 Cyatheaceae Cyathea rebeccae 2090 Cycadaceae Cycas media 2196 Marattiaceae Marattia oreades 31409 Musaceae Musa acuminata 8950 Osmundaceae Todea barbara 16566 Pandanaceae Pandanus conicus 14406 Pandanaceae Pandanus monticola

Region 3 Taxon-ID Taxon Name Number of Samples

18673 Agavaceae Cordyline cannifolia 18887 Agavaceae Pleomele angustifolia 26809 Arecaceae Laccospadix australasica 19439 Arecaceae Linospadix minor 8156 Cyatheaceae Cyathea cooperi 12798 Cyatheaceae Cyathea rebeccae 2090 Cycadaceae Cycas media 2196 Marattiaceae Marattia oreades 8950 Osmundaceae Todea barbara 6528 Pandanaceae Pandanus gemmifer

Region 4 Taxon-ID Taxon Name Number of Samples

18673 Agavaceae Cordyline cannifolia 20831 Agavaceae Cordyline fruticosa 18887 Agavaceae Pleomele angustifolia 46872 Arecaceae Archontophoenix alexandrae 4067 Arecaceae Arenga australasica 10072 Arecaceae Hydriastele wendlandiana 11554 Arecaceae Licuala ramsayi 26943 Arecaceae Linospadix microcarya 19439 Arecaceae Linospadix minor 14276 Arecaceae Livistona muelleri 16882 Arecaceae Ptychosperma macarthurii 8156 Cyatheaceae Cyathea cooperi 12798 Cyatheaceae Cyathea rebeccae 2090 Cycadaceae Cycas media 14506 Musaceae Musa banksii 16566 Pandanaceae Pandanus conicus 6528 Pandanaceae Pandanus gemmifer 6707 Pandanaceae Pandanus oblatus 11908 Pandanaceae Pandanus whitei 7999 Pandanaceae Pandanus zea

Region 5 Taxon-ID Taxon Name Number of Samples

18673 Agavaceae Cordyline cannifolia 20831 Agavaceae Cordyline fruticosa 18887 Agavaceae Pleomele angustifolia 26903 Arecaceae Archontophoenix alexandrae 46872 Arecaceae Archontophoenix alexandrae 4067 Arecaceae Arenga australasica 16688 Arecaceae Caryota rumphiana 3002 Arecaceae Caryota rumphiana 14674 Arecaceae Gulubia costata 10072 Arecaceae Hydriastele wendlandiana 26809 Arecaceae Laccospadix australasica 11554 Arecaceae Licuala ramsayi 19439 Arecaceae Linospadix minor 26945 Arecaceae Linospadix palmeriana 13640 Arecaceae Livistona benthamii 14276 Arecaceae Livistona muelleri 9 120 Arecaceae Normanbya normanbyi 3 126 Arecaceae Nypa fruticans 17210 Arecaceae Ptychosperma elegans 16882 Arecaceae Ptychosperma macarthurii 8 156 Cyatheaceae Cyathea cooperi 18628 Cyatheaceae Cyathea exilis 12798 Cyatheaceae Cyathea rebeccae 2090 Cycadaceae Cycas media 44023 Cycadaceae Cycas rumphii 14506 Musaceae Musa banksii 6262 Pandanaceae Pandanus cochleatus 16566 Pandanaceae Pandanus conicus 7 177 Pandanaceae Pandanus lauterbachii 6707 Pandanaceae Pandanus oblatus 11752 Pandanaceae Pandanus somersetensis 2945 Pandanaceae Pandanus sphaericus 10280 Pandanaceae Pandanus tectorius 15674 Pandanaceae Pandanus tectorius 11908 Pandanaceae Pandanus whitei 4406 Pandanaceae Pandanus yalna 7999 Pandanaceae Pandanus zea