Package 'Foreign'

Package ‘foreign’ March 19, 2013 Priority recommended Version 0.8-53 Date 2013-03-20 Title Read Data Stored by Minitab, S, SAS, SPSS, Stata, Systat, dBase,... Depends R (>= 2.14.0), stats Imports methods, utils Copyright see file COPYRIGHTS Description Functions for reading and writing data stored by statistical packages such as Minitab, S, SAS, SPSS, Stata,Systat, ..., and for reading and writing dBase files. ByteCompile yes Biarch yes License GPL (>= 2) BugReports http://bugs.r-project.org Author R Core Team [aut, cph, cre], Roger Bivand [ctb, cph], Vincent J. Carey [ctb, cph], Saikat DebRoy [ctb, cph], Stephen Eglen [ctb, cph], Rajarshi Guha [ctb, cph], Nicholas Lewin- Koh [ctb,cph], Mark Myatt [ctb, cph], Ben Pfaff [ctb], Frank Warmerdam [ctb, cph], Stephen Weigand [ctb, cph], Free Software Foundation, Inc. [cph] Maintainer R Core Team <[email protected]> NeedsCompilation yes Repository CRAN Date/Publication 2013-03-19 12:52:56 1 2 lookup.xport R topics documented: lookup.xport . .2 read.arff . .3 read.dbf . .4 read.dta . .5 read.epiinfo . .6 read.mtp . .8 read.octave . .9 read.spss . 10 read.ssd . 12 read.systat . 13 read.xport . 15 S3 read functions . 16 write.arff . 17 write.dbf . 18 write.dta . 19 write.foreign . 21 Index 23 lookup.xport Lookup Information on a SAS XPORT Format Library Description Scans a file as a SAS XPORT format library and returns a list containing information about the SAS library. Usage lookup.xport(file) Arguments file character variable with the name of the file to read. The file must be in SAS XPORT format. Value A list with one component for each dataset in the XPORT format library. Author(s) Saikat DebRoy read.arff 3 References SAS Technical Support document TS-140: “The Record Layout of a Data Set in SAS Transport (XPORT) Format” available as http://ftp.sas.com/techsup/download/technote/ts140.html. See Also read.xport Examples ## Not run: lookup.xport("transport") ## End(Not run) read.arff Read Data from ARFF Files Description Reads data from Weka Attribute-Relation File Format (ARFF) files. Usage read.arff(file) Arguments file a character string with the name of the ARFF file to read from, or a connection which will be opened if necessary, and if so closed at the end of the function call. Value A data frame containing the data from the ARFF file. References Attribute-Relation File Format http://www.cs.waikato.ac.nz/~ml/weka/arff.html http://weka.sourceforge.net/wekadoc/index.php/en:ARFF_(3.5.1) See Also write.arff 4 read.dbf read.dbf Read a DBF File Description The function reads a DBF file into a data frame, converting character fields to factors, and trying to respect NULL fields. Usage read.dbf(file, as.is = FALSE) Arguments file name of input file as.is should character vectors not be converted to factors? Details DBF is the extension used for files written for the ‘XBASE’ family of database languages, ‘cover- ing the dBase, Clipper, FoxPro, and their Windows equivalents Visual dBase, Visual Objects, and Visual FoxPro, plus some older products’ (http://www.clicketyclick.dk/databases/xbase/ format/). Most of these follow the file structure used by Ashton-Tate’s dBase II, III or 4 (later owned by Borland). read.dbf is based on C code from http://shapelib.maptools.org/ which implements the ‘XBASE’ specification. It can convert fields of type "L" (logical), "N" and "F" (numeric and float) and "D" (dates): all other field types are read as-is as character vectors. A numeric field is read as an R integer vector if it is encoded to have no decimals, otherwise as a numeric vector. However, if the numbers are too large to fit into an integer vector, it is changed to numeric. Note that is possible to read integers that cannot be represented exactly even as doubles: this sometimes occurs if IDs are incorrectly coded as numeric. Value A data frame of data from the DBF file; note that the field names are adjusted to use in R using make.names(unique=TRUE). There is an attribute "data_type" giving the single-character dBase types for each field. Author(s) Nicholas Lewin-Koh and Roger Bivand; shapelib by Frank Warmerdam References http://shapelib.maptools.org/. The Borland file specification via http://www.wotsit.org, currently at http://www.wotsit. org/list.asp?fc=6. read.dta 5 See Also write.dbf Examples x <- read.dbf(system.file("files/sids.dbf", package="foreign")[1]) str(x) summary(x) read.dta Read Stata Binary Files Description Reads a file in Stata version 5–11 binary format into a data frame. Usage read.dta(file, convert.dates = TRUE, convert.factors = TRUE, missing.type = FALSE, convert.underscore = FALSE, warn.missing.labels = TRUE) Arguments file a filename or URL as a character string. convert.dates Convert Stata dates to Date class? convert.factors Use Stata value labels to create factors? (version 6.0 or later). missing.type For version 8 or later, store information about different types of missing data? convert.underscore Convert "_" in Stata variable names to "." in R names? warn.missing.labels Warn if a variable is specified with value labels and those value labels are not present in the file. Details If the filename appears to be a URL (of schemes ‘http:’, ‘ftp:’ or ‘https:’) the URL is first downloaded to a temporary file and then read. (‘https:’ is only supported on some platforms.) The variables in the Stata data set become the columns of the data frame. Missing values are correctly handled. The data label, variable labels, and timestamp are stored as attributes of the data frame. Nothing is done with variable characteristics. By default Stata dates (%d and %td formats) are converted to R’s Date class and variables with Stata value labels are converted to factors. Ordinarily, read.dta will not convert a variable to a factor unless a label is present for every level. Use convert.factors = NA to override this. In any case the value label and format information is stored as attributes on the returned data frame. 6 read.epiinfo Stata 8.0 introduced a system of 27 different missing data values. If missing.type is TRUE a separate list is created with the same variable names as the loaded data. For string variables the list value is NULL. For other variables the value is NA where the observation is not missing and 0–26 when the observation is missing. This is attached as the "missing" attribute of the returned value. Value A data frame with attributes. These will include "datalabel", "time.stamp", "formats", "types", "val.labels", "var.labels" and "version" and may include "label.table". Possible versions are 5, 6, 7, -7 (Stata 7SE, ‘format-111’), 8 (Stata 8 and 9, ‘format-113’) and 10 (Stata 10 and 11, ‘format-114’). Stata 12 by default uses ‘format-115’, which is read as ‘format-114’ (the Stata documentation says its structure is identical). The value labels in attribute "val.labels" name a table for each variable, or are an empty string. The tables are elements of the named list attribute "label.table": each is an integer vector with names. Author(s) Thomas Lumley and R-core members References Stata Users Manual (versions 5 & 6), Programming manual (version 7), or online help (version 8 and later) describe the format of the files. Or at http://www.stata.com/help.cgi?dta and http://www.stata.com/help.cgi?dta_113. See Also A different approach is available in package memisc: see its help for Stata.file. write.dta, attributes, Date, factor Examples data(swiss) write.dta(swiss,swissfile <- tempfile()) read.dta(swissfile) read.epiinfo Read Epi Info Data Files Description Reads data files in the .REC format used by Epi Info versions 6 and earlier and by EpiData. Epi Info is a public domain database and statistics package produced by the US Centers for Disease Control and EpiData is a freely available data entry and validation system. read.epiinfo 7 Usage read.epiinfo(file, read.deleted = FALSE, guess.broken.dates = FALSE, thisyear = NULL, lower.case.names = FALSE) Arguments file A filename, URL, or connection. read.deleted Deleted records are read if TRUE, omitted if FALSE or replaced with NA if NA. guess.broken.dates Attempt to convert dates with 0 or 2 digit year information (see ‘Details’). thisyear A 4-digit year to use for dates with no year. Defaults to the current year. lower.case.names Convert variable names to lowercase? Details Epi Info allows dates to be specified with no year or with a 2 or 4 digits. Dates with four-digit years are always converted to Date class. With the guess.broken.dates option the function will attempt to convert two-digit years using the operating system’s default method (see Date) and will use the current year or the thisyear argument for dates with no year information. If read.deleted is TRUE the "deleted" attribute of the data frame indicates the deleted records. Value A data frame. Note Epi Info 2000, the current version, uses the Microsoft Access file format to store data. This may be readable with the RODBC or RDCOM packages. References http://www.cdc.gov/epiinfo/, http://www.epidata.dk See Also DateTimeClasses Examples ## Not run: data<-read.epiinfo("oswego.rec",guess.broken.dates=TRUE,thisyear="1972") ## End(Not run) 8 read.mtp read.mtp Read a Minitab Portable Worksheet Description Return a list with the data stored in a file as a Minitab Portable Worksheet. Usage read.mtp(file) Arguments file character variable with the name of the file to read.

Load more