Storage, Retrieval, and Analysis of Compacted Shale Data
Total Page:16
File Type:pdf, Size:1020Kb
14 Storage, Retrieval, and Analysis of Compacted Shale Data Dirk J. A. van Zyl, L.E. Wood, and C. W. Lovell, Department of Civil Engineering, Purdue University, West Lafayette, Indiana W. J. Sisiliano, Indiana State Highway Commission La1ge amounts of soil test data aro beir,g gonoroted by states testing materials fr-om three Joint Highway Research Project (J.HRP) reports for new highway facillties. Of particular Interest to ,omo states Is tho occur completed at Purdue University @, !, ID. A complete rence of shale os a highway motorlal. This paper describes a storage and rotrioval system for compacted lndlnnn shale data (163 setsl. Statistical analyses in summary of all the data available and descriptions of the cluded frequency analysis, bivariate correlation analysis, nnd multiple rogres, geology of Indiana shales and the physiography of Indiana sion analysis. Histograms of the frequency distributions and o summary table are given by van Zyl (§). The table below gives a summary of all the dlfferont statistical descriptors of the parameters are given. Bivariate of the data sets according to geological description. correlation analysis (ono•to-one torrolations) was done for all dota lumped and for data grouped by geologic unit. The multiple regression, concentrated on five-parameter models for predicting Califomla bearing rotio values. It was Geological No. of also found by multlple tllgrenion that the various value, of sloko durabilily in System Geological Stage Data Sets dices can be estimated by using second•ordor equotions. The analyses aro of potontia! value to practicing engineers as a first approximation in design and Pennsylvanian Shelburn formation 3 moy prove useful to engineers rosoarching compacted shale behavior. Dugger formation 2 Petersburg formation 2 Brazil formation 1 Mansfield formation 23 The usefulness of the large amounts of soil test data being 31 generated by testing of materials for new highway facili ties was not realized until recently. A number of states Mississippian Core limestone 2 Palestine sandstone 17 are now in the process of introducing storage and retrieval Watersburg sandstone 5 systems (.!., ~. The type of system to be used depends on Tar Springs formation 15 the amount of data available, but any system should be Glen Dean limestone 3 flexible enough to allow changes to be made later with the Hardinsburg formation 20 least amount of effort. The system should therefore be Haney limestone 3 easily administered, economical, dynamic, and adaptable Big Clifty formation 3 to the changing needs of its users. These qualities can be Elwren formation 1 Sample formation 4 best incorporated in a computerized system. Bethel formation 3 The statistical analysis of stored data can lead to a Borden group 16 number of very useful results such as proving or dis Locust Point formation 4 proving what the typical properties and behavior of certain New Providence shale 4 types of material indicated by experience are. Statistical New Albany shale 5 analyses will also indicate the typical range of values to 105 be expected for different parameters of the different Devonian Antrim shale 1 members in the system as well as the distribution of these New Albany shale 6 parameters. This can play a very important role in 7 assessing the ~uitabili.ty of a ce1-ta.in saw.plc before ~l~bc Urdov1c1an Whitewater forma1ion 2 rate laboratory testing is undertaken. Statistical analyses Dillsboro formation 6 should be updated regularly as more data become avail Kope formation 12 able. Results can be stated with greater confidence as the 20 available data bank grows. An additional benefit is the development of new ideas for more productive research Each data set consists of a number of parameters ob based on empirical relationships and correlation of soil tained during laboratory testing. The parameters and properties@. descriptors included in each data set are given in Table 1. Correlations among different parameters can be Not all parameters are always determined during labora developed by using stored data. Regression models can tory testing, which mealls that not all the data sets are be established to predict the parameters that are difficult complete and Ums that highly reliable conclusions cannot to measure by using a set of simple parameters. These always pe drawn. Much of the laboratory testing is done models should be reevaluated as new data become avail according to AASHTO sl-a:ndard procedures. However, the able. following parameters are determined by tentative test This paper describes a simple storage and retrieval methods of the ISHC: loss on ignition, slake durability system for data available on compacted Indiana shales. index, slaking index, fissility number, and modified Some results of a comprehensive statistical analysis of soundness. These test methods are described by van the data are also presented. Zyl (&). DATA STORAGE AND RETRIEVAL OF SHALE DATA The data for this study are ft'om testing done by the Certain descriptors should be used to identify a shale Indiana State Highway Commission (ISHC) on shales and sample in the storage and retrieval system. The choice 15 is between numerical values and codes or numerical STATISTICAL ANALYSIS values and words. As the system described here has only 163 data sets, it was decided to use the latter. The statistical analysis was done by using the routines From Table 1 it is clear that the descriptors used to available in the statistical package for the social sciences locate a sample are laboratory number, geographic loca (SPSS) (1). tion (counties), geologic unit (system, series, stage), and physiographic unit (bedrock). The numerical values are Frequency Analysis the laboratory test results. The data sets were coded on the four-card series The first step was to do a complete frequency analysis shown in Table 1. The three basic reasons for retrieving of all the data available. For this the CODEBOOK data are to get printouts of all the data available, to get routine of SPSS was used. The data can be analyzed as printouts based on some descriptor in some order, e.g., single points or as grouped data and were first analyzed geological stage in alphabetical order so that the printed as single points in order to obtain minima and maxima data sets belonging to a certain geological stage can be of each parameter. The values of each parameter were separated for study, and to use certain sets of data for then grouped into 10 intervals in order to obtain histograms. statistical manipulation. The results of the statistics of the different parameters A program was written for the CDC 6500 system of the are given in Table 2, where the first line of results for Purdue University Computer Center to achieve the first every parameter gives the values obtained by using single two. The sorting of the data, accomplished by using the points and the second line gives the results for grouped sort/1nerge package on the CDC system, can be done in an data. The definitions of standard error, kurtosis, skew ascending or descending order based on any parameter. ness, and so on are giveu in Nie and others (1). The histograms are glven in Figure 1. (Note that all dl:!nsity calculatlo)ls were done in the U.S. customary units of poUDds per cubic foot.) Table 1. Coding of four-card series. The last two columns of Table 2 give the number of valid values and also the missing values; e.g., for SDll, Symbols (for 131 data sets had values reported for this parameter but program and 32 data sets did not. The number of valid values plays a tables and Parameter figures) Format Columns very important role in the regression analysis. The low values obtained for the skewness for liquid Card 1 Laboratory number NO no 1-10 limit (QL) and percent silt (PSI) indicate that the distribu County co AlO 11-20 tions of these two parameters are almost symmetrically Geological system GSYl, GSY2 A10A5 21-35 bell shaped (see Figure 1). Low values of coefficient Geological series GSEl, GSE2 A10A5 36-50 Geological stage GSTl , GST2 A10A5 51-65 of variance were also obtained for these two variables; this Physiographio unit PHUl, PHU2 A10A5 66-80 confirms the small relative spread of values. Low values Card 2 Laboratory number NO no 1-10 of the coefficient of variation are to be e:xpected for specific Slaking index cycle 1 SDn F5.l 11-15 gravity (SG) and pH, while the low values for the coefficient Slaking index cycle 5 SDI2 F5.l 16-20 Slake durability index, SD2D F5.l 21-25 of variation obtained for the different densities reflect the 200 revolutions dry high average values and small spread of these parameters. Slake durability index, SD5D F5.l 26-30 500 revolutions dry It is interesting to note the relatively small values of Slake durability index, SD2S F5. l 31-35 coefficient of variation for the Atterberg limits, which can 200 revolutions soaked Slake durability index, SD5S F5.1 36-40 therefore be estimated with reasonable confidence if a few 500 revolutions soaked test results are available. Assuming that the plastic limit Fisslllty number FN F5. l 41-45 (PL) and QL are normally distributed (which is reaso.nable, Modified soundness MS F5.1 46-50 Card 3 as shown in Figure 1), it can be said that 68 percent of all Laboratory number NO no 1-10 values will be within one standard deviation of the mean, Textural classification TEXl, TEX2 A10A5 11-25 AASHTO classification ASH AlO 26-35 i.e., the ranges of values are 19. 9-24.1 for the PL and Plastic limit PL F5. 1 36-40 25. 4-41. 4 for the QL.