<<

360 Layers of Perception – CAA 2007

Hans-Joachim Mucha – Hans-Georg Bartel – Jens Dolata

Finding Roman Brickyards in Superior by Model-Based Cluster Analysis of Archaeometric Data

Abstract: Chemical analysis of ancient ceramics and of other archaeologically important materials has been used frequently to support archaeological research. Often the dimensionality of the measurements has been high. Therefore, multivariate statistical techniques such as cluster analysis have to be applied. The aim of the present paper is to give a review of the research on bricks and tiles from Roman military brick- yards in and to present the main results obtained by multivariate statistical analysis. In particular, new adaptive cluster analysis methods and modified model-based clustering are applied on archaeometric data (Mu c h a / Ba r t e l / Do l a t a 2002; 2003a; 2005b; in press; Ba r t e l / Do l a t a / Mu c h a 2000; 2003). The main result was the discovery of military brickyards that were not known when the project began about ten years ago. Recently, they have been discovered by the application of these multivariate sta- tistical analysis models. Newly developed visualization methods support and facilitate the interpretation of both the data set and the results of grouping. This means archaeologists can easily identify a new finding of a Roman brick or tile by comparing its chemical fingerprint with those from the detected provenances.

Introduction period of operation was from the middle of the first century AD until the end of the fourth century. About 1000 Roman stamped bricks and tiles from Archaeologists are most interested in the location the Upper area were the objects under of brickyards and in the chronology of the pro- investigation by methods of chemical and statisti- duction-marks, which are found on the building- cal analysis. Using both archaeological information material. and the results of mineralogy and chemistry allowed archaeologists to develop a complex model of brick- At present we have a long history of chemical and tile-making in the Roman period. In south-west and statistical analysis of Roman brick and tile a few large brickyards existed, of which making in Germania Superior (Do l a t a 1996; 1998; the operating authority was the Roman army. The 1999; 2000; 2001; Do l a t a / We rr 1999; We rr 1998;

Fig. 1. (left) Staatliche Museen Berlin Preußischer Kulturbesitz (Antikensammlung) TC 5778a: later (brick) with stamp of legio XXII, findspot: Neuwied-Niederbieber, provenance: ‘not yet known 1’. (right) Landesmuseum ZS 1581: tegula (tile) with stamp of legio XXII Primigenia Pia Fidelis, findspot: Mainz, provenance: -Nied. Analysing Ancient Economies and Social Relations 361

Ba r t e l / D o l a t a / M u c h a 2000; 2000 / 2001; 2001; Model-based Cluster Analysis in a Nutshell 2002; 2003; Do l a t a / M u c h a / Ba r t e l 2001; 2003a; 2003b; 2004; 2006; 2007; Mu c h a / Ba r t e l / D o l a t a The aim of clustering (grouping, unsupervised 2002; 2003a; 2003b; 2005a; 2005b; in press; Mu c h a classification) based on chemical components was e t a l . 2006; Sw a r t e t a l . 2004). both to confirm supposed sites of brickyards and to find places of those ones that are not yet identi- The data of ancient coarse ceramics from Ger- fied. Clustering was accompanied by multivariate mania Superior used here is described by 19 vari- graphical presentations of the objects and the clus- ables: nine oxides (measured in mass-%: SiO2, TiO2, ters found (see the references above and especially

Al2O3, Fe2O3, MnO, MgO, CaO, Na2O, K2O) and ten (Ba r t e l / Do l a t a / Mu c h a 2002) and (Do l a t a / Mu- trace elements (measured in ppm: V, Cr, Ni, Zn, Rb, c h a / Ba r t e l 2003b). Moreover, clustering was fol- Sr, Y, Zr, Nb, Ba). Besides the different scales of the lowed by validation of (a) the number of clusters, (b) variables, often problems with outliers and with the stability (reproducibility) of each cluster, and (c) long-tailed (skew) distributions of the variables in the reliability of the class membership of each object the archaeometric data were addressed, see recent- to its cluster (see for example Mu c h a / Ba r t e l / Do l a - ly (Ba x t e r 2006). t a in press; Do l a t a / Mu c h a / Ba r t e l 2007). Fig. 1 shows two objects from our data base that Generally, the final goal of cluster analysis is to were provided with a stamp. Consequently they find meaningful and stable clusters that can be re- appear to be very valuable documents. produced to a high degree. When done well, clus- tering techniques can support scientists of various The chemical components of the bricks and tiles research areas in their search for hypothesis. Details were measured by Gerwulf Schneider at the Free on cluster analysis can be found in numerous papers University Berlin using the X-ray fluorescence (Ev e r i t t 1980; Sp ä t h 1985; Mu c h a 1992; Ba r t e l 1996; analysis (XRF, concerning this method see for in- Pa p a g e o r g i o u e t a l . 2001). In this instance we present stance Le u t e 1987). At the start a total of 613 Roman some basic formulae based on distance measures bricks and tiles were analyzed. However, there is that were also used in the following practical prob- an ongoing research process. Day-to-day new find- lem of the identification of new findings from the ings can be reported. For example, quite substan- clusters found. tial new findings come from Boppard and Mainz. Let a sample of I independent observations Current details on both the history and the ongoing (objects) be given in the space RJ and denote by X research can be obtained from the web site http:// = (xij) the corresponding data matrix consisting of I www.ziegelforschung.de. rows and J columns (variables), where the element

xij provides a value for the jth variable describing

Fig. 2. (left) Bivariate density based on the first two principal components. (right) Several cuts of the density and the underlying objects projected as points bringing together the PCA-plot of points and the continuous density plot. 362 Layers of Perception – CAA 2007

Fig. 5. Graphical presentation of the identification of brick H239 to the provenance Straßburg-Königshofen using chemical profiles. Fig. 3. Localization of the Roman theater in Mainz. In the case of adaptive cluster analysis used here the ith object. Here the objects are archaeologi- the criterion above is modified by considering adap- cal findings. Further more, let C = {x1, …, xi, …, xI} tive weights of variables in the definition of the dis- denote the finite set of the I objects. Alternatively tance measure (Mu c h a /Ba r t e l /Do l a t a 2005a). The written shortly as C = {1, …, i, …, I}. squared weighted Euclidean distance Following the definition of the starting point of cluster analysis, then let us formalize the simplest

(elementary) solution to the clustering problem is such an adaptive distance, where sj is the pooled with a fixed number of clustersK : a partition {C1, …, standard deviation of the variable j (Mu c h a /Ba r t e l /

CK} of C, where every pair of two subsets (clusters) Do l a t a 2002).

Ck and Cl has an empty intersection, and where the union of all K clusters give the total set C. In the simplest kind of model-based Gaussian Roman Bricks and Tiles Classified cluster analysis the sum of within-clusters sum of squares criterion has to be minimized. This criterion Successful applications of simple model-based can be formulated as Gaussian clustering of Roman bricks and tiles have already been reported (Mu c h a / Ba r t e l / Do l a t a 2002). In this case new adaptive distances were ap- plied for finding provenances of production of mil- by using the squared Euclidean distance itary brickyards. As a result the following locations of military brickyards are identified: Frankfurt- Nied, Groß-Krotzenburg, Rheinzabern, Straßburg- between two objects i and l. Here nk is the number of Königshofen, Worms (initially ‘not yet known 2’) objects in cluster Ck. and two with respect to their provenience not yet

Fig. 4. (left) A brick from the Lothary brickyard from Mainz (19th century). (right) A tile with a stamp of the legio XII that was recently found in Boppard (Rhine). Analysing Ancient Economies and Social Relations 363 known ones. In Fig. 2 the ‘mountains’ (clusters) of Provenance Identification of New Findings the bivariate density estimation and several cuts of this density are shown. This estimation is based on As demonstrated above, the assignment (identifica- the first two components of the principal compo- tion) of provenance to new findings is an important nent analysis (PCA) of 613 objects. The PCA is a task in archaeometry. Beside identification of large projection technique that, hopefully, makes visible sets of objects, often individual objects have to be as- the essentials of the data in two dimensions (for signed to known brickyards or in general to identi- details see Gr e e n a c r e 1984; Ja c k s o n 1991; Mu c h a fied locations of production of any kind of artifacts. 1992). The quality of projection into two principal An identification of such objects can be based on the components is high (80% of variance). The bivari- same distance measures that are used in cluster anal- ate density looks like a main data body with two ysis. We recommend the K-nearest-neighbor tech- arms. Some of the clusters are compact ones; others nique (Ba r t e l / Do l a t a / Mu c h a 2004). are not clearly isolated from one another. The right Fig. 5 gives an additional graphical insight when arm consists of three compact clusters. In opposi- comparing chemical profiles (fingerprints). For sim- tion, the ridge at the left hand side shows neither plicity reason here the comparison is restricted to clear peaks nor clear separation into clusters. three profiles only: the pooled profile of members Usually the locations of findings are geographi- of the clusters Straßburg-Königshofen and Groß- cally different from the locations of military brick- Krotzenburg, and the chemical fingerprint of the yards. They all have in common that they are locat- finding H239 with unknown location of produc- ed nearby rivers. The transport was done by cargo tion. The numerical distance values are: d (H239, ships. Straßburg-Königshofen) = 0.37 << d (H239, Groß- Krotzenburg) = 4.13. Therefore, H239 is assigned to Straßburg-Königshofen by the nearest-neighbor Substantial New Findings in Mainz and rule. By visual inspection of Fig. 5 the profile of H239 Boppard looks much more similar to Straßburg-Königshofen.

During the excavation of a Roman theater in Mainz (Fig. 3) many bricks and tiles were found. The chem- Summary ical compositions of 70 objects were measured. The question arose: Was there a Roman military brick- During the past dozen years, a complex model of yard in Mainz? In order to answer this question a history and relations of Roman brick and tile pro- comparison with bricks from Worms, Rheinzabern, duction in south-west Germany has been devel- Frankfurt-Nied, and from the modern brickyard of oped by archaeologists. Clustering techniques are Christian Lothary was done. The latter was located able to support the archaeologists in their search in Mainz. Thirty bricks from this brickyard (Fig. 4) for hypothesis. The cluster analysis referred to were analyzed. here comes with validated results and outstanding The multivariate statistical comparison based on multivariate graphics of objects and clusters. The the chemical composition delivered the mathemati- graphics are visual methods for obtaining a better cally confirmed result that without any doubt the understanding of the statistical results obtained by objects from the theater were produced in Worms the archaeologists. (Do l a t a / Mu c h a / Ba r t e l 2006; Mu c h a e t a l . 2006). During an important excavation in Boppard, 43 additional bricks and tiles were discovered References (Fig. 4). It could be shown that these objects can be assigned to Worms with high probability (Mu c h a / Ba r t e l 1996 Ba r t e l / D o l a t a 2005a). Additionally the archaeo- H. Ba r t e l , Mathematische Methoden in der Chemie logical knowledge about the military brickyard in (Heidelberg 1996). Worms and its importance in the Rhine area could Ba r t e l / D o l a t a / Mu c h a 2000 be improved. H. Ba r t e l / J. D o l a t a / H. M u c h a , Klassifikation gestempelter römischer Ziegel aus Obergermanien. Archäometrie und Denkmalpflege, 2000, 86–88. 364 Layers of Perception – CAA 2007

Ba r t e l / D o l a t a / Mu c h a 2000/2001 Do l a t a 2001 H. Ba r t e l / J. Do l a t a / H. Mu c h a , Klassifikation von J. Do l a t a , Inventarisation und Forschung zu archäolo- 613 Proben als Referenzen für die Herstellungsprove- gischem. Römische Ziegelstempel aus Mainz – Er- nienzen römischer Baukeramik im nördlichen Ober- reichtes und Perspektiven. Denkmalpflege in Rhein- germanien. Mainzer Archäometrische Zeitschrift 7/8, land-Pfalz 52–56, 2001, 488–494. 2000/2001, 275–300. Do l a t a / Mu c h a / Ba r t e l 2001 Ba r t e l / D o l a t a / Mu c h a 2001 J. Do l a t a / H. Mu c h a / H. Ba r t e l , Untersuchungsper- H. Ba r t e l / J. Do l a t a / H. Mu c h a , Parametrische und spektiven zum kombinierten archäologischen und nichtparametrische Identifikationsmethoden, darge- materialanalytischen Nachweis römischer Ziegelher- stellt am Beispiel römischer Baukeramik aus Oberger- stellung in Mainz und Worms. Archäometrie und Denk- manien. Archäometrie und Denkmalpflege, 2001, malpflege, 2001, 107–109. 104–106. Do l a t a / Mu c h a / Ba r t e l 2003a Ba r t e l / D o l a t a / Mu c h a 2002 J. Do l a t a / H. Mu c h a / H. Ba r t e l , Statistische Untersu- H. Ba r t e l / J. Do l a t a / H. Mu c h a , Automatische Klassi- chung zur Aufklärung der Binnenstruktur römischer fikation in der Archäometrie. Berliner und Mainzer Ar- Ziegelproduktion von Frankfurt-Nied. Archäometrie beiten zu oberrheinischen Ziegeleien in römischer Zeit. und Denkmalpflege, 2003, 40–42. Berliner Beiträge zur Archäometrie 19, 2002, 31–62. Do l a t a / Mu c h a / Ba r t e l 2003b Ba r t e l / D o l a t a / Mu c h a 2003 J. Do l a t a / H. Mu c h a / H. Ba r t e l , Archäologische und H. Ba r t e l / J. Do l a t a / H. Mu c h a , Über eine Modi- mathematisch-statistische Neuordnung der Orte rö- fikation eines graphentheoretisch basierten partitio- mischer Baukeramikherstellung im nördlichen Oberger- nierenden Verfahrens der Clusteranalyse. Match 48, manien. Xantener Berichte 13, 2003, 381–409. 2003, 209–223. Do l a t a / Mu c h a / Ba r t e l 2004 Ba r t e l / D o l a t a / Mu c h a i n p r e s s J. Do l a t a / H. Mu c h a / H. Ba r t e l , Eine Anwendung der H. Bartel / J. Dolata / H. Mucha, Über Identifikations- hierarchischen Clusteranalyse. ‘Unbekannt 1’ als Pro- methoden, dargestellt am Beispiel römischer Bau- venienz einer bekannten obergermanischen Heeresziege- keramik aus Obergermanien. Berliner Beiträge zur lei? Archäometrie und Denkmalpflege, 2004, 28–30. Archäometrie 21, in press. Do l a t a / Mu c h a / Ba r t e l 2006 Ba x t e r 2006 J. Do l a t a / H. Mu c h a / H. Ba r t e l , Provenienz von M. Ba x t e r , A Review of Supervised and Unsupervised Ziegeln aus dem römischen Theater in Mainz. Archäo- Pattern Recognition in Archaeometry. Archaeometry logische Bewertung von clusteranalytischen Resultaten. 48, 2006, 671–694. Archäometrie und Denkmalpflege, 2006, 150–152. Do l a t a 1996 Do l a t a / Mu c h a / Ba r t e l 2007 J. Do l a t a , Hin zu einer archäologischen Nutzanwen- J. Do l a t a / H. Mu c h a / H. Ba r t e l , Uncovering the In- dung geochemischer Analytik römischer Baukeramik. ternal Structure of the Roman Brick and Tile Making Mainzer Archäometrische Zeitschrift 3, 1996, 105–125. in Frankfurt-Nied by Cluster Validation. In: R. De c k - Do l a t a 1998 e r / H.-J. Le n z (e d s .), Advances in Data Analysis (Berlin J. Do l a t a , Archäologische und archäometrische Unter- 2007) 663–670. suchung an römischer Baukeramik und Ziegelstem- Do l a t a / We rr 1999 peln. Archäometrie und Denkmalpflege, 1998, 93–95. J. Do l a t a / U. We rr , Wie gleich ist derselbe? – Homo- Do l a t a 1999 genität eines römischen Ziegels und Aussagegrenzen J. Do l a t a , Ingenieurtechnische Untersuchung an anti- geochemischer Analytik aufgrund von Meßtechnik und ken Ziegelsteinen aus Mainz: Interdisziplinäre Erforsch- Materialvarianz. Mainzer Archäometrische Zeitschrift ung römischer Baukeramik und Ziegelstempel. Ziegel 5/6, 1999, 129–147. Zeitschrift 6, 1999, 421–423. Ev e r i t t 1980 Do l a t a 2000 B. Ev e r i t t , Cluster Analysis (New York 1980). J. Do l a t a , Römische Ziegelstempel aus Mainz und dem Gr e e n a c r e 1984 nördlichen Obergermanien. PhD. Thesis (Frankfurt am M. Gr e e n a c r e , Theory and Applications of Correspon- Main 2000). dence Analysis (New York 1984). Ha r t i g a n 1975 J. Ha r t i g a n , Clustering Algorithms (New York 1975). Analysing Ancient Economies and Social Relations 365

Ja c k s o n 1991 Pa p a g e o r g i o u / Ba x t e r / Ca u 2001 J. Ja c k s o n , A User’s Guide to Principal Components I. Pa p a g e o r g i o u / M. Ba x t e r / M. Ca u , Model-based (New York 1991). Cluster Analysis of Artefact Compositional Data. Le u t e 1987 Archaeometry 43, 2001, 571–588. U. Le u t e , Archaeometry – An Introduction to Physical Sp ä t h 1985 Methods in Archaeology and the History of Art (Wein- H. Sp ä t h , Cluster Dissection and Analysis (Chichester heim 1987). 1985). Mu c h a 1992 Sw a r t e t a l . 2004 H. Mu c h a , Clusteranalyse mit Mikrocomputern (Berlin C. Sw a r t / B. Pa z / J. Do l a t a / G. Sc h n e i d e r / J. Si m o n / H. 1992). Ba r t e l / H. Mu c h a , Analyse römischer Ziegel mit ICP- Mu c h a 2007 MS/-OES. Methodenvergleich zwischen RFA und ICP. H. Mu c h a , On Validation of Hierarchical Clustering. In: Archäometrie und Denkmalpflege, 2004, 25–27. R. De c k e r / H.-J. Le n z (e d s .), Advances in Data Analysis We rr 1998 (Berlin 2007) 115–122. U. We rr , Grenzen der Aussagekraft chemischer Analy- Mu c h a / Ba r t e l / Do l a t a 2002 tik für römische Baukeramik. Archäometrie und Denk- H. Mu c h a / H. Ba r t e l / J. Do l a t a , Exploring Roman malpflege, 1998, 96–98. Brick and Tile by Cluster Analysis with Validation of Results. In: W. Ga u l / G. Ri t t e r (e d s .), Classification, Au- tomation, and New Media (Berlin 2002) 471–478. Hans-Joachim Mucha Mu c h a / Ba r t e l / Do l a t a 2003a H. Mucha / H. Bartel / J. Dolata, Core-Based Cluster- Weierstraß Institute for Applied Analysis and ing Techniques. Methods, Software, and Applications. Stochastics (WIAS) In: M. Sc h a d e r / W. Ga u l / M. Vi c h i (e d s .), Between Data Mohrenstr. 39 Science and Applied Data Analysis (Berlin 2003) 74–82. 10117 Berlin, Germany Mu c h a / Ba r t e l / Do l a t a 2003b [email protected] H. Mu c h a / H. Ba r t e l / J. Do l a t a , Modellbasierte Clus- teranalyse römischer Ziegel aus Worms und Rheinza- . Archäologische Informationen 26/2, 2003, Hans-Georg Bartel 471–480. Mu c h a / Ba r t e l / Do l a t a 2005a Institute for Chemistry H. Mu c h a / H. Ba r t e l / J. Do l a t a , Model-Based Cluster Humboldt University Berlin Analysis of Roman Bricks and Tiles from Worms and Unter den Linden 6 Rheinzabern. In. C. We i h s / W. Ga u l (e d s .), Classification 10099 Berlin, Germany – The Ubiquitous Challenge (Berlin 2005) 317–324. [email protected] Mu c h a / Ba r t e l / Do l a t a 2005b H. Mu c h a / H. Ba r t e l / J. Do l a t a , Techniques of Rear- rangements in Binary Trees (Dendrograms) and Appli- Jens Dolata cations. Match 54, 2005, 561–582. Mu c h a e t a l . 2006 Head Office for Cultural Heritage Rhineland- H. Mu c h a / H. Ba r t e l / J. Do l a t a / C. Sw a r t / K. Gr a u b - (GDKE) n e r , Zur Ausweisung einer Ziegelreferenz »Mainz« Große Langgasse 29 durch Bewertung der Stabilität von Klassen- und Ob- 55116 Mainz, Germany jektzuweisungen. Archäometrie und Denkmalpflege, [email protected] 2006, 153–155. Mu c h a / Ba r t e l / Do l a t a i n p r e s s H. Mu c h a / H. Ba r t e l / J. Do l a t a , Effects of Data Trans- formation on Cluster Analysis of Archaeological Data. In: C. Pr e i s a c h / H. Bu rk h a r d t / L. Sc h m i d t -Th i e m e / R. De c k e r (e d s .): Data Analysis, Machine Learning, and Applications. Proceedings of the 31st Annual Confer- ence of the Gesellschaft für Klassifikation, Berlin, 2007.