Abstract

Ching-Shui Cheng was one of the pioneers Using graphs to find optimal block designs of using graph theory to prove results about optimal incomplete-block designs. There are actually two graphs associated with an incomplete-, and R. A. Bailey either can be used. University of St Andrews / QMUL (emerita) A block design is D-optimal if it maximizes the number of spanning trees; it is A-optimal if it minimizes the total of the Special session for Ching-Shui Cheng, pairwise resistances when the graph is thought of as an 20 June 2013 electrical network. I shall report on some surprising results about optimal designs when replication is very low.

1/44 2/44

Conference on Design of Experiments, Tianjin, June 2006 Problem 1: Factorial design

Problem There are several treatment factors, with various numbers of levels. There may not be room to include all combinations of these levels. There are several inherent factors (also called block factors) on the experimental units. The inherent factors may pose some constraints on how treatment factors can be applied. So how do we set about designing the experiment? Solution Use Desmond Patterson’s design key.

3/44 4/44

Problem 2: Optimal incomplete-block design Two papers about the second problem

Problem I Ch’ing Shui Cheng:ˆ The design must be in blocks, but each block is too small to Maximizing the total number of spanning trees in a graph: contain every treatment. two related problems in graph theory and optimum design We want variance to be small, as measured by the D-criterion theory. (minimizing the volume of the ellpsoid of confidence). Journal of Combinatorial Theory, Series B 31 (1981), 240–248. How do we set about finding a D-optimal block design? I Ching-Shui Cheng: Solution Graph and optimum design theories—some connections Represent the incomplete-block design as a graph and examples. (with vertices and edges), Bulletin of the International Statistical Institute 49 (1981), and maximize the number of spanning trees in the graph. part 1, 580–590.

5/44 6/44 Some early papers Two memorial conferences for R. C. Bose, India, 1988

Year RAB CSC 1977 factorial (design key) 1978 factorial (design key) optimal IBDs Calcutta meeting on : 1979 optimal IBDs Combinatorial Mathematics and Applications CSC spoke on “On the optimality of (M.S)-optimal designs in 1980 optimal IBDs; factorial large systems”. 1981 optimal IBDs and graphs 1982 factorial Delhi meeting on Probability, Statistics and Design of Experiments: 1984 IBDs RAB spoke on “Cyclic designs and factorial designs”. 1985 factorial; IBDs IBDs and graphs; factorial with trend 1986 IBDs IBDs 1988 factorial with trend 1989 factorial

7/44 8/44

Two joint papers Some later papers and books

Year RAB CSC 1993 factorial; optimal IBDs 1995 optimal IBDs 1998 factorial; BIBDs I C.-S. Cheng and R. A. Bailey: 1999 factorial Optimality of some two-associate-class partially balanced 2000 factorial incomplete-block designs. 2001 factorial Annals of Statistics 19 (1991), 1667–1671. 2002 factorial I R. A. Bailey, Ching-Shui Cheng and Patricia Kipnis: 2003 factorial Construction of trend-resistant factorial designs. 2004 IBDs factorial Statistica Sinica 2 (1992), 393–411. 2005 IBDs factorial 2006 factorial 2007 optimal IBDs and graphs 2009 optimal IBDs and graphs factorial 2011 factorial factorial 2013 optimal IBDs and graphs factorial (design key)

9/44 10/44

With Jo Kunert, Dortmund, April 2009 What makes a block design good for experiments?

I have v treatments that I want to compare. I have b blocks. Each block has space for k treatments (not necessarily distinct).

How should I choose a block design?

11/44 12/44 Two designs with v = 5, b = 7, k = 3: which is better? Two designs with v = 15, b = 7, k = 3: which is better?

Conventions: columns are blocks; order of treatments within each block is irrelevant; 1 1 2 3 4 5 6 1 1 1 1 1 1 1 order of blocks is irrelevant. 2 4 5 6 10 11 12 2 4 6 8 10 12 14 3 7 8 9 13 14 15 3 5 7 9 11 13 15 1 1 1 1 2 2 2 1 1 1 1 2 2 2 2 3 3 4 3 3 4 1 3 3 4 3 3 4 replications differ by 1 queen-bee design 3 4 5 5 4 5 5 2 4 5 5 4 5 5 ≤

The replication of a treatment is its number of occurrences. binary non-binary A design is a queen-bee design if there is a treatment that A design is binary if no treatment occurs more than once in any occurs in every block. block.

13/44 14/44

Experimental units and matrix Levi graph

There are bk experimental units. If ω is an experimental unit, put The Levi graph G˜ of a block design ∆ has f (ω) = treatment on ω I one vertex for each treatment, g(ω) = block containing ω. I one vertex for each block, I one edge for each experimental unit, For i = 1, . . . , v and j = 1, . . . , b, let with edge ω joining vertex f (ω) to vertex g(ω). n = ω : f (ω) = i and g(ω) = j ij |{ }| It is a , = number of experimental units in block j which have with nij edges between treatment-vertex i and block-vertex j. treatment i. The v b N has entries n . × ij

15/44 16/44

Example 1: v = 4, b = k = 3 Example 2: v = 8, b = 4, k = 3

1 2 1 1 2 3 4 3 3 2 2 3 4 1 4 4 2 5 6 7 8

5 8 1 4 x ¨H @  ¨ H @ @ ¨¨ x HH x x @ ¨¨ HH  2 HH ¨¨  x H1 2¨ HxH ¨¨ x 4  H¨ @ @ x@ @ 3  x x63 x 7 x 17/44 18/44 Concurrence graph Example 1: v = 4, b = k = 3

1 2 1 The concurrence graph G of a block design ∆ has 3 3 2 I one vertex for each treatment, 4 4 2 I one edge for each unordered pair α, ω, with α = ω, 6 g(α) = g(ω) and f (α) = f (ω): 6 1 2 this edge joins vertices f (α) and f (ω). 4 ¨H ¨ H @ ¨¨ x HH x@ x There are no loops. ¨¨ HH @ H ¨  HH ¨¨ @ If i = j then the number of edges between vertices i and j is 1Hx¨2 x @ 6 HH¨¨ @ b 3x 3x 4x λij = ∑ nisnjs; s=1 Levi graph concurrence graph this is called the concurrence of i and j, can recover design may have more symmetry and is the (i, j)-entry of Λ = NN>. more vertices more edges if k = 2 more edges if k 4 19/44 ≥ 20/44

Example 2: v = 8, b = 4, k = 3 Laplacian matrices

The Laplacian matrix L of the concurrence graph G is a 1 2 3 4 v v matrix with (i, j)-entry as follows: 2 3 4 1 × I if i = j then 5 6 7 8 6 L = (number of edges between i and j) = λ ; ij − − ij I Lii = valency of i = ∑ λij. j=i 5 5 6 8 1 x ¡A The Laplacian matrix L˜ of the Levi graph G˜ is a @  ¡ xA (v + b) (v + b) matrix with (i, j)-entry as follows: @ @ 1¡ A 2 x x @ ¨ H ˜ ×  2 ¨ H I Lii = valency of i ¨ x x H x 8HH ¨¨ 6 k if i is a block xH ¨ x = 4  A ¡ @ @ 4 3 (replication ri of i if i is a treatment x@ @ xA ¡ x  A¡ I if i = j then Lij = (number of edges between i and j) 6 − x63 x 7 7x 0 if i and j are both treatments x Levi graph concurrence graph = 0 if i and j are both blocks   nij if i is a treatment and j is a block, or vice versa. 21/44 − 22/44 

Connectivity Generalized inverse

All row-sums of L and of L˜ are zero, so both matrices have 0 as eigenvalue on the appropriate all-1 vector. Under the assumption of connectivity, the Moore–Penrose generalized inverse L− of L is defined by Theorem 1 The following are equivalent. 1 − 1 L− = L + J J , v v − v v 1. 0 is a simple eigenvalue of L;   2. G is a connected graph; where J is the v v all-1 matrix. v × 3. G˜ is a connected graph; 1 (The matrix J is the orthogonal projector onto the null space 4. 0 is a simple eigenvalue of L;˜ v v of L.) 5. the design ∆ is connected in the sense that all differences between treatments can be estimated. The Moore–Penrose generalized inverse L˜ − of L˜ is defined similarly. From now on, assume connectivity. Call the remaining eigenvalues non-trivial.

They are all non-negative. 23/44 24/44 Estimation and variance How do we calculate variance?

We measure the response Yω on each experimenal unit ω.

If experimental unit ω has treatment i and is in block m Theorem (Standard linear model theory) (f (ω) = i and g(ω) = m), then we assume that Assume that all the noise is independent, with variance σ2. If x = 0, then the variance of the best linear unbiased estimator of Yω = τi + βm + random noise. ∑i i ∑i xiτi is equal to 2 (x>L−x)kσ . We want to estimate contrasts ∑ x τ with ∑ x = 0. i i i i i In particular, the variance of the best linear unbiased estimator of the simple difference τ τ is In particular, we want to estimate all the simple differences i − j τi τj. 2 − V = L− + L− 2L− kσ . ij ii jj − ij Put Vij = variance of the best linear unbiased estimator for   τ τ . i − j

We want all the Vij to be small.

25/44 26/44

. . . Or we can use the Levi graph Electrical networks

We can consider the concurrence graph G as an electrical network with a 1-ohm resistance in each edge. Connect a 1-volt battery between vertices i and j. Current flows in the network, according to these rules. Theorem 1. Ohm’s Law: In every edge, voltage drop = current resistance = The variance of the best linear unbiased estimator of the simple × current. difference τ τ is i − j 2. Kirchhoff’s Voltage Law: 2 The total voltage drop from one vertex to any other vertex V = L˜ − + L˜ − 2L˜ − σ . ij ii jj − ij is the same no matter which path we take from one to the   other. 3. Kirchhoff’s Current Law: At every vertex which is not connected to the battery, the total current coming in is equal to the total current going out. Find the total current I from i to j, then use Ohm’s Law to define the effective resistance Rij between i and j as 1/I. 27/44 28/44

Electrical networks: variance Example calculation: v = 12, b = 6, k = 3

T  vT 5  ~T 5 > T Theorem  T - [20] - T[30] - [47] The effective resistance Rij between vertices i and j in G is = T 5  10 T 17  V 47 vT  vvv T  T  T  Rij = Lii− + Ljj− 2Lij− . 5 } > } >19 − T  10 2 T    T  T  So [10] T T[28] I = 36 2 Vij = Rij kσ .  vT  vT × 5  T 10 14  T 7 > T} > T} Effective resistances are easy to calculate without  T  T 47 5 R = matrix inversion if the graph is sparse.   T -14  -7 T 36 [0] T [14] vv T  vv ~ T > 7 T  7 T  T 29/44 30/44 v . . . Or we can use the Levi graph Example 2 yet again: v = 8, b = 4, k = 3

1 2 3 4 23 V = 23 I = 8 R = 2 3 4 1 8 5 6 7 8

˜ If i and j are treatment vertices in the Levi graph G 5 ˜ ˜ 5 and Rij is the effective resistance between them in G then 8 1 - x ¡A  x ˜ 2 @ 3 R@ 1¡ A 2 Vij = Rij σ . x@  3 x 3 @ ¡ A × 2 ¨¨ HH  ¨ H x? 8H x x ¨ 6 63 3 H ¨ xxH ¨ 4 [0] [15] A ¡ R@ 5 5 R@ 4 3 x@ -5  8 @ xA ¡ x  [23] A¡

xx 63 7 7x x Levi graph concurrence graph

31/44 32/44

Average pairwise variance Optimality

The variance of the best linear unbiased estimator of the simple difference τ τ is The design is called i − j I A-optimal if it minimizes the average of the variances Vij; 2 V = L− + L− 2L− kσ . —equivalently, it maximizes the harmonic mean of the ij ii jj − ij   non-trivial eigenvalues of the Laplacian matrix L; I D-optimal if it minimizes the volume of the confidence ¯ Put V = average value of the Vij. Then ellipsoid for (τ1,..., τv); —equivalently, it maximizes the geometric mean of the 2kσ2 Tr(L ) 1 V¯ = − = 2kσ2 , non-trivial eigenvalues of the Laplacian matrix L; v 1 × harmonic mean of θ1,..., θv 1 − − over all block designs with block size k and the given v and b. where θ1,..., θv 1 are the nontrivial eigenvalues of L. −

33/44 34/44

D-optimality: spanning trees What about the Levi graph?

A spanning tree for the graph is a collection of edges of the graph which form a tree (connected graph with no cycles) Theorem (Gaffke) and which include every vertex. Let G and G˜ be the concurrence graph and Levi graph for a connected incomplete-block design for v treatments in b blocks of size k. Cheng (1981), after Gaffke (1978), after Kirchhoff (1847): Then the number of spanning trees for G˜ is equal to b v+1 k − times the number of spanning trees for G. product of non-trivial eigenvalues of L = v number of spanning trees. × So a block design is D-optimal if and only if its Levi graph maximizes the number of spanning trees. So a design is D-optimal if and only if its concurrence graph G has the maximal number of spanning trees. If v > b it is easier to count spanning trees in the Levi graph than in the concurrence graph. This is easy to calculate by hand when the graph is sparse.

35/44 36/44 Example 2 one last time: v = 8, b = 4, k = 3 Large blocks; many unreplicated treatments

1 2 3 4 2 3 4 1 5 6 7 8 k plots n plots 5 5 8 1  . . x ¡A  . .  x b blocks  whole design ∆ @ @ 1¡ A 2  x@ x ¡ A  @ 2 ¨ H   ¨¨ HH v varieties bn varieties 8H x x ¨ 6 x H ¨  all single replication xH ¨ x  4  A ¡  @ @ 4 3  x@ @ xA ¡ x   A¡ Whole design ∆ has v + bn varieties in b blocks of size k + n;

x63 x the subdesign Γ has v core varieties in b blocks of size k; 7 7x call the remaining varieties orphans. x Levi graph concurrence graph 8 spanning trees 216 spanning trees

37/44 38/44

Levi graph: 10 + 5n treatments in 5 blocks of 4 + n plots Pairwise resistance

1 2 3 4 A1 An ··· B1 H ¨ Bn H···¨ 1 5 6 7 B1 Bn An . 1x¨H 5x . C1 ··· .@ ¨ ¢ B H . .x@ ¨¨ 2 HH .x 2 5 8 9 C1 Cn A ¢ B C ··· 1 QQ x7 6 x  n A Q ¢ B  ¡ 3 6 8 0 D1 Dn x A Q¢ x B ¡ x ··· 4 A ¢ xQxB ¡ 8 4 7 9 0 E E 3 Q 9 1 ··· n xA ¢ x xQB ¡ x En   D1 . @ . .x 0x .x B1 H ¨ Bn @ H···¨ E1 Dn An 1x¨H 5x C1 .@ ¨ ¢ B H . x x .x@ ¨ ¢2B H .x ¨ ¢ B H A1 x x  Cn Resistance(A1, A2) = 2 AQQ 7 ¢ B 6 ¡ x Q x  x A Q¢ B ¡ Resistance(A1, B1) = 2 + Resistance(block A, block B) in Γ 4 A ¢ xQxB ¡ 8 3 Q 9 xA ¢ x xQB ¡ x Resistance(A1, 8) = 1 + Resistance(block A, 8) in Γ E D n .   . 1 Resistance(1, 8) = Resistance(1, 8) in Γ . 0 @ . x x @x E1 Dn x x 39/44 40/44

Sum of the pairwise variances A less obvious consequence

Theorem (cf Herzberg and Jarrett, 2007) The sum of the variances of treatment differences in ∆

= constant + V + nV + n2V , 1 3 2 Consequence where If n or b is large,

V1 = the sum of the variances of treatment differences in Γ and we want an A-optimal design, it may be best to make Γ a complete block design for k0 controls, V2 = the sum of the variances of block differences in Γ even if there is no interest in V3 = the sum of the variances of sums of comparisons between new treatments and controls, one treatment and one block in Γ. or between controls.

(If Γ is equi-replicate then V2 and V3 are both increasing functions of V1.) Consequence For a given choice of k, make Γ as efficient as possible.

41/44 42/44 Spanning trees D-optimality under very low replication

A spanning tree for the Levi graph is a collection edges which provides a unique path between every pair of vertices. Consequence B1 H ¨ Bn H···¨ The whole design ∆ is D-optimal An 1x¨H 5x C1 .@ ¨ ¢ B H . for v + bn treatments in b blocks of size k + n .x@ ¨ ¢2B H .x ¨ ¢ B H if and only if the core design Γ is D-optimal A1 x x  Cn AQQ 7 ¢ B 6 ¡ x Q x  x for v treatments in b blocks of size k. A Q¢ B ¡ 4 A ¢ xQxB ¡ 8 3 Q 9 Consequence xA ¢ x xQB ¡ x En .   . D1 Even when n or b is large, . 0 @ . x x @x D-optimal designs do not include uninteresting controls. E1 Dn x x The orphans make no difference to the number of spanning trees for the Levi graph.

43/44 44/44