Algebraic Geometry Fall 2013 & Spring 2014
Eduard Looijenga Rings are always supposed to be commutative and to possess a unit 1. A ring homomorphism will always take unit to unit. We allow that 1 = 0, but in that case we get of course the zero ring {0}. If R is a ring, then we denote the multiplicative group of invertible elements (units) of R by R×. We say that R is a domain if R has no zero divisors, equivalently, if (0) is a prime ideal. An R-algebra is a ring A endowed with a ring homomorphism φ : R → A, but if φ is understood, then for every r ∈ R and a ∈ A, the product φ(r)a is often denoted by ra. We say that A is finitely generated as an R-algebra if we can find a1, . . . , an in A such that every element of A can be written as a polynomial in these elements with coefficients in R; in other words, the R-algebra homomorphism R[x1, . . . , xn] → A which sends the variable xi to ai is onto. This is not to be confused with the notion of finite generation of an R-module M resp. which merely means the existence of a surjective homomorphism of R-modules Rn → M. Similarly, a field K is said to be finitely generated as a field over a subfield k if we can find elements b1, . . . , bn in K such that every element of K can be written as a fraction of two polynomials in these elements with coefficients in k. Contents
Chapter 1. Affine varieties5 1. The Zariski topology5 2. Irreducibility and decomposition8 3. Finiteness properties and the Hilbert theorems 14 4. The affine category 18 5. The sheaf of regular functions 23 6. The product 25 7. Function fields and rational maps 27 8. Finite morphisms 31 9. Dimension 36 10. Nonsingular points 39 11. The notion of a variety 47
Chapter 2. Projective varieties 51 1. Projective spaces 51 2. The Zariski topology on a projective space 53 3. The Segre embeddings 56 4. Projections 57 5. Elimination theory and closed projections 58 6. Constructible sets 62 7. The Veronese embedding 63 8. Grassmannians 65 9. Multiplicities of modules 70 10. Hilbert functions and Hilbert polynomials 74
Chapter 3. Schemes 79 1. Presheaves and sheaves 79 2. The spectrum of a ring as a locally ringed space 84 3. The notion of a scheme 88 4. Formation of products 94 5. Elementary properties of schemes 95 6. Separated and proper morphisms 99 7. Quasi-coherent sheaves 109 8. When is a scheme affine? 113 9. Divisors and invertible sheaves 116 10. The projective setting 124 11. Cohomology of coherent modules on a projective scheme 127 12. Ample and very ample sheaves 131 13. Blowing up 134 14. Flatness 138
3 4 CONTENTS
15. Serre duality 141
Chapter 4. Appendix 145 1. The basics of category theory 145 2. Abelian categories and derived functors 151 3. Sheaf cohomology 160
Bibliography 173 CHAPTER 1
Affine varieties
Throughout these notes k stands for an algebraically closed field k. Recall that this means that every polynomial f ∈ k[x] of positive degree has a root x1 ∈ k: f(x1) = 0. This implies that we can split off the factor x − x1 from f with quotient a polynomial of degree one less than f. Continuing in this manner we then find × that f decomposes simply as f(x) = c(x − x1) ··· (x − xd) with c ∈ k = k − {0}, d = deg(f) and x1, . . . , xd ∈ k. Since an algebraic extension of k is obtained by adjoining to k roots of polynomials in k[x], this also shows that the property in question is equivalent to: every algebraic extension of k is equal to k. A first example you may think of is the field of complex numbers C, but as we proceed you should be increasingly be aware of the fact that there are many others: it is shown in a standard algebra course that for any field F an algebraic closure1 F¯ is obtained by adjoining to F the roots of every polynomial f ∈ F [x]. So we could take for k an algebraic closure of the field of rational numbers Q, of the finite field 2 Fq, where q is a prime power or even of the field of fractions of any domain such as C[x1, . . . , xr]. 1. The Zariski topology n Any f ∈ k[x1, . . . , xn] determines in an evident manner a function k → k. In such cases we prefer to think of kn not as vector space—its origin and vector addition will be irrelevant to us—but as a set with a weaker structure. We shall make this precise later, but it basically amounts to only remembering that elements of k[x1, . . . , xn] can be understood as k-valued functions on it. For that reason it n n is convenient to denote this set differently, namely as A (or as Ak , if we feel that we should not forget about the field k). We refer to An as the affine n-space over k.A k-valued function on An is then said to be regular if it is defined by some f ∈ k[x1, . . . , xn]. We denote the zero set of such a function by Z(f) and the n n complement (the nonzero set) by Af ⊂ A . If f is not a constant polynomial (that is, f∈ / k), then we call Z(f) a hypersurface of An.
EXERCISE 1. Prove that f ∈ k[x1, . . . , xn] is completely determined by the reg- ular function it defines. (Hint: do first the case n = 1.) So the ring k[x1, . . . , xn] can be regarded as a ring of functions on An under pointwise addition and multiplica- tion. (This would fail to be so had we not assumed that k is algebraically closed: q for instance the function on the finite field Fq defined by x − x is identically zero.)
1This can not be done in one step and involves an infinite process which involves in general many choices. This is reflected by the fact that the final result is not canonical, although it is unique up to a (in general nonunique) isomorphism; whence the use of the indefinite article in ‘an algebraic closure’. 2 (qn) Since the elements of any algebraic extension of Fq of degree n ≥ 2 are roots of x − x, we only need to adjoin roots of such polynomials.
5 6 1. AFFINE VARIETIES
EXERCISE 2. Prove that a hypersurface is nonempty. It is perhaps somewhat surprising that in this rather algebraic context, the lan- guage of topology proves to be quite effective: algebraic subsets of An shall appear as the closed sets of a topology, albeit a rather peculiar one. n LEMMA-DEFINITION 1.1. The collection {Af : f ∈ k[x1, . . . , xn]} is a basis of a 3 topology on An, called the Zariski topology . A subset of An is closed for this topology if and only if it is an intersection of zero sets of regular functions.
PROOF. We recall that a collection {Uα}α of subsets of a set X is a basis for a topology if and only if its union is all of X and any intersection Uα1 ∩ Uα2 is a union of members of {Uα}α. This is here certainly the case, for we have U(0) = X n and U(f1) ∩ U(f2) = U(f1f2). Since an open subset of A is by definition a union n of subsets of the form Af , a closed subset must be an intersection of subsets of the form Z(f). EXAMPLE 1.2. The Zariski topology on A1 is the cofinite topology: its closed subsets 6= A1 are the finite subsets. EXERCISE 3. Show that the diagonal in A2 is closed for the Zariski topology, but not for the product topology (where each factor A1 is equipped with the Zariski topology). So A2 does not have the product topology. We will explore the mutual relationship between the following two basic maps:
n I {subsets of A } −−−−→ {ideals of k[x1, . . . , xn]} ∪ ∩
n Z {closed subsets of A } ←−−−− {subsets of k[x1, . . . , xn]}. n where for a subset X ⊂ A , I(X) is the ideal of f ∈ k[x1, . . . , xn] with f|X = 0 and n for a subset J ⊂ k[x1, . . . , xn], Z(J) is the closed subset of A defined by ∩f∈J Z(f). Observe that
I(X1 ∪ X2) = I(X1) ∩ I(X2) and Z(J1 ∪ J2) = Z(J1) ∩ Z(J2). In particular, both I and Z are inclusion reversing. Furthermore, the restriction of I to closed subsets defines a section of Z: if Y ⊂ An is closed, then Z(I(Y )) = Y . We also note that by Exercise1 I(An) = (0), and that any singleton {p = n (p1, . . . , pn)} ⊂ A is closed, as it is the common zero set of the degree one polyno- mials x1 − p1, . . . , xn − pn.
EXERCISE 4. Prove that I({p}) is equal to the ideal generated by these degree one polynomials and that this ideal is maximal.
EXERCISE 5. Prove that the (Zariski) closure of a subset Y of An is equal to Z(I(Y )). n Given Y ⊂ A , then f, g ∈ k[x1, . . . , xn] have the same restriction to Y if and only if f − g ∈ I(Y ). So the quotient ring k[x1, . . . , xn]/I(Y ) (a k-algebra) can be regarded as a ring of k-valued functions on Y . Notice that this k-algebra does not change if we replace Y by its Zariski closure.
3 n We shall later modify the definition of both A and the Zariski topology. 1. THE ZARISKI TOPOLOGY 7
n DEFINITION 1.3. Let Y ⊂ A be closed. The k-algebra k[x1, . . . , xn]/I(Y ) is called the coordinate ring of Y and we denote it by k[Y ].A k-valued function on Y is said to be regular if it lies in this ring. Given a closed subset Y ⊂ An, then for every subset X ⊂ An we have X ⊂ Y if and only if I(X) ⊃ I(Y ), and in that case IY (X) := I(X)/I(Y ) is an ideal of k[Y ]: it is the ideal of regular functions on Y that vanish on X. Conversely, an ideal of k[Y ] is of the form J/I(Y ), with J an ideal of k[x1, . . . , xn] that contains I(Y ), and such an ideal defines a closed subset Z(J) contained in Y . So the two basic maps above give rise to such a pair on Y : I {subsets of Y } −−−−→Y {ideals of k[Y ]} ∪ ∩ Z {closed subsets of Y } ←−−−−Y {subsets of k[Y ]}.
We ask: which ideals of k[x1, . . . , xn] are of the form I(Y ) for some Y ? Clearly, if f ∈ k[x1, . . . , xn] is such that some positive power vanishes on Y , then f vanishes on Y . In other words: if f m ∈ I(Y ) for some m > 0, then f ∈ I(Y ). This suggests: PROPOSITION-DEFINITION 1.4. Let R be a ring (as always commutative and with m 1) and let J ⊂ R be an ideal. Then the set of a ∈ R with the property√ that a ∈ J for some m > 0 is an ideal of R, called the√radical of J and denoted J. We say that J is a radical ideal if J = J. We say that the ring R is reduced if the zero ideal (0) is a radical ideal (in other words, R has no nonzero nilpotents: if a ∈ R is such that am = 0, then a = 0). √ √ m n PROOF. We show that J is an ideal. Let a, b ∈ J so that√ a , b ∈ J for m certain positive integers m,√ n. Then for every r ∈ R, ra ∈ J, since (ra) = rmam ∈ J. Similarly a−b ∈ J, for (a−b)m+n is a linear combination of monomials m n that are multiples of a or b and hence lie in J. EXERCISE 6. Show that a prime ideal is a radical ideal. Notice that J is a radical ideal if and only if R/J is reduced. The preceding shows that for every Y ⊂ An, I(Y ) is a radical ideal, so that k[Y ] is reduced. The dictionary between algebra and geometry begins in a more substantial manner with
THEOREM 1.5√ (Hilbert’s Nullstellensatz). For every ideal J ⊂ k[x1, . . . , xn] we have I(Z(J)) = J. We inclusion ⊃ is clear; the hard part is the opposite inclusion (which says that m if f ∈ k[x1, . . . , xn] vanishes on Z(J), then f ∈ J for some positive integer m). We postpone its proof, but discuss some of the consequences. n COROLLARY 1.6. Let Y ⊂ A be closed. Then the maps IY and ZY restrict to bijections: {closed subsets of Y } ↔ {radical ideals of k[Y ]}. These bijections are inclusion reversing and each others inverse. PROOF. We first prove this for Y = An. We already observed that for every closed subset X of An we have Z(I(X)) = X . The Nullstellensatz says that for a radical ideal J ⊂ k[x1, . . . , xn], we have I(Z(J)) = J. An ideal of k[Y ] is of the form J/I(Y ). This is a radical ideal if and only if J is one. So the property also follows for an arbitrary closed Y . 8 1. AFFINE VARIETIES
Let m be a maximal ideal of k[x1, . . . , xn]. Such an ideal is certainly radical as it is a prime ideal and so it is also maximal among the radical ideals that are distinct from k[x1, . . . , xn]. Hence it is of the form I(Y ) for a closed subset Y . Since n the empty subset of A is defined by the radical ideal k[x1, . . . , xn], Corollary 1.6 implies that Y will be nonempty and minimal for this property. In other words, Y is a singleton {y} (and if y = (a1, . . . , an), then m = (x1 − a1, . . . , xn − an), by Exercise4). Thus the above correspondence provides a bijection between the points of An and the maximal ideals of An. If we are given a closed subset Y ⊂ An and a point y ∈ An, then y ∈ Y if and only if the maximal ideal defined by y contains I(Y ). But a maximal ideal containing I(Y ) is the preimage of a maximal ideal in k[Y ] = k[x1, . . . , xn]/I(Y ). We conclude that points of Y correspond to maximal ideals of k[Y ]. Via this (or a very similar) correspondence, algebraic geometry seeks to express geometric properties of Y in terms of algebraic properties of k[Y ] and vice versa. In the end we want to forget about the ambient An completely.
2. Irreducibility and decomposition We introduce a property which for most topological spaces is of little interest, but as we will see, is useful and natural for the Zariski topology.
DEFINITION 2.1. Let Y be a topological space. We say that Y is irreducible if it is nonempty and cannot be written as the union of two closed subsets 6= Y . An irreducible component of Y is a maximal irreducible subset of Y .
EXERCISE 7. Prove that an irreducible Hausdorff space must consist of a single point. Prove also that an infinite set with the cofinite topology is irreducible.
EXERCISE 8. Let G1,...,Gs be closed subsets of a topological space Y . Prove that any irreducible subset of G1 ∪ · · · ∪ Gs is contained in some Gi.
LEMMA 2.2. Let Y be a topological space. (i) If Y is irreducible, then every nonempty open subset of Y is dense in Y and irreducible. (ii) Conversely, if C ⊂ Y is an irreducible subspace, then C¯ is also irreducible. In particular, an irreducible component of Y is always closed in Y .
PROOF. (i) Suppose Y is irreducible and let U ⊂ Y be open and nonempty. Then Y is the union of the two closed subspaces Y −U and U¯. Since Y is irreducible, and Y − U 6= Y , we must have U¯ = Y . So U is dense in Y . To see that U is irreducible, suppose that U is the union of two subsets that are both closed in U. These subsets will be of the form Gi ∩ U with Gi closed in Y . Then G1 ∪ G2 is a closed subset of Y which contains U. Since U¯ = Y , it follows that G1 ∪ G2 = Y . The irreducibility of Y implies that one of the Gi (say G1) equals Y and then U = G1 ∩ U. (ii) Let C ⊂ Y be irreducible (and hence nonempty). If C¯ is written as a union of two closed subsets G1,G2 of Y , then C is the union of the two subsets G1 ∩ C and G2 ∩ C that are both closed in C, and so one of these, say G1 ∩ C, equals C. ¯ ¯ This means that G1 ⊃ C and hence G1 ⊃ C. So C is irreducible. The following proposition tells us what irreducibility amounts to in the Zariski topology. 2. IRREDUCIBILITY AND DECOMPOSITION 9
PROPOSITION 2.3. A closed subset Y ⊂ An is irreducible if and only if I(Y ) is a prime ideal (which, we recall, is equivalent to: k[Y ] = k[x1, . . . , xn]/I(Y ) is a domain).
PROOF. Suppose Y is irreducible and f, g ∈ k[x1, . . . , xn] are such that fg ∈ I(Y ). Then Y ⊂ Z(fg) = Z(f) ∪ Z(g). Since Y is irreducible, Y is contained in Z(f) or in Z(g. So f ∈ I(Y ) or g ∈ I(Y ), proving that I(Y ) is a prime ideal. Suppose I(Y ) is a prime ideal, but that Y is the union of two closed subsets Y1 and Y2 that are both 6= Y . Since Yi 6= Y , there exists a fi ∈ I(Yi) − I(Y ). Then f1f2 vanishes on Y1 ∪ Y2 = Y , so that f1f2 ∈ I(Y ). The fact I(Y ) is a prime ideal implies that one of f1 and f2 is in I(Y ). Whence a contradiction.
One of our first aims is to prove that the irreducible components of any closed subset Y ⊂ An are finite in number and have Y as their union. This may not sound very surprising, but we will see that this reflects some nonobvious algebraic properties. Let us first consider the case of a hypersurface. Since we are going to use the fact that k[x1, . . . , xn] is a unique factorization domain, we begin with recalling that notion:
DEFINITION 2.4. We say that R is a unique factorization domain if R has no zero divisors and every principal ideal (a) := Ra in R which is neither the zero ideal nor all of R is in unique manner a product of principal prime ideals: (a) = (p1)(p2) ··· (ps) (so the ideals (p1),..., (ps) are unique up to order). Observe that the principal ideal generated by p ∈ R is prime if and only if for any factoring of p, p = p0p00, one of the factors must be a unit and that the identity (a) = (p1)(p2) ··· (ps) amounts to the statement that a is a unit times p1p2 ··· ps. The factors are then unique up to order and multiplication by a unit. For a field (which has no proper principal ideals distinct from (0)) the imposed condition is empty and hence a field is automatically unique factorization domain. A more substantial example (that motivated this notion in the first place) is Z: a principal prime ideal of Z is of the form (p), with p a prime number. Every integer n ≥ 2 has a unique prime decomposition and so Z is a unique factorization domain. A basic theorem in the theory of rings asserts that if R is a unique factorization domain, then so is its polynomial ring R[x]. This implies (with induction on n) that R[x1, . . . , xn] is one. This applies to the case when R is a field (think of our k): a nonzero principal ideal of this ring is prime precisely when it is generated by an irreducible polynomial of positive degree and every f ∈ R[x1, . . . , xn] of positive degree then can be written as a product of irreducible polynomials: f = f1f2 ··· fs, a factorization that is unique up to order and multiplication of each fi by a nonzero element of R. The following proposition connects two notions of irreducibility.
PROPOSITION 2.5. If f ∈ k[x1, . . . , xn] is irreducible, then the hypersurface Z(f) it defines is irreducible. If f ∈ k[x1, . . . , xn] is of positive degree and f = f1f2 ··· fs is a factoring into irreducible polynomials, then Z(f1),...,Z(fs) are the irreducible components of Z(f) and their union equals Z(f) (but we are not claiming that the Z(fi)’s are pairwise distinct). In particular, a hypersurface is the union of its ir- reducible components; these irreducible components are hypersurfaces and finite in number. 10 1. AFFINE VARIETIES
PROOF. If f ∈ k[x1, . . . , xn] is irreducible, then f generates a prime ideal and so Z(f) is an irreducible hypersurface by Proposition 2.3. If f ∈ k[x1, . . . , xn] is of positive degree and f = f1f2 ··· fs is as in the propo- sition, then it is clear that Z(f) = Z(f1) ∪ · · · ∪ Z(fs) with each Z(fi) irreducible. To see that Z(fi) is an irreducible component of Z(f), suppose that C ⊂ Z(f) is irreducible. By Exercise8, C is contained in some Z(fi). Since C is a maximal irreducible subset of Z(f), it follows that we then must have equality: C = Z(fi). It remains to observe that if any inclusion relation Z(fi) ⊂ Z(fj) is necessarily an identity (prove this yourself) so that each Z(fi) is already maximal and hence in irreducible component. The discussion of the general case begins with the rather formal
LEMMA 2.6. For a partially ordered set (A, ≤) the following are equivalent:
(i) (A, ≤) satisfies the ascending chain condition: every ascending chain a1 ≤ a2 ≤ a3 ≤ · · · becomes stationary: an = an+1 = ··· for n sufficiently large. (ii) Every nonempty subset B ⊂ A has a maximal element, that is, an element b0 ∈ B such that there is no b ∈ B with b > b0. PROOF. (i)⇒(ii). Suppose (A, ≤) satisfies the ascending chain condition and let B ⊂ A be nonempty. Choose b1 ∈ B. If b1 is maximal, we are done. If not, then there exists a b2 ∈ B with b2 > b1. We repeat the same argument for b2. We cannot indefinitely continue in this manner because of the ascending chain condition. (ii)⇒(i). If (A, ≤) satisfies (ii), then the set of members of any ascending chain has a maximal element, in other words, the chain becomes stationary. If we replace ≤ by ≥, then we obtain the notion of the descending chain condi- tion and we find that this property is equivalent to: every nonempty subset B ⊂ A has a minimal element. These properties appear in the following pair of definitions.
DEFINITIONS 2.7. We say that a ring R is noetherian if its collection of ideals satisfies the ascending chain condition. We say that a topological space Y is noetherian if its collection of closed subsets satisfies the descending chain condition.
EXERCISE 9. Prove that a subspace of a noetherian space is noetherian. Prove also that a ring quotient of a noetherian ring is noetherian.
EXERCISE 10. Prove that a noetherian space is quasi-compact: every covering of such a space by open subsets contains a finite subcovering. The interest of the noetherian property is that it is one which is possessed by almost all the rings we encounter and that it implies many finiteness properties without which we are often unable to go very far. Let us give a nonexample first: the ring R of holomorphic functions on C is not noetherian: if Ik denotes the ideal of f ∈ R vanishing on all the integers ≥ k, then I1 ⊂ I2 ⊂ · · · is a strictly ascending chain of ideals in R. Obviously a field is noetherian. The ring Z is noetherian: if I1 ⊂ I2 ⊂ · · · is an ∞ ascending chain of ideals in Z, then ∪s=1Is is an ideal of Z, hence of the form (n) for some n ∈ Z. But if s is such that n ∈ Is, then clearly the chain is stationary as of index s. (This argument only used the fact that any ideal in Z is generated by a single element, i.e., that Z is a principal ideal domain.) That most rings we know are noetherian is a consequence of 2. IRREDUCIBILITY AND DECOMPOSITION 11
THEOREM 2.8 (Hilbert’s basis theorem). If R is a noetherian ring, then so is R[x]. As with the Nullstellensatz, we postpone the proof and discuss some of its consequences first.
COROLLARY 2.9. If R is a noetherian ring (for example, a field) then so is every finitely generated R-algebra. Also, the space An (and hence any closed subset of An) is noetherian.
PROOF. The Hilbert basis theorem implies (with induction on n) that the ring R[x1, . . . , xn] is noetherian. By Exercise9, every quotient ring R[x1, . . . , xn]/I is then also noetherian. But a finitely generated R-algebra is (by definition actually) isomorphic to some such quotient and so the first statement follows. n Suppose A ⊃ Y1 ⊃ Y2 ⊃ · · · is a descending chain of closed subsets. Then (0) ⊂ I(Y1) ⊂ I(Y2) ⊂ · · · is an ascending chain of ideals. As the latter becomes stationary, so will become the former. PROPOSITION 2.10. If Y is noetherian space, then its irreducible components are finite in number and their union equals Y .
PROOF. Suppose Y is a noetherian space. We first show that every closed sub- set can be written as a finite union of closed irreducible subsets. First note that the empty set has this property, for a union with empty index set is empty. Let B be the collection of closed subspaces of Y for which this is not possible, i.e., that cannot be written as a finite union of closed irreducible subsets. Suppose that B is nonempty. According to 2.6 this collection has a minimal element, Z, say. This Z must be nonempty and cannot be irreducible. So Z is the union of two proper closed subsets Z0 and Z00. The minimality of Z implies that neither Z0 nor Z00 is in B and so both Z0 and Z00 can be written as a finite union of closed irreducible subsets. But then so can Z and we get a contradiction. In particular, there exist closed irreducible subsets Y1,...,Yk of Y such that Y = Y1 ∪ · · · ∪ Yk (if Y = ∅, take k = 0). We may of course assume that no Yi is contained in some Yj with j 6= i (otherwise, omit Yi). It now remains to prove that the Yi’s are the irreducible components of Y , that is, we must show that every irreducible subset C of Y is contained in some Yi. But this follows from an application of Exercise8. If we apply this to An, then we find that every subset Y ⊂ An has a finite number of irreducible components, the union of which is all of Y . If Y is closed in An, then so is every irreducible component of Y and according to Proposition 2.3 any such irreducible component is defined by a prime ideal. This allows us to recover the irreducible components of a closed subset Y ⊂ An from its coordinate ring: COROLLARY 2.11. Let Y ⊂ An be a closed subset. If C is an irreducible component of Y , then the image IY (C) of I(C) in k[Y ] is a minimal prime ideal of k[Y ] and any minimal prime ideal of k[Y ] is so obtained: we thus get a bijective correspondence between the irreducible components of Y and the minimal prime ideals of k[Y ].
PROOF. Let C be a closed subset of Y and let IY (C) be the corresponding ideal of k[Y ]. Now C is irreducible if and only if I(C) is a prime ideal of k[x1, . . . , xn], or what amounts to the same, if and only if IY (C) is a prime ideal of k[Y ]. It is 12 1. AFFINE VARIETIES an irreducible component if C is maximal for this property, or what amounts to the same, if IY (C) is minimal for the property of being a prime ideal of k[Y ].
EXAMPLE 2.12. First consider the set C := {(t, t2, t3) ∈ A3 | t ∈ k}. This is a 3 closed subset of A : if we use (x, y, z) instead of (x1, x2, x3), then C is the common zero set of y − x2 and z − x3. Now the inclusion k[x] ⊂ k[x, y, z] composed with the ring quotient k[x, y, z] → k[x, y, z]/(y − x2, z − x3) is easily seen to be an isomor- phism. Since k[x] has no zero divisors, (y − x2, z − x3) must be a prime ideal. So C is irreducible and I(C) = (y − x2, z − x3). We now turn to the closed subset Y ⊂ A3 defined by xy−z = 0 and y3 −z2 = 0. Let p = (x, y, z) ∈ Y . If y 6= 0, then we put t := z/y; from y3 = x2, it follows that y = t2 and z = t3 and xy = z implies that x = t. In other words, p ∈ C in that case. If y = 0, then z = 0, in other words p lies on the x-axis. Conversely, any point on the x-axis lies in Y . So Y is the union of C and the x-axis and these are the irreducible components of Y . We briefly discuss the corresponding issue in commutative algebra. We begin with recalling the notion of localization and we do this in the generality that is needed later.
2.13.L OCALIZATION. Let R be a ring and let S be a multiplicative subset of R: 1 ∈ S and S closed under multiplication. Then a ring S−1R, together with a ring homomorphism R → S−1R is defined as follows. An element of S−1R is by definition written as a formal fraction r/s, with r ∈ R and s ∈ S, with the understanding that r/s = r0/s0 if and only if s00(s0r − sr0) = 0 for some s00 ∈ S. This is a ring indeed: multiplication and subtraction is defined as for ordinary fractions: r/s.r0/s0 = (rr0)/(ss0) and r/s − r0/s0 = (s0r − sr0)/(ss0); it has 0/1 as zero and 1/1 as unit element and the ring homomorphism R → S−1R is simply r 7→ r/1. Observe that the definition shows that 0/1 = 1/1 if and only if 0 ∈ S, in which case S−1R is reduced to the zero ring. We also note that any s ∈ S maps to an invertible element of S−1R, the inverse of s/1 being 1/s. (This is also true when 0 ∈ S, for 0 is invertible in the zero ring.) In a sense (made precise in part (b) of Exercise 11 below) the ring homomorphism R → S−1R is universal for that property. This construction is called the localization away from S. It is clear that if S does not contain zero divisors, then r/s = r0/s0 if and only if s0r − sr0 = 0; in particular, r/1 = r0/1 if and only if r = r0, so that R → S−1R is then injective. If we take S maximal for this property, namely take it to be the set of nonzero divisors of R (which is indeed multiplicative), then S−1R is called the fraction ring Frac(R) of R. When R is a domain, S = R−{0} and so Frac(R) is a field, the fraction field of R. This gives the following corollary, which hints to the importance of prime ideals in the subject.
COROLLARY 2.14. An ideal p of a ring R is a prime ideal if and only if it is the kernel of a ring homomorphism from R to a field.
PROOF. It is clear that the kernel of a ring homomorphism from R to a field is always a prime ideal. Conversely, if p is a prime ideal, then it is the kernel of the composite R → R/p ,→ Frac(R/p).
Of special interest is when S = {sn | n ≥ 0} for some s ∈ R. We then usually write R[1/s] for S−1R. Notice that the image of s in R[1/s] is invertible. Note that when s is nilpotent, R[1/s] is the zero ring.
EXERCISE 11. Let R be a ring and let S be a multiplicative subset of R. (a) What is the the kernel of R → S−1R? 2. IRREDUCIBILITY AND DECOMPOSITION 13
(b) Prove that a ring homomorphism φ : R → R0 with the property that φ(s) is invertible for every s ∈ S factors in a unique manner through S−1R. (c) Consider the polynomial ring R[xs : s ∈ S] and the homomorphism of −1 R-algebras R[xs : s ∈ S] → S R that sends xs to 1/s. Prove that this homomorphism is surjective and that its kernel consists of the f ∈ R[xs : s ∈ S] which after multiplication by an element of S lie in the ideal generated the degree one polynomials sxs − 1, s ∈ S. EXERCISE 12. Let R be a ring and let p be a prime ideal of R. (a) Prove that the complement R − p is a multiplicative system. The result- ing localization (R − p)−1R is called the localization at p and is usually denoted Rp. (b) Prove that pRp is a maximal ideal of Rp and that it is the only maximal ideal of Rp. (A ring with a unique maximal ideal is called a local ring.) (c) Prove that the localization map R → Rp drops to an isomorphism of fields Frac(R/p) → Rp/pRp. (d) Work this out for R = Z and p = (p), where p is a prime number. (e) Same for R = k[x, y] and p = (x).
LEMMA 2.15. Let R be a ring. Then the intersection of all the prime ideals of R is the ideal of nilpotents p(0) of R. Equivalently, for every nonnilpotent a ∈ R, there exists a ring homomorphism from R to a field that is nonzero on a.
PROOF. It is easy to see that a nilpotent element lies in every prime ideal. Now for nonnilpotent a ∈ R consider the homomorphism R → R[1/a]. The ring R[1/a] has a maximal ideal4 and hence admits a ring homomorphism to some field F : φ : R[1/a] → F . Then the kernel of the composite R → R[1/a] → F is a prime ideal and a is not in this kernel (for its image is invertible with inverse φ(1/a)). EXERCISE 13. Let R be a ring. Prove that the intersection of all the maximal ideals of a ring R consists of the a ∈ R for which 1 + aR ⊂ R× (i.e., 1 + ax is invertible for every x ∈ R). You may use the fact that every proper ideal of R is contained in a maximal ideal. We can do better if R is noetherian. The following proposition is the algebraic counterpart of Proposition 2.10. Note the similarity between the proofs.
PROPOSITION 2.16. Let R be a noetherian ring. Then any radical ideal of R is an intersection of finitely many prime ideals. Also, the minimal prime ideals of R are finite in number and their intersection is equal to the ideal of nilpotents p(0). PROOF. Let B be the collection of the radical ideals I ( R that can not be writ- ten as an intersection of finitely many prime ideals and suppose that B is nonempty. Since R is noetherian, B contains a maximal member I0. We derive a contradiction as follows. Since I0 cannot be a prime ideal,√ there exist a1, a2 ∈ R − I0 with a1a2 ∈ I0. Consider the radical ideal Ji := I0 + Rai. We claim that J1 ∩ J2 = I0. The inclusion ⊃ is obvious and ⊂ is seen as follows: if a ∈ J1 ∩J2, then for i = 1, 2, there ni n1+n2 exists an ni > 0 such that a ∈ I0 +Rai. Hence a ∈ (I0 +Ra1)(I0 +Ra2) = I0,
4Every ring has a maximal ideal. For noetherian rings, which are our main concern, this is obvious, but in general this follows with transfinite induction, the adoption of which is equivalent to the adoption of the axiom of choice. 14 1. AFFINE VARIETIES so that a ∈ I0. Since Ji strictly contains I0, it does not belong to B. In other words, Ji is an intersection of finitely many prime ideals. But then so is J1 ∩ J2 = I0 and we get a contradiction. p We thus find that (0) = p1 ∩ · · · ∩ ps for certain prime ideals pi. We may of course assume that no pi contains some pj with j 6= i (otherwise, omit pi). It now remains to prove that every prime ideal p of R contains some pi. If that is not the case, then for i = 1, . . . , s there exists a ai ∈ pi − p. But then a1a2 ··· as ∈ p p1 ∩ · · · ∩ ps = (0) ⊂ p and since p is a prime ideal, some factor ai lies in p. This is clearly a contradiction. √ EXERCISE 14. Let J be an ideal of the ring R. Show that J is the intersection of all the prime ideals that contain J. Prove that in case R is noetherian, the prime ideals that are minimal√ for this property are finite in number and that their common intersection is still J. What do we get for R = Z and J = Zn?
EXERCISE 15. Let R be a ring, S ⊂ R be a multiplicative system and denote by φ : R → S−1R the natural homomorphism. Prove that the map which assigns to every prime ideal of S−1R its preimage in R under φ defines a bijection between the prime ideals of S−1R and the prime ideals of R disjoint with S. Prove also that if 0 ∈/ S, then the preimage of the ideal of nilpotents of S−1R is the ideal of nilpotents of R.
3. Finiteness properties and the Hilbert theorems The noetherian property in commutative algebra is best discussed in the con- text of modules, even if one’s interest is only in rings. We fix a ring R and first recall the notion of an R-module.
The notion of an R-module is the natural generalization of a K-vector space (where K is some field). Let us observe that if M is an (additively written) abelian group, then the set End(M) of group homomorphisms M → M is a ring for which subtraction is pointwise defined and multiplication is composition (so if f, g ∈ End(M), then f − g : m ∈ M 7→ f(m) − g(m) and fg : m 7→ f(g(m))); clearly the zero element is the zero homomorphism and the unit element is the identity. It only fails to obey our convention in the sense that this ring is usually noncommutative. We only introduced it in order to be able state succinctly:
DEFINITION 3.1. An R-module is an abelian group M, equipped with a ring homomor- phism R → End(M). So any r ∈ R defines a homomorphism M → M; we usually denote the image of m ∈ M simply by rm. If we write out the properties of an R-module structure in these terms, we get: r(m1 − m2) = rm1 − rm2, (r1 − r2)m = r1m − r2m, 1.m = m, r1(r2m) = (r1r2)m. If R happens to be field, then we see that an R-module is the same thing as an R-vector space. The notion of an R-module is quite ubiquitous if you think about it. A simple example is an ideal I ⊂ R. Any abelian group M is in a natural manner a Z-module. And a R[x]-module can be understood as an real linear space V (an R-module) endowed with an endomorphism (the image of x in End(V )). A more involved example is the following: if X is a manifold, f is a C∞-function on X and ω a C∞ differential p-form on X, then fω is also a C∞ differential p-form on X. Thus the linear space of C∞-differential forms on X of a fixed degree p is naturally a module over the ring of C∞-functions on X. Here are a few companion notions, followed by a brief discussion. 3. FINITENESS PROPERTIES AND THE HILBERT THEOREMS 15
3.2. In what follows is M an R-module. A map f : M → N from M to an R-module N is called a R-homomorphism if it is a group homomorphism with the property that f(rm) = rf(m) for all r ∈ R and m ∈ M. If f is also bijective, then we call it an R-isomorphism; in that case its inverse is also a homomorphism of R-modules. For instance, given a ring homomorphism f : R → R0, then R0 becomes an R-module by rr0 := f(r)r0 and this makes f a homomorphism of R-modules. A subset N ⊂ M is called an R-submodule of M if it is a subgroup and rn ∈ N for all r ∈ R and n ∈ N. Then the group quotient M/N is in a unique manner a R-module in such a way that the quotient map M → M/N is a R-homomorphism: we let r(m+N) := rm+N for r ∈ R and m ∈ M. Notice that a R-submodule of R (here we regard R as a R-module) is the same thing as an ideal of R. Given a subset S ⊂ M, then the set of elements m ∈ M that can be written as r1s1 + ··· + rksk with ri ∈ R and si ∈ S is a R-submodule of M. We call it the R-submodule of M generated by S and we shall denote it by RS. If there exists a finite set S ⊂ M such that M = RS, then we say that M is finitely generated as an R-module.
DEFINITION 3.3. We say that an R-module M is noetherian if the collection of R-submodules of M satisfies the ascending chain condition: any ascending chain of R-submodules N1 ⊂ N2 ⊂ · · · becomes stationary. It is clear that then every quotient module of a noetherian module is also noe- therian. The noetherian property of R as a ring (as previously defined) coincides with this property of R as an R-module. The following two propositions provide the passage from the noetherian prop- erty to finite generation:
PROPOSITION 3.4. An R-module M is noetherian if and only if every R-submodule of M is finitely generated as an R-module.
PROOF. Suppose that M is a noetherian R-module and let N ⊂ M be a R- submodule. The collection of finitely generated R-submodules of M contained in N is nonempty. Hence it has a maximal element N0. If N0 = N, then N is finitely generated. If not, we run into a contradiction: just choose x ∈ N −N0 and consider N0 + Rx. This is a R-submodule of N. It is finitely generated (for N0 is), which contradicts the maximal character of N0. Suppose now that every R-submodule of M is finitely generated. If N1 ⊂ ∞ N2 ⊂ · · · is an ascending chain of R-modules, then the union N := ∪i=1Ni is a
R-submodule. Let {s1, . . . , sk} be a finite set of generators of N. If sκ ∈ Niκ , and j := max{i1, . . . , ik}, then it is clear that Nj = N. So the chain becomes stationary as of index j. PROPOSITION 3.5. Suppose that R is a noetherian ring. Then every finitely gen- erated R-module M is noetherian.
PROOF. By assumption M = RS for a finite set S ⊂ M. We prove the propo- sition by induction on the number of elements of S. If S = ∅, then M = {0} and there is nothing to prove. Suppose now S 6= ∅ and choose s ∈ S, so that our induc- tion hypothesis applies to M 0 := RS0 with S0 = S − {s}: M 0 is noetherian. But so is M/M 0, for it is a quotient of the noetherian ring R via the surjective R-module homomorphism R → M/M 0, r 7→ rs + M 0. Let now N1 ⊂ N2 ⊂ · · · be an ascending chain of R-submodules of M. Then 0 0 N1 ∩ M ⊂ N2 ∩ M ⊂ · · · becomes stationary, say as of index j1. Hence we only 0 need to be concerned with the stabilization of the submodules Nk/(Nk ∩ M ) of 16 1. AFFINE VARIETIES
0 0 M/M . These stabilize indeed (say as of index j2), since M/M is noetherian. So the original chain N1 ⊂ N2 ⊂ · · · stabilizes as of index max{j1, j2}. We are now sufficiently prepared for the proofs of the Hilbert theorems. They are gems of elegance and efficiency. We will use the notion of initial coefficient of a polynomial, which we re- d call. Given a ring R, then every nonzero f ∈ R[x] is uniquely written as rdx + d−1 rd−1x + ··· + r0 with rd 6= 0. We call rd ∈ R the initial coefficient of f and denote it by in(f). For the zero polynomial, we simply define this to be 0 ∈ R. Notice that in(f) in(g) equals in(fg) when nonzero.
PROOFOF THEOREM 2.8. The assumption is here that R is a noetherian ring. In view of Proposition 3.4 we must show that every ideal I of R[x] is finitely gener- ated. Consider the subset in(I) := {in(f): f ∈ I} of R. We first show that this is an ideal of R. If r ∈ R, f ∈ I, then r in(f) equals in(rf) when nonzero and since rf ∈ I, it follows that r in(f) ∈ I. If f, g ∈ I with d := deg(f) − deg(g) ≥ 0, then in(f) − in(g) = in(f − tdg). So in(I) is an ideal as asserted. Since R is noetherian, in(I) is finitely generated: there exist f1, . . . , fk ∈ I such that in(I) = R in(f1) + ··· + R in(fk). Let di be the degree of fi, d0 := max{d1, . . . , dk} and R[x] I = R[x]f1 + ··· + R[x]fk + (I ∩ R[x] Our claim implies the theorem: R[x] R-generators of I ∩ R[x] PROPOSITION 3.6 (Artin-Tate). Let R be a noetherian ring, B an R-algebra and A ⊂ B an R-subalgebra. Assume that B is finitely generated as an A-module. Then A is finitely generated as an R-algebra if and only if B is so. Pm PROOF. By assumption there exist b1, . . . , bm ∈ B such that B = i=1 Abi. If there exist a1, . . . , an ∈ A which generate A as an R-algebra (which means that A = R[a1, . . . , an]), then a1, . . . , an, b1, . . . , bm generate B as an R-algebra. Suppose, conversely, that there exists a finite subset of B which generates B as a R-algebra. By adding this subset to b1, . . . , bm, we may assume that b1, . . . , bm also generate B as an R-algebra. Then every product bibj can be written as an A-linear combination of b1, . . . , bm: m X k k bibj = aijbk, aij ∈ A. k=1 3. FINITENESS PROPERTIES AND THE HILBERT THEOREMS 17 Let A0 ⊂ A be the R-subalgebra of A generated by all the (finitely many) coeffi- k P cients aij. This is a noetherian ring by Corollary 2.9. It is clear that bibj ∈ k A0bk P and with induction it then follows that B = R[b1, . . . , bm] ⊂ k A0bk. So B is finitely generated as an A0-module. Since A is an A0-submodule of B, A is also finitely generated as an A0-module by Proposition 3.4. It follows that A is a finitely generated R-algebra. This has a consequence for field extensions: COROLLARY 3.7. A field extension L/K is finite if and only if L is finitely gener- ated as a K-algebra. PROOF. It is clear that if L is a finite dimensional K-vector space, then L is finitely generated as a K-algebra. Suppose now b1, . . . , bm ∈ L generate L as a K-algebra. We must show that every bi is algebraic over K. Suppose that this is not the case. After renumbering we can and will assume that (for some 1 ≤ r ≤ m) b1, . . . , br are algebraically inde- pendent over K and br+1, . . . , bm are algebraic over the quotient field K(b1, . . . , br) of K[b1, . . . , br]. So L is a finite extension of K(b1, . . . , br). We apply Proposition 3.6 to R := K, A := K(b1, . . . , br) and B := L and find that K(b1, . . . , br) is as a K-algebra generated by a finite subset S ⊂ K(b1, . . . , br). If g is a common de- nominator for the elements of S, then clearly K(b1, . . . , br) = K[b1, . . . , br][1/g]. Since K(b1, . . . , br) strictly contains K[b1, . . . , br], g must have positive degree. In N particular, 1 + g 6= 0, so that 1/(1 + g) ∈ K(b1, . . . , br) can be written as f/g , with f ∈ K[b1, . . . , br]. Here we may of course assume that f is not divisible by g in N K[b1, . . . , br]. From the identity f(1+g) = g we see that N ≥ 1 (for g has positive degree). But then f = g(−f + gN−1) shows that f is divisible by g. We thus get a contradiction. EXERCISE 16. Prove that a field which is finite generated as a ring (that is, isomorphic to a quotient of Z[x1, . . . , xn] for some n) is finite. We deduce from the preceding corollary the Nullstellensatz. PROOFOFTHE NULLSTELLENSATZ√ 1.5. Let J ⊂ k[x1, . . . , xn] be an ideal.√ We must show that I(Z(J)) ⊂ J. This amounts to: for every f ∈ k[x1, . . . , xn] − J there exists a p ∈ Z(J) for which f(p) 6= 0. Consider k[x1, . . . , xn]/J and denote by ¯ ¯ f ∈ k[x1, . . . , xn]/J the image of f. Since f is not nilpotent, we have defined ¯ A := (k[x1, . . . , xn]/J)[1/f]. ¯ Notice that A is as a k-algebra generated by the images of x1, . . . , xn and 1/f (hence is finitely generated). Choose a maximal ideal m ⊂ A. Then the field A/m is a finitely generated k-algebra and so by Corollary 3.7 a finite extension of k. But then it must be equal to k, since k is algebraically closed. Denote by φ : k[x1, . . . , xn] → A → A/m = k the corresponding surjection and put pi := φ(xi) n n and p := (p1, . . . , pn) ∈ A . So if we view xi as a function on A , then xi(p) = pi = φ(xi). The fact that φ is a homomorphism of k-algebras implies that it is given as ‘evaluation in p’: any g ∈ k[x1, . . . , xn] takes in p the value φ(g). Since the kernel of φ contains J, every g ∈ J will be zero in p, in other words, p ∈ Z(J). On the other hand, f(p) = φ(f) is invertible, for it has the image of 1/f¯ in A/m = k as its inverse. So f(p) 6= 0. 18 1. AFFINE VARIETIES 4. The affine category We begin with specifying the maps between closed subsets of affine spaces that we wish to consider. DEFINITION 4.1. Let X ⊂ Am and Y ⊂ An be closed subsets. We say that a map f : X → Y is regular if the components f1, . . . , fn of f are regular functions on X (i.e., are given by the restrictions of polynomial functions to X). Composition of a regular function on Y with f yields a regular function on X (for if we substitute in a polynomial of n variables g(y1, . . . , yn) for every variable yi a polynomial fi(x1, . . . , xm) of m variables, we get a polynomial of m variables). So f then induces a k-algebra homomorphism f ∗ : k[Y ] → k[X]. There is also a converse: PROPOSITION 4.2. Let be given closed subsets X ⊂ Am and Y ⊂ An and a k-algebra homomorphism φ : k[Y ] → k[X]. Then there is a unique regular map f : X → Y such that f ∗ = φ. PROOF. Let fi := φ(yi|Y ) ∈ k[X] (i = 1, . . . , n) and define f = (f1, . . . , fn): n ∗ n X → A , so that f yi = fi = φ(yi|Y ). If j : Y ⊂ A denotes the inclusion, then we ∗ ∗ can also write this as f yi = φj yi. In other words, the k-algebra homomorphisms ∗ ∗ f , φj : k[y1, . . . , yn] → k[X] coincide on the generators yi. Hence they must be equal: f ∗ = φj∗. It follows that f ∗ is zero on I(Y ), which means that f takes its values in Z(I(Y )) = Y , and that the resulting map k[Y ] → k[X] equals φ. The same argument shows that if f : X → Y and g : Y → Z are regular maps, then so is their composite gf : X → Z. So we have a category (with objects the closed subsets and regular maps as defined above). In particular, we have a notion of isomorphism: a regular map f : X → Y is an isomorphism if is has a two-sided inverse g : Y → X which is also a regular map. This implies that f ∗ : k[Y ] → k[X] has a two-sided inverse g∗ : k[X] → k[Y ] which is also an isomorphism of k-algebras, and hence is an isomorphism of k-algebras. Proposition 4.2 implies that conversely, an isomorphism of k-algebras k[Y ] → k[X] comes from a unique isomorphism X → Y . We complete the picture by showing that any finitely generated reduced k- algebra A is isomorphic to some k[Y ]; the preceding then shows that Y is unique up to isomorphism. Since A is finitely generated as a k-algebra, there exists a sur- jective k-algebra homomorphism φ : k[x1, . . . , xn] → A. If we put I := Ker(φ), ∼ n then φ induces an isomorphism k[x1, . . . , xn]/I = A. Put Y := Z(I) ⊂ A . Since A is reduced, I is a radical ideal and hence equal to I(Y ) by the Nullstellensatz. It follows that φ factors through a k-algebra isomorphism k[Y ] =∼ A. We may sum up this discussion in categorical language as follows. PROPOSITION 4.3. The map which assigns to a closed subset of some An its coor- dinate ring defines an anti-equivalence between the category of closed subsets of affine spaces (and regular maps between them as defined above) and the category of reduced finitely generated k-algebras (and k-algebra homomorphisms). EXAMPLE 4.4. Consider the regular map f : A1 → A2, f(t) = (t2, t3). The image of this map is the hypersurface Z defined by x3 − y2 = 0. This is a home- omorphism on the image for it is a bijection of sets that have both the cofinite topology. The inverse sends (0, 0) to 0 and is on Z − {(0, 0)} given by (x, y) 7→ y/x. 4. THE AFFINE CATEGORY 19 In order to determine whether the inverse is regular, we consider f ∗. We have k[Z] = k[x, y]/(x3 − y2), A(A1) = k[t] and f ∗ : k[x, y]/(x3 − y2) → k[t] is given by x 7→ t2, y 7→ t3. This homomorphism is not surjective for its image misses t ∈ k[t]. So f is not an isomorphism. EXAMPLE 4.5. An affine-linear transformation of kn is of the form x ∈ kn 7→ g(x) + a, where a ∈ kn and g ∈ GL(n, k) is a linear transformation. Its inverse is y 7→ g−1(y − a) = g−1(y) − g−1(a) and so of the same type. When we regard such an affine linear transformation as a map from An to itself, then it is regu- lar: its coordinates (g1, . . . , gn) are polynomials of degree one. So an affine-linear transformation is also an isomorphism of An onto itself. When n ≥ 2, there exist automorphisms of An not of this form. For instance (x, y) 7→ (x, y + x2) defines an automorphism of A2 with inverse (x, y) 7→ (x, y − x2) (see also Exercise 18). 2 2 2 EXERCISE 17. Let C ⊂ A be the ‘circle’, defined by x + y = 1 and let p0 := (−1, 0) ∈ C. For every p = (x,√ y) ∈ C − {p0}, the line through p0 and p has slope f(p) = y/(x + 1). Denote by −1 ∈ k a root of the equation t2 + 1 = 0. 5 1 (a) Prove√ that when char(k) 6= 2, f defines an isomorphism onto A − {± −1}. √ (b) Consider the map g : C → A1, g(x, y) := x + −1y. Prove that when char(k) 6= 2, g defines an isomorphism of C onto A1 − {0}. (c) Prove that when char(k) = 2, the defining polynomial x2 + y2 − 1 for C is the square of a degree one polynomial so that C is a line. EXERCISE 18. Let f1, . . . , fn ∈ k[x1, . . . , xn] be such that f1 = x1 and fi − xi ∈ n n k[x1, . . . , xi−1] for i = 2, . . . , xn. Prove that f defines an isomorphism A → A . 4.6.Q UADRATIC HYPERSURFACES IN CASE char(k) 6= 2. Let H ⊂ An be a hyper- surface defined by a polynomial of degree two: n X X f(x1, . . . , xn) = aijxixj + aixi + a0. 1≤i≤j=n i=1 By means of a linear transformation (this involves splitting off squares, hence re- P quires the existence of 1/2 ∈ k), the quadratic form 1≤i≤j=n aijxixj can be brought in diagonal form. This means that we can make all the coefficients a √ ij with i 6= j vanish. Another diagonal transformation (which replaces xi by aiixi when aii 6= 0) takes every nonzero coefficient aii to 1 and then renumbering the coordinates (which is also a linear transformation) brings f into the form Pr 2 Pn f(x1, . . . , xn) = i=1 xi + i=1 aixi + a0 for some r ≥ 1. Splitting off squares Pr once more enables us to get rid of i=1 aixi so that we get r n X 2 X f(x1, . . . , xn) = xi + aixi + a0. i=1 i=r+1 We now have the following cases. If the nonsquare part is identically zero, then we end up with the equation Pr 2 i=1 xi = 0 for H. 5We have not really defined what is an isomorphism between two nonclosed subsets of an affine space. Interpret this here as: f ∗ maps k[x, y][1/(x + 1)]/(x2 + y2 − 1) (the algebra of regular functions 2 1 √ on C − {p0}) isomorphically onto k[t][1/(t + 1)] (the algebra of regular functions on A − {± −1}). This will be justified by Proposition 4.8. 20 1. AFFINE VARIETIES Pn If the linear part i=r+1 aixi is nonzero (so that we must have r < n), then an Pn affine-linear transformation which does not affect x1, . . . , xr and takes i=r+1 aixi+ Pr 2 a0 to −xn yields the equation xn = i=1 xi . This is the graph of the function Pr 2 n−1 n−1 i=1 xi on A and so H is then isomorphic to A . If the linear part Pn a x is zero, but the constant term a is nonzero, then i=r+1 i i 0 √ we can make another diagonal transformation which replaces xi by −a0xi) and Pr 2 divide f by a0: then H gets the equation i=1 xi = 1. In particular, there are only a finite number of quadratic hypersurfaces up to isomorphism. (This is also true in characteristic two, but the discussion is a bit more delicate.) The previous discussion (and in particular Proposition 4.3) leads us to asso- ciate to any ring R a space with a topology so that for R = k[X] we get a space homeomorphic to X. Since the points of X correspond to maximal ideals of k[X], we choose the underlying set of this space (which we shall denote by Specm(R)) to be the collection of maximal ideals of R. In order to avoid confusion about whether a maximal ideal is to be viewed as a subset of R or as a point of Specm(R) we agree that if m is a maximal ideal of R, then the corresponding element of Specm(R) is denoted xm and if x ∈ Specm(R), then the corresponding maximal ideal of R is denoted mx. Any a ∈ R defines a ‘zero set’ Z(a) ⊂ Specm(R), being the set of x ∈ R with a ∈ mx. We denote the complement of Specm(R) − Z(a) by U(a). We 0 0 have U(aa ) = U(a) ∩ U(a ) and so the collection of {U(a)}a∈R is the basis of a topology on Specm(R). Note that a subset Specm(R) is closed precisely if it is an intersection of subsets of the form Z(a); this is equal to the common zero set of the set of functions defined by an ideal of R. The space Specm(R) is called the maximal ideal spectrum6 of R (but our notation for it is less standard). We observe for later reference: LEMMA 4.7. The maximal ideal spectrum Specm(R) is quasi-compact: every open covering of Specm(R) admits a finite subcovering. PROOF. It suffices to verify this for an open covering by basic open subsets. So let {gi ∈ R}i∈I be such that Specm(R) = ∪i∈I U(gi). This means that the ideal generated by the gi is not contained in any maximal ideal and hence must be all of R. In particular, this ideal contains 1 so that 1 = a1gi1 + ··· + angin for certain n i1, . . . , in ∈ I and a1, . . . , an ∈ R. It follows that Specm(R) = ∪ν=1U(riν ). Let us now take for our ring a finitely generated k-algebra A. Then for every x ∈ Specm(A), the residue field A/mx is finitely generated as a k-algebra and hence by Corollary 3.7 a finite extension of k. Since we assumed k to be algebraically closed it is equal to k. We write ρx : A → k for the reduction modulo mx. Now every a ∈ A defines a ‘regular function’ a¯ : Specm(A) → k which takes in x ∈ Specm(A) the value ρx(a) ∈ k. Its zero set is indeed Z(a) ⊂ Specm(A). Notice that if A = k[X], then the above discussion shows that Specm(A) can be identified with X as a topological space and that under this identification, k[X] becomes the ring of regular functions on Specm(A). 6I. Gelfand was presumably the first to consider this, albeit in the context of functional analysis: he characterized the Banach algebras that appear as the algebras of continuous C-valued functions on compact Hausdorff spaces and so it might be appropriate to call this the Gelfand spectrum. 4. THE AFFINE CATEGORY 21 A homomorphism φ : A → B of finitely generated k-algebras gives rise to a map Specm(φ) : Specm(B) → Specm(A): if x ∈ Specm(B), then the composite −1 homomorphism ρxφ : A → k is the identity map when restricted to k so that φ mx is a maximal ideal of A with residue field k. We thus get a map Specm(φ) : Specm(B) → Specm(A), xm 7→ xφ−1m. For a ∈ A, the preimage of U(a) under Specm(φ) is U(φ(a)). This shows that Specm(φ) is continuous. We call the resulting pair (Specm(φ), φ) a morphism. PROPOSITION 4.8. Let A be a finitely generated k-algebra and put X := Specm(A). Then for every s ∈ A, A[1/s] is a finitely generated k-algebra (which is reduced when A is) and the natural k-algebra homomorphism A → A[1/s] induces a homeomor- 0 phism of Specm(A[1/s]) onto Xs := X − Z(s). Moreover, for s, s ∈ A the following are equivalent: (i) Xs ⊂ Xs0 , (ii) s0 divides some positive power of s, (iii) we have an A-homomorphism A[1/s0] → A[1/s]. PROOF. It is clear that A[1/s] is a k-algebra and is as such finitely generated (just add to a generating set for A the generator 1/s). If A is reduced, then so is A[1/s]. For if s is not nilpotent (so that A[1/s] 6= 0) then for any nilpotent a/sr ∈ A[1/s], the numerator a must be nilpotent in R. The preimage of a maximal ideal of A[1/s] in A is a maximal ideal of A which does not contain s. Conversely, if m is a maximal ideal of A, then mA[1/s] is a maximal ideal of A[1/s] unless s ∈ m. So the map A → A[1/s] induces an injection of Specm(A[1/s]) in Specm(A) with image Xs. This map is continuous. To see that it is also open, note that a basic open subset of Specm(A[1/s]) is given by some b ∈ A[1/s]. If we write b = a/sn, then this open subset is also given by a and its image in Specm(A) is U(as), hence open. 0 Assume that we have an inclusion Xs ⊂ Xs0 . Then Z(s ) ⊂ Z(s) and so by the Nullstellensatz, s ∈ p(s0). This implies that sn ∈ (s0) for some positive integer n. If we write sn = as0, then we have an A-homomorphism A[1/s0] → A[1/(as0)] = A[1/(sn] = A[1/s]. Conversely, the A-homomorphism A → A[1/s] factors through A[1/s0] if s0 becomes invertible in A[1/s] and its easily follows that s0 divides some positive power of s and Xs ⊂ Xs0 . From now on we identify a basic open subset Xs with the maximal ideal spec- trum Specm(A[1/s]). EXERCISE 19. Give an example of ring homomorphism φ : S → R and a maxi- mal ideal m ⊂ R, such that φ−1m is not a maximal ideal of S. (Hint: take a look at Exercise 12.) EXERCISE 20. Prove that if A is a finitely generated k-algebra, then the map a ∈ A 7→ a¯ is a k-algebra homomorphism from A to the algebra of k-valued functions on Specm(A) with kernel p(0). Show that for every subset X ⊂ Specm(A), the set I(X) of r ∈ A with a¯|X = 0 is a radical ideal of A. 7 DEFINITION 4.9 (Preliminary definition). An affine k-variety is a topological space X endowed with a finitely generated k-algebra k[X] of regular functions on 7Since we fixed k, we will often drop k and speak of an affine variety. 22 1. AFFINE VARIETIES X which identifies X with the maximal ideal spectrum of k[X]. A map f : X → Y between affine varieties is a morphism if composition with f takes regular functions on Y to regular functions on X, so that we have a k-algebra homomorphism f ∗ : k[Y ] → k[X]. Since k[X] appears here as a k-algebra of k-valued functions X → k, k[X] must be reduced. Indeed, there will exist a homeomorphism h of X onto a closed n ∗ subset of some A such that k[X] = h k[x1, . . . , xn]. We shall often denote an affine variety simply by the underlying topological space, e.g., by X, where it is then understood that k[X] is part of the data. It follows from Proposition 4.8 that any basic open subset U of X has the structure of an affine variety. With this terminology Proposition 4.3 takes now a rather trivial form: 4.10. The functor which assigns to an affine k-variety its k-algebra of regular functions identifies the category of affine k-varieties with the dual of the category of reduced finitely generated k-algebras, the inverse being given by the functor Specm. From our previous discussion it is clear that if X is an affine variety, then every closed subset X0 ⊂ X determines an affine variety X0 with k[X0] := k[X]/I(X0). Here I(X0) is of the course the ideal of regular functions on X that vanish on X0. Let f : X → Y be a morphism of affine varieties. Since f is continuous, a fiber f −1(y), or more generally, the preimage f −1Y 0 of a closed subset Y 0 ⊂ Y , will be closed in X. It is the zero set of the ideal in k[X] generated by f ∗I(Y 0). But this ideal need not be a radical ideal. Here is a simple example: EXAMPLE 4.11. Let f : X = A1 → A1 = Y be defined by f(x) = x2. Then f ∗ : k[y] → k[x] is given by f ∗y = x2. If we assume k not to be of characteristic 2, and we take a ∈ Y − {0}, then the fiber f −1(a) consists of two distinct points and is define by the ideal generated by f ∗(y − a) = x2 − a. This a radical ideal and the fiber has coordinate ring k[x]/(x2 − a). However, the fiber over 0 ∈ Y = A1 is the singleton {0} ⊂ X = A1 and the ideal generated by f ∗y = x2 is not a radical ideal. This example indicates that there might good reason to accept nilpotent elements in the coordinate ring of f −1(0) by endowing f −1(0) with the ring of functions k[f −1(0)] := k[x]/(x2). Since this is a k-vector space of dimension 2, we thus retain the information that two points have come together and the fiber should be thought of as a point with multiplicity 2. Example 4.4 shows that a homeomorphism of affine varieties need not be an isomorphism. Here is another class of examples. EXAMPLE 4.12 (THE FROBENIUSMORPHISM). Assume that k has positive char- acteristic p and consider the morphism Φ: A1 → A1, x 7→ xp. If we remember that A1 can be identified with k, then we observe that under this identification, Φ is a field automorphism: Φ(a − b) = (a − b)p = ap − bp = Φ(a) − Φ(b) and of course Φ(ab) = (ab)p = Φ(a)Φ(b) and Φ is surjective (every element of k has a pth root since k is algebraically closed) and injective. But the endomorphism Φ∗ of k[x] induced by Φ sends x to xp and has therefore image k[xp]. Clearly, Φ∗ is not surjective. The fixed point set of Φ (so the set of a ∈ A1 with ap = a) is via the identifi- 1 cation of A with k just the prime subfield Fp ⊂ k and we therefore denote it by 1 1 1 r A (Fp) ⊂ A . Likewise, the fixed point set A (Fpr ) of Φ is the subfield of k with r ¯ p elements. Since the algebraic closure Fp of Fp in k is the union of the finite 5. THE SHEAF OF REGULAR FUNCTIONS 23 1 subfields of k, the affine line over Fp equals ∪r≥1A (Fpr ). This generalizes in a straightforward manner to higher dimensions: by letting Φ act coordinatewise on An, we get a morphism An → An (which we still denote by Φ) which is also a r n n ¯ n bijection. The fixed point of Φ is A (Fpr ) and A (Fp) = ∪r≥1A (Fpr ). EXERCISE 21. Assume that k has positive characteristic p. Let q = pr be a power q of p with r > 0 and denote by Fq ⊂ k the subfield of a ∈ k satisfying a = a. We write F for Φr : An → An. Suppose Y ⊂ An is the common zero set of polynomials with coefficients in Fq ⊂ k. ∗ (a) Prove that f ∈ k[x1, . . . , xn] has its coefficients in Fq if and only if F f = f q. n (b) Prove that an affine-linear transformation of A with coefficients in Fq commutes with F . (c) Prove that F restricts to a bijection FY : Y → Y and that the fixed point m n set of FY is Y (Fqm ) := Y ∩ A (Fqm ). (d) Suppose that k is an algebraic closure of Fp. Prove that every closed subset of An is defined over a finite subfield of k. REMARK 4.13. After this exercise we cannot resist to mention the Weil zeta function. This function and its relatives—among them the Riemann zeta function— codify arithmetic properties of algebro-geometric objects in a very intricate manner. In the situation of Exercise 21, we can use the numbers |Y (Fqm )| (= the number of m P m fixed points of F in Y ) to define a generating series m≥1 |Y (Fqm )|t . It appears to be more convenient to work with the Weil zeta function: ∞ X tm Z (t) := exp |Y ( m )| , Y Fq m m=1 d which has the property that t dt log ZY yields the generating series above. This series has remarkable properties. For instance, a deep theorem due to Bernard Dwork (1960) asserts that it represents a rational function of t. Another deep theorem, due to Pierre Deligne (1974), states that the roots of the numerator and denominator have for absolute value a nonpositive half-integral power of q and that moreover, these powers have an interpretation in terms of an ‘algebraic topology for algebraic geometry’. All of this was predicted by Andr´e Weil in 1949. (This can be put in a broader context by making the change of variable t = q−s. Indeed, now numerator and denominator have their zeroes when the real part of s is a nonnegative half-integer and this makes Deligne’s result reminiscent of the famous conjectured property of the Riemann zeta function.) EXERCISE 22. Compute the Weil zeta function of affine n-space relative to the field of q elements. 5. The sheaf of regular functions In any topology or analysis course you learn that the notion of continuity is local: there exists a notion of continuity at a point so that a function is continuous if it is so at every point of its domain. We shall see that in algebraic geometry the property for a function to be regular is also local in nature. Suppose X is an affine variety and x ∈ X. We write A for k[X] and m for mx. A basic neighborhood of x is of the form Xg with g∈ / m and a regular function 24 1. AFFINE VARIETIES f on that neighborhood is an element of A[1/g]. Let us say that two such pairs 0 (Xg, f) and (Xg0 , f ) have the same germ at x if there exists a neighborhood U of x 0 in Xg ∩ Xg0 such that f|U = f |U. This is an equivalence relation; an equivalence class is called a germ of a regular function on X at x. The germs of regular functions on X at x form an k-algebra, which we shall denote by OX,x. Evaluation in x defines a homomorphism ρx : OX,x → k whose kernel (denoted mX,x) is the unique maximal ideal of OX,x. So OX,x is a local ring. We claim that OX,x = Am. Clearly, any φ ∈ Am is given as a fraction a/g with a ∈ A and g ∈ A − m, hence comes from a regular function on the basic 0 0 0 neighborhood Xg of x. Moreover, if φ is also given as a /g , then we have (ag − a0g)g00 for some g00 ∈ A − m. This just means that a/g and a0/g0 define the same element of A[1/(gg0g00)], where we note that gg0g00 ∈/ m so that U(gg0g00) is a basic neighborhood of x in X. Now let φ be a k-valued function defined on an open subset U of X and suppose x ∈ U. We say that φ is regular at x if it defines an element of OX,x, in other words, if it defines a regular function on some basic neighborhood of x in X. So there should exist g ∈ A − m and some f ∈ k[X][1/g] which coincides with φ on some neighborhood of x (we then may take g in such a manner that φ is in fact equal to f on Xg). We denote by O(U) the set of k-valued functions U → k that are regular at every point of U. This is clearly a k-algebra. PROPOSITION 5.1. A k-valued function φ on X which is regular at every point of X is regular, in other words, the natural k-algebra homomorphism k[X] → O(X) is an isomorphism. A map φ : X → Y between affine varieties is a morphism if and only if φ is continuous and for any f ∈ O(V ) (with V ⊂ Y open) we have f ∗φ = φf ∈ O(f −1V ). PROOF. It is clear that the map is injective: if f ∈ k[X] is in the kernel, then f must be identically zero as a function and hence f = 0. For surjectivity, let φ : X → k be regular in every point of X. We must show that φ is representable by some f ∈ k[X]. By assumption there exists for every x ∈ X, gx ∈ k[X] − mx and fx ∈ k[X] such that φ|U(gx) is representable as the fraction fx/gx. Since X is quasicompact (Lemma 4.7), the covering {U(gx)}x∈X of X pos- N sesses a finite subcovering {U(gxi )}i=1. Let us now write fi for fxi and gi for gxi . Then fi/gi and fj/gj define the same regular function on U(gi) ∩ U(gj) = U(gigj) mij and so gifj − gjfi is annihilated by (gigj) for some mij ≥ 0. Let m := max{mij} m+1 m m m+1 so that gi gj fj = gi gj fi for all i, j. Since φ|U(gi) is also represented by m m+1 (figi )/gi we may then without loss of generality assume that m = 0, so that gifj = gjfi for all i, j. Since the U(gi) cover X, the ideal generated by g1, . . . , gN must be all of k[X] PN and hence contain 1. So 1 = i=1 aigi for certain ai ∈ k[X]. Now consider PN f := i=1 aifi ∈ k[X]. We have for every j, N N X X fgj = aifigj = aigifj = fj i=1 i=1 and so f represents φ on Uj. It follows that φ is represented by f. The last statement is left as an exercise. 6. THE PRODUCT 25 Let us denote by OX the collection of the k-algebras O(U), where U runs over all open subsets of X. The preceding proposition says that OX is a sheaf of k-valued functions on X, by which we mean the following: DEFINITION 5.2. Let X be a topological space and K a ring. A sheaf O of K- valued functions8 on X assigns to every open subset U of X a K-subalgebra O(U) of the K-algebra of K-valued functions on U with the property that (i) for every inclusion U ⊂ U 0 of open subsets of X, ‘restriction to U’ maps O(U 0) in O(U) and (ii) given a collection (Ui)i∈I of open subsets of X and f : ∪i∈I Ui → K, then f ∈ O(∪iUi) if and only if f|Ui ∈ O(Ui) for all i. If (X, OX ) and (Y, OY ) are topological spaces endowed with a sheaf of K- valued functions, then a continuous map f : X → Y is called a morphism if for −1 every open V ⊂ Y , composition with f takes OY (V ) to OX (f V ). The definition above is a way of expressing that we are dealing with a local property—just as we have a sheaf of continuous R-valued functions on a topological space, a sheaf of differentiable R-valued functions on a manifold and a sheaf of holomorphic C-valued functions on a complex manifold. In fact for the following definition, which includes our final definition of an affine variety, we take our cue from the definition of a manifold. With the notion of a morphism, we have a category of topological spaces en- dowed with a sheaf of K-valued functions. In particular, we have the notion of isomorphism: this is a homeomorphism f : X → Y which for every open V ⊂ Y −1 maps OY (V ) onto OX (f V ). Note that a sheaf O of K-valued functions on X restricts to a sheaf O|U for every open U ⊂ X. Here is our final definition of an affine variety. DEFINITION 5.3. A topological space X endowed with a sheaf OX of k-valued functions is called an affine k-variety resp. a quasi-affine k-variety it is isomorphic to a pair as above. Note that X can then be identified with the spectrum of OX (X) resp. an open subset thereof. 6. The product Let m and n be nonnegative integers. If f ∈ k[x1, . . . , xm] and g ∈ k[y1, . . . , yn], then we have a regular function f ∗ g on Am+n defined by f ∗ g(x1, . . . , xm, y1, . . . , yn) := f(x1, . . . , xm)g(y1, . . . , yn). m+n m n It is clear that A (f ∗ g) = Af × Ag , which shows that the Zariski topology on Am+n refines the product topology on Am × An. Equivalently, if X ⊂ Am and Y ⊂ An are closed, then X × Y is closed in Am+n. We give X × Y the topology it inherits from Am+n (which is usually finer than the product topology). For the coordinate rings we have defined a map: k[X] × k[Y ] → k[X × Y ], (f, g) 7→ f ∗ g 8We give the general definition of a sheaf later. This will do for now. A defect of this definition is that a sheaf of K-valued functions on X need not restrict to one on an arbitrary subspace of X. 26 1. AFFINE VARIETIES which is evidently k-bilinear (i.e., k-linear in either variable). We want to prove that the ideal I(X × Y ) defining X × Y in Am+n is generated by I(X) and I(Y ) (viewed as subsets of k[x1, . . . , xm, y1, . . . yn]) and that X × Y is irreducible when X and Y are. This requires that we translate the formation of the product into al- gebra. This centers around the notion of the tensor product, the definition of which we recall. (Although we here only need tensor products over k, we shall define this notion for modules over a ring, as this is its natural habitat. This is the setting that is needed later anyhow.) If R is a ring and M and N are R-modules, then we can form their tensor product over R, M ⊗R N: as an abelian group M ⊗R N is generated by the expressions a ⊗R b, a ∈ M, 0 0 b ∈ N and subject to the conditions (ra) ⊗R b = a ⊗R (rb), (a + a ) ⊗R b = a ⊗R b + a ⊗R b 0 0 and a ⊗R (b + b ) = a ⊗R b + a ⊗R b . So a general element of M ⊗R N can be written PN like this: i=1 ai ⊗R bi, with ai ∈ M and bi ∈ N. We make M ⊗R N an R-module if we stipulate that r(a ⊗R b) := (ra) ⊗R b (which is then also equal to r ⊗R (rb)). Notice that the map ⊗R : M × N → M ⊗R N, (a, b) 7→ a ⊗R b, is R-bilinear (if we fix one of the variables, then it becomes an R-linear map in the other variable). In case R = k we shall often omit the suffix k in ⊗k. EXERCISE 23. Prove that ⊗R is universal for this property in the sense that every R- bilinear map M×N → P of R-modules is the composite of ⊗R and a unique R-homomorphism M ⊗R N → P . In other words, the map HomR(M ⊗R N,P ) → BilR(M × N,P )), f 7→ f ◦ ⊗R is an isomorphism of R-modules. EXERCISE 24. Let m and n be nonnegative integers. Prove that Z/(n) ⊗Z Z/(m) can be identified with Z/(m, n). If A is an R-algebra and N is an R-module, then A ⊗R N acquires the structure of an A-module which is characterized by 0 0 a.(a ⊗R b) := (aa ) ⊗R b. For instance, if N is an R-vector space, then C ⊗R N is a complex vector space, the complexi- fication of N. If A and B are R-algebras, then A⊗R B acquires the structure of an R-algebra characterized by 0 0 0 0 (a ⊗R b).(a ⊗R b ) := (aa ) ⊗R (bb ). Notice that A → A ⊗R B, a 7→ a ⊗R 1 and B → A ⊗R B, b 7→ 1 ⊗R b are R-algebra homo- morphisms. For example, A ⊗R R[x] = A[x] as A-algebras (and hence A ⊗R R[x1, . . . , xn] = A[x1, . . . , xn] with induction). EXERCISE 25. Prove that C ⊗R C is as a C-algebra isomorphic to C ⊕ C with componen- twise multiplication. PROPOSITION 6.1. For closed subsets X ⊂ Am and Y ⊂ An the bilinear map k[X] × k[Y ] → k[X × Y ], (f, g) 7→ f ∗ g induces an isomorphism µ : k[X] ⊗ k[Y ] → k[X × Y ] of k-algebras (so that in particular k[X] ⊗ k[Y ] is reduced). If X and Y are irreducible, then so is X × Y , or equivalently, if k[X] and k[Y ] are domains, then so is k[X] ⊗ k[Y ]. PROOF. Since the obvious map k[x1, . . . , xm] ⊗ k[y1, . . . , yn] → k[x1, . . . , xm, y1, . . . , yn] 7. FUNCTION FIELDS AND RATIONAL MAPS 27 is an isomorphism, it follows that µ is onto. In order to prove that µ is injective, PN let let first observe that every u ∈ k[X] ⊗ k[Y ] can be written u = i=1 fi ⊗ gi with g1, . . . , gN are k-linearly independent. Given p ∈ X, then the restriction of PN ∼ PN µ(u) = i=1 fi∗gi to {p}×Y = Y is the regular function up := i=1 fi(p)gi ∈ k[Y ]. Since the gi’s are linearly independent, we have up = 0 if and only if fi(p) = 0 for all i. In particular, the subset X(u) ⊂ X of p ∈ X for which up = 0, is equal to N ∩i=1Z(fi) and hence closed. If µ(u) = 0, then up = 0 for all p ∈ X and hence fi = 0 for all i. So u = 0. This proves that µ is injective. Suppose now X and Y irreducible and suppose that u is zero divisor: uv = 0 for some v ∈ k[X × Y ] with u, v 6= 0. Since the restriction of uv = 0 to {p} × Y =∼ Y is upvp and k[Y ] is a domain, it follows that up = 0 or vp = 0. So X = X(u) ∪ X(v). Since X is irreducible we have X = X(u) or X(v). This means that u = 0 or v = 0, which contradicts our assumption. EXERCISE 26. Let X and Y be closed subsets of affine spaces. Prove that each irreducible component of X × Y is the product of an irreducible component of X and one of Y . It is clear that the projections πX : X × Y → X and πY : X × Y → Y are regular. We have observed that X × Y has not a topological product in general. Still it is the ‘right’ product in the sense of category theory: it has the following universal property, almost seems too obvious to mention: if Z is a closed subset of some affine space, then any pair of regular maps f : Z → X, g : Z → Y defines a regular map Z → X × Y characterized by the property that its composite with πX resp. πY yields f resp. g (this is of course (f, g)). 7. Function fields and rational maps In this section we interpret the fraction ring of an algebra of regular functions. Let Y be an affine variety. Recall that for f, g ∈ k[Y ], a fraction f/g ∈ Frac(k[Y ]) is defined if g not a zerodivisor. We need the following lemma. LEMMA 7.1. Let Y be an affine variety and let g ∈ k[Y ]. Then g is not a zero divisor if and only if g is nonzero on every irreducible component of Y . This is also equivalent to Yg being dense in Y . PROOF. Suppose g is zero on some irreducible component C of Y and suppose C 6= Y . If Y 0 denotes the union of the irreducible components of Y distinct form C, then either Y 0 = ∅ (and so g = 0 in k[Y ]) or Y 0 6= ∅ and so there exists a nonzero 0 0 0 0 0 g ∈ IY (Y ). Then gg vanishes on C ∪ Y = Y and so gg = 0. It follows that g is then a zero divisor. Conversely, assume that g is a zerodivisor of k[Y ]. So there exists a nonzero g0 ∈ k[Y ] with gg0 = 0. Let C be an irreducible component of Y on which g0 is nonzero. Then we must have g|C = 0. The proof of the last clause is left to you. This lemma tells us that this means that the affine variety Yg is open-dense in 0 0 Y . Clearly, f/g defines a regular function Yg → k. Another such fraction f /g yields a regular function on an open-dense U(g0) and their product f/g.f 0/g0 = ff 0/gg0 resp. difference f/g−f 0/g0 = (fg0 −f 0g)/gg0 in Frac(k[Y ]) defines a regular function on U(gg0) which is there just the product resp. difference of the associated 28 1. AFFINE VARIETIES regular functions. In particular, if f 0/g0 = f/g in k[Y ], then the two associated regular functions coincide on U(gg0). There is a priori no best way to represent a given element of Frac(k[Y ]) as a fraction (as there is in a UFD), and so we must be content with the observation that an element of Frac(k[Y ]) defines a regular function on some (unspecified) open dense subset of Y , with the understanding that two such functions are regarded as equal if they coincide on an open-dense subset contained in a common domain of definition. When an element of Frac(k[Y ]) is thus understood, then we call it a rational function on Y . This is the algebro- geometric analogue of a meromorphic function in complex function theory. When Y is irreducible, then Frac(k[Y ]) is a field, called the function field of Y , and will be denoted k(Y ). We shall now give a geometric interpretation of finitely generated field exten- sions of k and the k-linear field homomorphisms between them. We begin with a notion that generalizes that of a rational function. DEFINITION 7.2. Let X and Y be affine varieties. A rational map from X to Y is given by a pair (U, F ), where U is an affine open-dense subset of X and F : U → Y is a morphism, with the understanding that a pair (V,G) defines the same rational map if F and G coincide on U ∩ V . We denote a rational map like this f : X 99K Y . We say that the rational map is dominant if for a representative pair (U, F ), F (U) is dense in Y . (This is then also so for any other representative pair. Why?) PROPOSITION 7.3. Any finitely generated field extension of k is k-isomorphic to the function field of an irreducible affine variety. If X and Y are irreducible and affine, then a dominant rational map f : X 99K Y determines a k-linear field embedding f ∗ : k(Y ) ,→ k(X) and every k-linear field embedding k(Y ) → k(X) is of this form (for a unique f). PROOF. Let K/k be a finitely generated field extension of k: there exist ele- ments a1, . . . , an ∈ K such that every element of K can be written as a fraction of polynomials in a1, . . . , an. So if R denotes the k-subalgebra R of K generated by a1, . . . , an, (a domain since K is a field), then K is the field of fractions of R. Since R is the coordinate ring of a closed irreducible subset X ⊂ An (defined by the kernel of the obvious ring homomorphism k[x1, . . . , xn] → R), it follows that K can be identified with k(X). Suppose we are given an open-dense affine subvariety U ⊂ X and a morphism F : U → Y with F (U) dense in Y . Now F ∗ : k[Y ] → k[U] will be injective, for if g ∈ k[Y ] is in the kernel: F ∗(g) = 0, then F (U) ⊂ Z(g0). Since F (U) is dense in Y , this implies that Z(g) = Y , in other words, that g = 0. Hence the composite map k[Y ] → k[U] → k(U) = k(X) is an injective homomorphism from a domain to a field and therefore extends to a field embedding k(Y ) → k(X). It remains to show that every k-linear field homomorphism Φ: k(Y ) → k(X) is so obtained. For this, choose generators b1, . . . , bm of k[Y ]. Then Φ(b1),..., Φ(bm) are rational functions on X and so are regular on an open-dense affine subset U ⊂ X. Since b1, . . . , bm generate k[Y ] as a k-algebra, it follows that Φ maps k[Y ] to k[U] ⊂ k(X). This is a k-algebra homomorphism and so defines a morphism F : U → Y such that F ∗ = Φ|k[Y ]. The image of F will be dense by the argument above: if the image of F is contained in a closed subset of Y distinct from Y , then its maps to some Z(g) with g ∈ k[Y ] − {0}. But this implies that Φ(g) = F ∗(g) = 0, 7. FUNCTION FIELDS AND RATIONAL MAPS 29 which cannot happen, since Φ is injective. It is clear that Φ is the extension of F ∗ to the function fields. As to the uniqueness: if (V,G) is another solution, then choose a nonempty affine open-dense subset W ⊂ U ∩ V so that F and G both define morphisms W → Y . The maps k[Y ] → k[W ] they define are equal, for are given by Φ. Hence F and G coincide on W . Since W is dense in U ∩V , they also coincide on U ∩V . EXERCISE 27. Let f ∈ k[x1, . . . , xn+1] be irreducible of positive degree. Its zero n+1 set X ⊂ A is then closed and irreducible. Denote by d the degree of f in xn+1. (a) Prove that the projection π : X → An induces an injective k-algebra ∗ homomorphism π : k[x1, . . . , xn] → k[X] = k[x1, . . . , xn]/(f) if and only if d > 0. (b) Prove that for d > 0, π is dominant and that the resulting field homomor- phism k(x1, . . . , xn) → k(X) is a finite extension of degree d. COROLLARY 7.4. We have a category with the irreducible affine varieties as objects and the rational dominant maps as morphisms. Assigning to an irreducible affine variety its function field makes this category anti-equivalent to the category of finitely generated field extensions of the base field k. PROPOSITION-DEFINITION 7.5. A rational map f : X 99K Y is an isomorphism in the above category (that is, induces a k-linear isomorphism of function fields) if and only if there exists a representative pair (U, F ) of f such that F maps U isomorphically onto an open subset of Y . If these two equivalent conditions are satisfied, then f is called a birational map. If a birational map X 99K Y merely exists (in other words, if there exists a k-linear field isomorphism between k(X) and k(Y )), then we say that X and Y are birationally equivalent. PROOF. If f identifies a nonempty open subset of X with one of Y , then f ∗ : k(Y ) → k(X) is clearly a k-algebra isomorphism. Suppose now we have a k-linear isomorphism k(Y ) =∼ k(X). Represent this isomorphism and its inverse by (U, F ) and (V,G) respectively. Then F −1V is affine and open-dense in U and GF : F −1V → X is defined. Since GF induces the identity on k(X), GF is the identity on a nonempty open subset of F −1V and hence on all of F −1V . This implies that F maps F −1V injectively to G−1U. For the same reason, G maps G−1U injectively to F −1V . So F defines an isomorphism −1 −1 between the open subsets F V ⊂ X and G U ⊂ Y . EXERCISE 28. We here assume k not of characteristic 2. Let for X ⊂ A2 be 2 2 1 defined by x1 + x2 = 1. Prove that X is birationally equivalent to the affine line A . (Hint: take a look at Exercise 17.) More generally, prove that the quadric in An+1 2 2 2 n defined by x1 + x2 + ··· xn+1 = 1 is birationally equivalent to A . What about the n+1 2 2 2 quadric cone in A defined by x1 + x2 + ··· xn+1 = 0 (n ≥ 2)? In case k(X)/k(Y ) is a finite extension, one may wonder what the geometric meaning is of the degree d of that extension, perhaps hoping that this is just the number of elements of a general fiber of the associated rational map X 99K Y . We will see that this is often true (namely when the characteristic of k is zero, or more generally, when this characteristic does not divide d), but not always, witness the following example. 30 1. AFFINE VARIETIES EXAMPLE 7.6. Suppose k has characteristic p > 0. We take X = A1 = Y and let f be the Frobenius map: f(a) = ap. Then f is homeomorphism, but f ∗ : k[Y ] → k[X] is given by y 7→ xp and so induces the field extension k(y) = k(xp) ⊂ k(x), which is of degree p. From the perspective of Y , we have enlarged its algebra of regular functions by introducing a formal pth root of its coordinate (which yields another copy of A1, namely X). From the perspective of X, k[Y ] is just f ∗k[X]. This is in fact the basic example of a purely inseparable field extension, i.e., a field extension L/K with the property that every element of L has a minimal poly- nomial in K[T ] that has at precisely one root in L. Such a polynomial must be of r the form T p − c, with c ∈ K not a p-th power of an element of K when r > 0, where p is the characteristic of K (for p = 0 we necessarily have L = K). So if L 6= K, then the Frobenius map Φ: a ∈ K 7→ ap ∈ K is not surjective. Purely in- separable extensions are not detected by Galois theory, for they have trivial Galois group: after all, there is only one root to move around. For any algebraic extension L/K, the elements of L that are separable over K make up an intermediate extension Lsep/K that is (of course) separable and is such that L/Lsep is purely inseparable. When L is an algebraic closure of K, then Lsep is called a separable algebraic closure of K: it is a separable algebraic extension of K which is maximal for that property. Then L is as an extension of Lsep obtained by successive adjunction of p-power roots of elements of Lsep: it is an extension minimal for the property of making the Frobenius map Φ: a ∈ L 7→ ap ∈ L surjective. In case L/K is a finite normal extension (i.e., a finite extension with the property that every f ∈ K[x] that is the minimal polynomial of some element of L has all roots in L), then the fixed point set of the Galois group of L/K is a subextension Linsep/L that is purely inseparable, whereas L/Linsep separable. In that case sep insep the natural map L ⊗K L → L is an isomorphism of K-algebras. EXERCISE 29. Let f : X → Y be a dominant morphism of irreducible affine varieties which induces a purely inseparable field extension k(Y )/k(X). Prove that there is an open-dense subset U ⊂ Y such that f restricts to a homeomorphism f −1U → U. (Hint: show first that it suffices to treat the case when k(X) is obtained from k(Y ) by adjoining the pth root of an element f ∈ k(Y ). Then observe that if f is regular on the affine open-dense U ⊂ Y , then Y contains as an open dense subset a copy of the locus of (x, t) ∈ U × A1 satisfying tp = f.) If K/k is a finitely generated field extension and generated by b1, . . . , bm, say, then we may after renumbering assume that for some r ≤ m, b1, . . . , br are alge- braically independent (so that the k-linear field homomorphism k(x1, . . . , xr) → K which sends xi to bi is injective) and br+1, . . . , bm are algebraic over k(b1, . . . , br). This number r is invariant of K/k, called its transcendence degree. So if K = k(X), r then this defines a dominant rational map X 99K A . We now recall the theorem of the primitive element: THEOREM 7.7. Every separable field extension L/K of finite degree is generated by a single element (which we then call a primitive element for L/K). It follows that if br+1, . . . , bm are separable over k(b1, . . . , br), then we can find a b ∈ K that is algebraic over k(b1, . . . , br) such that K is obtained from k(b1, . . . , br) by adjoining b. This b is a root of an irreducible polynomial F ∈ k(b1, . . . , br)[T ]. If we clear denominators, we may assume that the coefficients of 8. FINITE MORPHISMS 31 F lie in k[b1, . . . , br][T ] = k[b1, . . . , br,T ]. Then we can take for X the hypersurface r+1 in A defined by F , when viewed as an element of k[x1, . . . , xr, xr+1]. In particular, every irreducible affine variety Y is birationally equivalent to a hypersurface in Ar+1, where r is the transcendence degree of k(Y ). This suggests that this transcendence degree might be a good way to define the dimension of Y . We shall return to this. Much of the algebraic geometry in the 19th century and early 20th century was of a birational nature: birationally equivalent varieties were regarded as not really different. This sounds rather drastic, but it turns out that many interesting properties of varieties are an invariant of their birational equivalence class. Here is an observation which not only illustrates how affine varieties over al- gebraically nonclosed fields can arise when dealing with affine k-varieties, but one that is also a tell-tale sign that we ought to enlarge the maximal ideal spectrum. Let f : X → Y be a dominant morphism of irreducible affine varieties. This implies that f ∗ : k[Y ] → k[X] is injective and that f(X) contains an open-dense subset of Y . Then we may ask whether there exists something like a general fiber: is there an open-dense subset V ⊂ Y such that the fibers f −1(y), y ∈ V all “look the same”? The question is too vague for a clear answer and for most naive ways of making this precise, the answer will be no. But there is a cheap way out by simply refusing to specify one such V and allowing it to be arbitrarily small: we would like to take a limit over all open-dense subsets of Y . This we cannot do (yet) topologically, but we can do this algebraically by making all the nonzero elements of k[Y ] in k[X] invertible: we take −1 (k[Y ] − {0}) k[X] = k(Y ) ⊗k[Y ] k[X] (this equality follows from the fact that k[X] = k[Y ] ⊗k[Y ] k[X]). This is a reduced finitely generated k(Y )-algebra and we may regard its maximal ideal spectrum Specm(k(Y )⊗k[Y ] k[X]) as an affine variety over the (algebraically nonclosed) field k(Y ): now every regular function on X which comes from Y is treated as a scalar (and will be invertible when nonzero). If we decide to add to Y =∼ Specm(k(Y )) a point η with residue field k(Y ), then we can simply say that the generic fiber of f is 9 the fiber Xη over η(Y ) . But once we decide to do this for Y , we should of course do this for every irreducible affine variety. In particular, we must then add for every irreducible subset Z of Y (or equivalently, for every prime ideal p ⊂ k[Y ]) a point to Specm(k(Y )) with residue field k(Z) (= Frac(k[Y ]/p)). We will later do this in the same kind of generality as the maximal ideal spectrum, namely for an arbitrary ring. 8. Finite morphisms Let A be a ring and B an A-algebra. PROPOSITION-DEFINITION 8.1. We say that b ∈ B is integrally dependent on A if one the following equivalent properties is satisfied. 9It is often preferable to work over an algebraically closed field. We can remedy this simply by choosing an algebraic closure L of k(Y ). The maximal ideal spectrum of L ⊗k[Y ] k[X] is then an affine L-variety, and yields a notion of a general fiber that is even closer to our geometric intuition. 32 1. AFFINE VARIETIES n n−1 (i) b is a root of a monic polynomial x + a1x + ··· + an ∈ A[x], (ii) A[b] is as an A-module finitely generated, (iii) A[b] is contained in a A-subalgebra C ⊂ B which is finitely generated as an A-module. In particular, if B is finite over A (which is short for: B is a finitely generated A-module), then B is integral over A (which is short for: every element of B is integral over A). n n−1 PROOF. (i) ⇒ (ii). If b is a root of x + a1x + ··· + an ∈ A[x], then clearly A[b] is generated as a A-module by 1, b, b2, . . . , bn−1. (ii) ⇒ (iii) is obvious. (iii) ⇒ (i). Suppose that C is as in (iii). Choose an epimorphism π : An → C n of A-modules and denote the standard basis of A by (e1, . . . , en). We may (and Pn will) assume that π(e1) = 1. By assumption, bπ(ei) = j=1 xijπ(ej) for certain n xij ∈ A. Put σ := (bδij − xij)i,j ∈ End(A[b] ). Then πσ = 0. 0 n 0 If σ ∈ End(A[b] ) is the matrix of cofactors of σ, then σσ = det(σ)1n by 0 Cramer’s rule. We find that det(σ) = det(σ)π(e1) = π(det(σ)e1) = π(σσ )(e1) = 0 (πσ)σ (e1) = 0. Clearly, det(σ) is a monic polynomial in b. The integral closure of A in B is the set elements of B that are integrally depen- B dent on A and will be denoted A . COROLLARY 8.2. The integral closure of A in B is an A-subalgebra of B. PROOF. It is enough to prove the following: if b, b0 ∈ B are integrally depen- dent over A, then every element of A[b, b0] is algebraically dependent over A. Since b0 is integrally dependent over A, it is so over A[b]. Hence A[b][b0] = A[b, b0] is a finitely generated A[b]-module. Since A[b] is a finitely generated A-module, it fol- lows that A[b, b0] is a finitely generated A-module. Hence by (iii) every element of 0 A[b, b ] is integrally dependent over A. EXERCISE 30. Prove that ‘being integral over’ is transitive: if B is an A-algebra integral over A, then any B-algebra that is integral over B is as an A-algebra inte- gral over A. We say that the ring homomorphism A → B is an integral extension if it is injective (so that A may be regarded as a subring of B) and B is integral over A. PROPOSITION 8.3. Let A ⊂ B be an integral extension and let p ⊂ A be a prime ideal of A. Then the going up property holds: p is of the form q ∩ A, where q is a prime ideal of B. If also is given is a prime ideal q0 of B with the property that q0 ∩ A ⊂ p, then we can take q ⊃ q0. Moreover the incomparability property holds: two distinct prime ideals of B having the same intersection with A cannot obey an inclusion relation (equivalently, if q0 ∩ A = p, then q = q0 is the only solution). If B is a domain, then Frac(B) is an algebraic field extension of Frac(A), which is finite whenever B is finite over A. PROOF. Consider the localizations Ap resp. ApB obtained by making the ele- ments of A − p invertible. Then Ap is a local ring with maximal ideal p˜ := pAp and ApB is an integral extension of Ap. If we find a prime ideal ˜q of ApB with ˜q∩Ap = p˜, then q := ˜q ∩ B is a prime ideal of B the property that q ∩ A = ˜q ∩ A = p˜ ∩ A = p. Hence there is no loss in generality in assuming that A is a local ring and p is its unique maximal ideal mA. 8. FINITE MORPHISMS 33 We claim that mAB 6= B. Otherwise 1 ∈ B can be written as an mA-linear combination of a finite set elements of B. Denote by B0 the A-algebra generated by this finite set. Then B0 is an integral extension of A and finitely generated as an 0 0 A-algebra. This implies that B is a finitely generated A-module. Since 1 ∈ mAB 0 0 we have B = mAB , and it then follows from Nakayama’s lemma (recalled below) that B0 = 0. This is clearly a contradiction. Now that we know that mAB 6= B, we can take for q any maximal ideal of B which contains the ideal mAB: then qB ∩ A is a maximal ideal of A, hence equals mA. For the refinement and the incomparability property we can, simply by passing to the integral extension A/(q0 ∩ A) ⊂ B/q0, assume that q0 = 0. This reduces the refinement to the case already treated, while the incomparability property then amounts showing that q 6= (0) implies q ∩ A 6= (0). If b is a nonzero element of q, n n−1 then it is a root of a monic polynomial: b +a1b +···+an−1b+an = 0 (ai ∈ A), where we can of course assume that an 6= 0 (otherwise divide by b). We then find that 0 6= an ∈ Bb ∩ A ⊂ q ∩ A. Since B is a union of finitely generated A-modules, the last clause follows if we prove that Frac(B) = Frac(A)B. Let b ∈ B. It is a root of a monic polynomial n n−1 n−1 x + a1x + ··· + an = 0 (ai ∈ A) with an 6= 0 and so 1/b = −1/an.(b + n−2 a1b + ··· + an−1) ∈ Frac(A)B. REMARK 8.4. Note that the going-up property implies that for any strictly as- cending sequence p0 ( p1 ( ··· ( pn of prime ideals in A is the intersection with A of a (necessarily strictly) ascending sequence of prime ideals q0 ⊂ q1 ⊂ · · · ⊂ qn in B. Conversely, the incomparability property implies that if we intersect a strictly ascending sequence of prime ideals q0 ( q1 ( ··· ( qn in B with A, we get a strictly ascending sequence of prime ideals of A. For completeness we state and prove: LEMMA 8.5 (Nakayama’s lemma). Let R be a local ring with maximal ideal m and M a finitely generated R-module. Then a finite subset S ⊂ M generates M as a R-module if (and only if) the image of S in M/mM generates the latter as a R/m-vector space. In particular (take S = ∅), mM = M implies M = 0. PROOF. In fact, we first reduce to the case when S = ∅ by passing to M/RS. Our assumptions then say that M = mM. We must show that M = 0. Let π : Rn → M be an epimorphism of R-modules and denote the standard basis of Rn by Ps (e1, . . . , en). By assumption there exist xij ∈ m such that π(ei) = j=1 xijπ(ej). n So if σ := (δij − xij)i,j ∈ End(R ), then πσ = 0. Notice that det(σ) ∈ 1 + m. Since 1 + m consists of invertible elements, Cramer’s rule shows that σ is invertible. So −1 −1 π = π(σσ ) = (πσ)σ = 0 and hence M = 0. EXERCISE 31. Prove that an integral extension A ⊂ B defines surjective mor- phisms Spec(B) → Spec(A) and Specm(B) → Specm(A). DEFINITION 8.6. We say that a morphism of affine varieties f : X → Y is finite if the k-algebra homomorphism f ∗ : k[Y ] → k[X] is finite, i.e., makes k[X] a finitely generated k[Y ]-module. EXERCISE 32. Prove that if Y is an affine variety, then the disjoint union of its irreducible components is finite over Y . 34 1. AFFINE VARIETIES COROLLARY 8.7. Let f : X → Y be a finite morphism of affine varieties such that f ∗ : k[Y ] → k[X] is injective. Then f is surjective and every closed irreducible subset Y 0 ⊂ Y is the image of a closed irreducible subset X0 ⊂ X. If furthermore is given a closed irreducible subset X00 ⊂ X with f(X00) ⊃ Y 0, then we may choose X0 ⊂ X00. If in addition X is irreducible, then so is Y and f ∗ : k(Y ) → k(X) is a finite algebraic extension. ∗ PROOF. Apply Proposition 8.3 to f . EXERCISE 33. Prove that a finite morphism f : X → Y of affine varieties is closed and that every fiber f −1(y) is finite (possibly empty). THEOREM 8.8 (Noether normalization). Let K be a field. Every finitely generated K-algebra B is a finite extension of a polynomial algebra over K: there exist an integer r ≥ 0 and an injection K[x1, . . . , xr] ,→ B such that B is finite over K[x1, . . . , xr]. In fact, if we are given an epimorphism φ : K[x1, . . . , xm] → B of K-algebras, then φ is an isomorphism or we can take in this statement r < m. The proof will be with induction on m and the induction step is worth recording separately. LEMMA 8.9. Let φ : K[x1, . . . , xm] → B be an epimorphism of K-algebras, which is not an isomorphism. Then there exists a K-algebra homomorphism φ0 : 0 K[x1, . . . , xm−1] → B such that φ(xm) is integral over the image of φ . PROOF. We define for any positive integer s a K-algebra automorphism σs of sm−i K[x1, . . . , xm] by σs(xm) = xm and σs(xi) := xi + xm for i < m (its inverse sm−i m fixes xm and sends xi to xi − xm for i < m). So if I = (i1, . . . , im) ∈ Z≥0, and I i1 im x := x1 ··· xm , then m−1 I s i1 s im−1 im σs(x ) = (x1 + xm ) ··· (xm−1 + xm) xm . When viewed as an element of K[x1, . . . , xm−1][xm], this is a monic polynomial in m−1 m−2 m xm of degree pI (s) := i1s + i2s + ··· + im. Now give Z≥0 the lexicographic ordering. Then I > J implies pI (s) > pJ (s) for s large enough. Choose a nonzero −1 m f ∈ ker(φ). Then σs(f) ∈ ker(φσs ). If I ∈ Z≥0 is the largest multi-exponent of a monomial that appears in f with nonzero coefficient, then for s large enough, σs(f) is a constant times a monic polynomial in xm of degree pI (s) with coeffi- −1 cients in K[x1, . . . , xm−1] and hence φσs (xm) = φ(xm) becomes integral over −1 0 −1 φσs K[x1, . . . , xm−1]. So φ := φσs |K[x1, . . . , xm−1] is then as desired. PROOFOF NOETHER NORMALIZATION. Let φ : K[x1, . . . , xm] → B be an epi- morphism of K-algebras which is not an isomorphism (otherwise we are done). We prove the refined version of the theorem with induction on m. According to 0 Lemma 8.9, there exists a K-algebra homomorphism φ : K[x1, . . . , xm−1] → B 0 0 0 such that φ(xm) is integral over B := φ (K[x1, . . . , xm−1]). By induction, B is a finite extension of some polynomial algebra K[x1, . . . , xr] with r ≤ m − 1. Hence so is B. REMARK 8.10. If A is a domain, then according to Proposition 8.3, Frac(A) will be a finite extension of K(x1, . . . , xr) and so r must be the transcendence degree of Frac(A)/K. In particular, r is an invariant of A. 8. FINITE MORPHISMS 35 COROLLARY 8.11. For every affine variety Y there exists an integer r ≥ 0 and a finite surjective morphism f : Y → Ar. This corollary gives us a better grasp on the geometry of Y , especially when Y is irreducible, for it shows that Y can be spread over Ar in a finite-to-one manner over affine r-space. The last assertion of Proposition 8.3 has a converse, also due to Noether, which we state without proof. *THEOREM 8.12 (Emmy Noether). Let K be a field and A be a finitely generated L K-domain. Then for any finite field extension L/ Frac(A), the integral closure A of A in L is finite over A. L If we take L = Frac(A), then A is called the normalization of A. If A equals its normalization, then we say that A is normal. We carry this terminology to the geometric setting by saying that an irreducible affine variety X is normal when k[X] is. Any nonempty open subset of An is normal. More generally: LEMMA 8.13. Any unique factorization domain A is normal. d d PROOF. Suppose b ∈ Frac(A) is integral over A: b +a1b +···+ad with ai ∈ A. d d−1 d Write b = r/s with r, s relatively prime. Then r + a1rs + ··· + ads and we see d that r is divisible by s. This can only happen when s is a unit, so that b ∈ A. Proposition 8.12 has a remarkable geometric interpretation: let be given an irreducible affine variety Y and a finite field extension L/k(Y ). Then Proposi- L tion 8.12 asserts that A is a finitely generated k[Y ]-module. It is also an do- main (because it is contained in a field) and so it defines an irreducible variety L L YL := Spec(A ). Since A ⊂ A is an integral extension, we have a finite surjective morphism YL → Y . Notice that this morphism induces the given field extension L/k(Y ). This shows that every finite field extension of k(Y ) is canonically realized by a finite morphism of irreducible affine varieties! If L is a separable algebraic closure of k(Y ), then this does not apply, for L/k(Y ) will not be finite, unless Y is a singleton. But L can be written as a mono- ∞ tone union of finite (separable) field extensions: L = ∪i=1Li with Li ⊂ Li+1 and Li+1/Li finite. This yields a sequence of finite surjective morphisms Y ← YL1 ← YL2 ← YL3 ← · · · of which the projective limit can be understood as a “pro-affine variety” (a point of ∞ this limit is given by a sequence (yi ∈ YLi )i=1 such that yi+1 maps to yi for all i). Its ∞ Li L algebra of regular functions is ∪i=1k[Y ] = k[Y ] (which is usually not a finitely generated k-algebra) and its function field is L. Of special interest is the case of a finite Galois extension. We begin with the cor- responding result from commutative algebra which has also important applications in algebraic number theory: PROPOSITION 8.14. Let A be a normal domain and let L/ Frac(A) be a finite normal extension with Galois group G. Then G leaves invariant the integral closure L A of A in L, and for every prime ideal p ⊂ A, G acts transitively on the set of prime L ideals q ⊂ A that lie over p (i.e., with q ∩ A = p). 36 1. AFFINE VARIETIES L PROOF. That any g ∈ G leaves invariant A is clear, for g fixes the coefficients of an equation of integral dependence over A. 0 0 0 Let q and q lie over p. We first show that q ⊂ ∪g∈Ggq. If b ∈ q , then the norm Q of b over Frac(A), which is defined as the product g∈G gb, lies in Frac(A), and L L hence in A ∩Frac(A). Since A is assumed to be normal, we have A ∩Frac(A) = A. Q 0 Hence g∈G gb ∈ A ∩ q = p ⊂ q. So we must have gb ∈ q for some g ∈ G, or equivalently, b ∈ g−1q. 0 From q ⊂ ∪g∈Ggq and the fact that each gq is a prime ideal, it follows from the prime avoidance lemma below that q0 ⊂ gq for some g ∈ G. Since both q0 and gq 0 lie over p, the incomparability property implies that q = gq. LEMMA 8.15 (The prime avoidance lemma). Let R be a ring, q1,..., qn prime n ideals in R and I ⊂ R an ideal contained in ∪i=1qi. Then I ⊂ qi for some i. PROOF. We prove this induction on n. The case n = 1 being trivial, we may assume that n > 1 and that for every i = 1, . . . , n, I is not contained in ∪j6=iqj. Choose ai ∈ I \ ∪j6=iqj. So then ai ∈ qi. Consider a := a1a2 ··· an−1 + an. Then a ∈ I and hence a ∈ qi for some i. If i < n, then an = a − a1a2 ··· an−1 ∈ qi and we get a contradiction. If i = n, then a1a2 ··· an−1 = a − an ∈ qn and hence aj ∈ qn for some j ≤ n − 1. This is also a contradiction. We transcribe this to algebraic geometry: COROLLARY 8.16. Let Y be a normal variety and L/k(Y ) be a normal Galois extension with Galois group G. Then G acts on YL in such a manner that it commutes 0 with f : YL → Y : for all g ∈ G we have fg = f. If Y ⊂ Y is closed an irreducible, then G acts transitively on the irreducible components of f −1Y 0. This also leads for normal domains to a supplement of the going up property: COROLLARY 8.17 (Going down property). Suppose A is a normal domain and B ⊃ A an integral extension without zero divisors. Given prime ideals p in A and q0 in B such that q0 ∩ A ⊃ p, then there exists a prime ideal q in B with q ∩ A ⊂ p. PROOF. Choose an algebraic closure Frac(B) of Frac(B) and let L be the sub- field of Frac(B) generated by all the Frac(A)-embeddings of Frac(B) in Frac(B) (so this is obtained by adjoining the roots of the irreducible polynomials in Frac(A)[x] having a root in Frac(B)). Galois theory tells us that this is a finite normal ex- tension of Frac(B) (with Galois group G, say), which brings us in the situation of L Proposition 8.14 above. Since L contains Frac(B) , A contains B. Observe that L A is finite over B and (hence) over A. L Put p0 := q0 ∩ A so that p0 ⊃ p. According to Proposition 8.3 we can find in A nested prime ideals p˜0 ⊃ p˜ which meet A in p0 ⊃ p. Similarly, there exists a prime L ideal ˜q0 in A which meets B in q0. Since ˜q0 and p0 both meet A in p0, there exists according to Proposition 8.14a g ∈ G which takes p˜0 to ˜q0. Then upon replacing p˜0 by g0p˜0, we may assume that p˜0 = ˜q0. Then q := p˜ ∩ B is as desired: it meets A in p 0 0 and is contained in p˜ ∩ B = ˜q ∩ B = q. 9. Dimension One way to define the dimension of a topological space X is as follows: the empty set has dimension −1 and X has dimension ≤ n if it admits a basis of open 9. DIMENSION 37 subsets such that the boundary of every basis element has dimension ≤ n − 1. This is close in spirit to the definition that we shall use here (which is however adapted to the Zariski topology; as you will find in Exercise 34, it is useless for Hausdorff spaces). DEFINITION 9.1. Let X be a nonempty topological space. We say that the Krull dimension of X is at least d if there exists an irreducible chain of length d in X, that is, a strictly descending chain of closed irreducible subsets Y 0 ) Y 1 ) ··· ) Y d of X. The Krull dimension of X is the supremum of the d for which an irreducible chain of length d exists and we then write dim X = d. We stipulate that the Krull dimension of the empty set is −1. LEMMA 9.2. For a subset Z of a topological space X we have dim Z ≤ dim X. PROOF. If Y ⊂ Z is irreducible, then so is its closure Y¯ in X. So if we have an irreducible chain of length d in Z, then the closures of the members of this chain yield an irreducible chain of length d in X. This proves that dim Z ≤ dim X. EXERCISE 34. What is the Krull dimension of a nonempty Hausdorff space? EXERCISE 35. Let U be an open subset of the space X. Prove that for an irreducible chain Y • in X of length d with U ∩ Y d 6= ∅, U ∩ Y • is an irreducible chain of length d in U. Conclude that if U is an open covering of X, then dim X = supU∈U dim U. EXERCISE 36. Suppose that X is a noetherian space. Prove that the dimension of X is the maximum of the dimensions of its irreducible components. Prove also that if all the singletons (= one element subsets) in X are closed, then dim(X) = 0 if and only if X is finite. It is straightforward to transcribe this notion of dimension into algebra: DEFINITION 9.3. Let R be a ring. We say that the Krull dimension of R is at least d if there exists an prime chain of length d in R, that is, a strictly ascending chain p• of prime ideals p0 ( p1 ( ··· ( pd in R. The Krull dimension of R is the supremum of the d for which this is the case and we then write dim R = d. We stipulate that the trivial ring R = {0} (which has no prime ideals) has Krull dimension −1. The prime ideals of a ring R are the preimages of the prime ideals of Rred := R/p(o) and so these rings have the same dimension. It is clear that for a closed subset X ⊂ An, dim k[X] = dim X. Similarly, we have that for a finitely generated k-algebra A, dim A = dim Spec(A). Remark 8.4 shows immediately: LEMMA 9.4. The Krull dimension is invariant under integral extension: if B is integral over A, then A and B have the same Krull dimension. The notion of Krull dimension was easily defined, but can be difficult to use in concrete cases. How can we be certain that given prime chain has maximal possible length? It is not even clear how we can tell whether the Krull dimension of a given ring is finite. We will settle this for a finitely generated k-domain (i.e., of the form k[Y ] with Y an irreducible affine variety), by showing that this is just the transcendence degree over k of its field of fractions. 38 1. AFFINE VARIETIES THEOREM 9.5. Let K be a field. For a finitely generated K-domain A, the Krull dimension of A equals the transcendence degree of Frac(A)/K. Moreover, every max- imal prime chain in A (i.e., one that cannot be extended to a longer prime chain) has length dim A. PROOF. We prove both assertions with induction on the transcendence degree of Frac(A). By Noether normalization there exists an integer r ≥ 0 and a K-linear embedding of K[x1, . . . , xr] in A such that A is finite over K[x1, . . . , xr]. Then Frac(A) is a finite extension of K(x1, . . . , xr) and so the transcendence degree of Frac(A)/K is r. Let (0) = q0 ( q1 ( ··· ( qm be a prime chain of A of length m. By the incomparability property, p• := q• ∩ Sr is then a prime chain in Sr, also of length m. Choose an irreducible f ∈ p1. After renumbering the coordinates, we may PN i assume that f does not lie in K[x1, . . . , xr−1] and so f = i=0 aN−ixr with ai ∈ K[x1, . . . , xr−1], a0 6= 0 and N > 1. Then the image of xr in Frac(Sr/(f)) is a N PN−1 i root of the monic polynomial xr + i=0 (aN−i/a0)xr ∈ K(x1, . . . , xr−1)[xr] so that Frac(Sr/(f)) is a finite extension of K(x1, . . . , xr−1). Hence Frac(Sr/(f) has transcendence degree r − 1 over K. By our induction induction hypothesis, the Krull dimension of Sr/(f) equals r − 1. The image of the subchain p1 ( ··· ( pm in Sd/(f) will still be strictly ascending (for it will be so in Sr/p1) and so m−1 ≤ r−1. Hence m ≤ r. On the other hand, r is attained as the length of a prime chain in A, for the prime chain of length r in Sr given by (0) ( (x1) ( (x1, x2) ( ··· ( (x1, x2, . . . , xr), lifts to one in A, by Remark 8.4. The last assertion relies on the going down theorem. If we assume that q• is a maximal prime chain in A, then the going down property Corollary 8.17 (which applies here since Sr is normal) implies that p• is one in Sr. Since our irreducible f generates a prime ideal, it must then generate p1. The image of p1 ( ··· ( pm in Sr/p1 = Sr/(f) is then a maximal prime chain in Sr/(f) and hence of length r − 1 by induction hypothesis. This shows that m = r. If S ⊂ R is a multiplicative system, then the preimage of a prime ideal ˜q of S−1R under the ring homomorphism R → S−1R is a prime ideal q of R which does not meet S and we have ˜q = S−1q. This sets up a bijection between the prime ideals of S−1R and those of R not meeting S. If we take S = R − p, with p a prime ideal, then this implies that dim Rp is the supremum of the prime chains in R which end with p. Since the prime chains in R which start with p correspond to prime chains in R/p, it follows that dim Rp +dim R/p is the supremum of the prime chains in R which contain p. According to Theorem 9.5, the latter is for a domain R finitely generated over a field always equal to the dimension of R. COROLLARY 9.6. Let X be an irreducible affine variety. Then every irreducible chain in X is a subchain of one of length dim X. Moreover, every nonempty open affine U ⊂ X has the same dimension as X. EXERCISE 37. Prove that a hypersurface in An has dimension n − 1. EXERCISE 38. Let X be an irreducible affine variety and Y ⊂ X an irreducible closed subvariety. Prove that dim X − dim Y is equal to the Krull dimension of k[X]I(Y ). 10. NONSINGULAR POINTS 39 EXERCISE 39. Prove that when X and Y are irreducible affine varieties, then dim(X ×Y ) = dim X +dim Y . (Hint: Embed each factor as a closed subset of some affine space. You may also want to use the fact that the equality to be proven holds in case X = Am and Y = An.) 10. Nonsingular points In this section we focus on the local properties of an affine variety X = Spec(A) (so A = k[X] is here a reduced finitely generated k-algebra) at a point o and so a −1 central role will be played by the local algebra OX,o = mo A whose maximal ideal −1 is mX,o = (A − mo) mo. If k = C and X ⊂ Cn is a closed subset of dimension d, then we hope that there is a nonempty open subset of X where X is ‘smooth’, i.e., where X looks like a complex submanifold of complex dimension d. Our goal is to define smoothness in algebraic terms (so that it make sense for our field k) and then to show that the set of smooth points of a variety is open and dense in that variety. Our point of departure is the implicit function theorem. One version states that n n if U ⊂ R is an open neighborhood of o ∈ R and fi : U → R, i = 1, . . . n − d are differentiable functions such that fi(o) = 0 and the total differentials at o, df1(o), . . . , dfn−d(o) are linearly independent in o (this is equivalent to: the Jaco- bian matrix of (f1, . . . , fn−d) at o is of rank n − d), then the common zero set of f1, . . . , fn−d is a submanifold of dimension d at o whose tangent space there is the common zero set of df1(o), . . . , dfn−d(o). In fact, one shows that this solution set is there the graph of a map: we can express n − d of the coordinates as differentiable functions in the d remaining ones. Conversely, any submanifold of Rn at o of di- mension d is locally thus obtained. We begin with the observation that for any ring R partial differentiation of a polynomial f ∈ R[x1, . . . , xn] is well-defined and produces another polynomial. −1 The same goes for a fraction φ = f/g in R[x1, . . . , xn][g ]: a partial derivative of φ is a rational function (in this case with denominator g2). We then define the total differential of a rational function φ ∈ R[x1, . . . , xn] as usual: n X ∂φ dφ := (x)dx , ∂x i i=1 i where for now, we do not worry about interpreting the symbols dxi: we think of dφ simply as a regular map from an open subset of An to a k-vector space of dimension n with basis dx1, . . . , dxn, leaving its intrinsic characterization for later. However, caution is called for when R is a field of positive characteristic: EXERCISE 40. Prove that f ∈ k[x] has zero derivative, if and only if f is constant or (when char(k) = p > 0) a pth power of some g ∈ k[x]. Generalize this to: given f ∈ k[x1, . . . , xn], then df = 0 if and only if f is constant or (when char(k) = p > 0) a pth power of some g ∈ k[x1, . . . , xn]. We should also be aware of the failure of the inverse function theorem: EXAMPLE 10.1. Let C ⊂ A2 be the curve defined by y2 = x3 + x. By any reasonable definition of smoothness we should view the origin (0, 0) as a smooth point of C. Indeed, when k = C, the projection f : C → A1, (x, y) 7→ y, would 40 1. AFFINE VARIETIES be a local-analytic isomorphism. But the map is not locally invertible within our category: the inverse requires us to find a rational function x = u(y) which solves the equation y2 = x3 + x and it is easy to verify that none exists. (We can solve for x formally: x = u(y) = y2 − y6 + 3y10 + ··· , where it is important to note that the coefficients are all integers so that this works for every characteristic.) In fact, the situation is worse: no affine neighborhood U of (0, 0) in C is isomorphic to an open subset V of A1. The reason is that this would imply that k(C) = k(U) is isomorphic to k(V ) = k(x) and one can show that this is not so. Somewhat related to this is an issue illustrated by the following example. EXAMPLE 10.2. Consider the curve C0 ⊂ A2 defined by xy = x3 + y3 and let o = (0, 0). The polynomial x3 + y3 − xy is irreducible in k[x, y], so that k[C0] is 0 0 without zero divisors. Hence OC0,o ⊂ k(C ) is also without zero divisors. But C seems to have two branches at o which apparently can only be recognized formally: one such branch is given by y = u(x) = x2 + x5 + 3x8 + ··· and the other by interchanging the roles of x and y: x = v(y) = y2 + y5 + 3y8 + ··· . If we use ξ := x − v(y) and η := y − u(x) as new formal coordinates, then C0 is simply given at 0 by the reducible equation ξη = 0. These examples make it clear that for a local understanding of a variety X at o, the local ring OX,o still carries too much global information, and that one way to get rid of this overload might be by passing formal power series. This is accomplished by passage to what is known as formal completion. 10.3.F ORMALCOMPLETION. Let R be a ring and I ( R a proper ideal. We endow every R-module M with a topology, the I-adic topology of which a basis is the collection additive translates of the submodules InM, i.e., the collection of subsets a + InM, a ∈ M, n ≥ 0. This is a topology indeed: given two basic subsets 0 a + InM, a0 + In M, then for any element c in their intersection, the basic subset 0 c + Imax{n,n }M is also in their intersection. The fact that our basis is translation invariant implies that with this topology, M is a topological abelian group ((a, b) ∈ M × M 7→ a − b ∈ M is continuous). If we endow R also with the I-adic topology and R × M with the product topology, then the map (r, a) ∈ R × M → ra ∈ M which gives the action by R is also continuous. For the topology on M to be Hausdorff, it suffices that the intersection of all n neighborhoods of 0 is reduced to {0}: ∩n≥0I M = {0}. It is then even metriz- able: if φ : Z+ → (0, ∞) is any strictly monotonously decreasing function with −n limn→∞ φ(n) = 0 (one often takes u , where u > 1), then a metric ρ is defined by ρ(a, a0) := inf{φ(n): a − a0 ∈ InM}. This metric is nonarchimedean in the sense that ρ(a, a00) ≤ max{ρ(a, a0), ρ(a0, a00)}. ∞ and a sequence (an ∈ M)n=0 is then a Cauchy sequence if and only if for every integer n ≥ 0 all but finitely many terms lie in the same coset of InM in M; n in other words, there exists an index in ≥ 0 such that aj − ain ∈ I M for all j ≥ in. This makes it clear that the notion of Cauchy sequence is independent of the choice of φ. We also note that the Cauchy sequence defines a sequence of n cosets (αn ∈ M/I M)n≥0 with the property that αn is the reduction of αn+1 ∈ M/In+1. Recall that a metric space is said to be complete if every Cauchy sequence in that space converges. A standard construction produces a completion of every metric space M: its points are represented by Cauchy sequences in M, with the 10. NONSINGULAR POINTS 41 understanding that two such sequences represent the same point if the distance between the two nth terms goes to zero as n → ∞. In the present situation this yields what is called the I-adic completion, Mˆ I . From the above discussion we see n that an element of Mˆ I is uniquely given by a sequence (αn ∈ M/I M)n≥0 whose terms are compatible in the sense that αn is the reduction of αn+1 for all n. It is an R-module for componentwise addition and multiplication so that the obvious n map M → Mˆ I , a 7→ (a + I )n≥0 is a R-module homomorphism. (This makes Mˆ I a Q∞ n ˆ submodule of the R-module n=0 M/I M.) It is easy to verify that MI is complete for the Iˆ-adic topology and that the homomorphism M → Mˆ I is continuous and has dense image. ˆ Q∞ n ˆ Notice that RI is in fact a subring of n=0 R/I in a way that makes R → RI n n a ring homomorphism. Moreover, Mˆ I is a RˆI -module (use that M/I M is a R/I - module for every n) and an R-homomorphism φ : M → N of R-modules extends ˆ n n uniquely to an RˆI -homomorphism φI : Mˆ I → NˆI (use that φ sends I M to I N and hence induces a homomorphism M/In → N/In). EXAMPLE 10.4. Take the ring k[x1, . . . , xn]. Its completion relative the maximal ideal (x1, . . . , xn) is just the ring of formal power series k[[x1, . . . , xn]]. We get the n same result if we do this for the localization OA ,o of k[x1, . . . , xn] at (x1, . . . , xn). EXAMPLE 10.5. Take the ring Z. Its completion with respect to the ideal (p), ˆ ˆ p prime, yields the ring of p-adic integers Z(p): an element of Z(p) is given by a i ∞ sequence (ai ∈ Z/(p ))i=1 with the property that ai is the image of ai+1 under the reduction Z/(pi+1) → Z/(pi). We get the same result if we do this for the localization Z(p) = {a/b : a ∈ Z, b ∈ Z − (p)}. If n is an integer ≥ 2, then it follows ˆ Q ˆ from the Chinese remainder theorem that Z(n) = p|n Z(p). EXERCISE 41. Let I and J be ideals of a ring R and m a positive integer with J m ⊂ I. Prove that the J-adic topology is finer than the I-adic topology and that there is a natural continuous ring homomorphism RˆJ → RˆI . Conclude that when ˆ ˆ√ R is noetherian, RI can be identified with R I . *LEMMA 10.6 (Artin-Rees). Let R be a noetherian ring, I ⊂ R an ideal, M a finitely generated R-module and M 0 ⊂ M a R-submodule. Then there exists an integer n0 ≥ 0 such that for all n ≥ 0: M 0 ∩ In+n0 M = In(M 0 ∩ In0 M). The proof (which is ingeneous, but not difficult) can be found in any book on commutative algebra (e.g., [1]). The Artin-Rees lemma implies that for every n ≥ 0 0 0 n0 n 0 0 there exists a n ≥ 0 such that M ∩ I M ⊂ I M (we can take n = n + n0). This implies that the inclusion M 0 ⊂ M is not merely continuous, but that the I-adic topology on M 0 is induced by that of M. We will use the Artin-Rees lemma via this property only. ˆ 0 ˆ COROLLARY 10.7. In the situation of Lemma 10.6, the homomorphism MI → MI induced by the inclusion M 0 ⊂ M is a closed embedding with the property that the 0 preimage of M is M : if the image of a ∈ M under M → Mˆ I lies in the image ˆ 0 ˆ 0 ˆ ˆ 0 of MI → MI , then a ∈ M . Moreover, MI /MI can be identified with the I-adic completion of M/M 0. 42 1. AFFINE VARIETIES One might phrase this by saying that when R is noetherian, I-adic completion is an exact functor on the category of finitely generated R-modules. PROOFOF COROLLARY 10.7. We first part of the corollary amounts to the fol- 0 ∞ 0 lowing property: if a sequence (ai ∈ M )i=0 in M is a Cauchy sequence in M, then it is already one in M 0 and when its limit lies in the image of M, then this limit 0 ˆ 0 ˆ lies in fact in the image of M (an element of the kernel of MI → MI is given by 0 ∞ 0 a sequence (ai ∈ M )i=0 in M which converges in M to 0 and so this implies that ˆ 0 ˆ MI → MI is injective). 0 ∞ Suppose therefore given a sequence (ai ∈ M )i=0 which is a Cauchy sequence n in M. Then for every n there exists an in such that ai − ain ∈ I M for i ≥ in. So if 0 n+n0 n 0 n0 n 0 we take i ≥ in+n0 , then ai − ain ∈ M ∩ I M = I (M ∩ I M) ⊂ I M . Hence 0 ∞ 0 0 ∞ (ai ∈ M )i=0 is a Cauchy sequence in M . If (ai ∈ M )i=0 converges to the image 0 n 0 of a ∈ M, then for every n there exists an in such that ai − a ∈ I M for i ≥ in. i ≥ i0 a − a ∈ InM 0 So if we take n+n0 , then by the preceding argument, i . Hence n 0 0 0 ∞ 0 a ∈ ai + I M ⊂ M and (ai ∈ M )i=0 converges to the image of a in M . ˆ ˆ 0 As to the last statement, we observe that RI -homomorphism MI → M/M\ I 0 ˆ 0 induced by the surjection M → M/M is also surjective: any b ∈ M/M\ I is rep- 0 ∞ n 0 resentable by a Cauchy sequence (bi ∈ M/M )i=0 with bm − bn ∈ I (M/M ) for n 0 ∼ n n 0 m ≥ n. Since I (M/M ) = I M/(I M ∩ M ), we can represent bn+1 − bn by some n dn+1 ∈ I M. Let d0 ∈ M lift b0. Then the series d0 + d1 + ··· + dn + ··· converges in Mˆ and maps to ˆb. 0 0 The kernel of MI → M/M\ I clearly contains M and since a kernel is closed, ˆ 0 ˆ it will also contain its closure MI in MI . In order to see that it is not bigger, 0 let aˆ ∈ ker(MI → M/M\ I ) be arbitrary. If we represent aˆ by a Cauchy sequence ∞ 0 n (ai ∈ M)i=0, then for every n there exists an in such that for i ≥ in, ai ∈ M +I M. 0 0 0 n ∞ Write ai accordingly: ai = ai + di with ai ∈ M and di ∈ I M. Since (di ∈ M)i=0 0 0 ∞ converges to 0 ∈ M, (ai ∈ M )i=0 is also a Cauchy sequence converging to aˆ. As we saw above, this is then a Cauchy sequence in M 0 with the same limit and so ˆ 0 aˆ ∈ MI . Let R be a noetherian local ring with maximal ideal m and residue field κ. We use simply b for completion with respect to m. Then m is a finitely generated R-module. Since the ring R acts on m/m2 via R/m = κ, m/m2 is a finite dimensional vector space over κ. DEFINITION 10.8. The Zariski cotangent space T ∗(R) of R is the κ-vector space 2 2 m/m and its κ-dual, T (R) := Homκ(m/m , κ) (which is also equal to HomR(m, κ)), is called the Zariski tangent space T (R) of R. The embedding dimension embdim(R) is the dimension of m/m2 as a vector space over κ. ∗ If X is an affine variety and o ∈ X, then the Zariski cotangent space To X resp. the Zariski tangent space ToX resp. the embedding dimension embdimo X of X at o are by definition that of OX,o. For instance, the embedding dimension of An at any point o ∈ An is n. This n n follows from the fact that the map do : f ∈ mA ,o 7→ df(o) ∈ k defines an isomor- 2 ∼ n phism of k-vector spaces m n /m n = k . We note in passing that we here have A ,o A ,o a way of understanding the total differential at o in more intrinsic terms as the map 2 d : O n → m n /m n which assigns to f ∈ O n the image of f −f(o) ∈ m n o A ,o A ,o A ,o A ,o A ,o 10. NONSINGULAR POINTS 43 2 in m n /m n . Thus, a differential of f at o can be understood a k-linear function A ,o A ,o n n 2 df(o): T → k and (d (x ) = dx (o)) is a basis of m n /m n . oA o i i i=1 A ,o A ,o Notice that the embedding dimension and the Zariski tangent space of a local ring only depend on its formal completion. EXERCISE 42. Let (R0, m) and (R, m0) be local rings with the same residue field10 κ. Prove that a nonzero ring homomorphism φ : R0 → R induces an κ- linear map of Zariski tangent spaces T (φ): T (R) → T (R0) An application of Nakayama’s lemma to M = m yields: COROLLARY 10.9. The embedding dimension of a noetherian local ring R is the smallest number of generators of its maximal ideal. The embedding dimension is zero if and only if R is a field. DEFINITION 10.10. A local ring R with maximal ideal m is said to be regular if it is noetherian and its Krull dimension equals its embedding dimension. A point o of an affine variety X is called regular if its local ring OX,o is and singular when it is not. (We write Xreg resp. Xsing for the subsets of X defined by this property.) An affine variety without singular points is said to be nonsingular. We will show that for a point o ∈ X this is indeed an intrinsic characterization of ‘being like a submanifold’. We begin with a formal version of the implicit function theorem. n n LEMMA 10.11. Let o ∈ A and let f1, . . . , fn−d ∈ mA ,o be such that their images 2 in m n /m n are linearly independent. Then f , . . . , f generate a prime ideal A ,o A ,o 1 n−d n n p in the local ring OA ,o and the formal completion of OA ,o/p with respect to its maximal ideal is isomorphic to k[[x1, . . . , xd]] as a complete local k-algebra. Moreover, n there exists an affine neighborhood U of o in A on which f1, . . . , fn−d are regular and generate a prime ideal in k[U] with the property that the corresponding irreducible affine variety X ⊂ U has o as a regular point. n PROOF. Let us abbreviate OA ,o by O and its maximal ideal by m. Extend f1, . . . , fn−d to a system of regular functions f1, . . . , fn ∈ m such that their images in m/m2 are linearly independent. (After an affine linear transformation, we may n 2 then just as well assume that o is the origin of A and that fi ≡ xi (mod m ).) N N+1 Hence the monomials of degree N in f1, ··· , fn map to a k-basis of m /m . This implies that the monomials of degree ≤ N in f1, ··· , fn make up a k-basis of O/mN+1. It follows that the map yi ∈ k[[y1, . . . , yn]] 7→ fi ∈ Oˆ ∼ determines an isomorphism k[[y1, . . . , yn]] = Oˆ of complete local rings (a ring iso- morphism that is also a homeomorphism). Its inverse defines a topological embed- ding of O in k[[y1, . . . , yn]] (sending fi to yi). The ideal generated by (y1, . . . , yn−d) in k[[y1, . . . , yn]] is the closure of the image of p. It is clearly a prime ideal. Accord- ing to Corollary 10.7 the preimage of that prime ideal in O is p (hence p is a prime ideal) and the embedding O/p ,→ k[[y1, . . . , yn]]/(y1, . . . , yn−d) = k[[yn−d+1, . . . , yn]] 10By this we mean that φ induces an isomorphism R0/m0 =∼ R/m and that we identify the two residue fields via this isomorphism. 44 1. AFFINE VARIETIES realizes the m-adic completion of O/p. n Let U ⊂ A be an affine neighborhood of o on which the f1, . . . , fn−d are regular and denote by P ⊂ k[U] the preimage of p under the localization homor- morphism k[U] → O. This is a prime ideal which contains f1, . . . , fn−d and has the property that it generates the same ideal in O as (f1, . . . , fn−d). Choose a finite set Pn−d of generators φ1, . . . , φr of P and write φi = j=1 uijfj with uij ∈ O. By passing to a smaller U, we may then assume that each uij is regular on U so that P is in fact equal to the ideal (f1, . . . , fn−d) in k[U] generated by f1, . . . , fn−d. Hence (f1, . . . , fn−d) defines an irreducible affine variety X ⊂ U which has o is a regular point of dimension d. n THEOREM 10.12. Let X ⊂ A be closed and let o ∈ X. Then the local ring OX,o is regular of dimension d if and only there exist regular functions f1, . . . , fn−d on an n affine neighborhood U of o in A such that these functions generate IU (X ∩ U) and df1, . . . , dfn−d are linearly independent in every point of U. In that case, X ∩ U is regular and for every q ∈ X ∩ U the Zariski tangent space TqX is the kernel of the n n−d linear map (df1(q), . . . , dfn−d(q)) : TqA → k . n PROOF. Suppose that OX,o is regular of dimension d. Let IX,o ⊂ OA ,o be the ideal of regular functions at o vanishing on a neighborhood of o in X, in other n words, the kernel of OA ,o → OX,o. The latter is a surjective homomorphism of 2 n n local rings and so the preimage of mX,o resp. mX,o is IX,o + mA ,o = mA ,o resp. 2 2 ∼ 2 I + m n . It follows that the quotient m n /(I + m n ) = m /m has X,o A ,o A ,o X,o A ,o X,o X,o 2 2 dimension d. So the image (I +m n )/m n of I in the n-dimensional vector X,o A ,o A ,o X,o 2 space m n /m n must have dimension n − d. Choose f , . . . , f ∈ I such A ,o A ,o 1 n−d X,o that df1(o), . . . , dfn−d(o) are linearly independent. We show among other things that these functions generate IX,o (this will in fact be the key step). n According to Lemma 10.11, the ideal pi ⊂ OA ,o generated by f1, . . . , fi is prime and so we have a prime chain (0) ⊆ p0 ( p1 ( ··· ( pn−d ⊆ IX,o. As OX,o is of dimension d, there also exists a prime chain of length d containing IX,o: n IX,o ⊆ q0 ( q1 ( ··· ( qd ⊆ OA ,o. n Since OA ,o has dimension n, these two prime chains cannot make up a prime sequence of length n + 1 and so pn−d = IX,o = q0. 0 In particular, f1, . . . , fn−d generate IX,o. Let U be an affine neighborhood n 0 of o in A on which the fi’s are regular and has the property that X ∩ U is the common zero set of f1, . . . , fn−d. Since the differentials df1(o), . . . , dfn−d(o) are linearly independent, there exist n − d indices 1 ≤ ν1 < ν2 < ··· < νn−d ≤ n such 0 0 that δ := det((∂fi/∂xνj )i,j) ∈ k[U ] is nonzero in o. Then U := U(δ) ⊂ U has all the asserted properties (with the last property following from Lemma 10.11). The converse says that if U, o and f1, . . . , fn−d are as in the theorem, then n n the functions f1, . . . , fn−d generate a prime ideal Io in OA ,o such that OA ,o/Io is regular. This follows Lemma 10.11. The last assertion is clear from the preceding. PROPOSITION 10.13. The regular points of an affine variety X form an open- dense subset Xreg of X. 10. NONSINGULAR POINTS 45 For the proof we will assume Proposition 10.14 below, which we will leave unproved for now. (Note that it tells us only something new in case k has positive characteristic.) *PROPOSITION 10.14. Every finitely generated field extension L/k (so with k as in these notes, i.e., algebraically closed) is separably generated, by which we mean that there exists an intermediate extension k ⊂ K ⊂ L such that K/k is purely tran- scendental (i.e., of the form k(x1, . . . , xr)) and L/K is a finite separable extension. According to 7.7 and the subsequent discussion, this implies that every irre- ducible affine variety is birationally equivalent to a hypersurface. PROOFOF PROPOSITION 10.13. Without loss of generality we may assume that X is irreducible. Since we already know that Xreg is open, it remains to see that it is nonempty. It thus becomes an issue which only depends on k(X). Hence it suffices to treat the case of a hypersurface in An so that I(X) is generated by an irreducible polynomial f ∈ k[x1, . . . , n]. In view of Lemma Lemma 10.11 it then suffices to show that df is not identically zero on X. Suppose otherwise, i.e., that each partial derivative ∂f/∂xi vanishes on X. Then each ∂f/∂xi must be multiple of f and since the degree of ∂f/∂xi is less than that of f, this implies that it is identically zero. But then we know from Exercise 40 that the characteristic p of k is positive p and that f is of the form g . This contradicts the fact that f is irreducible. EXERCISE 43. Let X be a nonsingular variety. Prove that X is connected if and only if it is irreducible. REMARK 10.15. This enables us to find for an affine variety X of dimension d a descending chain of closed subsets X = Xd ⊃ Xd−1 ⊃ · · · ⊃ X0 such that dim Xi ≤ i and all the (finitely many) connected components of Xi − Xi−1 are nonsingular subvarieties of dimension i (such a chain is called a stratification of X). d−1 We let X be the union of Xsing and the irreducible components of dimension < d and if (with downward induction) Xi has been defined, then we take for Xi−1 i the union of the singular locus Xreg and the irreducible components of dimension ≤ i − 1. Then dim Xi−1 ≤ i − 1 and every connected component of Xi − Xi−1 i is an nonempty open subset of some Xreg and hence a nonsingular subvariety of dimension i. 10.16.D IFFERENTIALS AND DERIVATIONS. The differential that we defined earlier has an intrinsic, coordinate free description that turns out to be quite useful. Let us begin with the observation that the formation of the total differential of a polynomial, φ ∈ k[x1, . . . , xn] 7→ Pn ∂φ dφ := (x)dxi is a k-linear map which satisfies the Leibniz rule: d(φψ) = φdψ+ψdφ. i=1 ∂xi This property is formalized with the following definition. Let us fix a ring R (the base ring) and an R-algebra A. DEFINITION 10.17. Let M be a A-module. An R-derivation of A with values in M is a map D : A → M is an R-module homomorphism which satisfies the Leibniz rule: D(a1a2) = a1D(a2) + a2D(a2) for all a1, a2 ∈ A. The last condition usually prevents D from being an A-module homomorphism. Let us note that (by taking a1 = a2 = 1) that we must have D(1) = 0. Then it also follows that for every r ∈ R, D(r) = rD(1) = 0. Note also that if a happens to be invertible in A, then 0 = D(1) = D(a/a) = 1/aD(a) + aD(1/a) so that D(1/a) = −D(a)/a2. 46 1. AFFINE VARIETIES Given a1, . . . , an ∈ A, then the values of D on a1, . . . , an determine its values on 0 the subalgebra A by the ai’s, for if φ : R[x1, . . . , xn] → A denotes the corresponding R- homomorphism and f ∈ R[x1, . . . , xn], then n X ∂f Dφ(f) = φ Dai. ∂xi i=1 If we combine this with the formula for D(1/a), we see that this even determines D on 0 0 the largest subalgebra A of A generated by the ai’s and the invertible elements in A . In particular, if we are given a field extension L/K, then a K-derivation of L with values in some L-vector space is determined by its values on a set of generators of L as a field extension of K. Observe that the set of R-derivations of A in M form an R-module: if D1 and D2 are R-derivations of A with values in M, and a1, a2 ∈ A, then a1D1 + a2D2 is also one. We denote this module by DerR(A, M). EXERCISE 44. Prove that if D1,D2 ∈ DerR(A, A), then [D1,D2] := D1D2 − D2D1 ∈ DerR(A, A). What do we get for R = k and A = k[x1, . . . , xn]? It is immediate from the definition that for every homomorphism of A-modules φ : M → N, the composition of a D as above with φ is an R-derivation of A with values in N. We can now construct a universal R-derivation of A, d : A → ΩA/R (where ΩA/R must of course be an R-module) with the property that every D as above is obtained by composing ¯ d with a unique homomorphism of A-modules D :ΩA/R → N. The construction that is forced upon us starts with the free A-module A(A) which has A itself as a generating set— let us denote the generator associated to a ∈ A by d˜(a)—which we then divide out by the (A) A-submodule of A generated by the expressions d˜(ra) − rd˜(a), d˜(a1 + a2) − d˜(a1) − d˜(a2) and d˜(a1a2) − a1d˜(a2) − a2d˜(a2), with r ∈ R and a, a1, a2 ∈ A. The quotient A-module ˜ is denoted ΩA/R and the composite of d with the quotient map by d : A → ΩA/R. The latter is an R-derivation of A by construction. Given an R-derivation D : A → M, then the map which assigns to d˜(a) the value Da extends (obviously) as a A-module homomorphism A(A) → M. It has the above submodule in its kernel and hence determines an A-module ¯ ¯ homomorphism of D :ΩA/R → M. This has clearly the property that D = Dd. We call ΩA/R the module of Kahler¨ differentials. We will see that the map d : A → ΩA/R can be thought of as an algebraic version of the formation of the (total) differential. The universal derivation of a finitely generated R-algebra is obtained as follows. First we do the case of a polynomial algebra P := R[x1, . . . , xn]. Assigning to f ∈ P the row n vector (∂f/∂x1, . . . , ∂f/∂xn) is an R-derivation on P with values in P . By the universal n property of ΩP/R this must be given by a unique P -homomorphism ΩP/R → P . For any Pn R-derivation D : P → M we have Df = i=1(∂f/∂xi)Dxi and the Dxi can be prescribed n n arbitrarily. So if we take M = P as above, we see that ΩP/R → P is an isomorphism. Thus the universal R-derivation d : P → ΩP/R may be regarded as the intrinsic way of forming the total differential. Next consider a quotient A := P/I of P , where I ⊂ P is an ideal. If M is an A-module and D0 : A → M is an R-derivation, then its composite with the projection π : P → A, D = D0π : P → M, is an R-derivation of P with the property that Df = 0 for every f ∈ I. Conversely, every R-derivation D : P → M in an A-module M with this property factors through an an R-derivation D0 : A → M. We observe here that the map f ∈ I 7→ D(f) factors through I/I2, for if f, g ∈ I, then D(fg) = fDg + gDf ∈ IM = {0}. Since I/I2 is an A-module, we thus obtain a short exact sequence of A-modules 2 I/I → ΩP/R/IΩP/R → ΩA/R → 0. If we are given a generating set f1, . . . , fm for I, then ΩA/R can be identified with the n quotient of A by the A-submodule generated by the row vectors (∂fi/∂x1, . . . , ∂fi/∂xn), 11. THE NOTION OF A VARIETY 47 i = 1, . . . , m. Note that if R is a noetherian ring, then, as we know, so is A and it also follows that ΩA/R is a noetherian R-module. This applies to the case when R = k and A = k[X] for some affine variety X. We then write Ω(X) for Ωk[X]/k. EXERCISE 45. Prove that ΩA/R behaves well with localization: if S ⊂ A is a multi- plicative subset, then every R-derivation with values is some A-module M extends natu- rally to R-derivation of S−1A-with values in S−1M. Prove that we have a natural map −1 S ΩA/R → ΩS−1A/R and that this map is a A-homomorphism. For an affine variety X and x ∈ X, we write ΩX,x for ΩOX,x/k. The preceding exercise −1 implies that ΩX,x is the localization of Ω(X) at x: ΩX,x = (k[X] − px) Ω(X). EXERCISE 46. Let X be an affine variety and let o ∈ X. Show that the Zariski tangent space of X at o can be understood (and indeed, be defined) as the space of k-derivations of OX,o with values in k, where we regard k as a OX,o-module via OX,o/mX,o =∼ k. Prove that this identifies its dual, the Zariski cotangent space, with ΩX,o/mX,oΩX,o. n EXERCISE 47. Show (perhaps with the help of the preceding exercises) that if o ∈ A , n then ΩO n,o/k is the free OA ,o-module generated by dx1, . . . , dxn and that if X is an affine A n variety in A that has o a regular point, then ΩX,o is a free OX,o-module of rank dim OX,x. We must be careful with this construction when dealing with formal power series rings. k O 1 For instance, as we have seen, the completion of the local -algebra A ,0 with respect to its k[[x]] O 1 ,→ k[[x]] maximal ideal is and the embedding of A ,0 is given by Taylor expansion. A k-derivation D of k[[x]] with values in some k[[x]]-module is not determined by Dx. All k O 1 it determines are the values on the -subalgebra A ,0. This issue disappears however if we require that D commutes with infinite summation when the series converges relative to the P∞ r P∞ r−1 (x)-adic metric, for then D( r=0 crx ) will be equal to r=0 crrx Dx. ˆ EXERCISE 48. Prove that the composite d : A → ΩA/R → (ΩA/R)I factors through a ˆ ˆ ˆ derivation dI : AI → (ΩA/R)I and prove that it is universal among all the R-derivations of A in R-modules that are complete for the I-adic topology (i.e., for which M = Mˆ I ). So if o is a regular point of an affine variety X, then Ωˆ X,o is a free OˆX,o-module of rank dim OX,x. 11. The notion of a variety We begin with a definition. DEFINITION 11.1.A k-prevariety is a topological space X endowed with a sheaf OX of k-valued functions such that X can be covered by finitely many open sub- sets U such that (U, OX |U) is an affine k-variety. Given k-prevarieties (X, OX ) and (Y, OY ), then a morphism of prevarieties f :(X, OX ) → (Y, OY ) is simply a mor- phism in the category of spaces endowed with a sheaf OX of k-valued functions: f is continuous and for every open V ⊂ Y , composition with f takes OY (V ) to −1 OX (f V ). We often designate a prevariety and its underlying topological space by the same symbol, a habit which rarely leads to confusion. The composite of two mor- phisms is evidently a morphism so that we are dealing here with a category. The prefix ‘pre’ in prevariety refers to the fact that we have not imposed a separation requirement which takes the place of the Hausdorff property that one normally imposes on a manifold (see Example 11.4 below). 48 1. AFFINE VARIETIES Let X be a prevariety. By assumption X is covered by finitely many affine open subvarieties U1,...,UN . Suppose κi is an isomorphism of Ui onto an affine variety Xi that sits as a closed subset in some affine space. Then Xi,j := κi(Ui ∩ Uj) is −1 an open subset of Xi and κi,j := κjκi is an isomorphism of Xi,j onto Xj,i ⊂ Xj. We can recover X from the disjoint union of the X1,...,XN by means of a gluing process: if we use κi,j to identify Xi,j with Xj,i for all i, j we get back X. The N collection {(Ui, κi)}i=1 is called an affine atlas for X. EXERCISE 49. Let (X, OX ) be a prevariety. (i) Prove that X is a noetherian space. (ii) Prove that X contains an open-dense subset which is affine. (iii) Let Y ⊂ X be locally closed (i.e., the intersection of a closed subset with an open subset). Prove that Y is in natural manner a prevariety in such a manner that the inclusion Y ⊂ X is a morphism of prevarieties. Much of what we did for affine varieties extends in a straightforward manner to this more general context. For instance, a rational function f : X 99K k is defined as before: it is represented by a regular function on a subset of X that is open-dense in its set of closed points and two such represent the same rational function if they coincide on a nonempty open-dense subset in their common domain of definition. If X is irreducible, then the rational functions on X form a field k(X), the function field of X. If U ⊂ X is an open nonempty affine subset, then k(X) = k(U) = Frac(O(U)) (but we will see that it is not true in general that k(X) = Frac(O(X))). In particular, U1,...,UN all have the same dimension trdegk k(X). According to Exercise 35, this is then also the (Krull) dimension of X. Similarly, if X and Y are prevarieties, then a rational map f : X 99K Y is represented by morphism from a nonempty open-dense subset of X to Y with the understanding that two such defined the same map if they coincide on a nonempty open-dense subset. If some representative morphism has dense image in Y , then f is said to be dominant and then f induces a field extension f ∗ : k(Y ) ,→ k(X). Conversely, an embedding of fields k(Y ) ,→ k(X) determines a dominant rational map X 99K Y . If U ⊂ X is open and nonempty, then k(U) = k(X) and the inclusion is a birational equivalence. The notion of a regular point is local and so automatically carries over to this general setting. It is then also clear that the regular points of a prevariety X define an open-dense subset Xreg of X. 11.2.T HEPRODUCTOFTWOPREVARIETIES. Our discussion of the product of closed subsets of affine spaces dictates how we should define the product of two prevarieties X and Y : if (p, q) ∈ X × Y , then let p ∈ U ⊂ X and q ∈ V ⊂ Y be affine open neighborhoods of the components. We require that the topology on U × V be the Zariski topology so that a basis of neighborhoods of (p, q) consists of the loci (U × V )(h) where a h ∈ O(U) ⊗ O(V ) with h(p, q) 6= 0 is nonzero. We also require that ring of sections of OX×Y over such a basic neighborhood (U × V )(h) be O(U) ⊗ O(V )[1/h]. EXERCISE 50. Prove that this product has the usual categorical characteriza- tion: the two projections X × Y → X and X × Y → Y are morphisms and if Z is a prevariety, then a pair of maps (f : Z → X, g : Z → Y ) defines a morphism (f, g): Z → X × Y if and only both f and g are morphisms. 11. THE NOTION OF A VARIETY 49 The Hausdorff property is not of a local nature: a non-Hausdorff space can very well be locally Hausdorff. The standard example is the space X obtained from two copies of R by identifying the complement of {0} in either copy by means of the identity map. Then X is locally like R, but the images of the two origins cannot be separated. A topological space X is Hausdorff precisely when the diagonal of X×X is a closed subset relative to the product topology. As we know, the Zariski topology is almost never Hausdorff. But on the other hand, the selfproduct of the underlying space has not the product topology either and so requiring that the diagonal is closed is not totally unreasonable a priori. In fact, imposing this condition turns out to be the appropriate way of avoiding the pathologies that can result from an unfortunate choice of gluing data. DEFINITION 11.3.A k-prevariety X is called a k-variety if the diagonal is closed in X × X (where the latter has the Zariski topology as defined above). The diagonal in An ⊂ An × An is closed, so An is a variety. This implies that the same is true for any quasi-affine subset of An. Hence a quasi-affine prevariety is in fact a variety. EXAMPLE 11.4. The simplest example of a prevariety that is not a variety is the obvious generalization of the space described above: let X be obtained from two 1 1 1 1 1 copies A+ and A− of A by identifying A+ − {0} with A− − {0} by means of the 1 identity map. If o± ∈ X denotes the image of origin of A±, then (o+, o−) ∈ X × X lies in the closure of the diagonal, but is not contained in the diagonal. A subset of a variety X is called a subvariety if it is open in a closed subset of X. The proof of the following assertion is left as an exercise (see also Exercise 49). PROPOSITION 11.5. A subvariety is in a natural manner a variety such that the inclusion becomes a morphism. The product of two varieties is a variety. EXAMPLE 11.6. We can take this further: if U ⊂ Am is open and f : U → An is a morphism, then consider the graph of f, U˜ := {(x, y) ∈ Am+n : x ∈ U, y = f(x)}. It is easy to see that U˜ is a subvariety of Am+n. The map x ∈ U 7→ (x, f(x)) ∈ U˜ and the projection U˜ → U are regular and each others inverse. So they define an isomorphism U˜ → U. Notice that via this isomorphism f appears as a projection mapping: (x, y) ∈ U˜ 7→ y ∈ V . EXERCISE 51. The goal of this exercise is to show that U := A2 − {(0, 0)} is not affine. Let Ux ⊂ U resp. Uy ⊂ U be the complement of the x-axis resp. y-axis so that U = Ux ∪ Uy. (a) Prove that Ux is affine and that O(Ux) = k[x, y][1/x]. (b) Prove that every regular function on U extends to A2 so that O(U) = k[x, y]. (c) Show that U is not affine. CHAPTER 2 Projective varieties 1. Projective spaces Two distinct lines in the plane intersect in a single point or are parallel. In the last case one would like to say that the lines intersect at infinity so that the statement becomes simply: two distinct lines in a plane meet in a single point. There are many more examples of geometric configurations for which the special cases disappear by the simple remedy of adding points at infinity. A satisfactory approach to this which makes no a priori distinction between ordinary points and points at infinity involves the notion of a projective space. The following notion, perhaps a bit abstract, is quite useful. It will soon become more concrete. DEFINITION 1.1.A projective space of dimension n over k is a set P endowed with an extra structure that can be given by a pair (V, `), where V is k-vector space of dimension n + 1 and ` is a bijection between P and the collection of 1- dimensional linear subspaces of V : p ∈ P 7→ `p ⊂ V . It is here understood that another such pair (V 0, `0) defines the same structure if and only if there exists a 0 0 k-linear isomorphism φ : V → V such that for every p ∈ P , φ sends `p to `p. (We are in fact saying that thus is defined an equivalence relation on the collection of such pairs and that a projective structure is given by an equivalence class.) In particular, if V is a finite dimensional k-vector space, then the collection of its 1-dimensional linear subspaces is in a natural manner a projective space; we n n n+1 denote it by P(V ). We often write P or Pk for P(k ) and call it simply projective n-space (over k). EXERCISE 52. Prove that the linear isomorphism φ in Definition 1.1 is unique up to scalar multiplication. DEFINITION 1.2. Given a projective space P of dimension n over k, then a subset L of P is said to be linear subspace of dimension d if, for some (or any) pair (V, `) as above, there exists a linear subspace VL ⊂ V of dimension d + 1 such that `(L) is the collection of 1-dimensional linear subspaces of VL. A map f : P → P 0 between two projective spaces over k is said to be linear morphism if for corresponding structural data (V, `) and (V 0, `0) for P resp. P 0 there 0 0 exists a linear injection F : V → V such that F sends `p to `f(p) for all p ∈ P . So a linear subspace has itself the structure of a projective space and its inclu- sion in the ambient projective space is a linear morphism. Conversely, the image of a linear morphism is linear subspace. A linear subspace of dimension one resp. two is often called a line resp. a plane. A linear subspace of codimension one (of dimension one less than the ambient projective space) is called a hyperplane. It is now clear that two distinct lines in a 51 52 2. PROJECTIVE VARIETIES plane intersect in a single point: this simply translates the fact that the intersection of two distinct linear subspaces of dimension two in a three dimensional vector space is of dimension one. Let P be a projective space of dimension n. We can of course describe its structure by a pair (V, `) for which V = kn+1. This gives rise to a ‘coordinate system n+1 on P ’ as follows: if we denote the coordinates of k by (T0,...,Tn), then every point p ∈ P(V ) is representable as a ratio [p0 : ··· : pn] of n + 1 elements of k that are not all zero: choose a generator p˜ ∈ `p and let pi = Ti(˜p). Any other generator is of the form λp˜ with λ ∈ k − {0} and indeed, [λp0 : ··· : λpn] = [p0 : ··· : pn]. This is why [T0 : ··· : Tn] is called a homogeneous coordinate system on P(V ) even though an individual Ti is not a function on P(V ) (but the ratios Ti/Tj are, albeit that for i 6= j they are not everywhere defined). 1.3.L INEAR CHARTS AND STANDARD ATLAS. Let U ⊂ P be a hyperplane comple- ment in P , i.e., the complement of a hyperplane H ⊂ P . We show that U is in a natural manner an affine space. To see this, let the structure on P be given by the pair (V, `). Then the hyperplane H corresponds to a hyperplane VH ⊂ V . A coset A of VH in V is an affine hyperplane in the classical sense of the word. Given a coset A of VH distinct from VH , then assigning to v ∈ A the 1-dimensional linear subspace spanned by v defines a bijection A =∼ P − H whose inverse were denote ∼ by κA : P − H = A. This puts on U a structure of an affine space. This is indepen- dent of the choice of A: any another choice A0 is obtained from A by multiplication by some nonzero scalar u ∈ k and hence κA0 is the composite of κA and the map ∼ 0 u· : A = A ; since the latter is an isomorphism of affine spaces, κA0 defines the same affine structure on P − H as κA. For a linear subspace L ⊂ P the bijection ∼ U = A restricts to one between L ∩ U and the affine-linear subspace VL ∩ A. So if L ∩ U 6= ∅, then L ∩ U is an affine subspace of U. Concretely, if we choose a homogeneous coordinate system (T0,...,Tn) such that H is given by T0 = 0, then n (x1 := T1/T0, . . . , xn := Tn/T0) maps A bijectively onto A and any other choice will differ from this one by an affine-linear transformation of An. We call such an isomorphism from a hyperplane complement in P onto An a linear chart for P . Thus the homogeneous coordinate system [T0,...,Tn] defines a chart for every i = 0, . . . , n: if Ui ⊂ P is the hyperplane complement defined by Ti 6= 0, then ∼ n κi : Ui = A , [T0 : ··· : Tn] 7→ (T0/Ti,..., T\i/Ti,...,Tn/Ti), is a chart with inverse (x1, . . . , xn) 7→ [x1 : ··· : xi : 1 : xi+1 : ··· : xn]. Notice n that the Ui’s cover P . We call a collection of linear charts (Ui, κi)i=0 thus obtained from a homogeneous coordinate system a standard atlas for P . Let us determine −1 the coordinate change for a pair of charts, say for κnκ0 . The image of U0 ∩ Un n under κ0 resp. κn is the open subset U(xn) resp. U(x1) of A and −1 κnκ0 : U(xn) → U(x1), (x1, x2, . . . , xn) 7→ (1/xn, x1/xn, . . . , xn−1/xn). −1 Notice that this defines an isomorphism of affine varieties (the inverse is κ0κn ). Thus P becomes a prevariety. In particular, P acquires a (Zariski) topology for which U ⊂ P is open if and only if κ(U ∩ Ui) is open in the set of closed points of n A . Then we have a prevariety (P, OP ) with the property that U ⊂ P is open if and n −1 only if κ(U ∩ Ui) is open in A and f ∈ OP (U) if and only if fκi ∈ Oκi(U) for all i. We shall see shortly that this structure is independent of the coordinate system (T0,...,Tn) and that P is in fact a k-variety. 2. THE ZARISKI TOPOLOGY ON A PROJECTIVE SPACE 53 We could also proceed in the opposite direction and start with an affine space and realize it as the hyperplane complement of a projective space. Changing our point of view accordingly, we might say that a projective space arises as a ‘comple- tion with points at infinity’ of a given affine space. For instance, An embeds in Pn by (x1, . . . , xn) 7→ [1 : x1 : ··· : xn] with image the hyperplane complement defined by T0 6= 0. 2. The Zariski topology on a projective space We begin with giving a simpler, more classical characterization of the Zariski topology on a projective space. Let P be a projective space of dimension n over k and let [T0 : ··· : Tn] be a ho- mogeneous coordinate system for P . Suppose F ∈ k[X0,...,Xn] is homogeneous d of degree d so that F (tT0, . . . , tTn) = t F (T0,...,Tn) for λ ∈ k. The property of this being zero only depends on [T0 : ··· : Tn] and hence the zero set of F defines a subset Z(F ) ⊂ P (even though F is not a function on P ). We denote its nonzero set by PF := P − Z(F ). PROPOSITION 2.1. The collection UF , where F runs over the homogeneous poly- nomials in k[X0,...,Xn], is a basis for the Zariski topology on P . This topology is independent of the choice of our homogeneous coordinate system [T0,...,Tn] and ev- ery linear chart is a homeomorphism that identifies the sheaf of regular functions on n its domain with OA . PROOF. That the collection {PF }F is a basis of a topology follows from the ob- vious equality PF ∩ PF 0 = PFF 0 . The independence of the coordinate choice results from the observation that under a linear substitution a homogeneous polynomial transforms into a homogeneous polynomial (of the same degree). Let now U = P − H be a hyperplane complement. Choose a homogeneous coordinate system [T0,...,Tn] such that U is defined by T0 6= 0 and denote by n ∼ j : A = U, (y1, . . . , yn) 7→ [1 : y1 : ··· : yn] the inverse. If F ∈ k[T0,...,Tn] is −1 homogeneous, then j UF = Uf , where f(y1, . . . , yn) := F (1, y1, . . . , yn). So the inclusion j is continuous. Conversely, if f ∈ k[y1, . . . , yn] is nonzero of degree d, d then its ‘homogenization’ F (T0,...,Tn) := T0 f(T1/T0,...,Tn/T0) is homogeneous −1 of degree d and Uf = j PF . So j is also open. −1 By letting (U, j ) run over the charts (Uν , κν ), we see that have thus recovered the Zariski topology. The last statement follows from the fact that each κν j is an n isomorphism between two open subsets of A . EXERCISE 53. Let 0 6= F ∈ k[X0,...,Xn] be homogeneous of degree d. r (a) Prove that every function PF → k of the form G/F , with r ≥ 0 and G homogeneous of the same degree as F r, is regular. (b) Prove that conversely every regular function on PF is of this form. We now discuss the projective analogue of the (affine) I ↔ Z correspon- dence. Let us denote for an integer d ≥ 0 by k[X0,...,Xn]d the set of F ∈ k[X0,...,Xn] that are homogeneous of degree d. It is clear that k[X0,...,Xn] is the direct sum of the k[X0,...,Xn]d, d = 0, 1,... and that k[X0,...,Xn]d · k[X0,...,Xn]e = k[X0,...,Xn]d+e. A good way to understand the decomposi- tion of k[X0,...,Xn] into its homogeneous summands is in terms of the action n+1 in A defined by scalar multiplication: if F ∈ k[X0,...,Xn]d, then for every 54 2. PROJECTIVE VARIETIES d t ∈ k, we have F (tT0, . . . , tTn) = t F (T0,...,Tn). So this decomposition is in fact the eigenspace decomposition for this action of the multiplicative group k× on k[X0,...,Xn]. (Indeed, this simple decomposition should be understood as the algebraic expression of that action.) This implies among other things that for any n+1 n+1 F ∈ k[X0,...,Xn]d the zero set Z(F ) ⊂ A = k is invariant under scalar multiplication. This is still true for an intersection of such zero sets, in other words, if I ⊂ k[X0,...,Xn] is an ideal generated by homogeneous polynomials of pos- itive degree, then Z(I) ⊂ An+1 is invariant under scalar multiplication. Such a closed subset is called an affine cone; the origin is called the vertex of that cone. Observe that since we insisted that the degrees of the homogeneous generators of I be positive, we have I ⊂ (T0,...,Tn), and so Z(I) will always contain the vertex 0. n If Y ⊂ P is closed, Y can be written as an intersection ∩αZ(Fα), where each n+1 Fα is homogeneous. The common zero set of the Fα in A , ∩αZ(Fα), is the cone that as a set is just the union of the 1-dimensional linear subspaces of kn+1 parameterized by Y . We denote this affine cone by Cone(Y ). n Given a subset Y ⊂ P , then for d ≥ 1, we denote by IY,d the set of F ∈ k[X0,...,Xn]d with Y ⊂ Z(F ) (we put IY,0 = 0). This is a vector space and we have IY,d · k[X0,...,Xn]e ⊂ IY,d+e. So the direct sum of the IY,d, d = 1, 2,... , denoted by IY ⊂ k[X0,...,Xn], is an ideal of k[X0,...,Xn]. Notice that with this definition, I∅ is the ideal generated by T0,...,Tn. The k-algebra k[X0,...,Xn] is graded and the ideal IY is homogeneous in the sense below. DEFINITION 2.2.A (nonnegatively) graded ring is a ring R whose underlying ∞ additive group comes with a direct sum decomposition R• = ⊕k=0Rd such that the P∞ d ring product maps Rd × Re in Rd+e, or equivalently, is such that d=0 Rdt is a subring of R[t]. An ideal I of such a ring is said to be homogeneous if it is the direct sum of its homogeneous parts Id := I ∩ Rd. If R• is a graded ring, then clearly R0 is a subring of R. If I• is a homogeneous ∞ ideal, then R/I = ⊕d=0Rd/Id is again a graded ring. Of special interest is the ∞ homogeneous ideal R+ := ⊕k=1Rd, for which we have R/R+ = R0. We will be mostly concerned with the case when R0 = k, so that R+ is then a maximal ideal (and the only one that is homogeneous). LEMMA 2.3.√ If I,J are homogeneous ideals of a graded ring R•, then so are I ∩J, IJ, I + J and I. Moreover, a minimal prime ideal of R• is a graded ideal. PROOF. The proofs of the first statement are not difficult. We omit them. As to the last, it suffices to show that if p ⊂ R• is a prime ideal, then the direct sum of its homogeneous parts, p• := ⊕n(p ∩ Rn) is also a prime ideal. Indeed, suppose r, s ∈ R are such that rs ∈ p• and neither r nor s lies in p•. Write r and s as a sum of its homogenous parts and let rk ∈ Rk resp. sl ∈ Rl be the lowest degree part or r resp. s that is not in p. Then it is easily seen that rksl ∈ p and so rk ∈ p or sl ∈ p and we get a contradiction. LEMMA 2.4. If X ⊂ An+1 is an affine cone (with vertex 0), then I(X) is a homogeneous ideal. PROOF. Let F ∈ I(X). We must show that each homogeneous component of F lies in I(X). As X is invariant under scalar multiplication, the polynomial 2. THE ZARISKI TOPOLOGY ON A PROJECTIVE SPACE 55 P d F (tT0, . . . , tTn) = d≥1 t Fd(T0,...,Tn) (as an element of k[x1, . . . , xn][t]) van- ishes on X ×A1 in An+1 ×A1. Clearly the zero set of I(X)[t] is X ×A1 ⊂ An+1 ×A1 and since the quotient is k[x1, . . . , xn][t]/I(X)[t] = k[X][t] is reduced, we have 1 I(X × A ) = I(X)[t]. It follows that Fd ∈ I(X) for all d. The ideal I(X) is of course a radical ideal and so we find: COROLLARY 2.5. The maps Y 7→ Cone(Y ) and Y 7→ IY set up bijections between (i) the collection of closed subsets of Pn, (ii) the collection of affine cones in An+1 and (iii) the collection of homogeneous radical ideals of k[X0,...,Xn] contained in (T0,...,Tn). n+1 Note that here Y = ∅ ↔ {0} ⊂ A ↔ the homogeneous ideal (T0,...,Tn). The bijections above restrict to bijections between (i) the collection of irreducible subsets of Pn, (ii) the collection of irreducible affine cones in An+1 strictly contain- ing the vertex 0 and (iii) the collection of homogeneous prime ideals of k[X0,...,Xn] strictly contained in (T0,...,Tn). DEFINITION 2.6. The homogeneous coordinate ring of a closed subset Y of Pn is the graded ring S(Y )• := k[X0,...,Xn]/IY , S(Y )d = k[X0,...,Xn]d/IY,d. Notice that if we ignore the grading of S(Y )•, then we just get the coordinate ring of the affine cone over Y , k[Cone(Y )]. EXERCISE 54. Let R• be a graded ring. (b) Prove that if I is a prime ideal in the homogeneous sense: if rs ∈ I for some r ∈ Rk, s ∈ Rl implies r ∈ I or s ∈ I, then I is a prime ideal. (c) Prove that the intersection of all homogeneous prime ideals of R• is its ideal of nilpotents. n EXERCISE 55. Prove that a closed subset Y ⊂ P is irreducible if and only if IY is a prime ideal. Since k[X0,...,Xn] is a noetherian ring, any ascending chain of homogeneous ideals in this ring stabilizes. This implies that any projective space is noetherian (and according to Exercise 49 Pb, as any prevariety, is noetherian). In particular, every subset of a projective space has a finite number of irreducible components whose union is all of that subset. EXERCISE 56. Let S• be a graded k-algebra that is reduced, finitely generated and has S0 = k. (a) Prove that S• is as a graded k-algebra isomorphic to the homogeneous coordinate ring of a closed subset Y . (b) Prove that under such an isomorphism, the homogeneous radical ideals contained in the maximal ideal S+ := ⊕d≥1Sd correspond to closed sub- sets of Y under an inclusion reversing bijection: homogeneous ideals strictly contained in S+ and maximal for that property correspond to points of Y . (c) Suppose S• a domain. Show that a fraction F/G that is homogeneous of degree zero (F ∈ Sd and 0 6= G ∈ Sd for some d) defines a function on Ug. EXERCISE 57. Let Y be an affine variety. 56 2. PROJECTIVE VARIETIES (a) Show that a homogeneous element of the graded ring k[Y ][T0,...,Tn] defines a closed subset of Y × Pn as its zero set. (b) Prove that every closed subset of Y ×Pn is an intersection of finitely many zero set of homogeneous elements of k[Y ][T0,...,Tn]. (c) Prove that we have a bijective correspondence between closed subsets of n Y × P and homogeneous radical ideals in k[Y ][T0,...,Tn]. 3. The Segre embeddings First we show how a product of projective spaces can be realized as a closed subset of a projective space. This will imply among other things that a projective space is a variety. Consider the projective spaces Pm and Pn with their homoge- neous coordinate systems [T0 : ··· : Tm] and [W0 : ··· : Wn]. We also consider a projective space whose homogeneous coordinate system is the set of matrix coeffi- mn+m+n cients of an (m + 1) × (n + 1)-matrix [Z00 : ··· Zij : ··· : Zmn]; this is just P with an unusual indexing of its homogeneous coordinates. PROPOSITION 3.1 (The Segre embedding). The map f : Pm × Pn → Pmn+m+n defined by Zij = TiWj, i = 0, . . . , m; j = 0, . . . , n is an isomorphism onto a closed subset of Pmn+m+n. If m = n, then the diagonal of Pm × Pm is the preimage of the m2+2m m m linear subspace of P defined by Zij = Zji and hence is closed in P × P . PROOF. In order to prove that f defines an isomorphism onto a closed subset mn+m+n of P , it is enough to show that for every chart domain Uij of the standard mn+m+n −1 m n atlas of P , f Uij is open in P × P and is mapped by f isomorphically onto a closed subset of Uij. For this purpose we may (simply by renumbering) mn+m+n assume that i = j = 0. So then U00 ⊂ P is defined by Z00 6= 0 and is parametrized by the coordinates zij := Zij/Z00, (i, j) 6= (0, 0). It is clear that −1 f U00 is defined by T0W0 6= 0. This is just UT0 × UW0 and hence parametrized by x1 := T1/T0, . . . , xm := Tm/T0 and y1 := W1/W0, . . . , yn := Wn/W0. In terms of −1 these coordinates, f : f U00 → U00 is given by zij = xiyj, where (i, j) 6= (0, 0) and where we should read 1 for x0 and y0 (so that zi0 = xi and z0j = yj). It is now clear that the image is in fact the graph of the morphism Am × An → Amn with m n mn coordinates xiyj, 1 ≤ i ≤ m, 1 ≤ j ≤ n. This graph is closed in A × A × A and m n −1 isomorphic to A × A . So f indeed restricts to an isomorphism of f U00 onto a closed subset of U00. In case m = n, we must also show that the condition TiWj = TjWi for 0 ≤ i < j ≤ m implies that [T0 : ··· : Tm] = [W0 : ··· : Wm], assuming that not all Ti resp. × Wj are zero. Suppose Ti 6= 0 and put t := Wi/Ti ∈ k . Then TiWj = TjWi implies Wj = tTj for all j. So t 6= 0 and [W0 : ··· : Wm] = [T0 : ··· : Tm]. COROLLARY 3.2. A projective space over k is a variety. m m PROOF. Proposition 3.1 shows that the diagonal of P × P is closed. DEFINITION 3.3. A variety is said to be projective if it is isomorphic to a closed irreducible subset of some projective space. A variety is called quasi-projective if is isomorphic to an open subset of some projective variety. COROLLARY 3.4. Every irreducible closed (resp. locally closed) subset of Pn is a projective (resp. quasi-projective) variety. The collection of projective (resp. quasi- projective) varieties is closed under a product. 4. PROJECTIONS 57 PROOF. The first statement follows from 11.5 and the second from Proposition 3.1. EXERCISE 58. (a) Prove that the image of the Segre embedding is the common zero set of the homogenenous polynomials ZijZkl − ZilZkj. (b) Show that for every (p, q) ∈ Pm × Pn the image of {p} × Pn and Pm × {q} in Pmn+m+n is a linear subspace. n (n2+3n)/2 (c) Prove that the map P → P defined by Zij = TiTj, 0 ≤ i ≤ j ≤ n is an isomorphism on a closed subset defined by quadratic equations. Find these equations for n = 2. (d) As a special case we find that the quadric hypersurface in P3 defined by 1 1 Z0Z1 − Z2Z3 = 0 is isomorphic to P × P . Identify in this case the two systems of lines on this quadric. EXERCISE 59 (Intrinsic Segre embedding). Let V and W be finite dimensional k-vector spaces. Describe the Segre embedding for P(V ) × P(W ) intrinsically as a morphism P(V ) × P(W ) → P(V ⊗ W ). 4. Projections Let V be a finite dimensional k-vector space. If W ⊂ V is a linear subspace, then we can form the quotient vector space V/W and we have linear surjection π : V → V/W . We cannot projectivize this as a morphism from P(V ) to P(V/W ), but we shall see that it defines at least a morphism Pπ : P(V ) − P(W ) → P(V/W ) (which then determines a rational map P(V ) 99K P(V/W ) in case W 6= V ). As a map, Pπ is defined as follows: for p ∈ P(V ) − P(W ), `p is a one dimensional subspace of V not contained in W so that π(`p) is a one dimensional subspace of V/W ; we then put Pπ(p) := [π(`p)]. To see that this is morphism, let n := dim P(V ), m := dim P(V/W ) (so that dim V = n + 1 and dim W = n − m) and choose a coordinate system (T0,...,Tn) for V such that W is given by T0 = ··· = Tm = 0. Then (T0,...,Tm) serves as a coordinate system for V/W and π : V → V/W is simply given by (T0,...,Tn) → (T0,...,Tm). In projective coordinates this is of course given by [T0 : ··· : Tn] → [T0 : ··· : Tm] and so is indeed a morphism where it makes sense, namely where T0,...,Tm not all vanish, that is, on the complement of P(W ). EFINITION EMMA D -L 4.1. The blowup of P(V ) along P(W ), denoted BlP(W ) P(V ), is the closure of the graph of Pπ in P(V ) × P(V/W ) (hence is a projective variety). It enjoys the following properties: The projection on the first factor, p1 : BlP(W ) P(V ) → P(V ) is an isomorphism over P(V ) − P(W ) whereas the preimage over P(W ) is P(W ) × P(V/W ). The projection to the second factor p2 : BlP(W ) P(V ) → P(V/W ) is a locally trivial bundle of projective spaces of dimension dim W , to be precise, if U ⊂ P(V/W ) is a hyperplane complement (hence an affine space), then there exists an isomorphism −1 ∼ n−m p2 U = P × U whose second component is given by p2. PROOF. We use a homogeneous coordinate system [T0 : ··· : Tn] for P(V ) as above (so that P(W ) is given by T0 = ··· = Tm = 0). If we denote the cor- responding coordinate system for P(V/W ) by [S0 : ··· : Sm], then the graph of Pπ in P(V ) × P(V/W ) is given by the pairs ([T0 : ··· : Tn], [S0 : ··· : Sm]) with [T0 : ··· : Tm] proportional to [S0 : ··· : Sm] and (T0,...,Tm) 6= (0,..., 0). The 58 2. PROJECTIVE VARIETIES proportionality property is equivalent to: TiSj = TjSi for all 0 ≤ i < j ≤ m. Let Z be the subset of ⊂ P(V ) × P(V/W ) defined by these equations: TiSj = TjSi, 0 ≤ i < j ≤ m. In terms of the Segre embedding of Pn × Pm ,→ Pnm+n+m this amounts to Zij = Zji in this range and so Z is a closed subset of P(V ) × P(V/W ). The equations defining Z are trivially satisfied when (T0,...,Tm) = (0,..., 0) and so Z contains P(W ) × P(V/W ). We have just shown that the graph of Pπ equals the complement of P(W ) × P(V/W ) in Z. Let us now see what the projection Z → P(V/W ) is like over the open subset m US0 of P(V/W ) defined by S0 6= 0. We parametrize US0 by A via (y1, . . . , ym) ∈ m 7→ [1 : y : ··· : y ]. Then its preimage Z := Z ∩ ( (V ) × U ) in Z is A 1 m US0 P S0 n−m parametrized by P × US0 by means of the morphism n−m ([T0 : Tm+1 : ··· : Tn], [1 : y1 : ··· : yn]) ∈ P × US0 7→ ([T : T y : ··· : T y : T : ··· : T ], [1 : y : ··· : y ]) ∈ Z . 0 0 1 0 m m+1 n 1 n US0 This is an isomorphism (the inverse is obvious) which commutes with the projection on U . Notice that it makes Z ∩( (V )× (V/W )) correspond to the locus where S0 US0 P P n−m T0 = 0, which is the product of a hyperplane in P times US0 . In particular, Z ∩ ( (V ) × (V/W )) lies in the closure of the graph of π. This remains true, US0 P P P of course, if we replace US0 by USi , i = 1, . . . , m, or by any other hyperplane complement in P(V/W ). It follows that Z is the closure of the graph of Pπ and that Z enjoys the stated properties. It is worthwhile to describe this map in purely projective terms. This starts with the observation that the π-preimage of a one-dimensional linear subspace of V/W is a linear subspace of V which contains W as a hyperplane and that every such subspace so arises. So we can think of P(V/W ) as parameterizing the projective linear subspaces of P(V ) that contain P(W ) as a projective hyperplane. EXERCISE 60. Prove that the projective structure on this collection of linear subspaces is intrinsic: prove that if P is a projective space and Q ⊂ P is a projective linear subspace of codimension m > 0, then the collection of projective linear subspaces Q˜ ⊂ P which contain Q as a projective hyperplane is in a natural manner a projective space (denoted here by PQ) of dimension m−1. Show that the blowup BlQ P of P along Q can be identified with the set of pairs (p, [Q˜]) ∈ P × PQ with p ∈ Q˜. 5. Elimination theory and closed projections Within a category of reasonable topological spaces (say, the locally compact Hausdorff spaces), the compact ones can be characterized as follows: K is compact if and only if the projection K × X → X is closed for every space X in that cate- gory. In this sense the following theorem states a kind of compactness property for projective varieties. n THEOREM 5.1. If U is a variety, then the projection πU : P × U → U is closed. Moreover, if Z ⊂ Pn × U is closed and irreducible, then there exists an open-dense 0 0 subset Y ⊂ πU (Z) such that for every y ∈ Y we have dim Z = dim πU (Z) + dim Zy, n where Zy ⊂ P denotes the fiber πU |Z over y. We derive this theorem from the main theorem of elimination theory, which we will state and prove first. 5. ELIMINATION THEORY AND CLOSED PROJECTIONS 59 Given an integer d ≥ 0, let us write Vd for k[T0,T1]d, the k-vector space of d−i i d homogeneous polynomials in k[T0,T1] of degree d. The monomials (T0 T1)i=0 form a basis, in particular, dim Vd = d + 1. Given F ∈ Vm and G ∈ Vn, then uF,G : Vn−1 ⊕ Vm−1 → Vn+m−1, (A, B) 7→ AF + BG is a linear map between two k-vector spaces of the same dimension m + n. The resultant R(F,G) of F and G is defined as the determinant of this linear map with respect to the monomial bases of the summands of Vn−1 ⊕ Vm−1 and of Vn+m−1. So R(F,G) = 0 if and only if uF,G fails to be injective. Notice that R(F,G) is a polynomial in the coefficients of F and G: Pm m−i i Pn n−i i if F = i=0 aiT0 T1 and G = j=0 biT0 T1, then a0 a1 ··· am 0 ······ 0 0 a0 a1 ··· am ······ 0 0 0 a0 ··· 0 ··· 0 0 ··· 0 a0 a1 ··· am R(F,G) = det b0 b1 ······ bn 0 ··· 0 0 b0 b1 ······ bn 0 ··· 0 0 0 b0 ··· 0 ··· 0 ··· 0 b0 b1 ······ bn So the resultant defines an element of k[Vm × Vn] = k[Vm] ⊗ k[Vn]. LEMMA 5.2. R(F,G) = 0 if and only if F and G have a common zero in P1. PROOF. If R(F,G) = 0, then uF,G is not injective, so that there exist a nonzero (A, B) ∈ Vn−1 ⊕ Vm−1 with AF + BG = 0. Suppose that B 6= 0. It is clear that F divides BG. Since deg(B) = m − 1 < m = deg F , it follows that F and G must have a common factor. If conversely F and G have a common zero in P1, then they must have a common linear factor L, say: F = LF1, G = LG1 and we see that (G1, −F1) ∈ Vn−1 ⊕ Vm−1 is nonzero and in the kernel of uF,G. n PROOFOF THEOREM 5.1. Let Z ⊂ P × U. It is clear that πU (Z) is closed 0 0 0 in U if for every open affine subset U ⊂ U, πU (Z) ∩ U is closed in U . Since 0 n 0 πU (Z) ∩ U = πU 0 (Z ∩ (P × U )) we may (and will) assume that is U affine. We first treat the case n = 1 and then proceed with induction on n. 1 Let Z ⊂ P × U be closed. Denote by IZ,• the homogeneous ideal in the graded algebra k[U][T0,T1] of functions vanishing on Z. Then Z is the common zero set of the members of IZ,• (see Exercise 57). For every homogeneous pair F,G ∈ ∪mk[U][T0,T1]m, we can form the resultant R(F,G) ∈ k[U]. We claim that πU (Z) is the common zero set Z(R) ⊂ U of the set of resultants R(F,G) of pairs of homogeneous forms F,G taken in ∪mIZ,m, hence is closed in U. 1 Suppose that y ∈ πU (Z) so that (y, p) ∈ Z for some p ∈ P . Then p is a common zero of each pair Fy,Gy, where F,G ∈ ∪mIZ,m and the subscript y refers to substituting y for the first argument. So R(F,G) vanishes in y. It follows that y ∈ Z(R). 1 Next we show that if y∈ / πU (Z), then y∈ / Z(R). Since {y}×P is not contained in Z, there exists an integer m > 0 and a F ∈ IZ,m with Fy 6= 0. Denote by 60 2. PROJECTIVE VARIETIES 1 p1, . . . , pr ∈ P the distinct zeroes of Fy. We show that there exists a G ∈ IZ,n for some n such that Gy does not vanish in any pi. This suffices, for this means that R(Fy,Gy) 6= 0 and so y∈ / Z(R). To this end, we note that for any given 1 ≤ i ≤ r, S 1 (i) Z ∪j6=i{(y, pj)} is closed in U × P , so that there will exist a G ∈ ∪mIZ,m with (i) (i) Gy zero in all the pj with j 6= i, but nonzero in pi. Upon replacing each G by some positive power of it , we may assume that G(1),...,G(r) all have the same (1) (r) (i) degree n, say. Then G := G + ··· + G ∈ IZ,n and Gy(pi) = G (pi) 6= 0. In order to prove the dimension assertion, we assume Z irreducible so that 1 Y := πU (Z) is irreducible and closed. We show that either Z = P × Y (so that 0 dim Z = dim Y +1) or that there exists an open-dense Y ⊂ Y such that ZY 0 := Z ∩ 1 0 0 0 (P × Y ) → Y is a finite morphism (so that dim Y = dim Y = dim ZY 0 = dim Z). Consider the ideal in k[U] generated by the coefficients of all the members of IZ ⊂ k[U][T0,T1] and denote by Y1 ⊂ U its zero set. This is closed subset of Y 1 consisting of the y ∈ U with the property that P × {y} ⊂ Z. If Z1 = Y , then 1 Z = P × Y . Assume therefore that Z1 is a proper closed subset of Y . Choose p ∈ P1 such that {p} × Y is not contained in Z. By adapting the coordinates in P1, we may assume that p = ∞. Let Y2 := πU ({∞}×U ∩Z). This is also a proper closed 0 subset of Y and so Y contains an an affine open-dense subset Y ⊂ Y − Y1 − Y2 1 0 0 such that ZY 0 is contained in A × Y and has finite fibers over Y . Then ZY 0 can be given as a subset of A1 × Y 0 by a polynomial in k[Y 0][t]. After possible shrinking Y 0, we can assume that the top coefficient of this polynomial is a unit in k[Y 0]. Then 0 ZY 0 will be given by a monic polynomial and hence ZY 0 → Y is a finite morphism. Now assume n ≥ 2. Let q = [0 : ··· : 0 : 1] and consider the blowup P˜n := n n ˜n Bl{q} P → P . Recall that an element of P is the set of pairs in ([T0 : ··· : n n−1 Tn], [S0 : ··· : Sn−1]) in P × P with [T0 : ··· : Tn−1] = [S0 : ··· : Sn−1]. We n−1 have seen that over the open subset USi ⊂ P defined by Si 6= 0, the projection ˜n n−1 1 P → P is isomorphic to the projection P × USi → USi . Hence the projection ˜n n−1 1 π1 : P × U → P × U is over USi × U like P × USi × U → USi × U. So this projection is closed over USi × U. It follows that the projection π1 is closed. The preimage Z˜ of Z under the projection P˜n × U → Pn × U is closed and by what ˜ n−1 we just proved, π1(Z) is closed in P × U. By induction, the image of the latter n−1 under the projection π2 : P × U → U is closed. But this is just πU (Z). For the dimension assertion, we may without loss of generality assume that Z is not contained in {q} × U (otherwise choose a different q) and that πU (Z) = U (otherwise replace U by π(Z)). We then replace Z˜ by the closure of Z ∩ (Pn − {q}) × U in P˜n × U so that Z˜ is irreducible, is of the same dimension as Z and maps also onto U. It therefore suffices to prove the dimension assertion for the projection ˜ ˜ n−1 π˜ : Z → U. We put W := π1(Z) ⊂ P × U. ˜n n−1 The case n = 1 treated above shows that for the projection π1 : P → P × U ˜ −1 ˜ 1 ˜ either Z = π1 W (so that Z is a P -bundle over W and dim Z = dim W + 1), or dim Z˜ = dim W and Z˜ is finite over W − V ⊂ W where V ⊂ W is a proper closed subset. In that last case, we note that for every closed irreducible subset C ⊂ W −1 ˜ which is not contained in V , we have dim(π1 C ∩ Z) = dim C: this is clear for −1 ˜ −1 an irreducible component of π1 C ∩ Z which meets π (W − V ) and any other irreducible component will be contained in a P1-bundle over C ∩ V and hence of dimension 1 + dim(C ∩ V ) ≤ dim C. 5. ELIMINATION THEORY AND CLOSED PROJECTIONS 61 n−1 If π2 : P × U → U is the projection, then (by induction) there exists an open-dense subset U 0 ⊂ U such that the fibers of W → U resp. V → U all have dimension dim W − dim U resp. have dimension smaller than dim W − dim U. So ˜ 0 ˜ all the fibers of Z → U over U have dimension dim Z − dim U. We mention a few corollaries. COROLLARY 5.3. Let X be a projective variety. Then any morphism from X to a variety is closed (and hence has closed image). PROOF. Assume that X is closed in Pn. A morphism f : X → Y to a variety Y can be factored as the obvious isomorphism of X onto the graph Γf of f, the inclusion of this graph in X × Y (which is evidently closed), the inclusion of X × Y in Pn ×Y (which is closed since X ⊂ Pn is closed) and the projection onto Y (which is closed by Theorem 5.1). So f is closed. It is an elementary result from complex function theory (based on Liouville’s thoerem) that a holomorphic function on the Riemann sphere is constant. This implies the corresponding assertion for holomorphic functions on complex projec- tive n-space n (to see that a holomorphic function on n takes the same value on PC PC any two distinct points, simply apply the previous remark to its restriction to the complex projective line passing through them, viewed as a copy of the Riemann sphere). The following corollary is an algebraic version of this fact. COROLLARY 5.4. Let X be a projective variety. Then any morphism from X to a quasi-affine variety is constant. In particular, any regular function on X is constant. PROOF. If f : X → Y is a morphism to a quasi-affine variety Y , then its com- posite with an embedding of Y in some affine space An is given by n regular func- tions on X. So it suffices to prove the special case when Y = A1. By the previous corollary this image is closed in A1. But if we think of f as taking its values in P1 (via the embedding y ∈ A1 7→ [1 : y] ∈ P1), then we see that f(X) is also closed in P1. So f(X) cannot be all of A1. Since X is irreducible, so is the image and it follows that f(X) is a singleton. In other words, f is constant. PROPOSITION 5.5. Let Z ⊂ Pn an irreducible and closed subset. If Q ⊂ Pn is a linear subspace with the property that Q ∩ Z = ∅, then dim Z + dim Q < n and the n n projection π : P → PQ maps Z onto an irreducible and closed subset π(Z) with the property that every fiber of Z → π(Z) is finite. PROOF. Every fiber of Z → π(Z) is the intersection of Z with a linear subspace Q˜ ⊂ Pn which contains Q a hyperplane. It is closed in Q˜ and hence this fiber is projective (or strictly speaking, every irreducible component of that fiber is). On the other hand it is contained in the hyperplane complement Q˜ − Q, which is affine. So the fiber is also affine. Then Corollary 5.4 implies that each irreducible component of this fiber is a singleton. Hence the fiber is finite. n It follows from Theorem 5.1 that dim Z = dim π(Z) ≤ dim PQ − dim Q − 1. EXERCISE 61. Let P be a projective space of dimension n. (a) The dual Pˇ of P is by definition the collection of hyperplanes in P . Prove that Pˇ has a natural structure of a projective space. (b) Identify the double dual of P with P itself. 62 2. PROJECTIVE VARIETIES (c) The incidence locus I ⊂ P × Pˇ is the set of pairs (p, q) ∈ P × Pˇ with the property that p lies in the hyperplane Hq defined by q. Prove that I is a nonsingular variety of dimension 2n − 1. (d) Show that we can find homogeneous coordinates [Z0 : ··· : Zn] for P and ˇ Pn [W0 : ··· : Wn] for P such that I is given by i=0 ZiWi = 0. EXERCISE 62. Let F ∈ k[X0,...,Xn]d define a nonsingular hypersurface H in Pn. Prove that the map H → Pˇn which assigns to p ∈ H the projectived tangent space of H at p is given by [ ∂F : ··· : ∂F ]. Prove that the image of this map is ∂Z0 ∂Zn closed in Pˇn (this image is called the dual of H). What can you say in case d = 2? 6. Constructible sets We now ask about the image of an arbitrary morphism of varieties. This need not be a variety as the following simple example shows. 2 2 EXAMPLE 6.1. Consider the morphism f : A → A , (x1, x2) 7→ (x1, x1x2).A 2 point (y1, y2) ∈ A is in the image of f if and only if the equation y2 = y1x2 has a solution in x2. This is the case precisely when y1 6= 0 or when y1 = y2 = 0. So the image of f is the union of the open subset y1 6= 0 and the singleton {(0, 0)}. This is not a locally closed subset, but the union of two such. DEFINITION 6.2. A subset of variety is called constructible if it can be written as the union of finitely many (locally closed) subvarieties. THEOREM 6.3. Let f : X → Y be a morphism of varieties. Then f takes constructible subsets of X to constructible subsets of Y . In particular, f(X) is con- structible. We first show that the theorem follows from PROPOSITION 6.4. Let X be an irreducible variety and let f : X → Y be a morphism of varieties. Then f(X) contains a nonempty open subset of its closure (in other words, f(X) contains a locally closed subvariety of Y which is dense in f(X)). PROOF THAT PROPOSITION 6.4 IMPLIES THEOREM 6.3. A constructible subset is a finite union of irreducible subvarieties and so it is clearly enough to prove that the image of each of these is constructible. In other words, it suffices to show that the image of a morphism f : X → Y of varieties with X irreducible is constructible. We prove this with induction on the dimension of X. According to Proposition 6.4 the closure of f(X) contains a nonempty open subset U such that f(X) ⊃ U. It is clear that f −1U is a nonempty open subset of X. If Z := X − f −1U, then f(X) = U ∪ f(Z). The irreducible components of Z have smaller dimension than X and so f(Z) is constructible by induction. PROOFOF PROPOSITION 6.4. Since X is covered by finitely many of its open affine subsets, we may without loss of generality assume that X is affine. So we can assume that X is closed in some An. Then the graph of f identifies X with a closed subset of An × Y and so we see that we only need to prove that for every n closed subset X ⊂ A × Y , πY (X) contains a nonempty open subset of πY (X). Since we can factor πY in an obvious manner into successive line projections (by forgetting the last coordinate) n n−1 1 A × Y → A × Y → · · · → A × Y → Y, 7. THE VERONESE EMBEDDING 63 it suffices to do the case n = 1. Upon replacing Y by πY (X), we are now left to 1 show that if X ⊂ A × Y is closed and such that πY (X) is dense in Y , then πY (X) contains a nonempty open subset of Y . Since X is irreducible, so is its image πY (X) and hence also Y . Upon replacing Y be a nonempty affine open subset and X by the preimage of that open subset, we may also assume that Y is affine. When X = A1 × Y , there is nothing to show. Otherwise X is contained in a 1 PN i hypersurface Z(f) ⊂ A × Y with equation f ∈ k[Y ][x], f(x, y) = i=0 ai(y)x , with ai ∈ k[Y ] and aN not identically zero. Next, by replacing Y by the nonzero set U(aN ) and X by its preimage, we may assume that aN is nowhere zero. We prove that then πY (X) = Y . Since is aN now invertible in k[Y ], we may assume that f is a monic polynomial. Then Z(f) → Y is a finite morphism and hence closed by Exercise 33. Since X is a closed subset of Z(f), it follows that πY (X) is closed in Y . Since πY (X) is also dense in Y , it follows that πY (X) ⊃ Y . 7. The Veronese embedding Let be given a positive integer d. We index the monomials in Z0,...,Zn that are homogenous of degree d by their exponents: these are the sequences of non- n+d negative integers d = (d0, . . . , dn) with sum d. They are d in number. We use (n+d)−1 this index set to label the homogeneous coordinates Zd of P d . n (n+d)−1 PROPOSITION 7.1 (The Veronese embedding). The map fd : P → P d d0 dn defined by Zd = T0 ··· Tn is an isomorphism onto a closed subset of the target projective space. PROOF. In order to prove that fd is an isomorphism onto a closed subset, it is enough to show that for every chart domain Ud of the standard atlas of the target −1 n space, its preimage fd Ud is open in P and is mapped by fd isomorphically onto a d0 dn closed subset of Ud. This preimage is defined by T0 ··· Tn 6= 0. Let us renumber the coordinates such that d0, . . . , dr are positive and dr+1 = ··· = dn = 0. Then −1 fd Ud = U0 ∩ · · · ∩ Ur ⊂ U0. So if we use the standard coordinates (x1, . . . , xn) on −1 U0, then fd Ud is defined by x1 ··· xr 6= 0. The coordinates on Ud are the functions Ze/Zd with e 6= d. Let us write ze−d := ze0−d0,e1−d1,...,en−dn for this function. This notation is chosen as to make the expression fd in terms of these coordinates simply: −1 e1−d1 en−dn fd : U0(x1 ··· xr) = fd Ud → Ud, ze−d = x1 ··· xn (e 6= d). e1−d1 en−dn Since the Laurent monomial x1 ··· xn has degree d0 − e0, the components k1 kn of the above morphism are all the nonconstant Laurent monomials x1 ··· xn with ki ≥ −di and of total degree k1 + ··· + kn ≤ d0. In particular, each xi appears −d1 −dr as a component and so does the Laurent monomial x1 ··· xr . The graph of d1 dr the regular function x1 ··· xr on U0(x1 ··· xr) identifies U0(x1 ··· xr) with the d1 dr n+1 closed hypersurface Z = Z(yx1 ··· xr − 1) in A . All other components of −d1 −dr fd|U0(x1 ··· xr): U0(x1 ··· xr) → Ud are products of x1, . . . , xn, x1 ··· xr and (n+d)−n−2 so its image is the graph of a morphism Z → A d . So we get a closed embedding. The following proposition is remarkable for its repercussions in intersection theory. 64 2. PROJECTIVE VARIETIES PROPOSITION 7.2. Let H ⊂ Pn be a hypersurface. Then Pn − H is affine and for every closed irreducible subset Z ⊂ Pn of positive dimension, Z ∩ H is nonempty and of dimension ≥ dim(Z) − 1, with equality holding if Z is not contained in H. For the proof we need the following complement to Proposition 5.5. PROPOSITION 7.3. For every proper closed subset Z (Pn there exists a linear subspace in P of dimension n − dim(Z) − 1 which misses Z. PROOF. We may (and will) assume that Z is irreducible. Let i be a nonnegative integer < n−dim(Z). We prove with induction on i that Z misses a linear subspace of dimension i. If i = 0, then we must have dim(Z) < n and so any singleton in Pn − Z is as required. When i > 0, there exists by induction hypothesis a linear subspace Q ⊂ Pn of dimension (i − 1) which does not meet Z. Then projection n from Q defines a morphism π : Z → PQ from Z to a projective space of dimension n − i. Any fiber of this map is an intersection of Z with a linear subspace in Pn of dimension i passing through Q. We claim that at least one of them is empty. For if n not, then π is surjective and hence dominant. In particular, dim(Z) ≥ dim(PQ) = n − i, which evidently contradicts our assumption that i < n − dim(Z). PROOFOF PROPOSITION 7.2. The hypersurface H is given by a homogeneous P d0 dn polynomial of degree d, say by d cdT0 ··· Tn . This determines a hyperplane n+d ˜ ( d )−1 P ˜ H ⊂ P defined by d cdZd. It is clear that H is the preimage of H under the Veronese morphism and hence the latter identifies Pn − H with a closed subset (n+d)−1 n of the affine space P d − H˜ . So P − H is affine. For the rest of the argument we may, by passing to the Veronese embedding, assume that H is a hyperplane. If dim(Z ∩ H) ≤ dim(Z) − 2, then by Proposition 7.3 there exists a linear subspace Q ⊂ H of dimension dim(H)−(dim(Z ∩H)−1 ≤ (n − 1) − (dim(Z) − 2) − 1 = n − dim(Z) which avoids Z ∩ H. Then Q is a linear subspace of Pn which avoids Z and we thus contradict Proposition 5.5. This proves that dim(Z ∩ H) ≥ dim(Z) − 1. If Z is irreducible and not contained in H, then we have also dim(Z ∩ H) ≤ dim(Z) − 1. EXERCISE 63. Let d be a positive integer. The universal hypersurface of degree d n+d n ( d )−1 P d0 d1 dn is the hypersurface of P × P defined by F (X,Z) := d ZdT0 T1 ··· Tn . (n+d)−1 We denote it by H and let π : H → P d be the projection. (a) Prove that H is nonsingular. (b) Prove that projection π is singular at (X,Z) (in the sense that the deriv- ative of π at (X,Z) is not a surjection) if and only the partial derivatives of FZ ∈ k[X0,...,Xn] have X as a common zero. (c) Suppose from now on that d ≥ 2. Prove that the singular set of π is a n (n+d)−1 smooth subvariety of P × P d of codimension n + 1. (n+d)−1 (d) Prove that the set of Z ∈ P d over which π has a singular point is a hypersurface. This hypersurface is called the discriminant of π. (n+d)−1 (e) For d = 2 we denote the coordinates of P d simply by Zij (where it is understood that Zij = Zji). Prove that the discriminant of π is then the zero set of det(Zij). 8. GRASSMANNIANS 65 8. Grassmannians Let P be a projective space of dimension n and let d ∈ {0, . . . , n}. We want to show that the collection Grd(P ) of linear d-dimensional subspaces of P is a nonsingular projective variety. Let the projective structure on P be defined by the pair (V, `) so that V is a (n + 1)-dimensional k-vector space and P has been identified with P(V ). LEMMA 8.1. Let Q ⊂ P be a linear subspace of codimension d + 1. Then the collection of linear d-dimensional subspaces of P contained in P − Q has in a natural manner the structure of an affine space AQ of dimension (n − d)(d − 1). PROOF. Let Q correspond to the linear subspace W ⊂ V of dimension (n + 1) − (d + 1) = n − d. Then the elements of AQ correspond to linear subspaces L ⊂ V of dimension d + 1 with L ∩ W = {0}. This means that V = L ⊕ W . The affine structure is best seen by fixing some L0 with that property. Then every other such L is always the graph of a (unique) linear map L0 → W and vice versa. So this identifies AQ with Hom(L0,W ). It is easy to verify that this affine structure is independent of the choices (the translation vector space of AQ is canonically identified with Hom(V/W, W )). It can now be shown without much difficulty that Grd(P ) admits a unique structure of a variety for which a subset as in this lemma is affine open and its identification with affine space an isomorphism. We will however proceed in a more direct manner to show that Grd(P ) admits the structure of a projective variety. V• ∞ Vp We recall that the exterior algebra V = ⊕p=0 V is the quotient of the ∞ ⊗p ⊗0 tensor algebra on V , ⊕p=0V (here V = k by convention), by the two-sided ideal generated by the ‘squares’ v ⊗ v, v ∈ V . It is customary to denote the product by the symbol ∧. So we can characterize V• V as (noncommutative) associative k-algebra with unit element by saying that is generated by the k-vector space V and is subject to the relations v ∧ v = 0 for all v ∈ V . It is a graded algebra (Vp V is the image of V ⊗p) and ‘graded-commutative’ in the following sense: if Vp Vq pq α ∈ V and β ∈ V , then β ∧ α = (−1) α ∧ β. If e0, . . . , en is a basis for V , then a basis of Vp V is indexed by the p-element subsets I ⊂ {0, . . . , n}: I = {0 ≤ i1 < i2 < ··· < ip ≤ n} is associated to the basis element eI = ei1 ∧· · ·∧eip V0 Vp n+1 (where the convention is that e∅ = 1 ∈ k = V ). So dim V = p . Notice Vn+1 Vp that V is one-dimensional and spanned by e0 ∧ · · · ∧ en, whereas V = 0 for p > n + 1. Let us say that α ∈ Vp V is fully decomposable if the exist linearly independent v1, . . . , vp in V such that α = v1 ∧ · · · ∧ vp. This is equivalent to the existence of a p-dimensional subspace K ⊂ V such that α is a generator of ∧pK. p LEMMA 8.2. Let α ∈ V V be nonzero. Denote by K(α) the set of e ∈ V with e∧α = 0. Then dim K(α) ≤ p and equality holds if and only if α is fully decomposable and spans Vp K(α). 0 PROOF. Let e1, . . . , er be a basis of K(α) and let V ⊂ V be a subspace supple- mentary to K(α). Then we have a decomposition • •−|I| ^ M ^ 0 V = eI ∧ V . I⊂{1,...,r} 66 2. PROJECTIVE VARIETIES Wedging with ei kills all the summands in which ei appears and is injective on the Vp−r 0 sum of the others. So α ⊂ e1 ∧ · · · ∧ er ∧ V . In particular, r ≤ p with equality holding if and only if α is a multiple of e1 ∧ · · · ∧ ep. If L is a linear subspace of V of dimension d + 1, then Vd+1 L is of dimension 1 and will be thought of as a one dimensional subspace of Vd+1 V . We thus have Vd+1 Vd+1 defined map δ : Grd(P ) → P( V ), [L] 7→ [ L]. It is called the Plucker¨ embedding because of: Vd+1 PROPOSITION 8.3. The map δ : Grd(P ) → P( V ) maps Grd(P ) bijectively Vd+1 onto a closed subset of P( V ). d+1 PROOF. Let α ∈ V V . According to Lemma 8.2, [α] is in the image of δ if and only if K(α) is of dimension d + 1 and if that is the case, then δ−1[α] has P(K(α)) as its unique element. In particular, δ is injective. Consider the linear map d+1 d+2 ^ ^ V → Hom(V, V ), α 7→ eα := α∧ The set of linear maps V → Vd+2 V with kernel of dimension ≥ d + 1 are those of rank < s := dim(Vd+2 V ) − d. If we choose a basis for V , then this locus is given by set a homogeneous equations in Hom(V, Vd+2 V ), namely the s × s-minors of the corresponding matrices. Hence the set of α ∈ Vd+1 V for which dim K(α) ≥ d + 1 is also given by a set of homogeneous equations and thus defines a closed subset of Vd+1 P( V ). This subset is the image of δ. Proposition 8.3 gives Grd(P ) the structure of projective variety. In order to complete the construction, let Q ⊂ P be a linear subspace of codimension d. Let W ⊂ V correspond to Q and choose a generator β ∈ Vn−d W . Then we have a nonzero linear form d+1 n+1 ^ ^ ∼ eβ = β∧ : V → V = k. Vd+1 Its kernel defines a hyperplane in P( V ) and hence an affine subspace Ueβ ⊂ Vd+1 P( V ). LEMMA 8.4. The preimage of Ueβ under the Plucker¨ embedding is the affine space AQ and δ maps AQ isomorphically onto its image. d+1 PROOF. If α is the generator of V L for some (d + 1)-dimensional subspace, then L ∩ W = {0} if and only if α ∧ β 6= 0: if L ∩ W contains a nonzero vector v then both α and β are divisible by v and so α ∧ β = 0; if L ∩ W = {0}, then we have decomposition V = L ⊕ V and it is then easily seen (by picking a compatible −1 basis of V for example) that α ∧ β 6= 0. This implies that δ Ueβ = AQ. Let us now express the restriction δ : AQ → Ueβ in terms of coordinates. Choose a basis e0, . . . , en for V such that ed+1, . . . , en is a basis for W and β = Vd+1 ed+1 ∧ · · · ∧ en. Then eβ simply assigns to an element α of V the coefficient of ∼ e0 ∧ · · · ∧ ed in α. If L0 ⊂ V denotes the span of e0, . . . , ed, then AQ = Hom(L0,W ) is identified with the affine space A(d+1)×(n−d) of (d + 1) × (n − d)-matrices via n j X j d (ai )0≤i≤d COROLLARY 8.5. The Plucker¨ embedding realizes Grd(P ) as a nonsingular sub- Vd+1 variety of P( V ) of dimension (n − d)(d + 1). This structure is compatible with the affine structure that we have on each AQ. PROOF. Every two open subsets of the form AQ a have nonempty intersection and so Grd(P ) is irreducible. The rest follows from the previous corollary. REMARK 8.6. The image of Grd(P ) is a closed orbit of the natural SL(V )-action Vd+1 on P( V ). It lies in the closure of any other SL(V )-orbit. EXERCISE 64. Let P be a projective space. Given integers 0 ≤ d < e ≤ dim P , prove that the set of pairs of linear subspaces R ⊂ Q ⊂ P with dim R = d and dim Q = e is a closed subvariety of Grd(P ) × Gre(P ). The Grassmannian of hyperplanes in a projective space is itself a projective space (see Exercise 61). So the simplest example not of this type is the Grassman- nian of lines in a 3-dimensional projective space. Let V be vector space dimension 4. On the 6-dimensional space V2 V we have a homogeneous polynomial F : V2 V → k of degree two defined by 4 ^ F (α) := α ∧ α ∈ V =∼ k (the last identification is only given up to scalar and so the same is true for F ). In coordinates F is quite simple: if e1, . . . , e4 is a basis for V , then (ei ∧ ej)1≤i F (T1,2,...,T3,4) = T1,2T3,4 − T1,3T2,4 + T1,4T2,3. Notice that F is irreducible. Its partial derivatives are the coordinates themselves (up to sign and order) and so F defines a smooth quadric hypersurface of dimension 4 in a 5-dimensional projective space. V2 PROPOSITION 8.7. The image of the Plucker¨ embedding of G1(P(V )) in P( V ) is the zero set of F . PROOF. The image of the Plucker¨ embedding is of dimension 4 and so must be a hypersurface. Since the zero set of F is an irreducible hypersurface, it suffices to show that the Plucker¨ embedding maps to the zero set of F . For this, let α be a V2 generator of L for some linear subspace L ⊂ V of dimension 2. If e1, . . . , e4 is a basis of V such that α = e1 ∧ e2, then it is clear that α ∧ α = 0. This proves that the Plucker¨ embedding maps to the zero set of F . 68 2. PROJECTIVE VARIETIES d+1 REMARK 8.8. The image of the Plucker¨ embedding Grd(P ) ,→ P(∧ V ) is in fact always the common zero set of a collection of quadratic equations, called the Plucker¨ re- lations. To exhibit these, we first recall that every φ ∈ V ∗ defines a linear ‘inner contrac- • •−1 tion’ map ιφ : ∧ V → ∧ V of degree −1 characterized by the fact that for v ∈ V , 0 p • p ιφ(v) = φ(v) ∈ k = ∧ V and for α ∈ ∧ V, β ∈ ∧ V , ιφ(α∧β) = ιφ(α)∧β +(−1) α∧ιφ(β). d+1 d+1 d+2 d We then define a linear map B : ∧ V × ∧ V → ∧ V ⊗ ∧ V as follows: if (e0, . . . , en) ∗ ∗ ∗ is a basis of V and (e0, . . . , en) is the basis of V dual to (e0, . . . , en), then n X ∗ B(α, β) := (α ∧ er) ⊗ (ιer β). r=0 It is easy to check that this is independent of the choice of basis. In particular, if α is fully decomposable, then we can choose (e0, . . . , en) in such a manner that α = e0 ∧ · · · ∧ ed. We then find: X ∗ B(α, α) = (e0 ∧ · · · ∧ ed ∧ er) ⊗ (ιer e0 ∧ · · · ∧ ed ∧ er) = 0, r ∗ because e0 ∧ · · · ∧ ed ∧ er = 0 for r ≤ d and ιer e0 ∧ · · · ∧ ed ∧ er = 0 for r > d. This identity might be called the universal Plucker¨ relation. One can show that conversely, any α ∈ ∧d+1V for which B(α, α) = 0 is either zero or fully decomposable. Let us rephrase this in terms of algebraic geometry: every nonzero linear form ` on d+2 d d+1 ∧ V ⊗ ∧ V , determines a quadratic form Q` on ∧ V defined by α 7→ `(B(α, α)) whose d+1 zero set is a quadratic hypersurface in P(∧ V ). This quadric contains the Plucker¨ locus and the latter is in fact the common zero set of the Q`, with ` running over the linear forms d+2 d on ∧ V ⊗ ∧ V . It can be shown that the Q` generate the full graded ideal defined by the Plucker¨ locus. PROPOSITION-DEFINITION 8.9. Let X be a closed subvariety of the projective space P . If d is an integer between 0 and dim P , then the set of d-linear subspaces of P which are contained in X defines a closed subvariety Fd(X) of Grd(P ), called the Fano variety (of d-planes) of X. PROOF. An open affine chart of Grd(P ) is given by a decomposition V = L⊕W with dim L = d + 1 and dim W = n − d and is then parametrized by Hom(L, W ) by assigning to A ∈ Hom(L, W ) the graph of A. It suffices to prove that via this identification Fd(X) defines a closed subset of Hom(L, W ). Choose homogeneous coordinates [Z0 : ··· : Zn] such that L resp. W is given by Zd+1 = ··· = Zn = 0 resp. Z0 = ··· Zd = 0. A linear map A ∈ Hom(L, W ) ∗ P is then given by A Zd+i = j=0d aijZj, i = 1, . . . , n − d. It defines an element ∗ ∗ of Fd(X) if and only for all G ∈ ∪m≥0Im(X), G(Z0,...,Zd,A Zd+1,...A Zn) is m0 md identically zero. This means that the coefficient of every monomial Z0 ··· Zd in such an expression much vanish. Since this coefficient is a polynomial in the matrix coefficients aij of A, we find that the preimage of Fd(X) in Hom(L, W ) is the common zero set of a set of polynomials and hence is closed therein. EXAMPLE 8.10. Consider the case of a quadratic hypersurface X ⊂ P(V ) and assume for simplicity that char(k) 6= 2. So in terms of homogeneous coordinates 1 P [T0 : ··· : Tn], X can be given by an equation F (T0,...,Tn) = 2 0≤i,j≤n bijTiTj 1 2 with bij = bji (so 2 bii is the coefficient of Ti ). In more intrinsic terms: X is given by a symmetric bilinear form B : V × V → k (namely by B(v, v) = 0). Let us assume that X is nonsingular. This means that the partial derivatives of F have no common zero in P(V ). This translates into: B : V × V → k is nonsingular, that is, the linear map b : v ∈ V 7→ B( , v) ∈ V ∗ is an isomorphism of vector spaces. 8. GRASSMANNIANS 69 A subspace L ⊂ V determines an element of the Fano variety of X precisely when B(v, v) = 0 for all v ∈ L. This implies that B is identically zero on L × L. So b maps L to (V/L)∗ ⊂ V ∗. Since b is injective, this implies that dim L ≤ dim(V/L), 1 in other words that dim L ≤ 2 dim V . This condition is optimal. If for instance dim V = 2m + 2 is even, then it not difficult to show that we can find coordinates (T0, ··· ,T2m+1) such that F = Pm 0 i=0 TiTm+1+i. Let L resp. L be the linear subspace defined by Tm+1 = ··· = 0 T2m+1 = 0 resp. T0 = ··· = Tm = 0, so that V = L ⊕ L . Notice that both [L] and 0 0 [L ] are in Fm(X). The vector space Hom(L, L ) describes an affine open subset of the Grassmannian of m-planes in P(V ). An element A ∈ Hom(L, L0) is given by ∗ Pm A Tm+i = j=0 aijTj, j = 0, . . . , m. The corresponding m-plane is contained in Pm X precisely when i,j=0 aijTiTj is identically zero, i.e., if (aij) is antisymmetric. It follows that [L] ∈ Fm(X) has a neighborhood isomorphic to an affine space of 1 dimension 2 m(m + 1). EXERCISE 65. Let X be a quadratic hypersurface P(V ) of odd dimension 2m+1 and assume for simplicity that char(k) 6= 2. Prove that Fm+1(X) = ∅ and that Fm(X) 6= ∅ (Hint: use the fact that we can choose coordinates (T0,...,T2m+2) for 2 Pm+1 V such that F is given by T0 + i=1 TiTm+1+i.) Prove that Fm(X) is a smooth variety and determine its dimension. EXERCISE 66. Let X ⊂ Pn be a hypersurface of degree d and let 0 ≤ m ≤ n. n Prove that the intersection of Fm(X) with a standard affine subset of Grm(P ) is m+d given by d equations. EXERCISE 67. Consider the universal hypersurface of degree d in Pn, H ⊂ n (n+d)−1 P × P d . n (n+d)−1 (a) For every m-plane Q ⊂ P , let Yz denote the set of z ∈ P d for which the corresponding hypersurface Hz contains Q. Prove that Yz is a linear n+d ( d )−1 m+d subspace of P of codimension d . (n+d)−1 (n+d)−1 (b) Let Y ⊂ P d be the set of z ∈ P d for which Hz contains an (n+d)−1 m-plane. Prove that Y is a closed subset of P d of codimension at m+d least d − (m + 1)(n − m). (c) Prove that the family of m-planes contained in a generic hypersurface of n m+d degree d in P is of dimension (m + 1)(n − m) − d or empty. In m+d1 particular, this is a finite set when (m + 1)(n − m) = d . Let P be as before and let X be a nonsingular (not necessarily closed) subvari- ety of P of dimension d. For every p ∈ X, we have defined the tangent space TpX as a d-dimensional subspace of TpP . There is precisely one d-dimensional linear subspace Tˆ(X, p) of P which contains p and has the same tangent space at p as X. In terms of a standard affine chart: if U ⊂ P is the complement of a hyperplane which does not pass through p, then U has the structure of an affine space and Tˆ(X, p) ∩ U is then the affine subspace of U spanned by TpX. PROPOSITION-DEFINITION 8.11. The map G : X → Grd(P ), p ∈ X 7→ Tˆ(X, p) is a morphism. This morphism is called the Gauss map. 1 3 For instance, every cubic surface in P (so here n = 3, d = 3 and m = 1) contains a line. If it is nonsingular, then it contains in fact exactly 27 lines. This famous result due to Cayley and Salmon published in 1849 is still subject of research. 70 2. PROJECTIVE VARIETIES PROOF. Let p0 ∈ X. The tangent space Tˆ(X, p0) defines a linear subspace L ⊂ V of dimension d + 1. This linear subspace contains the line `p0 defined by po. Choose homogeneous coordinates Z0,...,Zn on P such that p0 lies in the standard open subset U0 ⊂ P defined by Z0 6= 0. We use the standard coordinates zi := Zi/Z0 to parametrize U0 ⊂ P , but this time we find it convenient to include z0, which is of course constant 1 on U0. In this way U0 is identified with the affine hyperplane H1 ⊂ V defined by z0 = 1. The tangent space TpU0 (p ∈ U0) is then just p+H, where H ⊂ V is defined by z0 = 0. In particular, if p ∈ X ∩U0, then TpX can be regarded as a linear subspace K(p) ⊂ H0 of dimension d displaced over p. The span of p and K(p) is then a linear subspace of V which corresponds to Tˆ(X, p). According to Theorem 10.12 there exists a neighborhood U of p0 in U0, reg- ular functions f1, . . . , fn−d on U such that X ∩ U is their common zero set and df1, . . . , dfn−d are linearly independent on all of U. Then for every p ∈ X ∩ U, K(p) is the common zero set of df1(p), . . . , dfn−d(p). We regard (df1, . . . , dfn−d) n−d as a matrix-valued function on U, denoted D : U → Hom(H0, k ), so that for p ∈ U ∩ X, K(p) is the kernel of D(p). We also assume our coordinates chosen in 0 00 such a manner that K(p0) is spanned by e1, . . . , ed and then write H = H ⊕ H 0 00 with H = K(p0) and H spanned by ed+1, . . . , en. We write D = (D0,D00), with D0 : U → Hom(H0, kn−d),D00 : U → Hom(H00, kn−d). 00 00 Since D (p0) is an isomorphism, the set of p ∈ U for which D (p) is an isomorphism defines an open neighborhood of p0 and so we may as well assume that that this is the case on all of U. Cramer’s rule shows that p ∈ U 7→ D00(p)−1 ∈ Hom(kn−d,H00) has regular entries: it is a morphism. For every p ∈ U, the kernel of D(p) is the graph of the linear map B(p) := −D00(p)−1D0(p) ∈ Hom(H0,H00). In particular, for p ∈ X ∩ U, TpX is parallel to the subspace spanned by e1 + B(p)(e1), . . . , ed + B(p)(ed). It follows that a linear subspace of V which corresponds to Tˆ(X, p) is spanned Pn by p = e0 + i=1 piei and the d vectors e1 + B(p)(e1), . . . , ed + B(p)(ed). This is 0 n−d the graph of the linear map A(p): ke0 ⊕ H → k defined by d n X X A(p)(e0) := − zi(p)B(p)(ei) + pjej,A(p)(ei) := B(p)(ei)(i = 1, . . . d). i=1 j=d+1 Since A is a morphism, this defines a morphism X ∩ U → Grd(P ). For a closed irreducible subset X ⊂ P , the Gauss map is defined on its nonsin- gular part: G : Xreg → Grd(P ). The closure of the graph of G in X × Grd(P ) is called the Nash blowup of X and denoted Xˆ. The projection morphism Xˆ → X is an isomorphism over the open-dense subset Xreg and hence birational. A remark- able property of Xˆ is that the Zariski tangent space of each point contains a distin- guished d-dimensional subspace (prescribed by the second projection to Grd(P )) in such a manner that these subspaces extend the tangent bundle in a regular manner. 9. Multiplicities of modules B´ezout’s theorem asserts that two distinct irreducible curves C,C0 in P2 of de- grees d and d0 intersect in dd0 points. Strictly speaking this is only true if C and C0 intersect as nicely as possible, but the theorem is true as stated if we count each 9. MULTIPLICITIES OF MODULES 71 point of intersection with an appropriate multiplicity. There is in fact a generaliza- tion: the common intersection of n hypersurfaces in Pn has cardinality the product of the degrees of these hypersurfaces, provided that this intersection is as nice as possible and again, this is even true more generally when the locus of intersection is finite and each point of intersection is counted with an appropriate multiplicity. One of our aims is to define these multiplicities. The commutative algebra tools that we use for this have an interest in their own right. DEFINITION 9.1. We say that an R-module has length ≥ d if there exist a d-step filtration by submodules M = M 0 ) M 1 ) ··· ) M d = {0}. The length of M is the supremum of such d (and so may be ∞). EXERCISE 68. Suppose R is a noetherian local ring with maximal ideal m and residue field K. Prove that the length of a finitely generated R-module M is finite d Pd−1 i i+1 precisely when m M = 0 for some d and is then equal to i=0 dimK (m M/m M). Prove that if R is a K-algebra, then this is also equal to dimK (M). We fix a noetherian ring R and a finitely generated R-module M. Recall that if p is a prime ideal of R, then Rp is a local ring with maximal ideal pRp whose residue field can be identified with the field of fractions of R/p. We define Mp := Rp ⊗R M. So this is a Rp-module. −1 REMARK 9.2. We can describe Mp and more generally, any localization S R ⊗R M, as follows. Consider the set S−1M of expressions m/s with m ∈ M and s ∈ S with the understanding that m/s = m0/s0 if the identity s00s0m = s00sm holds in M for some s00 ∈ S (so we are considering the quotient of S×M by an equivalence relation). Then the following rules put on S−1M the structure of a R-module: m/s − m0/s0 := (s0m − sm0)/(ss0), r · m/s := rm/s. The map S−1R × M → S−1M, (r/s, m) → (rm)/s is R-bilinear and hence factors through −1 −1 −1 an R-homomorphism S R ⊗R M → S M. On the other hand, the map S M → −1 0 0 00 0 S R ⊗R M, m/s 7→ 1/s ⊗R m is also defined: if m/s = m /s , then s (s m = sm) for some s00 ∈ S and so 0 00 0 00 0 00 00 0 1/s ⊗R m = 1/(ss s ) ⊗R s s m = 1/(ss s ) ⊗R ss m = 1/s ⊗R m. It is an R-homomorphism and it immediately verified that it is a two-sided inverse of the −1 −1 map above. So S R ⊗R M → S M is an isomorphism. This description shows in particular that if N ⊂ M is a submodule, then S−1N may be regarded as submodule of S−1M (this amounts to: S-localization is an exact functor). DEFINITION 9.3. The multiplicity of M at a prime ideal p of R, denoted µp(M), is the length of Mp as an Rp-module. (For x ∈ Spec(R), we may also write µx(M) instead of µpx (M).) REMARK 9.4.√ Let X be a variety, x ∈ X a closed point of X and I ⊂ OX,x r an ideal with I = mX,x. So I ⊃ mX,x for some positive integer r. Then r dimk(OX,x/I) is finite (since dimk(OX,x/mX,x) is) and according to Exercise 68 equal to the length of OX,x/I of as an OX,x-module. Hence it is the multiplic- ity of OX,x/I at the prime ideal mX,x: µx(OX,x/I) = dimk(OX,x/I). If X is affine and we are given an ideal I ⊂ k[X] whose image in OX,x is I, then OX,x/I is the localization of k[X]/I at x and x is an isolated point of Z(I). Then still µx(k[U]/I) = dimk(OX,x/I). 72 2. PROJECTIVE VARIETIES If X is nonsingular at x of dimension n and I has exactly n generators f1, . . . , fn, n then we will see that µp(OX,x/(f1, . . . , fn)) = dimk(OA ,p/(f1, . . . , fn)) can be in- terpreted as the multiplicity of p as a common zero of f1, . . . , fn. We wish to discuss the graded case parallel to the ungraded case. This means ∞ that if R is graded: R = ⊕i=0Ri, then we assume M to be graded as well, that is, M is endowed with a decomposition as an abelian group M = ⊕i∈ZMi such that Rj sends Mi to Mi+j (we do not assume that Mi = 0 for i < 0). For example, a homogeneous ideal in R is a graded R-module. In that case we have the notion of graded length of M, which is the same as the definition above, except that we only allow chains of graded submodules. The annihilator of an R-module M, Ann(M), is the set of r ∈ R with rM = 0. It is clearly an ideal of R. We denote by P(M) the set of prime ideals of R which contain Ann(M) and are minimal for that property. According to Exercise 14 these are finite in number and their common intersection equals pAnn(M). In the graded setting, Ann(M) is a graded ideal and then according to Lemma 2.3 the members of P(M) are all graded. We further note that the annihilator of any m ∈ M, Ann(m) := {r ∈ R : rm 6= 0} is an ideal of R, which is graded when m is homogeneous. LEMMA 9.5. Let M be a finitely generated and nonzero (graded) R-module. Then the collection of annihilators of nonzero (homogeneous) elements of M contains a maximal element and any such maximal element is a (graded) prime ideal of R. PROOF. We only do the graded case. The first assertion follows from the noe- therian property of R. Let now Ann(m) be a maximal element of the collection (so with m ∈ M homogeneous and nonzero). It suffices to show that this is a prime ideal in the graded sense (see Exercise 54), i.e., to show that if a, b ∈ R are homogeneous and ab ∈ Ann(m), but b∈ / Ann(m), then a ∈ Ann(m). So bm 6= 0 and a ∈ Ann(bm). Since Ann(bm) ⊃ Ann(m), the maximality property of the latter implies that this must be an equality: Ann(bm) = Ann(m), and so a ∈ Ann(m). If l is an integer and M is graded, then M(l) denotes the same module M, but with its grading shifted over l, meaning that M(l)i := Ml+i. Let us call a (graded) R-module elementary if it is isomorphic to R/p ((R/p)(l)), where p is a (homogeneous) prime ideal (and l ∈ Z). So the above lemma says that every nonzero (graded) R-module contains an elementary submodule. PROPOSITION 9.6. Every finitely generated (graded) R-module M can be obtained as a successive extension of elementary modules in the sense that there exists a finite filtration by (graded) R-submodules M = M 0 ) M 1 ) ··· ) M d = {0} such that each quotient M i/M i+1 is elementary. PROOF. We do the graded case only. Since M is noetherian, the collection of graded submodules of M that can be written as a successive extension of elemen- tary modules has a maximal member, M 0, say. We claim that M 0 = M. If M/M 0 were nonzero, then according to Lemma 9.5, it contains an elementary submodule. But then the preimage N of this submodule in M is a successive extension of an extension of elementary modules which strictly contains M 0. This contradicts the maximality of M. 9. MULTIPLICITIES OF MODULES 73 PROPOSITION 9.7. In the situation of the preceding proposition, let pi be the prime ideal of R such that M i−1/M i =∼ R/pi. Then P(M) is the set of minimal members i d of the collection {p }i=1 and every p ∈ P(M) occurs precisely µp(M) times in the sequence (p1,..., pd). p PROOF. We first show that Ann(M) = p1 ∩ · · · ∩ pd. If r ∈ p1 ∩ · · · ∩ pd, then r maps M i−1 to M i and so rd ∈ Ann(M). This proves that p1 ∩· · ·∩pd ⊂ pAnn(M). Conversely, if r ∈ R and l ≥ 1 are such that rl ∈ Ann(M), then rl ∈ pi for all i. This implies that r ∈ pi for all i. i d So the collection of minimal members of {p }i=1 is just the set P(M) of minimal prime ideals containing Ann(M). In particular, no pi can be strictly contained in a member of P(M). 0 d Fix p ∈ P(M). We then have a filtration Mp = Mp ⊃ · · · ⊃ Mp = {0} (for an inclusion of R-modules induces an inclusion of Rp-modules). We have i−1 i ∼ i Mp /Mp = Rp/p Rp and the latter is the residue field Rp/pRp or trivial according to whether or not pi = p (if pi 6= p, then pi contains an element not in p; this gives i in Rp an invertible element, and hence p Rp = Rp). Following our definition the first case occurs precisely µp(M) times. We can of course pass from the graded case to the nongraded case by just forgetting the grading. But more interesting and useful is the following construc- tion, where we pass from a projective setting to an affine one and vice versa. The example to keep in mind is when we associate to a homogeneous polynomial of degree d, F ∈ k[T0,...,Tn] the inhomogeneous polynomial f ∈ k[x1, . . . , xn] of degree ≤ d by f(x1, . . . , xn) := F (1, x1, . . . , xn) and go back via F (T0,...,Tn) := d T0 f(T1/T0,...,Tn/T0). Let p ⊂ R be a graded prime ideal. Then denote by Rp,0 the set of fractions r/s with r ∈ Ri and s ∈ Ri − pi for some i. This is clearly a subring of Rp; it is in fact a local ring whose maximal ideal mp is obtained by taking in the previous sentence r ∈ pi. For instance, if R = k[T0,...,Tn] and p = (T1,...,Tn), then p defines the n n point p = [1 : 0 : ··· : 0] ∈ P and Rp can be identified with OP ,p. This definition makes also sense for any graded R-module M: Mp,0 is the set of fractions m/s with m ∈ Mi and s ∈ Ri − pi for some i. Note that this is a Rp,0-module. −1 Suppose now that p1 6= R1 and choose s1 ∈ R1 − p1 so that s1 ∈ (Rp)−1. Then R(l)p,0 is the the set of fractions r/s with r ∈ Ri+l and s ∈ Ri − pi for some l l l l i. For such a fraction we have r/s = s1r/(s1s) ∈ s1Rp,0, so that R(l)p,0 = s1Rp,0, which as an Rp,0-module is of course isomorphic to Rp,0. So the graded iterated extension M = M 0 ) M 1 ) ··· ) M d = {0} of M by elementary R-modules yields an iterated extension of Mp,0 by trivial or by elementary Rp,0-modules: Mp,0 = 0 1 d Mp,0 ⊃ Mp,0 ⊃ · · · ⊃ Mp,0 = {0} and the number of times a nonzero successive quotient appears is µp(M). In other words, we have µp(M) = µmp (Mp,0). We use this observation mainly via the following example. n EXAMPLE 9.8. Let J ⊂ k[T0,...,Tn] be a homogeneous ideal and let p ∈ P be an isolated point of ZJ . We take here M := k[T0,...,Tn]/J and take for p the graded ideal Ip ⊂ k[T0,...,Tn] defining p. Then k[T0,...,Tn]Ip,0 can be identified with the local k-algebra O n . Moreover, J defines an ideal J ⊂ O n and we P ,p √ P ,p n n can identify MIp,0 with OP ,p/J. Notice that I = mP ,p. According to the above n n discussion µI (M) = µp(OP ,p/I) and by Exercise 68 this is just dimk(OP ,p/J). 74 2. PROJECTIVE VARIETIES 10. Hilbert functions and Hilbert polynomials We shall be dealing with polynomials in Q[z] which take integral values on in- tegers. Such polynomials are called numerical. An example is the binomial function of degree n ≥ 0: z z(z − 1)(z − 2) ··· (z − n + 1) := . n n! k It has the property that its value in any integer k is an integer, namely n if k ≥ n, n −k+n−1 0 if 0 ≤ k ≤ n − 1 and (−1) n if k ≤ −1. Let ∆ : Q[z] → Q[z] denote the difference operator: ∆(f)(z) := f(z +1)−f(z). It clearly sends numerical polynomials to numerical polynomials. It is also clear that z z the kernel of ∆ consists of the constants Q. Notice that ∆ n+1 = n . LEMMA 10.1. Every P ∈ Q[z] which takes integral values on sufficiently large integers is numerical and a Z-basis of the abelian group of numerical polynomials is provided by the binomial functions. If f : Z → Z is a function such that for sufficiently large integers ∆(f) is given by a polynomial, then so is f. PROOF. The first assertion is proved with induction on the degree d of P . If d = 0, then P is constant and the assertion is obvious. Suppose d > 0 and the Pd−1 z assertion known for lower values of d. So ∆(P )(z) = i=0 ci i for certain ci ∈ Z. Pd−1 z Then P (z) − i=0 ci i+1 is in the kernel of ∆ and hence is constant. As this expression takes integral values on large integers, this constant is an integer. This proves that P is an integral linear combination of binomial functions. For the second assertion, let P ∈ Q[z] be such that P (i) = ∆(f)(i) ∈ Z for P z large i. By the preceding, P (z) = i ai i for certain ai ∈ Z. So if we put P z Q(z) := i ai i+1 , then Q is a numerical polynomial with ∆(f − Q)(i) = 0 for large i. This implies that f − Q is constant for large i, say equal to a ∈ Z. So f(i) = Q(i) + a for large i and hence Q + a is as required. We shall see that examples of such functions are furnished by the Hilbert func- tions of graded noetherian modules. REMARK 10.2. A function f : Z → Z which is zero for sufficiently negative integers de- P k k termines a Laurent series Lf := f(k)u ∈ ((u)). For the function k 7→ max{0, } k∈Z Z n this gives n n n n n X k(k − 1) ··· (k − n + 1) k u d X k u d 1 u u = u = = . n! n! dun n! dun 1 − u n!(1 − u)n k≥n k≥0 So if we also know that for sufficiently large integers f is the restriction of a polynomial function, then Lemma 10.1 implies that Lf is a Z-linear combination of powers of u and the un −1 −1 rational functions n!(1−u)n . In particular, Pf ∈ Q[u][u , (1 − u) ]. In the rest of this section R stands for the graded ring k[T0,...,Tn] where each Ti has degree one. An R-module is always assumed to be graded and finitely generated. Let M be a graded noetherian R-module. We define the Hilbert function of M, φM : Z → Z, by φM (i) := dimk Mi. i+n EXAMPLE 10.3. The Hilbert function of R is given by φR(i) = n . 10. HILBERT FUNCTIONS AND HILBERT POLYNOMIALS 75 LEMMA 10.4. Let F ∈ Rd and denote by Ann(F,M) ⊂ M the set of m ∈ M with F m = 0 (this is a graded submodule of M). Then we have φM/F M (i) = φM (i) − φM (i − d) + φAnn(F,M)(i − d). PROOF. In degree i, multiplication by F defines a k-linear map between finite dimensional k-vector spaces Mi−d → Mi whose kernel is Ann(F,M)i−d and whose cokernel is (M/F M)i. The lemma easily follows from this. The zero set of the graded ideal Ann(M) in Pn will be called the support of M and denoted supp(M). THEOREM 10.5 (Hilbert-Serre). Let M be a finitely generated graded module over the graded ring R = k[T0,...,Tn] (where each Ti has degree 1). Then there exists a unique numerical polynomial PM ∈ Q[z], the Hilbert polynomial of M, such that φM (i) = PM (i) for i sufficiently large. The degree of PM is equal to dim(supp(M)) if we agree that the zero polynomial has the same degree as the dimension of the empty set (namely −1). PROOF. We proceed with induction on n ≥ −1. For n = −1, M is a finite dimensional graded k-vector space and hence Mi = 0 for i sufficiently large. So assume n ≥ 0 and the theorem verified for lower values of n. We first reduce to the case when M is of the form R/p, with p a graded prime ideal. If N is a graded submodule of M, then Ann(M) = Ann(N)∩Ann(M/N) and so supp(M) = supp(N) ∪ supp(M/N). Since dimk(Mi) = dimk Ni + dimk(Mi/Ni), we have φM = φN + φM/N . It follows that if the theorem holds for N and M/N, then it holds for M. As M is a successive extension of elementary modules, it suffices to do the case M = (R/p)(l) with p a graded prime ideal. Since φM (i) = φR/p(l + i), it is enough to do the case M = R/p. So let M = R/p. Then supp(M) = supp(R/p) is defined by the graded ideal p. In case p = (T0,...,Tn), the theorem holds trivially: we have dimk Mi = 0 for i > 0 (so that we may take PM to be identically zero) and supp(M) = ∅. Suppose therefore that p 6= (T0,...,Tn), say that Tn ∈/ p. Now multiplication by Tn sends M isomorphically onto TnM with quotient M¯ := M/TnM and so for ¯ i ≥ 0, φM¯ (i) = φM (i) − φM (i − 1) = ∆φM (i − 1). Since Ann(M) = Ann(M) + ¯ n−1 TnR, we have supp(M) = supp(M) ∩ P . According to Proposition 7.2 we then have dim supp(M¯ ) = dim supp(M) − 1. By regarding M¯ as a finitely generated module over R¯ := k[T0, ··· ,Tn−1], our induction hypothesis tells us that there ¯ exists a polynomial PM¯ of degree dim supp(M) such that φM¯ and PM¯ coincide on large integers. Since ∆φM (i − 1) = PM¯ (i) for large i, Lemma 10.1 implies that there exists a polynomial PM of degree one higher than that of PM¯ (so of degree dim(supp(M))) which coincides with φM for sufficiently large integers. It follows from Lemma 10.1 that when PM is nonzero, then its leading term has d the form cdz /d!, where d is the dimension of supp(M) and cd is a positive integer. REMARK 10.6. For M as in this theorem we may also form the Laurent series LM (u) := P i i dim(Mi)u (this is usually called the the Poincar´e series of M). It follows from Remark −1 −1 10.2 and Theorem 10.5 that LM (u) ∈ Q[u][u , (1 − u) ]. DEFINITION 10.7. If d = dim supp(M), then the degree deg(M) is d! times the leading coefficient of its Hilbert polynomial (an integer, which we stipulate to be n zero in case supp(M) = ∅). For a closed subset Y ⊂ P , the Hilbert polynomial PY resp. the degree deg(Y ) of Y are those of R/IY as a R-module. 76 2. PROJECTIVE VARIETIES Let M be a finitely generated graded R-module. Recall that P(M) denotes the set of minimal prime ideals containing Ann(M). The support of M is defined by the ideal Ann(M) and is graded. So it defines a closed subset in Pn which in our previous notation is just ZAnn(M). For every p ∈ P(M) not equal to (T0,...,Tn), n Zp ⊂ P is an irreducible component of ZAnn(M) whose degree is deg(R/p) and all irreducible components of ZAnn(M) are so obtained (for p = (T0,...,Tn) we get Zp = ∅, but deg(R/(T0,...,Tn) = 0). PROPOSITION 10.8. Let M be a finitely generated graded R-module. Then X deg(M) = µp(M) deg(R/p), p∈P(M) where we note that the right hand side can also be written as a sum over the irreducible components of ZAnn(M) of maximal dimension, where we then may replace deg(R/p) by the degree of that component. PROOF. Write M as an iterated extension by elementary modules: M = M 0 ) 1 d i−1 i ∼ Pd M ) ··· ) M = {0} with M /M = R/pi(li). Then φM (i) = i=1 φR/pi (i−li). Now φR/pi is a polynomial of degree equal to the dimension of supp(R/pi) = Zpi ⊂ n P . This degree does not change if we replace the variable i by i − li. In view of Proposition 9.7 we only get a contribution to the leading coefficient of φM when pi ∈ P(M) and for any given p ∈ P(M) this happens exactly µp(Mp) times. We only need to consider the cases p for which the degree of φR/p is maximal (i.e., equals the degree of φM ). The proposition follows. EXERCISE 69. Compute the Hilbert polynomial and the degree of n (n+d)−1 (a) the image of the d-fold Veronese embedding of P in P n , (b) the image of the Segre embedding of Pm × Pn in Pmn+m+n. EXERCISE 70. Let Y ⊂ Pm and Z ⊂ Pn be closed and consider Y ×Z as a closed subset of Pmn+m+n via the Segre embedding. Prove that the Hilbert function resp. polynomial of Y ×Z is the product of the Hilbert functions resp. polynomials of the factors. One can show that there exists a nonempty open subset of (n − d)-planes Q ⊂ Pn which meet Y in exactly deg(Y ) points (see Exercise 71). This characterization is in fact the classical way of defining the degree of Y . n Observe that if Y ⊂ P is nonempty, then deg(Y ) > 0. For then IY,d 6= Rd for every d ≥ 0 and so the Hilbert function of S(Y ) is positive on all nonnegative integers. This implies that PY is nonzero with positive leading coefficient. We also note that since deg(Y ) only depends on the dimensions of the graded pieces S(Y )i of the homogenenous coordinate ring the degree of Y will not change if we regard Y as sitting in a higher dimensional projective space Y ⊂ Pn ⊂ Pm (with m ≥ n). We verify some other properties we expect from a notion of degree. PROPOSITION 10.9. A hypersurface H ⊂ Pn defined by an irreducible polynomial of degree d has degree d. PROOF. Let F ∈ Rd be the polynomial in question and apply Lemma 10.4 to M = R. Since the Hilbert function of R takes at the positive integer i the value 10. HILBERT FUNCTIONS AND HILBERT POLYNOMIALS 77