An Elementary "Proof" of Fermat's Last Theorem

Introduction

In the three centuries since its proposal, Fermat's Last Theorem has led to the development an enormous amount of mathematics, both good and bad. On the good side, it would not be too far-fetched to assert that much of modern algebraic number theory has sprung from attempts to prove what Fermat is alleged to have written in the margin of his copy of Diophantus in 1637. However, because of the problem's simple statement it has also generated a mountain of material which as best was wrong and at worst not even classifiable as mathematics.

For the past seven years, James Harris has been a regular fixture on the Usenet newsgroup, sci.math. James, an amateur mathematician, has produced scores, if not hundreds of self-proclaimed elementary "proofs" of Fermat's Last Theorem, none of which, to date, have withstood serious scrutiny. Due to James's unfamiliarity with mathematical exposition, the work available on his main and "area one" web sites and his contributions to the newsgroup have often been difficult to understand and have been augmented and clarified by many others over the years. This page is my attempt to gather together what has been said by and to Mr. Harris about FLT and present it in a style which meets at least minimal standards for clarity.

Rick Decker


We begin by establishing four preliminary results.

Lemma 1. If p is an odd prime, p(r + s)rs | (r + s)p - rp - sp. Denote by Hp the polynomial

 

Hp = ((r + s)p - rp - sp) / (p(r + s)rs).

Then Hp(r, s) = Gp(r + s, rs), where Gp is a polynomial with integer coefficients.

Proof. We observe that each term of (r + s)p - rp - sp has as its coefficient an order-p binomial coefficient, and it is well known that if p is prime, all of these will be divisible by p. It is easy to verify that both rs and r + s divide (r + s)p - rp - sp. To establish the last statement of the Lemma, we observe that H is a symmetric polynomial and that any such polynomial may be expressed in terms of the elementary symmetric functions, which in this 2-variable case are r + s and rs. [Note 1.]

Lemma 2. Let p be an odd prime and x, y, z be integers satisfying xp + yp = zp and let v be an arbitrary integer. Then

 

(vp + 1)z2p - pv(xy)2 z2 Gp(vz2, (xy)2) - 2 (xy)p = 0 mod (x2 + y2 + vz2),

where Gp is defined in Lemma 1.

Proof. For any integer v we have, trivially,

 

x2 + y2 = -vz2 mod (x2 + y2 + vz2).

Raising both sides to the power p gives

 

(x2 + y2)p = -(vz2)p mod (x2 + y2 + vz2).

Expanding the left side and using Lemma 1 gives

 

x2p + p(xy)2(x2 + y2)Hp(x2, y2) + y2p = -(vz2)p mod (x2 + y2 + vz2).

Since we assume that x, y, and z satisfy xp + yp = zp we can use (xp + yp)2= z2p to eliminate x2p and y2p in the congruence above, yielding

 

(vp + 1)z2p - p(xy)2(x2 + y2)Hp(x2, y2) - 2(xy)p = 0 mod (x2 + y2 + vz2).

Again referring to Lemma 1, we see that Hp(x2, y2) may be expressed as a polynomial in x2 + y2 and (xy)2, so modulo x2 + y2 + vz2 may be denoted by Gp(vz2, (xy)2), yielding the desired result.

Lemma 3. Let p be an odd prime. Let x, y, z be pairwise coprime positive integers satisfying xp + yp = zp. Then

  1. One of z - y, z - x, or x + y has a prime factor qp.
  2. Without loss of generality, suppose q | z - y. Then q | x, and if x = qju, with q not dividing u, then qjp | z - y.

Proof. First, it is easy to show that no two of z - y, z - x, x + y can be 1. If, for example, z - y = z - x = 1, then we would have x = y, contradicting the relative primality of x and y (since we obviously couldn't have x = y = 1). The other two cases are handled similarly. It is also easy to show that no two of z - y, z - x, x + y can be divisible by p If, for example, z = y mod p and z = x mod p, then, applying Fermat's little theorem to xp + yp = zp, we would have x + y = z mod p, so we would conclude that x = 0 mod p and y = 0 mod p, again contradicting the relative primality of x and y. The remaining two cases are handled similarly. Combining these two results, we see that at least one of z - y, z - x, or x + y must have a prime factor q unequal to p.

Suppose that q | z - y. Then, since

 

xp = zp - yp = (z - y)(zp-1 + zp-2y + ... + zyp-2 + yp-1)

it is clear that z - y | xp, so q | x. It is an old result that for prime p, z - y and (zp - yp) / (z - y) have no factors in common except for p, so if x = qju, with q not dividing u, then by inspecting the orders of q in the terms x, xp, and z - y, we conclude that qjp | z - y.

Lemma 4. Let R be a commutative ring containing the integers. Suppose that q is a rational prime and t is an integer not divisible by q. Suppose also that q | A(t + qC) in R, for some A and C in R. Then, q | A in R.

Proof. Since q and t are relatively prime in Z, there exist integers r and s such that rq + st = 1. Since q | A(t + qC) in R, we have q | At in R and obviously q | Arq in R, so q | Ats + Arq = A(ts + rq) in R and hence q | A in R, as required.


With the preliminaries out of the way, we establish the two congruences that constitute what James calls "Area One."

1.1. Divisibility results

Let p be an odd prime and let x, y, z be pairwise coprime nonzero integers satisfying xp + yp = zp. From Lemma 3, without loss of generality we may suppose that qp is a prime factor of x and z - y. [Note 2.] Then q must be a factor of x2 + y2 - z2 as well. Let x = qju, where q does not divide u.

Theorem 1. With the assumptions above, for any positive integer n there exists an integer m for which

  u2 + (y2 - z2) / q2j + mz2 = 0 mod qn. [1]

Proof. Observe that from Lemma 3, qpj | z - y, so we have that q2j divides y2 - z2. Since q does not divide z (otherwise we would contradict the relative primality of x and z), we can solve the congruence [1] for m, as required.

From this, we have an immediate consequence,

Corollary 1. For any integer k > 2j, there exists m such that if v = -1 + m2j then

  qk | x2 + y2 + vz2

 

1.2. The polynomial Fp, v(z2, xy)

Denote the left hand side of the congruence of Lemma 1 by F = Fp, v(z2, xy), so that

  Fp, v(z2, xy) = (vp + 1)z2p - pvz2(xy)2Gp(vz2, (xy)2) - 2(xy)p.

For any fixed values of p and v, with v ≠ -1 we may consider F as a degree p polynomial in the indeterminants z2 and xy, so we may factor F into terms linear in z2 and xy, over the complex numbers:

  Fp, v(z2, xy) = (a1z2 + b1xy) ... (apz2 + bpxy). [2]

Equating coefficients, we observe that a1 ... ap = (vp + 1). By appropriate multiplication and division we may write [2] as

 
Fp, v(z2, xy) = ((v + 1)1/2 z2 + (b1(v + 1)1/2) / a1) xy)
  ((v + 1)1/2 z2 + (b2(v + 1)1/2) / a2) xy)
  (((vp + 1) / (v + 1)) z2 + (b3(vp + 1) / ((v + 1)a3)) xy)
  . . .
  (z2 + (bp / ap) xy).

In what follows, we will write the factorization of F as

  Fp, v(z2, xy) = (c1z2 + d1xy) ... (cpz2 + dpxy), [3]

where c1 = c2 = (v + 1)1/2, c3 = (vp + 1) / (v + 1), and ci = 1 for 3 < ip, and the d values are defined as above.

Now from the polynomial identity [3], we return to regarding Fp, v as an integer, using the original values for x, y, and z, namely a Fermat counterexample for a specific p.

 

1.3. Divisibility in the overring

Let k > 2j be an arbitrary integer and fix v to be -1 + mq2j where m is as in Corollary 1. Define the ring R = Z[m1/2, d1, d2, ... , dp] for the ds in [3] resulting from this choice of v. [Note 3.] So, since qk | x2 + y2 + vz2 from Corollary 1 and x2 + y2 + vz2 | Fp, v from Lemma 2 we have (considering all subsequent congruences in the context of R, which we shall denote by =R for emphasis),

  Fp, v = (c1z2 + d1qjuy) (c2z2 + d2qjuy) ... (cpz2 + dpqjuy) =R 0 mod qk.

We then have

Theorem 2. Suppose Fp, v = (c1z2 + d1xy)(c2z2 + d2xy) ... (cpz2 + d1xy) as in [3]. Then, for any integer n > 0 there exists an integer m such that with v = -1 + m2j,

  (m1/2z2 + d1uy)(m1/2z2 + d2uy) =R 0 mod qn

Proof. Using the values of ci established in [3], we have

  (m1/2qjz2 + d1qjuy)(m1/2qjz2 + d2qjuy) ... (z2 + dpqjuy) =R 0 mod qk

and dividing out qj from each of the first two factors yields

  (m1/2z2 + d1uy)(m1/2z2 + d2uy) ... (z2 + dpqjuy) =R 0 mod qk - 2j. [4]

Denote by A the product of the first 2 factors in [4] and by B the product of the remaining p - 2 factors, we have, after expanding B,

  A = (m1/2z2 + d1uy)(m1/2z2 + d2uy)

and

  B = ((vp + 1) / (v + 1))z2(p - 2) + qjC.

Letting t = ((vp + 1) / (v + 1))z2(p - 2) (and observing that in Z, q does not divide the integer t) we can rewrite [4] as

  A(t + qjC) =R 0 mod qk - 2j,

and by repeated applications of Lemma 4 we conclude that

  A = (m1/2z2 + d1uy)(m1/2z2 + d2uy) =R 0 mod qk - 2j. [5]

Writing k - 2j as n establishes our result.

 

1.4. Consequences

For convenience, we recall our terminology.

  • p is an odd prime.
  • x, y, z are pairwise coprime positive integers satisfying xp + yp = zp.
  • qp is a prime which, without loss of generality, we may suppose divides x and y - z.
  • x = qju, with q not dividing u.
  • For n > 0, m = m(n) is a solution to mz2 = -(u2 + (y2 - z2) / q2j) mod qn.
  • v = -1 + mq2j. Observe that v ≠ -1.
  • Fp, v is factored as (c1z2 + d1xy) . . . (cpz2 + dpxy), with c1 = c2 = (v + 1)1/2, c3 = (vp + 1) / (v + 1), and ci = 1 for 3 < ip.
  • R = R(m) = Z[m1/2, d1, ... , dp].
  • S(m) = u2 + (y2 - z2) / q2j + mz2.
  • T(m) = mz4 + m1/2(d1 + d2)z2uy + d1d2u2y2.

Finally, recall that by Theorems 1 and 2, for any n > 0 we can find an m = m(n) so that

  S(m) = 0 mod qn

and in R = R(m)

  T(m) =R 0 mod qn.

Note that changing the exponent r in the congruence modulo qr that defines m will in general change m, which will change v, which will change Fp, v, which will change the c and d values, which will give rise to a different ring R. However, if we have defined m(r), then in any congruence modulo qs for sr, m(s) will serve equally well as a solution for m(r) with no change in the ring R.

The next result and its corollaries establishes some further (and somewhat surprising) relations among the d values and the nature of q in the ring R.

Theorem 3. Let m = m(j). Then, in the ring R = R(m) we will have d1 + d2 =R 0 mod qj.

Proof. Let Fp, v be the polynomial defined in [3] for v = -1 + mq2j. Observe that in Fp, v = (vp + 1)z2j - pvz2(xy)2Gp(z2, xy) - 2(xy)p, Gp is a degree-(p - 3) polynomial in z2, so in Fp, v the (z2)p-1 term will have coefficient 0. If we expand the right side of the factorization of Fp, v and equate coefficients we will have

  d1c2...cp + c1d2c3...cp + . . . + c1...cp-1dp = 0.

Using the values for the cs gives us

  (d1 + d2)(v + 1)1/2((vp + 1) / (v + 1)) + d3(v + 1) + (vp + 1)(d4 + . . . + dp) = 0.

So, since v ≠ -1, we have

  ((vp + 1) / (v + 1))(d1 + d2) = -(v + 1)1/2(d3 + ((vp + 1) / (v + 1))(d4 + . . . + dp)).

Denoting the factor on the right by D, we may write this as

  ((vp + 1) / (v + 1))(d1 + d2) = -(v + 1)1/2D.

Thus, in the ring R we have

  ((vp + 1) / (v + 1))(d1 + d2) =R -(v + 1)1/2D mod qj.

Since (vp + 1) / (v + 1) is an integer not divisible by q it has a multiplicative inverse in Z modulo qj, so we may write

  d1 + d2 =R -(v + 1)1/2((vp + 1) / (v + 1))-1D mod qj.

Finally, recall that v = -1 + mq2j so the factor (v + 1)1/2 on the right will be divisible (in R) by qj, and thus

  d1 + d2 =R 0 mod qj.

 

Corollary 2. With R defined as above, d1d2 =R 1 mod qj.

Proof. Since we have just established d1 + d2 =R 0 mod qj and we know from Theorem 2 that

  mz4 + z2m1/2(d1 + d2)uy + d1d2u2y2 =R 0 mod qj

we find that

  mz4 + d1d2u2y2 =R 0 mod qj.

Now since y = z mod qpj by Lemma 3, we may set y = z above and since neither is zero modulo q we may multiply both sides by (z2)-1 modulo q in Z, giving us

  mz2 + d1d2u2 =R 0 mod qj. [8]

Recall that m is defined by m = -(u2 + (y2 - z2) / q2j) / z2 mod qj and observe that qj divides y2 - z2) / q2j so we have m = -u2 / z2 mod qj. Substituting this into [8] yields

  -u2 + d1d2u2 =R 0 mod qj

and since u is not divisible by q in Z we may multiply both sides of the above congruence by a multiplicative inverse of u mod qj to obtain the desired result.

 

In his so-called "Area 2," James now proceeds to use S(m) and T(m) to get an expression not apparently involving m. Recall that for any n > 0 we may find a rational integer m = m(n) so that

  S(m) = u2 + (y2 - z2) / q2j + mz2 = 0 mod qn [9]

and in R = R(m),

  T(m) = mz4 + m1/2(d1 + d2)z2uy + d1d2u2y2 =R 0 mod qn. [10]

For simplicity, let A = u2 + (y2 - z2) / q2j. Multiplying [9] by z2 then gives us

  mz4 = -Az2 mod qn. [11]

Substituting into [10] and rearranging yields

  m1/2(d1 + d2)z2uy =R Az2 - d1d2u2y2 mod qn.

Squaring both sides of the congruence above, we have

  (mz4)(d1 + d2)2u2y2 =R (Az2 - d1d2u2y2)2 mod qn.

Expanding the right hand side and using [11] again, we have

  -Az2(d1 + d2)2u2y2 =R A2z4 - 2Ad1d2z2u2y2 + d12d22u4y4 mod qn.

So, finally, we have, after simplifying,

  A2z4 + A(d12 + d22)z2u2y2 + d12d22u4y4 =R 0 mod qn. [12]

Which, we note in passing, may be written

  (Az2 + d12u2y2)(Az2 + d22u2y2) =R 0 mod qn.

It appears that we have eliminated m from our expressions, but nothing could be further from the truth. While it is certainly true that q, u, y, and z (and hence A) do not depend on m, certainly the d values and the underlying ring R all depend on m(n). What we have, then, are a family of congruences that posit some mildly interesting divisibility results in different rings. What we certainly do not have at this stage is anything that establishes a contradiction.


Notes and Commentary

Credits

This explication of James Harris's most recent attempt at a proof of Fermat's Last Theorem using elementary methods is the distillation of the work of many people on the sci.math newsgroup. Since this discussion has taken place over the span of seven years, it is highly likely that I have forgotten to credit some of the contributors, but much of what appears here is due to comments by (in no particular order) David Libert, John Rickard, Arturo Magidin, Nora Baron, "The Scarlet Manuka," Dik T. Winter, John Roberts-Jones, Keith Ramsay, and I'm certain, many others. If I've left off your name and you'd like credit, let me know and I'll be happy to include you and, of course, I'll be happy to entertain any suggestions for modifications to this page.
Back to the paper

Notes

  1. For example, if we let p1 = r + s and p2 = rs, then
    H3(r, s) = G3(p1, p2) = 1
    H5(r, s) = G5(p1, p2) = (p1)2 - p2
    H7(r, s) = G7(p1, p2) = (p1)4 - 2(p1)2p2 + (p2)2. Back to the paper

  2. For the benefit of those people following the discussions with James, I should note that what James has been calling f I have renamed q for stylistic reasons. In addition, in the factorization [2] I have used di for what James has been calling bi. Back to the paper

  3. In the original argument, R was taken to be a ring containing the integers, the square roots of the integers, and the ds. I have reduced the ring by excluding all the square roots except that of m (and any arising from the ds). Note here that there is a dependency among the ds from [3], since the product d1 ... dp = -2. [In the two years since this page was produced, James has admitted to problems with his approach and attempted to recast his arguments using the ring of algebraic integers for R.] Back to the paper

Last modified 2/24/04 Valid HTML 4.0!