Geometry of Cubic Polynomials (Xavier Boesken)

hbghlyj · 2022-8-14 15:12

Last edited by hbghlyj 2023-1-10 13:18Geometry of Cubic Polynomials - Geometry of Cubic Polynomials - Undergrad Mathematics - Part of Geometry and Topology Commons

Figure 1 An equilateral triangle and its inscribed sphere

Abstract
Imagine a sphere floating in 3-space. By inscribing one of its great circles within an equilateral triangle, we can use the linear projection in map to the $(x,y)$-plane, viewed here as the complex plane $z=x+iy$, to project the vertices of the equilateral triangle onto the roots of a given cubic polynomial $p(z)$.
This discovery allows us to prove Marden’s Theorem: the roots of the derivative $p'(z)$ are the foci of the inscribed ellipse tangent to the midpoints of the triangle in the complex place determined by the roots of the polynomial. It also sheds light on Cardano’s formula for finding the roots of the cubic $p(z)$.
1 Introduction
In order to fully understand and prove Marden’s Theorem, we need to identify several different aspects of the geometry of the floating sphere in Figure 1. What do we know about this geometric figure? Marden’s Theorem states this figure projects to the roots of the polynomial, viewed as points in the complex plane.
Begining with the fundamentals, we can build a strong proof of Marden’s Theorem. The fundamental theorem of algebra gives us a good starting point. Given an arbitrary polynomial:

Fundamental Theorem of Algebra. Any polynomial of degree $n$ has exactly $n$ roots. The polynomial is assumed to have complex coefficients and the roots are complex as well.

Generally, these roots are distinct, but not necessarily. For our cubic polynomial, the fundamental theorem of algebra states we have precisely three (possibly complex) roots. Let $r, s, t$ be the roots of our cubic. Therefore, the arbitrary cubic polynomial can be written in the form$$p(x) = a(x-r)(x-s)(x-t)$$where $a$ is a constant.
Therefore, for the arbitrary $p(z)$, the derivative of the cubic polynomial can be written as
$$p'(x) = a[(x - s)(x - t)] + a[(x - r)(x - t)] + a[(x - r)(x - s)]$$
where $a$ is a constant.
If we were to plot these roots in the complex plane, these distinct roots would form a triangle.

drawing-2.svg (1.78 KB, Downloads: 174)

When working with such a triangle, two mathematicians came up with a theorem which says that the derivative of the cubic polynomial has roots lying within the triangle. Carl Friedrich Gauss, a German, and Felix Lucas, a Frenchman, created a theorem giving a geometric relation between a polynomial’s roots and its derivative’s roots.

The Gauss-Lucas Theorem. If $P$ is a (nonconstant) polynomial with complex coefficients, all zeros of $P'$ belong to the convex hull of the set of zeros of $P$.

So, the Gauss-Lucas theorem implies that the two roots of the derivative of the cubic polynomial must lie within the triangle in the complex plane. We will prove the theorem for the cubic case by using proof by contradiction:
Proof. Let $r, s, t$ be the complex roots of the cubic polynomial $p(x)$. Suppose $p'(u) = 0$ where $u$ is NOT in the complex hull of $r, s, t$.
As seen in the image below, $u$ is not in the complex hull, meaning it is not contained within triangle formed by the roots of the polynomial. As seen below, $u$ is divided from one of the vertices ($s$ in our case), by one of the sides of the triangle ($rt$ in this case).

drawing-2.svg (2.75 KB, Downloads: 165)

This complex plane can be rotated by some angle such that the dividing side ($rt$) is vertical, with $u$ on the right side and the third vertex ($s$) is on the left. We call this angle $θ$. Note that the vectors, drawn green
in the diagram, from the roots of the polynomial to our root of the derivative are now all pointing towards the right, meaning they have positive real part.

drawing-2.svg (14.1 KB, Downloads: 182)

If we look at this algebraicly:\begin{aligned} \frac{p^{\prime}(u)}{p(u)} &=\frac{a[(u-s)(u-t)]+a[(u-r)(u-t)]+a[(u-r)(u-s)]}{a(u-r)(u-s)(u-t)} \\ &=\frac{1}{u-r}+\frac{1}{u-s}+\frac{1}{u-t} \end{aligned}If we take the conjugate of both sides of the equation, we need to remember two important rules regarding conjugates:
• The conjugate of the sums is the sum of the conjugates.
• The conjugate of the reciprocal is the reciprocal of the conjugate.
So we can simplify our conjugation we get the sum of the reciprocal of the conjugate:\begin{aligned} \overline{\frac{p^{\prime}(u)}{p(u)}} &=\overline{\frac{1}{u-r}+\frac{1}{u-s}+\frac{1}{u-t}} \\ &=\frac{1}{\overline{u-r}}+\frac{1}{\overline{u-s}}+\frac{1}{\overline{u-t}} \end{aligned}If we are to multiply each summand by 1, in the form of their respective $u$−root, and remember that a value multiplied by its conjugate equals the magnitude squared: e.g. $x · \bar{x}=|x|^{2}$:
$$\left[\frac{1}{\overline{u-r}} \cdot \frac{u-r}{u-r}\right]+\left[\frac{1}{\overline{u-s}} \cdot \frac{u-s}{u-s}\right]+\left[\frac{1}{\overline{u-t}} \cdot \frac{u-t}{u-t}\right]=\frac{u-r}{|u-r|^{2}}+\frac{u-s}{|u-s|^{2}}+\frac{u-t}{|u-t|^{2}}$$If we multiply each summand by $e^{iθ}$, we get our vectors, which we stated earlier have positive real part.
The vectors are depicted in the diagram above, point to the right. Meaning, these vectors have positive real part. The contradiction comes when we recall that $p'(u)=0$. So our result must equal zero. BUT, we stated that each summand has positive real part, and so must their sum. This contradicts the fact the result must be zero.$$0=e^{i \theta} \frac{\overline{p^{\prime}(u)}}{p(u)}=e^{i \theta} \frac{u-r}{|u-r|^{2}}+e^{i \theta} \frac{u-s}{|u-s|^{2}}+e^{i \theta} \frac{u-t}{|u-t|^{2}}$$This proof can be extended for polynomials of higher orders. $□$
2 Marden’s Projection
The sphere with the inscribed triangle is floating above a copy of the complex plane. The vertices of the triangle project onto the roots of the cubic on the $(x, y)$-plane, viewed as a copy of the complex numbers. The triangle, which is the image of the projection, represents the convex hull of the three roots of $p(x)$. The Gauss-Lucas theorem states the two roots of the derivative of the cubic lies within that complex hull. Knowing this, we can begin to make connections for Marden’s theorem. Marden’s theorem states if we have any three real numbers, not all equal, then they are the projections of the vertices of some equilateral triangle in the complex plane. For a cubic polynomial $p(x)$ with three real roots (not all equal), the inscribed circle of the equilateral triangle that projects onto those roots itself projects to an interval with endpoints equal to the roots of $p'(x)$. There is a special case where the roots are real they lie along the real axis. The equilateral triangle projects onto them must lie in the $(x, z)$-plane. But this special case of Marden’s Theorem, which we will address later.
The theorem is saying that the roots of $p'(z)$, where $z$ is a root of the derivative of the cubic, are the foci of the ellipse inscribed in that triangle tangent to the midpoints of the sides.

drawing-2.svg (780 Bytes, Downloads: 156)
Figure 2 Roots of $p'$ are foci of "midpoint" ellipse.

Figure 2 illustrates Marden’s Theorem. As depicted in the image, the roots of $p'$ are the foci of the midpoint of two roots of the quadratic polynomial $p(x)$.
Figure 2 shows us why it is important for our roots to be distinct and not collinear. If this were the case, our triangle would “collapse” into a line, since the roots would be collinear. This removal of the third vertex into a two vertices figure (a line) makes the ellipse within the Figure 2 collapse into a line segment contained in the same segment into which the triangle has collapsed. There would no longer be an interior space within the triangle, since the triangle would no longer exist. To better understand what this means, we need to imagine the projection of the sphere to this plane.

drawing-2.svg (2.29 KB, Downloads: 151)

Consider an equilateral triangle in a copy of the complex plane. Let the vertices of the triangle be $1, ω,$ and $\bar ω$. As in Figure 1, we have an inscribed sphere within the equilateral triangle. If we take the projection
of the triangle at the equator of the sphere to the triangle on the complex plane. the resulting projection of that circle (that represented the equator of the sphere) is an ellipse. This illustrates a technical fact: the projection of a circle is an ellipse. We will need to prove that with the following lemma.
3 Projection of a Circle is an Ellipse
Recall that a linear map from $\mathbb{R}^{2}$ to itself is of the form $(x, y) \mapsto(\alpha x+\beta y, \gamma x+\delta y)$ for some $\alpha, \beta, \gamma$, and $\delta \in \mathbb{R}$. If we imagine this mapping as $\mathbb{C}$ to itself $(z:=x+i y)$, and if we let $a:=\frac{1}{2}[(\alpha+\delta)+i(\gamma-\beta)]$ and $b:=\frac{1}{2}[(\alpha-\delta)+i(\gamma+\beta)]$:\begin{aligned}
a z+b \bar{z} &=\frac{1}{2}[(\alpha+\delta)+i(\gamma-\beta)](x+i y)+\frac{1}{2}[(\alpha-\delta)+i(\gamma+\beta)](x-i y) \\
&=\frac{1}{2}[(\alpha+\delta) x+i(\gamma-\beta) i y+(\alpha+\delta) i y+(\gamma-\beta) i x+(\alpha-\delta) x-i(\gamma+\beta) i y-(\alpha-\delta) i y+(\gamma+\beta) i x] \\
&=\frac{1}{2}[(\alpha+\delta) x-(\gamma-\beta) y+i(\alpha+\delta) y+i(\gamma-\beta) x+(\alpha-\delta) x+(\gamma+\beta) y-i(\alpha-\delta) y+i(\gamma+\beta) x] \\
&=\frac{1}{2}[2(\alpha x+\beta y)+2 i(\gamma x+\delta y)] \\
&=\alpha x+\beta y+i(\gamma x+\delta y)
\end{aligned}for some complex $a, b$
Now, we want to show that every linear map from $\mathbb{C}$ to itself takes a unit circle to an ellipse.
Lemma. Every one-to-one linear map $z \mapsto a z+b \bar{z}$ takes the unit circle to an ellipse with foci $\pm 2 \sqrt{a b}$.
Proof. The unit circle can be parameterized by
$$
C:=\left\{e^{i \theta}: 0 \leq \theta<2 \pi\right\}
$$And let $\mathrm{E}$ be the image of the circle under the linear map $z \rightarrow a z+b \bar{z}$ such that:
$$
E:=\left\{x: x=a e^{i \theta}+b e^{-i \theta}\right\}
$$
In other words, we are letting $x$ be a point on this projected circle. Recall the definition of an ellipse and its foci:
An ellipse is defined by the set of all points
$$
\{x \in \mathbb{C}:|x-u|+|x-v|=L\}
$$where $u, v \in \mathbb{C}$ and $L$ is a constant
So, to prove that $\pm 2 \sqrt{a b}$ are the foci of this ellipse, we shall plug them in for $u, v$ from our definition:
Let $z:=\sqrt{a} e^{\frac{i \theta}{2}}$ and $w:=\sqrt{b} e^{\frac{-i \theta}{2}}$\begin{aligned}
&|x-2 \sqrt{a b}|+|x+2 \sqrt{a b}|=\left|ae^{i \theta}+b e^{-i \theta}-2 \sqrt{a b}\right|+\left|a e^{i \theta}+b e^{-i \theta}+2 \sqrt{a b}\right| \\
&=\left|a e^{i \theta}-2 \sqrt{a b}+b e^{-i \theta}\right|+\left|a e^{i \theta}+2 \sqrt{a b}+b e^{-i \theta}\right| \\
&=\left|\left(\sqrt{a} e^{\frac{i \theta}{2}}-\sqrt{b} e^{\frac{-i \theta}{2}}\right)\left(\sqrt{a} e^{\frac{i \theta}{2}}-\sqrt{b} e^{\frac{-i \theta}{2}}\right)\right|+\left|\left(\sqrt{a} e^{\frac{i \theta}{2}}+\sqrt{b} e^{\frac{-i \theta}{2}}\right)\left(\sqrt{a} e^{\frac{i \theta}{2}}+\sqrt{b} e^{\frac{-i \theta}{2}}\right)\right| \\
&=|(z-w)(z-w)|+|(z+w)(z+w)|\\
&=\left|(z-w)^2\right|+\left|(z+w)^2\right| \\
&=(z-w) \overline{(z-w)}+(z+w) \overline{(z+w)} \\
&=(z-w)(\bar{z}-\bar{w})+(z+w)(\bar{z}+\bar{w}) \\
&=(z \bar{z}-z \bar{w}-\bar{z} w+w \bar{w})+(z \bar{z}+z \bar{w}+\bar{z} w+w \bar{w}) \\
&=2|z|^2+2|w|^2 \\
&=2a+2b
\end{aligned}Because the result is a constant, since $a, b$ are both constants, then $\pm 2 \sqrt{a b}$ must be the foci of the ellipse.
And thus, $E$ is an ellipse with foci $\pm 2 \sqrt{a b}$ $□$
One aspect of Figure 1 is that we can rotate it within 3-space. We can do such to a point where we no longer see all three vertices of the triangle. Meaning, we only see the line formed by two of the vertices.

The ellipse is flattened into a vertical ellipse, with the two outer vertices projecting to endpoints of an interval (line). In other words, the complex plane where this sphere exists is now vertical and directly above the real axis below in the other copy of the complex plane. If we were to stand on that real axis and look up, we would see only two of the vertices. To us, this triangle would appear as a line. We know these vertices project down to the roots of our polynomial. We can then deduce the following:

1.svg (1.33 KB, Downloads: 138)
Figure 3 A cubic with corresponding triangle and circle

Theorem. Any three real numbers, not all equal, are the projections of the vertices of some equilateral triangle in the plane. For a cubic polynomial $p(x)$ with three real roots (not all equal), the inscribed circle of the equilateral triangle that projects onto those roots itself projects to an interval with endpoints equal to the roots of $p'(x)$.
Proof. Suppose we have a polynomial $p(x)$ with three real roots $r, s, t$. That is,$$p(x)=(x-r)(x-s)(x-t)=x^{3}-(r+s+t) x^{2}+(r s+r t+s t) x-r s t$$and$$p'(x)=3 x^{2}-2(r+s+t) x+(r s+r t+s t)$$It is verifiable that the coordinates of the vertices of the triangle are: $\left(r, \frac{s-t}{\sqrt{3}}\right),\left(s, \frac{t-r}{\sqrt{3}}\right)$, and $\left(t, \frac{r-s}{\sqrt{3}}\right)$. This is found by a form of brute force, meaning a little bit of trial and error. Using the distance formula for any two vertices:
$$
\sqrt{(r-s)^2+\frac{(r+s-2 t)^2}{3}}=2 \sqrt{\frac{r^{2}+s^{2}+t^{2}-r s-r t-s t}{3}}
$$
This gives us the distance from any vertice to either of the other two. Also note that the inscribed circle has center $\left(\frac{r+s+t}{3}, 0\right)$ and the radius is $\frac{1}{2\sqrt3}$ the distance between any two of the vertices.
$$\frac{1}{2\sqrt3} · 2 \sqrt{\frac{r^{2}+s^{2}+t^{2}-r s-r t-s t}{3}}=\frac{1}{3}\sqrt{r^{2}+s^{2}+t^{2}-r s-r t-s t}$$
Therefore, the closed interval with endpoints being the two vertices, those that project to the real axis and containing the projections of the circle, is found by adding or subtracting the value of the radius from the origin $\left(\frac{r+s+t}{3}\right)$ :
$$
x =\frac{1}{3}\left[r+s+t±\sqrt{r^{2}+s^{2}+t^{2}-r s-r t-s t}\right]
$$
Note if we plug the values of $x$ into $p^{\prime}(x)=3 x^{2}-2(r+s+t) x+(r s+r t+s t)$, we get 0 . These $x$ values are the endpoints of the interval formed by two of the vertices of the triangle formed by $r, s, t$ that project to the real axis and contain the projections of the circle. Therefore, the endpoints of the interval are the roots of the dervative $p'(x)$. $□$
Figure 3 shows the cubic polynomial and its relation to the triangle with its inscribed circle.
We can see the vertices project down to the roots of the polynomial. The outer roots create an interval on the real axis. Also seen, are the edges of the circle projecting down to the extremas. If we recall that our extremas of our polynomials are the zeros of the derivative, then we can say the roots of the derivative are contained within the interval created by the outer roots (those visible from below).
4 Linear Maps
Given how we typically draw an ellipse (using a piece of string connecting two points), we can represent the image with the set of all points:
$$
x \in \mathbb{C}:|x-u|+|x-v|=L
$$
where $u, v \in \mathbb{C}$ and $L \in(|u-v|, \infty)$ is an ellipse.
This will be our definition of an ellipse, being that every ellipse is created this way. Since all ellipses have unique foci $u, v$ and maximum length $L$, we can safely say such a method yields an ellipse. Now that we have defined the ellipse, we can comment on the remark of the projection of a circle is, in fact, an ellipse. In other words, the image of a circle under a linear map is an ellipse.
Recall that linear map from $\mathbb{R}^{2}$ to itself is of the form:
$$
(x, y) \rightarrow(\alpha x+\beta y, \gamma x+\delta y)
$$
for some $\alpha, \beta, \gamma, \delta \in \mathbb{R}$. If we were to do the same for a map from $\mathbb{C} \rightarrow \mathbb{C}$ :
That is to say $z:=x+i y$ maps to $\alpha x+\beta y+i(\gamma x+\delta y)$
$$\tag1\begin{aligned} \alpha x+\beta y+i(\gamma x+\delta y) &=\frac122[( \alpha x+ \beta y)+i( \gamma x+ \delta y)] \\
&=\frac{1}{2}\big[(\alpha +\delta ) x-(\gamma -\beta ) y+i\big((\gamma -\beta ) x+(\alpha +\delta ) y\big)\big]\\
&+\frac{1}{2}\big[(\alpha -\delta ) x+(\gamma +\beta ) y+i\big((\gamma +\beta ) x-(\alpha -\delta ) y\big)
\big]\\
&=\frac12[(\alpha+\delta)+i(\gamma-\beta)](x+i y)+\frac12[(\alpha-\delta)+i(\gamma+\beta)](x-i y) \end{aligned}$$If we let:\begin{aligned} a &=\frac{1}{2}[(\alpha+\delta)+i(\gamma-\beta)] \\ z &=x+i y \\ b &=\frac{1}{2}[(\alpha-\delta)+i(\gamma+\beta)] \\ \bar{z} &=x-i y \end{aligned}Then we can say that every linear map is of the form $z → az + b\bar z$ for some complex $a, b$.
This leads us into an observation of a connection between the linear map to itself and the projection of a circle.
We can also see an alternative way of looking at $p'(z)$ by the use of matices.
$$
\left(\begin{array}{ccc}
1 & 1 & 1 \\
1 & \omega & \bar{\omega} \\
1 & \bar{\omega} & \omega
\end{array}\right) \underbrace{\left(\begin{array}{lll}
r & 0 & 0 \\
0 & s & 0 \\
0 & 0 & t
\end{array}\right)}_{D}=\underbrace{\left(\begin{array}{lll}
0 & a & b \\
b & 0 & a \\
a & b & 0
\end{array}\right)}_{M} \left(\begin{array}{ccc}
1 & 1 & 1 \\
1 & \omega & \bar{\omega} \\
1 & \bar{\omega} & \omega
\end{array}\right)
$$
We recognize that $p(z)$ is the characteristic polynomial of the matrix $D$, which by the matrix equation, means that $p(z)$ is also the characteristic polynomial of the matrix $M$. If we call $W$ the matrix with the omegas, then $D=W^{-1}MW$\begin{aligned} p(z) &=\det(z I-D) \\ &=\det\left(z I-W^{-1}MW\right) \\ &=\det\left(W^{-1}(z I-M) W\right) \\ &=\det\left(W^{-1}\right)\det(z I-M)\det(W) \\ &=\det(z I-M) \\ &=\det\left(z\left(\begin{array}{ccc}1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1\end{array}\right)-\left(\begin{array}{ccc}0 & a & b \\ b & 0 & a \\ a & b & 0\end{array}\right)\right) \\ &=\det\left(\left(\begin{array}{ccc}z & 0 & 0 \\ 0 & z & 0 \\ 0 & 0 & z\end{array}\right)-\left(\begin{array}{ccc}0 & a & b \\ b & 0 & a \\ a & b & 0\end{array}\right)\right)\\ &=\det\left(\left(\begin{array}{ccc}z & -a & -b \\ -b & z & -a \\ -a & -b & z\end{array}\right)\right)\\ &=z\left(z^{2}-a b\right)-(-a)\left(-b z-a^{2}\right)+(-b)\left(b^{2}+a z\right) \\ &=z^{3}-z a b-z a b-a^{3}-b^{3}-z a b \\ &=z^{3}-\left(a^{3}+b^{3}\right)-3 z a b \end{aligned}5 Marden’s Theorem
We are getting closer to fully understanding Marden’s Theorem. However, we still need to complete the connection of the linear mapping from the equilateral figure to the projected image. Up to this point, we have only used real roots when solving for the zeros of the cubic. Now, we will suppose $p(z)$ is a cubic with distinct complex roots $r, s, t$. Assume that $r+s+t=0$, for convenience.
$$p(z) = z^3 + (rs + rt + st)z − rst$$
These roots, like before, are the projections of the vertices of an equilateral triangle. Considering Figure 1, there exists a linear map from the plane to itself. This mapping takes the unit circle to an ellipse.
Figure 5 illustrates how such a map would appear.

drawing-2.svg (1.2 KB, Downloads: 134)
Figure 5 $f(z)=az+b\bar z$, where $f(1)=r$ and $f(ω)=s$

Because the linear map of the plane to itself preserves certain characteristics of the projected image, and we know the inscribed unit circle is tangent at the midpoints of the sides of the equilateral triangle, we can say that the projection of that circle (the ellipse) is also tangent to the midpoints of the sides of the projected triangle. We can deduce that the outer radius is twice the inner radius, for either image (because of the linear map). This deduction comes from observing the geometric behavior of the figure.

drawing-2.svg.2022_08_15_04_53_37.0.svg (17.61 KB, Downloads: 94)

The root 1 has argument 0 and its polar form is $e^{0}$.
The root $\omega$ is one-third of the way around the unit circle. The argument of $\omega$ is $120^{\circ}$ or $\frac{2 \pi}{3}$. We can write $\omega$ in polar form: $\omega=\cos \frac{2 \pi}{3}+i \sin \frac{2 \pi}{3}=\frac{-1}{2}+\frac{\sqrt{3}}{2} i$
Similarly, the root $\bar{\omega}$ is two-thirds around the unit circle, or $240^{\circ}$ or $\frac{4 \pi}{3}$. We can write $\bar{\omega}$ in polar form: $\bar{\omega}=\cos \frac{4 \pi}{3}+i \sin \frac{4 \pi}{3}=\frac{-1}{2}-\frac{\sqrt{3}}{2} i$
$\therefore$ the midpoints of the sides of the equilateral triangle are as follows:\begin{array}l
\frac{\omega+1}{2}=\frac{1}{4}+\frac{\sqrt{3}}{4} i \\
\frac{\omega+\bar{\omega}}{2}=\frac{-1}{2} \\
\frac{\bar{\omega}+1}{2}=\frac{1}{4}-\frac{\sqrt{3}}{4} i
\end{array}Because the midpoint of $\frac{\omega+\bar{\omega}}{2}=\frac{-1}{2}$, we can see that the radius of the inner circle is $\frac{1}{2}$.

We can also prove this by finding the values of $a, b$ explicitly. We know $r+s+t=0$ and $f(z)=a z+b \bar{z}$ where $f(1)=r, f(\omega)=s$ and $f(\bar{\omega})=t$.

Because $r, s, t$ are roots of the cubic, we can say $r+s+t=0$. The linear mapping allows the same reasoning to follow for the projections. Meaning, $1, \omega, \bar{\omega}$ are the projections of the roots in 3-space. So, we can write them as $1+\omega+\bar{\omega}=0$. This can be easily verified by plugging in our evaluated values found previously.

It follows from $f(1)=r$ and $f(\omega)=s$ that $a+b=r$ and $a \omega+b \bar{\omega}=s$. We can easily verify that $1+\omega+\bar{\omega}=0$ :\begin{aligned}
1+\omega+\bar{\omega} &=1+\left(\frac{-1}{2}+\frac{\sqrt{3}}{2} i\right)+\left(\frac{-1}{2}-\frac{\sqrt{3}}{2} i\right) \\
&=1+\frac{-1}{2}+\frac{-1}{2}+\frac{\sqrt{3}}{2} i-\frac{\sqrt{3}}{2} i \\
&=0
\end{aligned}Since $r=f(1)=a(1)+b(\overline{1})=a+b$ and $s=f(\omega)=a\omega+b\bar\omega$, it follows from $r+s+t=0$ that $t=a\bar\omega+b\omega$. Knowing these projected values allows us to rewrite $p(z)$ in terms of $a$ and $b$.
\begin{aligned}
r s+r t+s t &=(a+b)(a \omega+b \bar{\omega})+(a+b)(a \bar{\omega}+b\omega)+(a \omega+b \bar{\omega})(a \bar{\omega}+b \omega) \\
&=\left(a^{2} \omega+a b \bar{\omega}+a b \omega+b^{2} \bar{\omega}\right)+\left(a^{2} \bar{\omega}+a b \bar{\omega}+a b \omega+b^{2} \omega\right)+\left(a^{2} \omega \bar{\omega}+a b \omega^{2}+a b \bar{\omega}^{2}+b^{2} \omega \bar{\omega}\right) \\
&=a^{2}(\omega+\bar{\omega}+\omega \bar{\omega})+a b\left(\bar{\omega}+\omega+\bar{\omega}+\omega+\omega^{2}+\bar{\omega}^{2}\right)+b^{2}(\bar{\omega}+\omega+\omega \bar{\omega}) \\
&=a^{2}(\omega+\bar{\omega}+1)+a b(3 \bar{\omega}+3 \omega)+b^{2}(\bar{\omega}+\omega+1) \\
&=a^{2}(0)+3 a b(\bar{\omega}+\omega)+b^{2}(0) \\
&=3 a b(-1) \\
&=-3 a b
\end{aligned}We can also rewrite $r s t$ in terms of $a$ and $b$:\begin{aligned}
r s t &=(a+b)(a \omega+b \bar{\omega})(a \bar{\omega}+b \omega) \\
&=a^{3} \omega \bar{\omega}+a^{2} b \omega^{2}+a^{2} b \bar{\omega}^{2}+a^{2} b \omega \bar{\omega}+a b^{2} \bar{\omega} \omega+a b^{2} \omega^{2}+a b^{2} \bar{\omega}^{2}+b^{3} \bar{\omega} \omega \\
&=a^{3}1+a^{2} b\bar{\omega}+a^{2} b\omega+a^{2} b1+a b^{2}1+a b^{2}\bar{\omega}+a b^{2}\omega+b^{3}1\\
&=a^{3}+a^{2} b(\bar{\omega}+\omega+1)+a b^{2}(1+\omega+\bar{\omega})+b^{3} \\
&=a^{3}+a^{2} b0+a b^{2}0+b^{3} \\
&=a^{3}+b^{3}
\end{aligned}Knowing $r=a+b, s=a \omega+b \bar{\omega}$, and $t=a \bar{\omega}+b \omega$, are the roots of $p(z)$:\begin{aligned}
p(z) &=z^{3}+(r s+r t+s t) z-r s t \\
&=z^{3}-3 a b z-\left(a^{3}+b^{3}\right)
\end{aligned}Alternatively, if $z:=f(e^{i \theta})=a e^{i \theta}+b e^{-i \theta}$
$$
\begin{aligned}
z^{3}-3 a b z &=\left(a e^{i \theta}+b e^{-i \theta}\right)^{3}-3 a b\left(a e^{i \theta}+b e^{-i \theta}\right) \\
&=\left(a^{3} e^{i 3 \theta}+3 a^{2} b e^{i \theta}+3 a b^{2} e^{-i \theta}+b^{3} e^{-i 3 \theta}\right)-3 a b\left(a e^{i \theta}+b^{-i \theta}\right) \\
&=a^{3} e^{i 3 \theta}+b^{3} e^{-i 3 \theta}
\end{aligned}
$$
Then the equation will reduce to $a^{3}+b^{3}$ when $\theta=\frac{k \pi}{3}$ with $k=0,2,4$
Thus, we can say the roots of $z^{3}-3 a b z-\left(a^{3}+b^{3}\right)$ are: $\left\{a e^{i \frac{k \pi}{3}}+b e^{-i \frac{k \pi}{3}}: k=0,2,4\right\}$
Marden's Theorem. If $p(z)$ is a cubic polynomial with three complex roots $r, s,t$, that form a triangle in $\mathbb{C}$, then the roots of $p^{\prime}(z)$ are the foci of the unique ellipse tangent to the midpoints of each side.
Proof. By the Lemma, the foci of the outer ellipse are $\pm 2 \sqrt{a b}$, and so the inner ellipse has foci $\pm \sqrt{a b}$. Since $p(z):=z^{3}-3 a b z-\left(a^{3}+b^{3}\right)$, and $p^{\prime}(z)=3 z^{2}-3 a b$. Then, $\pm \sqrt{a b}$ are the roots of $p'(z)$. $□$
6 Cardano's Formula
Cardano's Formula. The solutions of the equation
$$
z^3-3 A-2 B=0
$$
are $a+b, a \omega+b \bar{\omega}$, and $a \bar{\omega}+b \omega$, where
$$
a, b:=\sqrt[3]{B \pm \sqrt{B^{2}-A^{3}}}
$$
Proof. From what we did before, we know that $a+b, a \omega+b \bar{\omega}$, and $a \bar{\omega}+b \omega$ are roots of $z^{3}-3 a b z-\left(a^{3}+b^{3}\right)$. We want to find $A, B$ that satisfies $a b=A$ and $\frac{a^{3}+b^{3}}{2}=B$
Provided that $a, b$ satisfy $A$ and $B$ :
$$
\begin{aligned}
0 &=\left(z-a^{3}\right)\left(z-b^{3}\right) \\
&=z^{2}-\left(a^{3}+b^{3}\right) z+a^{3} b^{3} \\
&=z^{2}-2 B z+A^{3}
\end{aligned}
$$
Using the quadratic formula for $z^{2}-2 B z+A^{3}$ :
$$
a^{3}, b^{3}=B \pm \sqrt{B^{2}-A^{3}}
$$
And thus:
$$
a, b:=\sqrt[3]{B \pm \sqrt{B^{2}-A^{3}}}
$$$□$
7 Example
We can apply what we have learned from Cardano equation and Marden’s Theorem to solve an example involving a cubic equation.

Suppose $f(x)=x^{3}-3 x^{2}-12 x+18$
Let $p(x)=f(x+1)$
This will suppress the cubic and remove the $x^{2}$ from the equation. This will make our calculations easier.
$$
\begin{aligned}
p(x) &=f(x+1) \\
&=(x+1)^{3}-3(x+1)^{2}-12(x+1)+18 \\
&=x^{3}-15 x+4
\end{aligned}
$$
Note this takes the form $x^3-3 A x-2 B=0,B=-2, A=5$.
Cardano's Formula allows us to find $a, b=\sqrt[3]{B \pm \sqrt{B^{2}-A^{3}}}$. By cubing $a, b$, we can remove the cube root. Thus, we get the following:
$$a^{3}, b^{3}=B \pm \sqrt{B^{2}-A^{3}}=-2±11i$$Take the cube root, $a =-2 + i, b =-2-i$.
As stated before, the roots of the $p(z)$ are
\begin{array}l
a+b=-4\\
a \omega+b \bar{\omega}=2-\sqrt3\\
a \bar{\omega}+b \omega=2+\sqrt3
\end{array}We can solve for $a+b$, which will suffice in allowing us to solve for the other two roots, just by multiplying $\omega$ and $\bar{\omega}$. We performed similar steps earlier in this paper for the general $\omega$ and $\bar{\omega}$. Remember that $\omega=\frac{-1+i \sqrt{3}}{2}$ and $\bar{\omega}=\frac{-1-i \sqrt{3}}{2}$.
The roots of $p(x)$ are: $-4, 2 + \sqrt3$ and $2 - \sqrt3$.
∴ the roots of $f(x)$ are: $-5, 1 + \sqrt3$ and $1 - \sqrt3$.
8 Higher Dimensions
We can apply simliar logic we have used for our cubic to any regular tetrahedron. We can rotate and scale the image to match desired coordinates. It can be shown what a quartic would appear like and how the logic works similarly for higher dimension cases. Proof of these other cases are left to be revealed with future work.

Theorem. Given a quartic $p(x)$ with four real roots (at least two distinct), those roots are the first coordinate projections of a regular tetrahedron in $\mathbb{R}^{3}$. That tetrahedron has a unique inscribed sphere, which projects onto an interval whose endpoints are the two roots of $p^{\prime \prime}(x)$.

The figure 6 above shows the quartic polynomial $p(z)$ with a regular tetrahedron, the inscribed sphere, and an equilateral triangle circumscribing that sphere. In fact, the regular tetrahedron is formed by the roots of the quartic, the equilateral triangle is formed by the roots of the derivative, and the edges of the circle project to the roots of the second derivative. We do not understand how the roots of $p(z), p^{\prime}(z)$, and $p^{\prime \prime}(z)$ are related geometrically, as we solve for the cubic case, but further research could allow us to solve these issues rather easily. In fact, seeing the geometric relationship of all the roots could allow us to approach open problem, such as:

Conjecture. There does not exist a quartic polynomial $p$ with four distinct rational roots such that $p^{\prime}, p^{\prime \prime}$, and $p^{\prime \prime \prime}$ all have rational roots.

This conjecture comes from the similar process we used to prove the theorem using real roots earlier. The proof for this is much more complex, and the proof good for future research and studying. Many issues like this arise from the use of higher dimensions. This would be the next step for research for this subject.

9 References
1. Sam Northshield, Geometry of Cubic Polynomials Math. Mag 86 (April 2013) 136-143 doi: 10.4169/math.mag86.2.136
2. Pahio (planetmath.org), Gauss-Lucas Theorem, planetmath.org 2008-08-25, planetmath.org/gausslucastheorem
3. J. Buddenhagen, C. Ford, Nice cubic polynomials, Pythagorean triples, and law of cosines, Math. Mag 65 (1992) 244-249. doi: 10.2307/2691448
4. D. Kalman, An elementary proof of Marden’s Theorem, Amer. Math. Monthly 115 (2008) 330-338
5. Burnside W. S. and Panton A. W. (1886), The theory of equations: with an introduction to the theory of binary algebraic forms, Longmans, Green and Co. 2nd edition (1960)

hbghlyj · 2022-8-14 15:33

Last edited by hbghlyj 2022-8-15 18:11本楼将附件转换为inline SVG

hbghlyj · 2022-8-16 02:38

第8节 higher dimension有点水啊...投影到quartic equation的root的tetrahedron和那个sphere是什么关系都没有说...我估计也没有什么关系...那个figure 6比较玄幻...我就没有抄过来...

Account		Remember me	Forgot password
Password			Register account

Geometry of Cubic Polynomials (Xavier Boesken)

Quick Reply

Viewed boards