1. Introduction
In [
1], I derived and applied some new Feynman diagram combinatorics, in order to incorporate Gauss’s law constraint in the perturbation expansion of Yang–Mills theory using an auxiliary, Lagrange multiplier field
. This led to the derivation of a Coleman–Weinberg effective potential for
that appeared inverted in the resulting effective action and admitted soliton solutions. However, the vacuum structure was not correctly identified in [
1], and the relation to actual properties of Yang–Mills theory was not clear.
Eventually, in [
2], the vacuum structure and stability were demonstrated and two distinct vacua emerged, which led to the proof of confinement, as well as various other expected properties of the strong interaction, and to further confirmation of the often stressed importance of the vacuum state in deciphering a physical theory.
Here, I will use some well-known results of axiomatic, algebraic quantum field theory (AQFT) [
3,
4,
5], in order to further justify the previous work. The discussion around Theorem 4.6 of [
5], which I will mostly follow, is especially useful. The relation of the uniqueness of the vacuum state to the clustering decomposition property (which was observed as early as in [
6,
7]) will be described, and it will be shown that the presence of two separate vacua is related to the central decomposition of the GNS (Gelfand, Naimark, Segal) construction for the vacuum state, to eigenstates of the auxiliary field. The general vacuum state, then, is a mixed quantum state, described by a density matrix, and not a pure, unique vector, vacuum state, as it is often assumed in quantum field theory.
The formalism of AQFT that will be reviewed and applied here is especially constructed to ensure relativistic invariance of a quantum theory. However, since the two vacua that emerge have different energy densities, there is no unique, Lorentz-invariant Lagrangian that can fully capture the physics of the strong interactions (if there was, one would be able to locally perturb one vacuum and lower the energy). Transitions between the two vacua happen only in a background of finite temperature or particle density, and the effective action derived in [
1] can help elucidate many physical properties of the strong interactions, as demonstrated in [
2].
In
Section 2, I start with the formalism of AQFT, and in
Section 3, I review some basic results of the solution of Yang–Mills theory and apply the previous formalism. In
Section 4, I continue the description of the vacuum as a mixed state and show the relation with the cluster decomposition. In
Section 5, I discuss possible relations with other works related to confinement, and in
Section 6, I conclude with some more comments, a criticism of the statement of a related mathematical problem, and some tentative remarks regarding the spin-2 interaction.
2. Axiomatic, Algebraic Quantum Field Theory
The formalism starts with an abstract algebra (that includes the physical observables), a physical state (a functional that also assigns expectation values to observables), and, via the GNS construction, gives them a representation on a concrete algebra of operators on a Hilbert space.
Here, by an abstract algebra, , I will denote a normed vector space over the complex numbers, , that is also an algebra (admits multiplication of its elements), admits conjugation (denoted here by *), and has a unity element with respect to multiplication. The norm and the conjugation operation have expected and compatible properties, for example, , , , for any Q, , in , and .
For the purposes of this work, usually, I will ignore issues of convergence and completion and denote most units (algebraic, group theoretic, etc.) by , unless these issues need specific clarification. Technically, is a -algebra with unity.
A state,
, over
, is a linear functional that assigns a complex number to each element,
Q, of
and is positive (
) and normalized:
Realistic (quantum and probabilistic) physical theories have physical observables that are self-conjugate elements of the abstract algebra, , and the physical system under consideration is described by a state, , that bestows the various expectation values.
The transition to the usual formalism of quantum mechanics is enabled by the GNS construction, which gives a representation of the aforementioned abstract algebra to a concrete algebra, that is, a subalgebra of , the set of bounded linear operators on a Hilbert space, H, with the operation of the adjoint as conjugation (also denoted by *) and obviously also including the unity operator (technically, a von Neumann algebra).
Namely, for any state,
, over an abstract algebra,
, there exist a Hilbert space
, a unit vector
in
, and a representation
of
on
, that is,
, such that
where
denotes the inner product of the GNS Hilbert space, and
is dense in
, that is,
is a cyclic vector (any vector of
can be approximated by
).
Note that the construction is specific to the state and applies to any state and physical theory. Most crucially, however, note that the GNS representation is not necessarily irreducible:
For any subset S of , denote by the commutant of S, that is, the set of all elements of that commute with every element of S. The center of S is , the set of all elements of S that commute with every element of S.
Then, the GNS representation is irreducible if the center of is trivial, consisting of multiples of the identity operator. Otherwise, it is decomposed as a direct sum of eigenspaces of the operators of the center, and the state is mixed (described by a density matrix). So, irreducibility or reducibility of the GNS construction is equivalent to whether the original state is pure or mixed.
The notation of (
2) may lead to the confusion that this is a pure, vector state, but this is not the case, since one, obviously, could have started with a mixed state in the first place.
In the case of a non-trivial center, the resulting Hilbert space is decomposed as a direct sum , the representation as , the unit, cyclic vector as , with complex coefficients , and the original state is a mixed state, given by a density matrix .
This will be used and explained in more detail in later sections, where it will be shown that the Yang–Mills vacuum state is a mixed state.
Like most fundamental results, the proof of the GNS construction is rather simple: the idea is to define the set
to use the Cauchy–Schwarz inequality (which holds generally)
to show that it is a left ideal (
); and eventually, with the necessary completion procedures, to construct the Hilbert space as a quotient space,
, with inner product of the equivalence classes of
A and
B given by
.
Symmetries can also be naturally implemented in the algebraic formalism: to each element, g, of a symmetry group G, there corresponds an automorphism of , and if the state is invariant under g, that is, if for all , then there exists a unitary operator on the cyclic representation space associated with , such that and .
In particular, we will consider the relativistic symmetry of the four-dimensional Minkowski spacetime , with metric and the four vectors , . As usual, x will be called timelike, lightlike, and spacelike depending on whether is positive, zero, or negative, respectively.
The causal complement of a subset D of M is the set of all points that have no causal relation with points of D,
The open future cone, ; the closed future cone, ; and the similarly defined open past and closed past cones are also important in describing the causal structure of the theory. The latter is preserved by the group of Poincare transformations, with elements , where denotes a Lorentz transformation and denotes a spacetime translation. Usually, one considers the group , where is in the connected component of the identity of the Lorentz group (proper, orthochronous Lorentz transformations).
A local, relativistic, quantum field theory involves the abstract algebra of operators that is the union of the abstract algebras for every bounded spacetime domain . The observables of are the physical quantities measured by an observer within the space and time limits of D. They obey the axioms:
- (i)
If , then (monotonicity: if an observable is measured in it can be measured in ).
- (ii)
If , then (causality: the observables of two regions that are spacelike separated commute).
- (iii)
(relativistic covariance: for any , where ).
Now, all one has to do in order to apply the GNS construction and arrive at a quantum field theory is to pick a state, and usually, this is chosen as the vacuum state, which, because of the cyclicity property, encodes a vast amount of information for the physical system.
3. The Yang–Mills Theory
The self-interacting, spin-1, massless particle is described by the non-Abelian, Yang–Mills gauge theory, with coupling and gauge group , with generators , structure constants , gauge field , and . The theory is gauge invariant, with , under the local gauge transformation .
After imposing Gauss’s law with an auxiliary field (Lagrange multiplier)
, transforming covariantly as
, and a set of appropriately modified Feynman rules, the gauge-invariant (although not manifestly Lorentz-invariant) effective action
was derived (written in terms of the chromo-electric and -magnetic fields
) with
(
,
,
) a generated Coleman–Weinberg effective potential term that appears inverted in the effective action. Essentially, what happens is that there are some missing terms involving the Coulomb interaction that can be resummed to this term. The equations of motion are
The inverted effective potential
has a local minimum at
and a global maximum at
. However, because of the presence of gauge and kinetic terms, the analysis of [
2] showed that they are both stable, classically and quantum mechanically, vacua.
, with
, is the perturbative vacuum, with the usual Coulomb interaction, and
, with
, is the confining vacuum (in the
gauge, the vacuum
consists of time-independent, covariantly constant configurations,
, with
a fixed adjoint vector
,
, and
).
Both vacua admit instanton solutions (), but since at most physical situations, there are transitions between the two vacua, it was argued that the strong-CP violating parameter of Yang–Mills is zero.
The equations of motion also admit a soliton solution of the equation
with
. Classical stability was demonstrated for all solutions of the equations of motion, including the soliton. It has a finite total energy (mass), and it connects the two vacua; in physical backgrounds with finite temperature and/or fermion density, the solitons mediate the transitions between the vacua.
The vacuum energy density of
is zero, and the vacuum energy density of
is positive (equal to
). The canonical formalism described in [
2] shows that the energy (Hamiltonian) is given by
In general, the effective action is not Lorentz-invariant, and there is no well-defined energy-momentum tensor. However, the two vacua do not decay, and at the vacua, there is a well-defined energy–momentum tensor,
where
is the usual energy–momentum tensor for perturbative Yang–Mills (
.
Finally, because of the presence of a non-zero condensate in the confining vacuum, the confining interaction was demonstrated, with a linearly rising interaction energy between two color sources at a distance r, proportional to .
In order to apply the axiomatic formalism of the previous section in the Yang–Mills case, one first notices that the auxiliary field commutes with all other fields of the theory (the canonical formalism, in the gauge, implies the usual commutation relations between the canonical variables and ).
The center of the algebra of the observables, therefore, is not trivial; it contains, besides unity, the gauge-invariant . According to the discussion of the previous section, the GNS vacuum representation , where is the vacuum state, splits into eigenspaces with and , that is, and .
The Hilbert space of the theory is
, and the vacuum state is mixed
with positive constant coefficients,
. Accordingly, vacuum expectation values of the observables,
Q, are given by
Observables are the gauge- and Lorentz-invariant operators of the theory, and they obviously have completely different effects on the two vacua. The notation was simplified in (
15); formally, one has
, where the parentheses denote the components of the direct sum. Cyclicity of the vacuum implies that
, or
with less notation, is dense in
.
The symmetries of the theory are also split in the two Hilbert spaces. Here, I will only consider translation symmetry, that is the subgroup of the Poincare group with , and denote the corresponding unitary operator on the GNS Hilbert space as . This is also split in the two Hilbert spaces, and there is an additional energy “gap”, but for all states in the confining vacuum. This is similar to old phenomenological, “bag model”, pictures of confinement.
In the next Section, I will elaborate more on the translation symmetry, and the cluster decomposition, but before proceeding there, I would like to clarify some possible confusion regarding the superselection sectors that may also appear in reducible GNS constructions. These are not decompositions of the vacuum representation; they appear whenever there is a conserved quantity (charge) of continuous discrete symmetry. They also appear in QED and in QCD, wherever one can properly define a conserved, gauge-invariant charge. They are decompositions of a general reducible GNS construction into sectors with different charge (there are some lower-dimensional cases with different vacua, with the same energy density, connected to discrete symmetry, but these are also irrelevant to the present work).
The derivation of confinement and the associated splitting of the vacuum state shown here, are purely due to the dynamics, the self-interaction of the spin-1 field. Other properties of the strong interaction, asymptotic freedom, symmetries, anomalies, etc., have no relation to confinement and the vacuum state (no other than being derived from the same theory).
4. Vacuum State and Clustering Property
Here, I will describe some more properties of a general vacuum state, following [
5], which should be consulted for the proofs that are not given here, as well as the issues of convergence. Then, I will note the differences between a pure and mixed vacuum state with respect to clustering properties and make contact with the solution of Yang–Mills.
The expression of the symmetry of the proper, orthochronous, Poincare group,
, on a representation,
, of an algebra,
, is generally given by
for any
, where
is a general Poincare transformation (consisting of a translation
a and a Lorentz transformation
),
is the action of the symmetry on the algebra
, and the unitary
U is the symmetry operator on the Hilbert space of the representation. The action of translations, where
, will be denoted by
and the unitary
. The spectral decomposition of the unitary operator of a translation is
for four momenta
, where
E is a projection-valued measure on momentum space. Then, for any vector
of the Hilbert space, the support of the measure
is the energy–momentum spectrum of
. The following definitions and theorems can be found in [
5].
(i) If
is the four-dimensional momentum space,
is a compact subset of
, and
Q is an element of the algebra
, then
is an operator that shifts the momentum by
if
satisfies
(the support of a function being the set of its non-zero points).
(ii) If the energy–momentum spectrum of a vector of the Hilbert space is in , the energy–momentum spectrum of is in .
(iii) If is any positive timelike unit vector of , and , an operator will be called an energy-decreasing operator if .
(iv) The vacuum is defined as a state
, such that
for any energy-decreasing operator. Here, as in (
4),
, and the vacuum is defined as a state that is stable under any local perturbation.
(v) The vacuum state is translationally invariant, that is for any , and any x in M.
(vi) On the GNS representation space of the vacuum state
and
where the spectral measure
has support on the positive light-cone of
(
).
From now on, we focus on the GNS representation space of the vacuum state, for which the same observation holds for the irreducibility that was described in
Section 2. Namely, irreducibility is equivalent to triviality of the center
. As was described in
Section 3, in the case of Yang–Mills theory, the center is non-trivial, since it contains the gauge- and Lorentz-invariant operator
.
Irreducibility of the vacuum state is also equivalent to the condition that there is a unique translation invariant vector, that is, any translation invariant vector is proportional to
[
5]. This was also shown not to hold in the Yang–Mills case, since there are two distinct translationally invariant vectors,
and
.
The GNS vacuum representation space splits in the direct sum of two components, the eigenspaces of
or
, the perturbative and the confining vacuum, as described before, and so do the various operators and symmetries, for example,
where the
and the
quantities that correspond to the
and
vectors are calculated with (
12) or (
13) and are different in each vauum; in the confining vacuum, there is a “mass gap” but, for all states at the corresponding Hilbert space,
.
Finally, irreducibility is also equivalent to a general clustering property:
where
x is any spacelike vector in
M, and
and
are elements of
.
Indeed, first assume that (
23) holds: then, if we denote the projection operator to
by
, and the projection operator to all translationally invariant vectors by
, (
23) can be written as
Since the
are arbitrary and
are dense in the Hilbert space, it follows that
, and if we multiply this relation by
, we obtain
; thus, (
23), clustering, implies uniqueness of a translationally invariant vacuum.
Conversely, uniqueness of a translation invariant vacuum (irreducibility of the vacuum state) implies clustering. Indeed, first notice that, as
,
and
, with spacelike
x, eventually are in spacelike separated regions and commute by the axiom of causality. Since the center is trivial, the limit of
exists and is also trivial, say
, with a complex constant
c. By translational invariance
, we obtain the clustering property (
23).
On the other hand, for a reducible vacuum state, clustering does not necessarily hold: let us consider the case of Yang–Mills, described before, where one has the simplest, perhaps, case of reducibility, with a mixed vacuum state consisting of two distinct translationally invariant vectors, and .
As was explained, the Hilbert space of the theory is
, and the vacuum state is mixed
,
. The vacuum expectation values of the observables,
Q, are given by
Since the center now is not trivial but spanned by
and
, the previous limit of
is not well defined. In fact, different sequences
, with
, may have different accumulation points of the form
, with complex constants
. If we pick one such point, then the limit of
is
. On the other hand,
and
, and it is easy to see that the clustering property, in general, does not hold.
Obviously, what happens physically, is that, when two sources that are initially in one vacuum vector are pulled apart, they may remain in the same vacuum or jump in the other vacuum vector or in a mixed vacuum state. Hence, the clustering property does not hold, and the restriction of the attenuation of the interaction energy is also lifted.
Generally, in axiomatic field theory, under the assumption of a unique, pure vacuum state, one goes on to prove stronger clustering properties than (
23), namely, that the limit to a product state is reached at large spacelike separations bounded by a Coulomb or Yukawa interaction. It should be stressed that the results of axiomatic field theory are non-perturbative; if the clustering property holds, it is not possible to obtain a linearly rising, confining, potential energy.
So, the existence of a mixed vacuum state is necessary for the theory of the strong interaction. On the other hand, between the two vacua described here, there is also an energy density difference. This implies that a Lagrangian description of the system is not possible. Indeed, if there was such a Lorentz-invariant Lagrangian (in place of (
6)), it would be possible to build a local configuration using the gauge- and Lorentz-invariant operators of the Lagrangian (not necessarily satisfying the equations of motion) that perturbs one vacuum, creates a small bubble of the other vacuum, and thus lowers the energy, which is not possible based on the discussion and definitions in the beginning of this section. In our case, (
11) creates a small bubble of the confining vacuum in the perturbative vacuum and increases the energy, but the operators involved are not Lorentz-invariant. So, the solitons described and their combinations do not belong to any of the Hilbert spaces,
or
. They are mediators of the phase transitions between the two spaces (phases) in physical situations, with a background temperature and matter density.
6. Comments
The postulate of a unique, translation invariant vacuum (a pure vacuum state) is unfitting of a theory of strong interactions. It is equivalent to the cluster decomposition property and was originally used in axiomatic field theory with the aim of describing scattering and asymptotic states for accelerator setups. However, the strong interaction in the confining phase does not admit asymptotic states. Colored sources are dynamically confined by a linearly rising interaction at this scale, and the entire spectrum has a “mass gap”.
As far as a related mathematical problem is concerned [
14], it should be clear from these results that the statement of the problem itself is wrong, or at least incomplete, in that it assumes a pure, unique vector, vacuum state. Thus stated, the problem is a red herring; it cannot have a solution with a physical theory that corresponds to a strong interaction.
Once the picture of the spin-1 interaction is better understood, one would like to apply a similar approach to the spin-2 case, which was, in fact, the initial motivation of [
1]. One can immediately notice that the possibility of another vacuum with positive energy density can be related to an inflationary, cosmological, de Sitter phase. Thus, an inflationary period may arise, purely based on the dynamics of the spin-2 interaction, without the need for an inflaton field. Some more also naive observations regarding the existence of a mixed vacuum state and the breakdown of the clustering property are the presence of a de Sitter horizon and the presence of a polynomial interaction, besides the Coulomb term, for a static observer in a vacuum with a cosmological constant (∼
, for a distance
r, where
H is the Hubble constant).
Provided that a renormalisable interaction (“vertex”) can be identified, besides the quadratic Fierz–Pauli term, and the constraints are treated, one would like to arrive at an expression like (
6), with different, stable vacua. Then, resummations of the effective equations of motion in each vacuum [
15] would lead to different phases: some would correspond to ordinary gravity (general relativity with a zero cosmological constant), others would correspond to an inflationary phase, and others may even be non-interacting or “modified” gravity vacua. It should be clear that the picture that emerges is fundamentally, conceptually different than any inflationary, eternal inflation, or multiverse scenario. Apart from the absence of additional inflaton fields, what is described here are different phases, different vacua of the same spin-2 field, that correspond to different effective theories and may exist in the same cosmic fluid in other cosmological regions or scales.
However, the problem of finding a renormalizable “vertex”, a dimensionless coupling constant, in order to apply the techniques of ordinary quantum field theory is still not solved. It is possible that, much like the full Yang–Mills case not admitting a Lagrangian description, for the spin-2 interaction, even the “vertex” term may not admit a Lagrangian description. Hence, the formalism of algebraic quantum field theory seems necessary.