John von Neumann’s Time-Frequency Orthogonal Transforms

Stefanoiu, Dan; Culita, Janetta

doi:10.3390/math11122607

Open AccessArticle

John von Neumann’s Time-Frequency Orthogonal Transforms

by

Dan Stefanoiu

and

Janetta Culita

^*

Faculty of Automatic Control and Computers, “Politehnica” University of Bucharest, 313 Splaiul Independentei, 060042 Bucharest, Romania

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(12), 2607; https://doi.org/10.3390/math11122607

Submission received: 30 April 2023 / Revised: 1 June 2023 / Accepted: 2 June 2023 / Published: 7 June 2023

(This article belongs to the Special Issue Automatic Control and Soft Computing in Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

John von Neumann (JvN) was one of the greatest scientists and minds of the 20th century. His research encompassed a large variety of topics (especially from mathematics), and the results he obtained essentially contributed to the progress of science and technology. Within this article, one function that JvN defined long time ago, namely the cardinal sinus (sinc), was employed to define transforms to be applied on 1D signals, either in continuous or discrete time. The main characteristics of JvN Transforms (JvNTs) are founded on a theory described at length in the article. Two properties are of particular interest: orthogonality and invertibility. Both are important in the context of data compression. After building the theoretical foundation of JvNTs, the corresponding numerical algorithms were designed, implemented and tested on artificial and real signals. The last part of the article is devoted to simulations with such algorithms by using 1D signals. An extensive analysis on JvNTs effectiveness is performed as well, based on simulation results. In conclusion, JvNTs prove to be useful tools in signal processing.

Keywords:

numerable bases in Lebesgue-Hilbert space; orthogonal transforms; time-frequency dictionary; Fourier transforms; analysis and synthesis of finite energy signals

MSC:

47N70; 40B99; 40C99; 40D99; 40E99

1. Introduction

Orthogonality is an important and interesting, yet challenging, property in pure mathematics, as well as in some branches of applied mathematics (such as system identification [1] or signal processing (SP) [2]). One of the oldest results regarding orthogonality is the ancient Pythagorean theorem, which suggested that some mathematical entity (e.g., the hypotenuse vector) can be expressed by means of other entities, which are orthogonal to each other (e.g., the catheti unit vectors), as a linear combination of them. Moreover, thanks to orthogonality, the coefficients of linear combination are easily computed by simply projecting the mathematical entity on each orthogonal entity. One refers to this operation to as analysis or decomposing. Thus, the mathematical entity can be replaced by the coefficients, provided that the orthogonal entities are known. From coefficients, the mathematical entity can be either partially or fully recovered by using the linear combination. This operation stands for synthesis or recomposing of the mathematical entity. The orthogonal entities put together constitute a so-called transform (Often, the transform is expressed by means of an orthogonal/Hermitian matrix). They also can play the role of bases in a vectorial space. It’s very likely that Pythagoras was not using this terminology, which started to be known much later. However, his idea generated an intense quest for orthogonal transforms or bases and has had a huge impact in modern technology. Since Pythagoras’ times, myriads of orthogonality results have been derived. To reveal the framework of this article, only very few of them are cited next.

The vectorial spaces of interest for this article are the ones defined by Lebesgue, either in continuous or discrete time, for 1D functions, also referred to as signals. (Recall that any signal from a Lebesgue space is

p

-integrable or

p

-summable, which automatically involves the signal’s being bounded). It has been proven that all Lebesgue spaces are of a Banach type, which means that norms yield computing distances between signals. Even more interesting are the spaces for

p \in {1, 2}

, since the discussion on orthogonality can be extended from time domain to frequency domain (

p = 1

) and a scalar product can serve as important tool for testing orthogonality between signals (

p = 2

). (Actually, for

p = 2

, the Lebesgue space also is a Hilbert space).

In pure mathematics, polynomials are comfortable mathematical entities selected to build orthogonal bases. Among them, Chebyshev polynomials [3] are remarkable in this respect. Nowadays, squared Krawtchouk–Chebyshev polynomials are successfully employed in applications [4]. Another interesting development is reported in the context of the Euclidean and Weyl–Heisenberg (uncertainty) groups [5], where Hermite functions are employed. The Weyl–Heisenberg group belongs to the larger class of Lie groups and is particularly interesting, as the analysis is approached not only in the time domain, but also in the frequency domain, by using the two linear operators which are defined within this article as well. Some unconventional orthogonal bases are proposed as well. For example, in [6], the basis is founded on discrete spherical Bessel functions.

One of the most notorious orthogonal bases was introduced by J. Fourier more than 200 years ago and is harmonic in nature. This was an important breakthrough which led not only to the concepts of Fourier series and Fourier transform (FT), but also to the combined time-frequency approach. The Dirichlet-Fourier punctual convergence theorem is a fundamental result [7] for harmonic analysis and synthesis. Nowadays, numerical algorithms of the fast Fourier transform class [2] are employed in various fields. They implement the discrete Fourier transform (DFT) formula in efficient manner. Nowadays, implementations benefit from parallel computing, which has allowed reaching a milestone in terms of speed [8,9]. From this class of procedures, the discrete cosine transform (DCT) algorithm is integrated in many standards of signal compression and coding [10].

Although the FT dominated the research for long time, some other orthogonal transforms were defined, especially in recent decades. In applied mathematics, orthogonality is associated with redundancy reduction, entropy minimization, decorrelation, denoising, and principal components extraction. All of these are key concepts for data/signal compression and coding, which, furthermore, are fundamental operations in modern telecommunications. Thus, the ideal orthogonal transform that can completely de-correlate a signal was introduced in the works of Hotelling [11], Karhunen [12] and Loève [13], in context of principal components analysis developed by Pearson [14] and the minimum description length principle stated by Rissanen [15]. The Hotelling or Karhunen–Loeve transform (KLT) builds a basis of orthogonal eigenvectors starting from the autocorrelation matrix of a centered signal (obtained after subtracting its mean). (Later, the KLT was generalized by Hua and Liu [16] with the help of the least squares method). Thus, the orthogonal basis adapts itself to each signal. Because the ideal KLT cannot be implemented by a numerical procedure, only approximates can be considered. This puts KLT in competition with many other orthogonal transforms, including the FT (which also exhibits decorrelating properties).

Nowadays, the orthogonal transforms can roughly be grouped into four classes.

The first class includes harmonic transforms such as: FT, DCT, and the Hartley transform [17] (or CAS (cosine and sine) Transform).

The second class is based on time-frequency representations [18,19] (in the framework of the Weyl–Heisenberg group). The basic idea of such representation is to build the orthogonal basis starting from a mother-signal (or window), by applying two operators: time-shifting and frequency modulation. Typical transforms of this class are: windowed FT [2], the Weierstrass–Gauss transform [20], the Gabor–Gauss transform [21], the Vigner–Ville transform [19], the Gaussian Vigner–Ville transform [22], and the Morlet–Gabor transform [23]. Note that not all the transforms of this class can be enforced to verify the orthogonality property. For example, since the frequency representation of a Gauss bell also is a Gauss bell, the Weierstrass–Gauss transform can only be nearly orthogonal. Nevertheless, the John von Neumann transforms (JvNTs) described within this article are orthogonal and belong to this class.

The transforms based on time-scale representation and multiresolution theory constitute the third class. In fact, such a representation can be performed in the framework of another group, namely the affine one, which also belongs to the Lie class of groups. This time, the mother-signal gives birth to the basis by using time-shifting and scaling operators. The most prominent members of this class are the Wavelet transforms (WT). Thereby, the class is quite large and can be split in three groups: one operating with smooth signals, another one comprising fractal empirical signals and a third one including fractal generic signals.

Smooth WT were introduced, for example, by Meyer [24] and especially by Mallat [25] (who proposed a pyramidal algorithm for signal and image processing that has rapidly been adopted by both the scientific and the technical communities).

Because many real-life signals are not only nonstationary but also fractal, the interest in building bases of fractal signals started very early, in the 19th century, as proven by the works of Hadamard [26] and then by the orthogonal fractal functions of Haar [27] or Walsh [28]. The fractal effect was empirically induced starting from the rectangular window, by inserting ruptures at some instants. It has been proven that, actually, Hadamard and Walsh defined the same transform, although their fractal bases are differently ordered. Another representative of this group is referred to as slant transform [29]. This time, the fractal effect is obtained by imposing some constraints along a slanted line in the transform matrix. Nowadays, new transforms of this group are proposed, such as: the complex Hadamard transform [30], the Gabor–Walsh–Fourier transform [31], and the generalized Walsh–Hadamard Transform [32].

The most notorious basis of fractal generic signals seemingly is the one constructed by Daubechies, with orthonormal wavelets [33]. However, there is a price to pay for orthogonality: the lack of symmetry. Therefore, one effort tried to recover this property, but the new wavelets only constituted biorthogonal frames of signals space [34]. (Recall that a frame is more than a basis, i.e., it can generate the whole space, but with vectors that are not necessarily linearly independent). After the fundamental results were proven by Meyer, Mallat and Daubechies, the wavelets theory was developed for more than 15 years. There are too many contributions to be cited here. However, one can cite a recent article dealing with orthogonal and biorthogonal wavelets in context of filter banks implementations, namely [35]. Nowadays, wavelets are employed in many fields of science and technology.

The fourth class is the most complex one. It relies on combined approaches of the classes above. More specifically, here, one can perform time–frequency–scale representations. Putting together all three operators makes difficult the task of achieving orthogonality. The transforms in this class usually are defined by time–frequency–scale dictionaries of waveforms (including wavelets), which constitute frames of signals space. Here, the challenge is to find the minimal number of waveforms (ones suitably orthogonal to each-other) from the dictionary, in order to represent the signal with sufficient accuracy. To solve this problem, Mallat and Zhong made a great contribution by introducing the matching pursuits algorithm [36], which relies on the Pythagorean theorem. Thus, the circle closes, as this short overview completes by invoking again the Greek mathematician who originated the whole endeavor of building orthogonal transforms.

The article is structured as follows. Section 2 is devoted to theoretical background of orthogonal JvNTs. In Section 3, two numerical algorithms to implement discrete time JvNTs are designed. Section 4 presents simulation results obtained after running the algorithms for one artificial stochastic signal and one speech signal recorded from a male. Additionally, an extensive discussion on the results is performed, especially concerning the theoretical compression capacity of JvNTs. The last section completes the article with concluding remarks. The acronyms employed and reference lists are appended at the end.

2. Theoretical Background

2.1. JvNTs for Continuous Time 1D Signals

John von Neumann (JvN) defined the following function [37]:

ν (t) = sinc (π t) = \frac{\sin (π t)}{π t}, \forall t \in ℝ,

(1)

which stands for cardinal sinus and can be linked to low-pass filtering of signals. More specifically, consider the ideal low-pass filter with the frequency response:

H (j Ω) = {\begin{array}{l} A & , Ω \in [- Ω_{c}, + Ω_{c}] \\ 0 & , Ω \in ℝ \ [- Ω_{c}, + Ω_{c}] \end{array} = A χ_{[- Ω_{c}, + Ω_{c}]} (Ω), \forall Ω \in ℝ,

(2)

where

A > 0

is the filter gain,

Ω_{c} > 0

is the cut-off (absolute) pulsation (in [rad/s]), while

χ_{[a, b]}

stands for the index function of interval

[a, b]

. Then, the impulse response of the filter can be obtained by means of inverse FT:

h (t) = \frac{1}{2 π} \int_{- \infty}^{+ \infty} H (j Ω) e^{j Ω t} d Ω = \frac{A}{2 π} \int_{- Ω_{c}}^{+ Ω_{c}} e^{j Ω t} d Ω = \frac{A Ω_{c}}{π} sinc (Ω_{c} t) = \frac{A Ω_{c}}{π} ν (\frac{Ω_{c}}{π} t), \forall t \in ℝ .

(3)

Equation (3) reveals an interesting correspondence between the JvN function (1) and the index function (2). This means the JvN function is the impulse response of a low-pass filter of unit gain (

A = 1

) and cut-off pulsation

Ω_{c} = π

. This property can easily be proven by means of FT (direct and inverse):

{\begin{cases} \int_{- \infty}^{+ \infty} ν (t) e^{- j Ω t} d t = \int_{- \infty}^{+ \infty} sinc (π t) e^{- j Ω t} d t = χ_{[- π, + π]} (Ω), \forall Ω \in ℝ; \\ \int_{- \infty}^{+ \infty} χ_{[- π, + π]} (Ω) e^{j Ω t} d Ω = 2 π sinc (π t) = 2 π ν (t), \forall t \in ℝ . \end{cases}

(4)

Both functions belong to Lebesgue and Hilbert space of finite energy functions for which the FT is well defined, denoted by

L_{FT}^{2}

. (Recall that this space includes functions with integrable squares of the module, although there are such functions, outside

L_{FT}^{2}

, with divergent FT). This is an important feature that allows using the scalar product (

〈 •, • 〉

) to test the orthogonality. Before approaching the orthogonality property, it is useful to note (and easy to prove) that, if

f, g \in L_{FT}^{2}

, then:

〈 f, g 〉 = \frac{1}{2 π} 〈 F, G 〉,

(5)

where:

F

and

G

are the FT of

f

and

g

, respectively. Another interesting property of

L_{FT}^{2}

is its separability, i.e., the capability to admit numerable bases. This opens the way to employ such bases in real-world applications, by using computing techniques.

Given the correspondence (4), which rather belongs to SP field [2], hereafter, the JvN function will be referred to as JvN waveform (or window). (The term ‘window’ will be explained later). Two elementary SP operators can be applied on the JvN waveform (1), in order to generate a family from which a numerable basis of

L_{FT}^{2}

can be extracted.

Definition 1.

The time-shifting (linear) operator is defined as follows:

[\begin{array}{r} q^{- τ} : L_{FT}^{2} \to L_{FT}^{2} \\ f \mapsto q^{- τ} f \end{array}, (q^{- τ} f) (t) = f (t - τ), \forall t \in ℝ,

(6)

where:

τ \in ℝ

is the specific offset as free parameter.

The notation

q^{- τ}

is employed on purpose. If

τ > 0

, then the signal is delayed, whereas, if

τ < 0

, then the signal is anticipated. Thus, the signs

“ > ”

and

“ < ”

visually point to the direction of shifting. (Obviously, if

τ = 0

, no shifting is applied).

The Fourier operator

F

reacts to time-shifting as follows:

F (q^{- τ} f) (Ω) = \int_{- \infty}^{+ \infty} (q^{- τ} f) (t) e^{- j Ω t} d t = \int_{- \infty}^{+ \infty} f (t - τ) e^{- j Ω t} d t = e^{- j Ω τ} F (f) (Ω) = e^{- j Ω τ} F (j Ω), \forall Ω \in ℝ .

(7)

Interestingly, the FT was modulated by the elementary harmonic

e^{- j Ω τ}

. Property (7) suggests defining a second operator that should be able to translate the FT along the pulsations axis.

Definition 2.

The harmonic modulation (linear) operator is defined as follows:

[\begin{array}{r} μ^{ω} : L_{FT}^{2} \to L_{FT}^{2} \\ f \mapsto μ^{ω} f \end{array}, (μ^{ω} f) (t) = e^{j ω t} f (t), \forall t \in ℝ,

(8)

where:

ω \in ℝ_{+}

is the specific pulsation as free parameter.

This time:

F (μ^{ω} f) (Ω) = \int_{- \infty}^{+ \infty} (μ^{ω} f) (t) e^{- j Ω t} d t = \int_{- \infty}^{+ \infty} f (t) e^{- j (Ω - ω) t} d t = F (f) (Ω - ω) = F (j (Ω - ω)) = q^{- ω} F (j Ω), \forall Ω \in ℝ,

(9)

which proves that the FT is translated towards

+ \infty

with the offset

ω \geq 0

.

Both operators are mapping the space

L_{FT}^{2}

onto itself, as the energy of any input signal

f \in L_{FT}^{2}

is conserved:

{\begin{cases} E (q^{- τ} f) = \int_{- \infty}^{+ \infty} {| (q^{- τ} f) (t) |}^{2} d t = \int_{- \infty}^{+ \infty} {| f (t - τ) |}^{2} d t = E (f), \forall τ \in ℝ; \\ E (μ^{ω} f) = \int_{- \infty}^{+ \infty} {| (μ^{ω} f) (t) |}^{2} d t = \int_{- \infty}^{+ \infty} {| f (t) e^{j ω t} |}^{2} d t = E (f), \forall ω \in ℝ_{+} . \end{cases}

(10)

Starting from waveform (1), a new signal of

L_{FT}^{2}

can be generated with the help of operators (6) and (8), as follows: first, apply a time-shifting with offset

τ \in ℝ

, then apply a harmonic modulation with pulsation

ω \in ℝ_{+}

. Thus, one obtains:

ν^{(τ, ω)} (t) = (μ^{ω} (q^{- τ} f)) (t) = e^{j ω t} sinc (π (t - τ)) = e^{j ω t} ν (t - τ), \forall t \in ℝ .

(11)

From (11), one can note that the time-shifting does not affect the elementary harmonic. By varying the parameters

τ \in ℝ

and

ω \in ℝ_{+}

, a continuously indexed family of waveforms is obtained:

{ν^{(τ, ω)}}_{τ \in ℝ, ω \in ℝ_{+}} \subset L_{FT}^{2}

. According to SP terminology,

ν^{(0, 0)} \equiv ν

is a mother-waveform (mw) which can give birth to any child

ν^{(τ, ω)}

, often referred to a as time-frequency atom (tfa), due to the nature of the two operators. As in physics, any tfa (11) can be decomposed into a kernel (

“ sinc (π (t - τ)) ”

) and some electron(s) (

{“ e}^{j ω t} ”

), each of each is performing in one domain: the kernel—time and the electron(s)—frequency. (It has been proven that the sinc kernel also serves as interpolation kernel in SP [2]).

In this context, the main problem is how to extract a numerable orthogonal basis of

L_{FT}^{2}

from the family of tfas. The following result holds true.

Theorem 1.

A numerable family

{ν^{(τ, ω)}}_{τ \in T, ω \in P}

is orthogonal if and only if both requirements below are met:

a.: The set $T \subset ℝ$ is numerable and $τ_{1} - τ_{2} \in ℤ$ , $\forall τ_{1}, τ_{2} \in T$ ;
b.: The set $P \subset ℝ_{+}$ is numerable and $ω_{1} - ω_{2} \in 2 π ℤ$ , $\forall ω_{1}, ω_{2} \in P$ (where $2 π ℤ$ includes all integer multiples of $2 π$ ).

Proof.

Before proving the theorem, it is useful to notice that the orthogonality can very easily be verified by using index functions. Thus, evidently, if

χ_{[a, b]}

and

χ_{[c, d]}

are two arbitrarily selected index functions with

a < b

and

c < d

, they are orthogonal if and only if the two intervals are almost disjoint, i.e.,

[a, b] \cap [c, d] \in {\emptyset, a = d, b = c}

. However, the orthogonality property is not obvious when looking at tfa in the time domain. Hopefully, the correspondence (4), together with the property (5), can be exploited to express the orthogonality in the frequency domain, where index functions can be exploited.

Clearly, the necessary preparatory step is to derive the TF of any tfa. Thus:

\begin{array}{l} F (ν^{(τ, ω)}) (Ω) & = \int_{- \infty}^{+ \infty} (ν^{(τ, ω)}) (t) e^{- j Ω t} d t = \int_{- \infty}^{+ \infty} ν (t - τ) e^{- j (Ω - ω) t} d t = \int_{- \infty}^{+ \infty} ν (t) e^{- j (Ω - ω) (t + τ)} d t = e^{- j τ (Ω - ω)} \int_{- \infty}^{+ \infty} ν (t) e^{- j (Ω - ω) t} d t = \\ = e^{- j τ (Ω - ω)} χ_{[- π, + π]} (Ω - ω) = e^{- j τ (Ω - ω)} χ_{[ω - π, ω + π]} (Ω), \forall Ω \in ℝ, \forall τ \in ℝ, \forall ω \in ℝ_{+} . \end{array}

(12)

Now, according to (5):

\begin{array}{l} 〈 ν^{(τ_{1}, ω_{1})}, ν^{(τ_{2}, ω_{2})} 〉 & = \frac{1}{2 π} 〈 F (ν^{(τ_{1}, ω_{1})}), F (ν^{(τ_{2}, ω_{2})}) 〉 = \frac{1}{2 π} \int_{- \infty}^{+ \infty} e^{- j τ_{1} (Ω - ω_{1})} e^{+ j τ_{2} (Ω - ω_{2})} χ_{[ω_{1} - π, ω_{1} + π]} (Ω) χ_{[ω_{2} - π, ω_{2} + π]} (Ω) d Ω = \\ = \frac{e^{j (τ_{1} ω_{1} - τ_{2} ω_{2})}}{2 π} \int_{- \infty}^{+ \infty} χ_{[ω_{1} - π, ω_{1} + π]} (Ω) χ_{[ω_{2} - π, ω_{2} + π]} (Ω) e^{j Ω (τ_{2} - τ_{1})} d Ω, \end{array}

(13)

for any

τ_{1}, τ_{2} \in ℝ

and

ω_{1}, ω_{2} \in ℝ_{+}

.

From (13), it follows that the integral is null if and only if the intervals

[ω_{1} - π, ω_{1} + π]

and

[ω_{2} - π, ω_{2} + π]

are almost disjoint. Since both intervals have the same length, one equal to

2 π

, it results that they are disjoint if and only if the difference between the pulsations

ω_{1}

and

ω_{2}

is a non-null integer multiple of

2 π

. Consequently, so far, the necessary and sufficient orthogonality condition is:

ω_{1}, ω_{2} \in P

with

ω_{1} \neq ω_{2}

, which does not cover all the indexes of set

P

.

What if

ω_{1} = ω_{2} = ω \in P

? In this case, the scalar product (13) becomes:

\begin{array}{l} 〈 ν^{(τ_{1}, ω)}, ν^{(τ_{2}, ω)} 〉 & = \frac{e^{j ω (τ_{1} - τ_{2})}}{2 π} \int_{- \infty}^{+ \infty} χ_{[ω - π, ω + π]}^{2}_{} (Ω) e^{j Ω (τ_{2} - τ_{1})} d Ω = \frac{e^{j ω (τ_{1} - τ_{2})}}{2 π} \int_{ω - π}^{ω + π} e^{j Ω (τ_{2} - τ_{1})} d Ω = \\ = \frac{e^{j ω (τ_{1} - τ_{2})}}{2 π} \cdot {(\frac{e^{j Ω (τ_{2} - τ_{1})}}{j (τ_{2} - τ_{1})}) |}_{Ω = ω - π}^{Ω = ω + π} = \frac{e^{j ω (τ_{1} - τ_{2})}}{2 π} \cdot \frac{e^{j (ω + π) (τ_{2} - τ_{1})} - e^{j (ω - π) (τ_{2} - τ_{1})}}{j (τ_{2} - τ_{1})} = \\ = \frac{e^{j π (τ_{2} - τ_{1})} - e^{- j π (τ_{2} - τ_{1})}}{2 π j (τ_{2} - τ_{1})} = ν (τ_{2} - τ_{1}) . \end{array}

(14)

Since the JvN function (1) is null in all integers, excepting for the non-null one (where it takes the unit value), Equation (14) shows that the two tfas are orthogonal if and only if

τ_{1}, τ_{2} \in T

and

τ_{1} \neq τ_{2}

.

What if

τ_{1} = τ_{2} = τ \in T

? Obviously, in this case, the scalar product cannot be null, as it evaluates the energy of the corresponding tfa:

〈 ν^{(τ, ω)}, ν^{(τ, ω)} 〉 = {‖ ν^{(τ, ω)} ‖}^{2} = E (ν^{(τ, ω)}) = ν (0) = 1

(15)

Thus, not only is the numerable family orthogonal, but it also includes the unit norm tfas (i.e., it is an orthonormal family). □

As direct consequence of Theorem 1, a comfortable choice is to extract the following numerable family from the set

{ν^{(τ, ω)}}_{τ \in ℝ, ω \in ℝ_{+}}

:

V = {ν^{(p, 2 k π)}}_{p \in ℤ, k \in ℕ}

(16)

The reader can easily verify that the family

V

is orthonormal (according to the proof of Theorem 1). In SP terminology,

V

also is referred to as dictionary of time-frequency waveforms (or tfas). The tfas that belong to the JvN dictionary (16) are expressed as:

ν^{(p, 2 k π)} (t) = e^{2 π k t j} sinc (π (t - p)), \forall t \in ℝ, \forall p \in ℤ, \forall k \in ℕ

(17)

The next problem is to determine the requirements to meet such that the JvN dictionary becomes a basis of

L_{FT}^{2}

space. Obviously, thanks to the orthogonality property,

V

is a linearly independent system of

L_{FT}^{2}

. Then, it suffices to verify that

V

is also a generators system of

L_{FT}^{2}

or to determine in which conditions it can become such a system.

To solve the problem, the auxiliary result below may help.

Lemma 1.

The subset of translated tfas

{ν^{(p, 0)}}_{p \in ℤ} \subset V

verifies the following remarkable property:

\sum_{p \in ℤ} {[ν^{(p, 0)} (t)]}^{2} = 1, \forall t \in ℝ .

(18)

Proof.

Observe that, if

t = n \in ℤ

, then:

ν^{(p, 0)} (n) = sinc (π (n - p)) = δ_{0} [n - p], \forall p \in ℤ,

(19)

where

δ_{0} [\cdot]

is the unit impulse centered in origin (i.e., the Kronecker symbol). In this case, the equality (18) is verified, evidently. If

t \in ℝ \ ℤ

, then:

\sum_{p \in ℤ} {[ν^{(p, 0)} (t)]}^{2} = \sum_{p \in ℤ} {sinc}^{2} (π (t - p)) \leq \frac{1}{π^{2}} \sum_{p \in ℤ} \frac{1}{{(p - t)}^{2}}, \forall t \in ℝ \ ℤ,

(20)

which proves that the series

\sum_{p \in ℤ} {[ν^{(p, 0)} (t)]}^{2}

is absolutely convergent.

According to property (12), the atom

ν^{(p, 2 k π)}

(

p \in ℤ

,

k \in ℕ

) can be recovered by means of inverse FT:

ν^{(p, 2 k π)} (t) = e^{2 π k t j} sinc (π (t - p)) = \frac{1}{2 π} \int_{- \infty}^{+ \infty} e^{- j p (Ω - 2 k π)} χ_{[(2 k - 1) π, (2 k + 1) π]} (Ω) e^{j Ω t} d Ω = \frac{1}{2 π} \int_{- \infty}^{+ \infty} χ_{[(2 k - 1) π, (2 k + 1) π]} (Ω) e^{j Ω (t - p)} d Ω, \forall t \in ℝ .

(21)

Equation (21) involves:

ν^{(p, 0)} (t) = sinc (π (t - p)) = \frac{e^{- 2 π k t j}}{2 π} \int_{- \infty}^{+ \infty} χ_{[(2 k - 1) π, (2 k + 1) π]} (Ω) e^{j Ω (t - p)} d Ω, \forall t \in ℝ .

(22)

Note that

ν^{(p, 0)}

is a real-valued function. Then, with the help of (22) and Fubini’s theorem, one can write:

\begin{array}{l} \sum_{p \in ℤ} {[ν^{(p, 0)} (t)]}^{2} & = \sum_{p \in ℤ} ν^{(p, 0)} (t) \bar{ν^{(p, 0)} (t)} = \frac{1}{4 π^{2}} \sum_{p \in ℤ} (\int_{- \infty}^{+ \infty} χ_{[(2 k - 1) π, (2 k + 1) π]} (Ω) e^{j Ω (t - p)} d Ω) (\int_{- \infty}^{+ \infty} χ_{[(2 k - 1) π, (2 k + 1) π]} (Φ) e^{- j Φ (t - p)} d Φ) = \\ = \frac{1}{4 π^{2}} \sum_{p \in ℤ} \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} χ_{[(2 k - 1) π, (2 k + 1) π]} (Ω) χ_{[(2 k - 1) π, (2 k + 1) π]} (Φ) e^{j (Ω - Φ) t} e^{j (Φ - Ω) p} d Ω d Φ, \forall t \in ℝ . \end{array}

(23)

Furthermore, since both the integrals and the infinite sum are absolutely convergent, the computations can be made in any order. For example, in (23), one can first evaluate the sum and then the integrals. Before that, recall the Poisson-like identity coming from distributions theory:

\sum_{p \in ℤ} e^{j p α β} = \frac{2 π}{α} δ_{0} (β), \forall α, β \in ℝ .

(24)

where

δ_{0} (\cdot)

is the Dirac impulse centered in origin. Then, with the help of identity (24) (where

α = 1

and

β = Ω - Φ

), Equation (23) becomes:

\begin{array}{l} \sum_{p \in ℤ} {[ν^{(p, 0)} (t)]}^{2} & = \frac{1}{4 π^{2}} \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} χ_{[(2 k - 1) π, (2 k + 1) π]} (Ω) χ_{[(2 k - 1) π, (2 k + 1) π]} (Φ) e^{j (Ω - Φ) t} \underset{2 π δ_{0} (Ω - Φ)}{\underset{︸}{(\sum_{p \in ℤ} e^{j (Φ - Ω) p})}} d Ω d Φ = \\ = \frac{1}{2 π} \int_{- \infty}^{+ \infty} χ_{[(2 k - 1) π, (2 k + 1) π]}^{2} (Ω) d Ω = \frac{1}{2 π} \int_{(2 k - 1) π}^{(2 k + 1) π} d Ω = 1, \forall t \in ℝ . \end{array}

(25)

□

Theorem 2.

The JvN dictionary

V = {ν^{(p, 2 k π)}}_{p \in ℤ, k \in ℕ}

is an orthonormal basis of

L_{FT}^{2}

space.

Proof.

According to Theorem 1, the dictionary is an orthonormal system of

L_{FT}^{2}

. Hence, the tfas of

V

are linearly independent. Let

f

be any signal of

L_{FT}^{2}

. Then, the following projection coefficients can be computed, by using

V

:

F [p, k] = 〈 f, ν^{(p, k)} 〉 = \int_{- \infty}^{+ \infty} f (t) \bar{ν^{(p, k)} (t)} d t = \int_{- \infty}^{+ \infty} f (t) sinc (π (t - p)) e^{- 2 k π t j} d t, \forall p \in ℤ, \forall k \in ℕ .

(26)

With the decomposition coefficients (26), one can generate the following signal:

\tilde{f} (t) = \sum_{p \in ℤ} \sum_{k \in ℕ} F [p, k] ν^{(p, k)} (t) = \sum_{p \in ℤ} \sum_{k \in ℕ} F [p, k] sinc (π (t - p)) e^{2 k π t j}, \forall t \in ℝ .

(27)

Two cases have to be analyzed, in order to compute the values of signal

\tilde{f}

:

$t = n \in ℤ$

In this case, since the property (19) is verified, from (27), one obtains:

\tilde{f} (n) = \sum_{k \in ℕ} F [n, k] = \sum_{k \in ℕ} \int_{- \infty}^{+ \infty} f (t) sinc (π (t - n)) e^{- 2 k π t j} d t = \sum_{k \in ℕ} \int_{- \infty}^{+ \infty} f (t) sinc (π (t - n)) e^{- 2 k π (t - n) j} d t .

(28)

The infinite sum and the integral are absolutely convergent. This allows the computation of the sum first in (28), with the help of the Poisson-like formula (24) (where

α = 2 π

and

β = n - t

). Hence:

\tilde{f} (n) = \sum_{k \in ℕ} F [n, k] = \int_{- \infty}^{+ \infty} f (t) sinc (π (t - n)) \underset{δ_{0} (n - t)}{\underset{︸}{\sum_{k \in ℕ} e^{2 k π (n - t) j}}} d t = f (n) .

(29)

b.: $t \in ℝ \ ℤ$

By inserting (26) in (27), one obtains:

\tilde{f} (t) = \sum_{p \in ℤ} \sum_{k \in ℕ} F [p, k] sinc (π (t - p)) e^{2 k π t j} = \sum_{p \in ℤ} \sum_{k \in ℕ} (\int_{- \infty}^{+ \infty} f (τ) sinc (π (τ - p)) e^{- 2 k π τ j} d τ) sinc (π (t - p)) e^{2 k π t j} .

(30)

Since both the infinite sums and the integral are absolutely convergent, the computations can be organized as follows:

\tilde{f} (t) = \sum_{p \in ℤ} sinc (π (t - p)) \int_{- \infty}^{+ \infty} f (τ) sinc (π (τ - p)) \underset{δ_{0} (t - τ)}{\underset{︸}{(\sum_{k \in ℕ} e^{- 2 k π (t - τ) j})}} d τ = f (t) \sum_{p \in ℤ} {sinc}^{2} (π (t - p)) = f (t) \sum_{p \in ℤ} {[ν^{(p, 0)} (t)]}^{2} .

(31)

In (32), the Poisson-like formula (24) was exploited (with

α = 2 π

and

β = t - τ

).

Now, Lemma 1 yields the exact recovery of signal value

f (t)

.

In both cases,

\tilde{f} \equiv f

, which proves that

V

is a generators system as well. □

The two theorems above can be employed to define the invertible and orthogonal JvN Transform (JvNT) in continuous time.

Definition 3.

The continuous time JvNT is defined as follows, for any signal

f \in L_{FT}^{2}

:

N (f) [p, k] = F [p, k] = 〈 f, ν^{(p, k)} 〉 = \int_{- \infty}^{+ \infty} f (t) sinc (π (t - p)) e^{- 2 k π t j} d t, \forall p \in ℤ, \forall k \in ℕ,

(32)

where the notation

“ N ”

was selected in memory of JvN.

Note that the JvNT (32) is a complex-valued linear function of two integer arguments: the time-shifting index (

p

) and the harmonic modulation index (

k

). Moreover, although not clearly specified, one assumes that

L_{FT}^{2}

includes real-valued signals. In this case, if Definition 3 is extended for negative harmonic modulation indexes, then:

N (f) [p, - k] = F [p, - k] = \int_{- \infty}^{+ \infty} f (t) sinc (π (t - p)) e^{+ 2 k π t j} d t = \bar{F [p, k]} = \bar{N (f) [p, k]}, \forall p \in ℤ, \forall k \in ℕ,

(33)

which means the JvNT is congregate symmetric (similarly to the FT). This is the reason the harmonic modulation index only takes non-negative values. However, if

L_{FT}^{2}

is extended to complex-valued signals, the symmetry is lost and the harmonic modulation index should cover all integers, regardless of their signs.

In SP terminology, applying

N

on a signal

f \in L_{FT}^{2}

stands for performing signal analysis (with the transform

N

). Additionally,

{F [p, k]}_{p \in ℤ, k \in ℕ}

is the set of JvN analysis coefficients.

The inverse JvNT in continuous time can straightforwardly be expressed, thanks to Theorem 2:

f (t) = N^{- 1} (F) (t) = \sum_{p \in ℤ} \sum_{k \in ℕ} F [p, k] sinc (π (t - p)) e^{2 k π t j}, \forall t \in ℝ .

(34)

This time, SP practitioners say that the signal synthesis is performed (with the inverse of transform

N

or by using the JvN analysis coefficients).

A final remark, before approaching the discrete-time case. From a mathematical point of view, the recovering Equation (34) only relies on weak (punctual) convergence of functional series. (A similar result to the well-known Dirichlet–Fourier Theorem [7] can be proven in this aim). This means that each value

f (t)

can be recovered with its own (punctual) accuracy, which can vary from one point to another (unlike the case of strong (uniform) convergence, in which all values are computed with the same accuracy).

2.2. JvNTs for Discrete-Time 1D Signals

2.2.1. Discrete Time Signals Framework

This subsection is a natural extension of the previous one. The main operation to apply on the tfas of orthogonal basis

V

is sampling with some rate. However, sampling is not applied at random, as the goal to be achieved is triple: preserve frequency information of any tfa; conserve the orthogonality; and do not affect the inversibility of the resulting transform.

To reach for the first goal, the Kotelnikov–Shannon–Nyquist sampling theorems can be applied [2]. Thus, the minimum (critical) sampling rate to avoid aliasing is two times bigger than the cut-off frequency of JvN mw (1). Alternatively, this cut-off frequency equals 0.5 Hz, according to Equation (4) (as the cut-off pulsation equals

π

rad/s). It follows that the sampling rate must be at least equal to 1 Hz, which requires that the sampling period, denoted by

T_{s}

, only varies in the interval

(0, 1]

(and is measured in seconds [s]).

Consider a sampling period

T_{s} \in (0, 1]

[s]. Then, the sampled version of tfa (17) is:

ν_{T_{s}}^{[p, k]} [n] = ν^{(p, 2 k π)} (n T_{s}) = e^{2 π k n T_{s} j} sinc [π (n T_{s} - p)], \forall n, p \in ℤ, \forall k \in ℕ .

(35)

The dictionary

V

of (16) becomes:

V_{T_{s}} = {ν_{T_{s}}^{[p, k]}}_{p \in ℤ, k \in ℕ} .

(36)

The framework changes as well. The space hosting all signals is now

l^{2}

(instead of

L_{FT}^{2}

). This is a Lebesgue-Hilbert space as well. The signals of

l^{2}

are discrete and have finite energy (the sum of square modules is convergent). Since

l^{2} \subset l^{1}

, such signals also are stable (absolutely summable) and thus, the FT can be computed for any of them. (Recall that, in the continuous time case,

L^{1}

is only intersected with

L^{2}

, and no space is included into the other. Moreover, their intersection is not closed. These are the reasons the notation

“ L_{FT}^{2} ”

was used, instead of simply

“ L^{2} ”

). In context of

l^{2}

, two classes of signals are of interest: those with infinite support length and those with finite support length. The second class is extremely important for the SP techniques that can be applied in real-life applications. Denote this class by

l_{N}^{2}

, where

N \in ℕ^{*}

is the support length. In fact,

l_{N}^{2}

is a subspace of

l^{2}

. Note that all tfas of family P

V_{T_{s}}

(36) have infinite length.

The property (5) of the scalar product is verified in

l^{2}

, as well, but only in cases of the FT operating with continuous pulsations. In case of signals with finite length (i.e., from

l_{N}^{2}

), the FT is replaced by the DFT [2] (for which the pulsations’ axis is discrete) and the property (5) is replaced by:

〈 x, y 〉 = \frac{1}{N} 〈 X, Y 〉,

(37)

where:

X \in l_{N}^{2}

and

Y \in l_{N}^{2}

are the DFTs of

x \in l_{N}^{2}

and

y \in l_{N}^{2}

, respectively. Similarly to

L_{FT}^{2}

, the space

l^{2}

is separable and the same property transfers to

l_{N}^{2}

. Moreover, in the case of

l_{N}^{2}

, bases with finite number of signals can be defined, which constitutes an essential property yielding implementation of efficient SP numerical algorithms.

2.2.2. JvNT for Discrete Time Signals with Infinite Support Length

Return to the remaining goals to be achieved by sampling. Before approaching such goals, it is useful to analyze how the correspondence (4) is affected by sampling the JvN mw. Recall that sampling changes the definition of FT as well. In this case, the integral is replaced by an infinite sum and, more importantly, the pulsations axis becomes relative. This means working with relative/normalized pulsation

ω

[rad], obtained from the absolute pulsation

Ω

[rad/s] after normalization by the sampling rate:

ω = Ω T_{s}

. This correlation suggests introducing a new notation:

ω_{c} = π T_{s}

. In fact,

ω_{c}

is the relative cut-off pulsation of discretized JvN mw, as

Ω_{c} = π

[rad/s] is the absolute cut-off pulsation of continuous time JvN mw (1). Moreover, if the Shannon-Nyquist sampling rule is applied, then

ω_{c} = π T_{s} \leq π

. (Recall that the TF is

2 π

-periodic in case of signals from

l^{2}

). One expects that sampling does not affect essential correspondence between the sinc function and the rectangular (index) function. Thus, the low-pass filter (2) corresponds to the digital filter:

H (e^{j ω}) = {\begin{array}{l} A & , ω \in [- ω_{c}, + ω_{c}] \\ 0 & , ω \in [- π, + π] \ [- ω_{c}, + ω_{c}] \end{array} = A χ_{[- ω_{c}, + ω_{c}]} (ω), \forall ω \in [- π, + π] .

(38)

The impulse response of filter (38) can be obtained by means of inverse FT:

h [n] = \frac{1}{2 π} \int_{- π}^{+ π} H (e^{j ω}) e^{j ω n} d ω = \frac{A}{2 π} \int_{- ω_{c}}^{+ ω_{c}} e^{j ω n} d ω = \frac{A ω_{c}}{π} sinc (ω_{c} n) = \frac{A ω_{c}}{π} ν (\frac{ω_{c}}{π} n), \forall n \in ℤ .

(39)

Equations (38) and (39) allow for the specifying of the new correspondence between the discretized JvN Function (1) and the index function. In this case,

A = 1

and

ω_{c} = π T_{s}

. Thus:

{\begin{cases} \sum_{n \in ℤ} ν (n T_{s}) e^{- j ω n} = \sum_{n \in ℤ} sinc (n π T_{s}) e^{- j ω n} = \sum_{n \in ℤ} sinc (ω_{c} n) e^{- j ω n} = \frac{1}{T_{s}} χ_{[- ω_{c}, + ω_{c}]} (ω), \forall ω \in [- π, + π]; \\ \int_{- π}^{+ π} χ_{[- ω_{c}, + ω_{c}]} (ω) e^{j ω n} d ω = 2 π T_{s} sinc (n π T_{s}) = 2 π T_{s} ν (n T_{s}), \forall n \in ℤ . \end{cases}

(40)

The following result shows the requirement to be met such that the dictionary

V_{T_{s}}

becomes orthogonal.

Theorem 3.

The dictionary

V_{T_{s}} = {ν_{T_{s}}^{[p, k]}}_{p \in ℤ, k \in ℕ}

is orthogonal if

T_{s} \in 1 / ℕ^{*}

(i.e., the sampling period is set to the inverse of a positive integer or, equivalently, the sampling rate is integer).

Proof.

As in the case of JvN tfas in continuous time, the orthogonality can more easily be tested by using index functions. Evaluate then the FT of any tfa

ν_{T_{s}}^{[p, k]}

(

p \in ℤ

,

k \in ℕ

) from the family

V_{T_{s}}

, by using expression (35):

F (ν_{T_{s}}^{[p, k]}) (ω) = \sum_{n \in ℤ} ν_{T_{s}}^{[p, k]} [n] e^{- j ω n} = \sum_{n \in ℤ} sinc [π (n T_{s} - p)] e^{- j n (ω - 2 k π T_{s})} = \sum_{n \in ℤ} sinc [ω_{c} (n - \frac{p}{T_{s}})] e^{- j n (ω - 2 k ω_{c})}, \forall ω \in [- π, + π] .

(41)

In Equation (41), the only impediment to the exploiting of the correspondence (40) is the fact that the argument of the sinc function takes non-integer values. It suffices then to require that the ratio

p / T_{s}

belong to

ℤ

. Since

p \in ℤ

, the only possibility is that

T_{s} \in 1 / ℕ^{*}

. Consequently, instead of choosing

T_{s} \in (0, 1]

, one can set

K \in ℕ^{*}

such that

T_{s} = 1 / K

, which involves

ω_{c} = π / K

. With these specifications, the correspondence (40) can be employed in order to continue the derivations from Equation (41):

\begin{array}{l} F (ν_{T_{s}}^{[p, k]}) (ω) & = \sum_{n \in ℤ} sinc [ω_{c} (n - p K)] e^{- j n (ω - \frac{2 k π}{K})} = \sum_{n \in ℤ} sinc (ω_{c} n) e^{- j (n + p K) (ω - \frac{2 k π}{K})} = e^{- j p K ω} \sum_{n \in ℤ} sinc (ω_{c} n) e^{- j n (ω - \frac{2 k π}{K})} = \\ = K e^{- j p K ω} χ_{[- ω_{c}, + ω_{c}]} (ω - \frac{2 k π}{K}) = K e^{- j p K ω} χ_{[\frac{(2 k - 1) π}{K}, \frac{(2 k + 1) π}{K}]} (ω), \forall ω \in [- π, + π] . \end{array}

(42)

Note that the pulsations band of tfa

ν_{T_{s}}^{[p, k]}

is

[\frac{(2 k - 1) π}{K}, \frac{(2 k + 1) π}{K}]

. Clearly if

k_{1} \neq k_{2}

, the tfas

ν_{T_{s}}^{[p, k_{1}]}

and

ν_{T_{s}}^{[p, k_{2}]}

exhibit an almost-disjoint pulsations band. This property is verified regardless of the time-shifting indices (equal or not), as the bandwidth does not depend on them. According to property (5), it follows that they are orthogonal. The orthogonality property needs then to be verified only in case

k_{1} = k_{2} = k \in ℕ

. Arbitrarily choose

p_{1}, p_{2} \in ℤ

and

k \in ℕ

. With the help of property (5), one can write:

〈 ν_{T_{s}}^{[p_{1}, k]}, ν_{T_{s}}^{[p_{2}, k]} 〉 = \frac{1}{2 π} 〈 F (ν_{T_{s}}^{[p_{1}, k]}), F (ν_{T_{s}}^{[p_{2}, k]}) 〉 = \frac{K^{2}}{2 π} \int_{- π}^{+ π} χ_{[\frac{(2 k - 1) π}{K}, \frac{(2 k + 1) π}{K}]}^{2} (ω) e^{j K (p_{2} - p_{1}) ω} d ω = \frac{K^{2}}{2 π} \int_{\frac{(2 k - 1) π}{K}}^{\frac{(2 k + 1) π}{K}} e^{j K (p_{2} - p_{1}) ω} d ω .

(43)

The final integral of Equation (43) can be computed in two cases.

If

p_{1} = p_{2} = p \in ℤ

, then the two tfas are identical. In this case, the scalar product returns the energy of tfa:

〈 ν_{T_{s}}^{[p, k]}, ν_{T_{s}}^{[p, k]} 〉 = {‖ ν_{T_{s}}^{[p, k]} ‖}^{2} = E (ν_{T_{s}}^{[p, k]}) = \frac{K^{2}}{2 π} \int_{\frac{(2 k - 1) π}{K}}^{\frac{(2 k + 1) π}{K}} d ω = K .

(44)

Thus, by difference from the continuous-time tfa, the discrete-time tfa is not necessarily normalized, its energy being equal to the sampling rate.

If

p_{1} \neq p_{2}

, then:

\begin{array}{l} 〈 ν_{T_{s}}^{[p_{1}, k]}, ν_{T_{s}}^{[p_{2}, k]} 〉 & = \frac{K^{2}}{2 π} \int_{\frac{(2 k - 1) π}{K}}^{\frac{(2 k + 1) π}{K}} e^{j K (p_{2} - p_{1}) ω} d ω = \frac{K}{2 π} \cdot {\frac{e^{j K (p_{2} - p_{1}) ω}}{j (p_{2} - p_{1})} |}_{ω = \frac{(2 k - 1) π}{K}}^{ω = \frac{(2 k + 1) π}{K}} = \frac{K}{2 π} \cdot \frac{e^{j (2 k + 1) (p_{2} - p_{1}) π} - e^{j (2 k - 1) (p_{2} - p_{1}) π}}{j (p_{1} - p_{2})} = \\ = \frac{K}{2 π} \cdot \frac{e^{j (p_{2} - p_{1}) π} - e^{- j (p_{2} - p_{1}) π}}{j (p_{1} - p_{2})} = K ν (p_{2} - p_{1}) = 0 . \end{array}

(45)

□

Thanks to Theorem 3, the notation

ν_{T_{s}}^{[p, k]}

can be replaced by

ν_{K}^{[p, k]}

and

K \in ℕ^{*}

is a parameter to be set according to further requirements (which are stated later in this article). Additionally, the notation of dictionary

V_{T_{s}}

becomes

V_{K}

. Note that, according to property (44), the dictionary

V_{K}

is orthogonal, but not orthonormal.

The last goal to be achieved is the invertibility. Thanks to the specific selection of sampling period, the expression (35) of any tfa becomes:

ν_{K}^{[p, k]} [n] = e^{\frac{2 π k n}{K} j} sinc [π (\frac{n}{K} - p)], \forall n, p \in ℤ, \forall k \in ℕ .

(46)

Focus on the harmonic factor in (46). Obviously, if the harmonic index is replaced by

k + l K

(with

l \in ℤ

) then:

ν_{K}^{[p, k + l K]} [n] = e^{\frac{2 π (k + l K) n}{K} j} sinc [π (\frac{n}{K} - p)] = e^{\frac{2 π k n}{K} j} sinc [π (\frac{n}{K} - p)] = ν_{K}^{[p, k]} [n], \forall n, p \in ℤ, \forall k \in ℕ .

(47)

Interestingly, thanks to property (47), the dictionary of tfas can be enumerated by means of a finite number of harmonic indices:

V_{K} = {ν_{K}^{[p, k]}}_{p \in ℤ, k \in \bar{0, K - 1}}

.

In order to ease the proof of invertibility, the following result is helpful.

Lemma 2.

The real-valued tfas of orthogonal dictionary

V_{K}

verify the following property:

\sum_{p \in ℤ} ν_{K}^{[p + m, 0]} [n] ν_{K}^{[p, 0]} [n] = δ_{0} [m], \forall m, n \in ℤ .

(48)

Proof.

One starts by proving that the series in (48) is absolutely convergent. Two cases are analyzed next.

If

n = l K \in K ℤ

, then, obviously:

\sum_{p \in ℤ} ν_{K}^{[p + m, 0]} [l K] ν_{K}^{[p, 0]} [l K] = \sum_{p \in ℤ} sinc (π (l - p - m)) sinc (π (l - p)) = sinc (m π) = δ_{0} [m], \forall m \in ℤ,

(49)

which proves that the identity (48) holds true.

If

n \in ℤ \ K ℤ

, then:

| \sum_{p \in ℤ} ν_{K}^{[p + m, 0]} [n] ν_{K}^{[p, 0]} [n] | \leq \sum_{p \in ℤ} | ν_{K}^{[p + m, 0]} [n] ν_{K}^{[p, 0]} [n] | \leq \frac{K^{2}}{π^{2}} \sum_{p \in ℤ} \frac{1}{| (p + m) K - n | | p K - n |}, \forall m \in ℤ .

(50)

The upper limit in (50) is convergent.

Now, the identity (48) has to be proven for

n \in ℤ \ K ℤ

. The tfa

ν_{K}^{[p, 0]}

(

p \in ℤ

) can be recovered by means of inverse FT (see Equation (42)):

ν_{K}^{[p, 0]} [n] = \frac{K}{2 π} \int_{- π}^{+ π} χ_{[- \frac{π}{K}, + \frac{π}{K}]} (ω) e^{- j p K ω} e^{j n ω} d ω = \frac{K}{2 π} \int_{- π}^{+ π} χ_{[- \frac{π}{K}, + \frac{π}{K}]} (ω) e^{j (n - p K) ω} d ω, \forall n \in ℤ

(51)

Since

ν_{K}^{[p, 0]}

is real-valued for any

\forall p \in ℤ

, one can use (51) to write:

\begin{array}{l} \sum_{p \in ℤ} ν_{K}^{[p + m, 0]} [n] ν_{K}^{[p, 0]} [n] & = \sum_{p \in ℤ} ν_{K}^{[p + m, 0]} [n] \bar{ν_{K}^{[p, 0]} [n]} = {(\frac{K}{2 π})}^{2} \sum_{p \in ℤ} \int_{- π}^{+ π} \int_{- π}^{+ π} χ_{[- \frac{π}{K}, + \frac{π}{K}]} (ω) χ_{[- \frac{π}{K}, + \frac{π}{K}]} (ϕ) e^{j (n - p K - m K) ω} e^{- j (n - p K) φ} d ω d ϕ = \\ = {(\frac{K}{2 π})}^{2} \sum_{p \in ℤ} \int_{- π}^{+ π} \int_{- π}^{+ π} χ_{[- \frac{π}{K}, + \frac{π}{K}]} (ω) χ_{[- \frac{π}{K}, + \frac{π}{K}]} (ϕ) e^{j n (ω - ϕ)} e^{- j m K ω} e^{j p K (φ - ω)} d ω d ϕ, \forall n \in ℤ . \end{array}

(52)

The infinite sum and the two integrals in Equation (52) are absolutely convergent. Thus, the sum can be computed first:

\begin{array}{l} \sum_{p \in ℤ} ν_{K}^{[p + m, 0]} [n] ν_{K}^{[p, 0]} [n] & = {(\frac{K}{2 π})}^{2} \int_{- π}^{+ π} \int_{- π}^{+ π} χ_{[- \frac{π}{K}, + \frac{π}{K}]} (ω) χ_{[- \frac{π}{K}, + \frac{π}{K}]} (ϕ) e^{j n (ω - ϕ)} e^{- j m K ω} \underset{\frac{2 π}{K} δ_{0} (ϕ - ω)}{\underset{︸}{\sum_{p \in ℤ} e^{j p K (ϕ - ω)}}} d ω d ϕ = \\ = \frac{K}{2 π} \int_{- π}^{+ π} χ_{[- \frac{π}{K}, + \frac{π}{K}]}^{2} (ω) e^{- j m K ω} d ω = \frac{K}{2 π} \int_{- \frac{π}{K}}^{+ \frac{π}{K}} e^{- j m K ω} d ω, \forall m \in ℤ . \end{array}

(53)

In (53), the Poisson-like identity (24) was employed (with

α = K

and

β = ϕ - ω

).

If

m = 0

, then the final integral of (53) equals

2 π / K

, which means the first sum is unit.

If

m \neq 0

, then:

\sum_{p \in ℤ} ν_{K}^{[p + m, 0]} [n] ν_{K}^{[p, 0]} [n] = \frac{K}{2 π} \int_{- \frac{π}{K}}^{+ \frac{π}{K}} e^{- j m K ω} d ω = - \frac{1}{2 π} \cdot {\frac{e^{- j m K ω}}{j m} |}_{ω = - \frac{π}{K}}^{ω = + \frac{π}{K}} = \frac{1}{2 π} \cdot \frac{e^{j m π} - e^{- j m π}}{j m} = 0 .

(54)

□

Note that, if

m = 0

in Equation (48), then:

\sum_{p \in ℤ} {(ν_{K}^{[p, 0]} [n])}^{2} = 1, \forall n \in ℤ,

(55)

which is similar to Equation (18) of Lemma 1. Thus, sampling did not affect this property. Now, the invertibility property can be proven.

Theorem 4.

The JvN dictionary

V_{K} = {ν_{K}^{[p, k]}}_{p \in ℤ, k \in \bar{0, K - 1}}

is an orthonormal basis of

l^{2}

space.

Proof.

Theorem 3 shows that the dictionary is an orthogonal system of

l^{2}

and, thus, the various tfas are linearly independent. Arbitrarily choose a discrete signal

x \in l^{2}

. Then, the signal can be decomposed by using the dictionary

V_{K}

. The projection coefficients are:

X_{K} [p, k] = 〈 x, ν_{K}^{[p, k]} 〉 = \sum_{n \in ℤ} x [n] \bar{ν_{K}^{[p, k]} [n]} = \sum_{n \in ℤ} x [n] sinc [π (\frac{n}{K} - p)] e^{- \frac{2 k n π}{K} j}, \forall p \in ℤ, \forall k \in \bar{0, K - 1} .

(56)

They can be employed to generate the following discrete signal:

\begin{array}{l} \tilde{x} [n] & = \frac{1}{K} \sum_{p \in ℤ} \sum_{k = 0}^{K - 1} X_{K} [p, k] ν_{K}^{[p, k]} [n] = \frac{1}{K} \sum_{p \in ℤ} \sum_{k = 0}^{K - 1} X_{K} [p, k] sinc [π (\frac{n}{K} - p)] e^{\frac{2 k n π}{K} j} = \\ = \frac{1}{K} \sum_{p \in ℤ} \sum_{k = 0}^{K - 1} \sum_{m \in ℤ} x [m] sinc [π (\frac{m}{K} - p)] sinc [π (\frac{n}{K} - p)] e^{- \frac{2 k m π}{K} j} e^{\frac{2 k n π}{K} j}, \forall n \in ℤ . \end{array}

(57)

The inner sum of (57) can be split into two terms: one for

m \in K ℤ

and another one for

m \in ℤ \ K ℤ

. This is a first computational trick. The second one is to permute the sums, such that the finite one can be computed in the first place. This is possible, as all sums are absolutely convergent. Hence:

\begin{array}{l} \tilde{x} [n] & = & \frac{1}{K} \sum_{p \in ℤ} \sum_{l \in ℤ} x [l K] \underset{δ_{0} [l - p]}{\underset{︸}{sinc [π (l - p)]}} sinc [π (\frac{n}{K} - p)] \sum_{k = 0}^{K - 1} e^{\frac{2 k n π}{K} j} + \\ + \frac{1}{K} \sum_{p \in ℤ} \sum_{m \in ℤ \ K ℤ} x [m] sinc [π (\frac{m}{K} - p)] sinc [π (\frac{n}{K} - p)] \sum_{k = 0}^{K - 1} e^{\frac{2 k (n - m) π}{K} j} = \\ = & \frac{1}{K} \sum_{p \in ℤ} x [p K] sinc [π (\frac{n}{K} - p)] \sum_{k = 0}^{K - 1} e^{\frac{2 k n π}{K} j} + \\ + \frac{1}{K} \sum_{p \in ℤ} \sum_{m \in ℤ \ K ℤ} x [m] sinc [π (\frac{m}{K} - p)] sinc [π (\frac{n}{K} - p)] \sum_{k = 0}^{K - 1} e^{\frac{2 k (n - m) π}{K} j}, \forall n \in ℤ . \end{array}

(58)

The finite sums in (58) can be computed by means of the Poisson formula:

\sum_{k = 0}^{K - 1} e^{\frac{2 k n π}{K} j} = K δ_{K ℤ} [n], \forall n \in ℤ,

(59)

where

δ_{K ℤ}

is the periodical unit impulse (with period equal to

K

). (No distributions are involved in (59), since, in fact, unlike in (24), here, one deals with finite length geometric series). Assume first that

n = l K \in K ℤ

. Then, the second term in (58) is null, according to property (59), because

\sum_{k = 0}^{K - 1} e^{\frac{2 k (n - m) π}{K} j} = \sum_{k = 0}^{K - 1} e^{\frac{2 k (l K - m) π}{K} j} = \sum_{k = 0}^{K - 1} e^{- \frac{2 k m π}{K} j}

and

m \in ℤ \ K ℤ

. In turn, the first term becomes:

\tilde{x} [l K] = \frac{1}{K} \sum_{p \in ℤ} x [p K] sinc [π (\frac{l K}{K} - p)] \underset{K}{\underset{︸}{\sum_{k = 0}^{K - 1} e^{\frac{2 k l K π}{K} j}}} = \sum_{p \in ℤ} x [p K] \underset{δ_{0} [l - p]}{\underset{︸}{sinc [π (l - p)]}} = x [l K], \forall l \in ℤ .

(60)

Second, arbitrarily set

n \in ℤ \ K ℤ

. This time, the first term in (58) is null, thanks to property (59). For the second term,

\sum_{k = 0}^{K - 1} e^{\frac{2 k (n - m) π}{K} j}

is null every time

n - m \in ℤ \ K ℤ

. The only possibility is to have

n - m \in K ℤ

. In this case,

\sum_{k = 0}^{K - 1} e^{\frac{2 k (n - m) π}{K} j} = K

and:

\tilde{x} [n] = \sum_{p \in ℤ} \sum_{m \in ℤ \ K ℤ} x [m] sinc [π (\frac{m}{K} - p)] sinc [π (\frac{n}{K} - p)] δ_{K ℤ} [n - m], \forall n \in ℤ .

(61)

Overall, from (60) and (61), one can write:

\tilde{x} [n] = x [n] δ_{K ℤ} [n] + \sum_{p \in ℤ} \sum_{m \in ℤ \ K ℤ} x [m] sinc [π (\frac{m}{K} - p)] sinc [π (\frac{n}{K} - p)] δ_{K ℤ} [n - m], \forall n \in ℤ .

(62)

To compute the second term in (62), the following index changing is made:

m = n - l K

, with

l \in ℤ

. Note that, if

n \in K ℤ

, then

δ_{K ℤ} [n - m] = 0

in the second term, as

m \in ℤ \ K ℤ

. Therefore, the second term has to be activated only if

n \notin K ℤ

and, thus,

m = n - l K \notin K ℤ

, as required. This results in:

\tilde{x} [n] = x [n] δ_{K ℤ} [n] + (1 - δ_{K ℤ} [n]) \sum_{p \in ℤ} \sum_{l \in ℤ} x [n - l K] sinc [π (\frac{n}{K} - p - l)] sinc [π (\frac{n}{K} - p)], \forall n \in ℤ .

(63)

As both infinite sums of (63) are absolutely convergent, they can be switched. This algebraic manipulation allows for the exploitation of the result of Lemma 2:

\begin{array}{l} \tilde{x} [n] & = x [n] δ_{K ℤ} [n] + (1 - δ_{K ℤ} [n]) \sum_{l \in ℤ} x [n - l K] \underset{δ_{0} [l]}{\underset{︸}{\sum_{p \in ℤ} sinc [π (\frac{n}{K} - p - l)] sinc [π (\frac{n}{K} - p)]}} = \\ = x [n] δ_{K ℤ} [n] + (1 - δ_{K ℤ} [n]) x [n] = x [n], \forall n \in ℤ . \end{array}

(64)

Consequently, the generated signal is identical to the initial signal.

This proves that the dictionary

V_{K}

is an orthogonal basis of

l^{2}

. □

Thanks to the results above, the first invertible JvN orthogonal transform in discrete time can be defined.

Definition 4.

The discrete time JvNT is defined as follows, for any signal

x \in l^{2}

:

N_{K} (x) [p, k] = X_{K} [p, k] = 〈 x, ν_{K}^{[p, k]} 〉 = \sum_{n \in ℤ} x [n] \bar{ν_{K}^{[p, k]} [n]} = \sum_{n \in ℤ} x [n] sinc [π (\frac{n}{K} - p)] e^{- \frac{2 k n π}{K} j}, \forall p \in ℤ, \forall k \in \bar{0, K - 1} .

(65)

Like the previous JvNT, the transform (65) is a complex-valued linear function having the same integer arguments. Unlike the JvNT (32), it works the same for real or complex-valued discrete signals. However, if the signal to analyze is real-valued, then the congregate symmetry is expressed inside the set

\bar{0, K - 1}

:

\begin{array}{l} N_{K} (x) [p, K - k] & = \sum_{n \in ℤ} x [n] sinc [π (\frac{n}{K} - p)] e^{- \frac{2 (K - k) n π}{K} j} = \sum_{n \in ℤ} x [n] sinc [π (\frac{n}{K} - p)] e^{\frac{2 k n π}{K} j} \\ = \bar{N_{K} (x) [p, k]}, \forall p \in ℤ, \forall k \in \bar{0, K - 1} . \end{array}

(66)

The property (66) allows for the computing of only about the first half of the JvN coefficients. Set

K_{2} = ⌊ K / 2 ⌋

. Then the JvN coefficients are computed for

k \in \bar{0, K_{2}}

by using definition (65). It is easy to notice that all coefficients for

k = 0

are real-valued and do not contribute to other coefficients evaluation:

X_{K} [p, 0] = \sum_{n \in ℤ} x [n] sinc [π (\frac{n}{K} - p)] \in ℝ, \forall p \in ℤ .

(67)

Next, the remaining JvN coefficients are determined by complex conjugating other already computed coefficients. More specifically:

If $K$ is even ( $K \in 2 ℕ^{*}$ ), then the coefficients for $k = K_{2} = K / 2$ are real-valued as well:

$X_{K} [p, \frac{K}{2}] = \sum_{n \in ℤ} x [n] sinc [π (\frac{n}{K} - p)] e^{- n π j} = \sum_{n \in ℤ} {(- 1)}^{n} x [n] sinc [π (\frac{n}{K} - p)] \in ℝ, \forall p \in ℤ$

(68)

and have no other contribution in further evaluations; however:

$X_{K} [p, k] = \bar{X_{K} [p, K - k]}, \forall p \in ℤ, \forall k \in \bar{\frac{K}{2} + 1, K - 1};$

(69)
If $K$ is odd ( $K \in 2 ℕ + 1$ ), then $K_{2} = (K - 1) / 2$ and no coefficients such as the ones in (68) exist; in this case:

$X_{K} [p, k] = \bar{X_{K} [p, K - k]}, \forall p \in ℤ, \forall k \in \bar{\frac{K + 1}{2}, K - 1} .$

(70)

If the signal to analyze is complex-valued, then all JvN coefficients have to be computed.

Thanks to Theorem 4, the inverse JvNT in discrete time is:

x [n] = N_{K}^{- 1} (X) [n] = \frac{1}{K} \sum_{p \in ℤ} \sum_{k = 0}^{K - 1} X_{K} [p, k] ν_{K}^{[p, k]} [n] = \frac{1}{K} \sum_{p \in ℤ} \sum_{k = 0}^{K - 1} X_{K} [p, k] sinc [π (\frac{n}{K} - p)] e^{\frac{2 k n π}{K} j}, \forall n \in ℤ .

(71)

In case

x

is real-valued, then the congregate symmetry property (66) can be exploited, to reduce the computational burden of synthesis Equation (71):

If $K$ is even ( $K \in 2 ℕ^{*}$ ):

\begin{array}{l} x [n] & = & \frac{1}{K} \sum_{p \in ℤ} X_{K} [p, 0] sinc [π (\frac{n}{K} - p)] + \frac{1}{K} \sum_{p \in ℤ} {(- 1)}^{n} X_{K} [p, \frac{K}{2}] [p, 0] sinc [π (\frac{n}{K} - p)] + \\ + \frac{1}{K} \sum_{p \in ℤ} (\sum_{k = 1}^{\frac{K}{2} - 1} X_{K} [p, k] e^{\frac{2 k n π}{K} j} + \sum_{k = \frac{K}{2} + 1}^{K - 1} \underset{\bar{X_{K} [p, K - k]}}{\underset{︸}{X_{K} [p, k]}} e^{\frac{2 k n π}{K} j}) sinc [π (\frac{n}{K} - p)] = \\ = & \frac{1}{K} \sum_{p \in ℤ} (X_{K} [p, 0] + {(- 1)}^{n} X_{K} [p, \frac{K}{2}]) sinc [π (\frac{n}{K} - p)] + \\ + \frac{1}{K} \sum_{p \in ℤ} \sum_{k = 1}^{\frac{K}{2} - 1} (X_{K} [p, k] e^{\frac{2 k n π}{K} j} + \bar{X_{K} [p, k] e^{\frac{2 k n π}{K} j}}) sinc [π (\frac{n}{K} - p)] = \\ = & \frac{1}{K} \sum_{p \in ℤ} (X_{K} [p, 0] + {(- 1)}^{n} X_{K} [p, \frac{K}{2}]) sinc [π (\frac{n}{K} - p)] + \\ + \frac{2}{K} \sum_{p \in ℤ} \sum_{k = 1}^{\frac{K}{2} - 1} Re (X_{K} [p, k] e^{\frac{2 k n π}{K} j}) sinc [π (\frac{n}{K} - p)], \forall n \in ℤ . \end{array}

(72)

b.: If $K$ is odd ( $K \in 2 ℕ + 1$ ):

\begin{array}{l} x [n] & = \frac{1}{K} \sum_{p \in ℤ} X_{K} [p, 0] sinc [π (\frac{n}{K} - p)] + \frac{1}{K} \sum_{p \in ℤ} (\sum_{k = 1}^{\frac{K - 1}{2}} X_{K} [p, k] e^{\frac{2 k n π}{K} j} + \sum_{k = \frac{K + 1}{2}}^{K - 1} \underset{\bar{X_{K} [p, K - k]}}{\underset{︸}{X_{K} [p, k]}} e^{\frac{2 k n π}{K} j}) sinc [π (\frac{n}{K} - p)] = \\ = \frac{1}{K} \sum_{p \in ℤ} X_{K} [p, 0] sinc [π (\frac{n}{K} - p)] + \frac{2}{K} \sum_{p \in ℤ} \sum_{k = 1}^{\frac{K - 1}{2}} Re (X_{K} [p, k] e^{\frac{2 k n π}{K} j}) sinc [π (\frac{n}{K} - p)], \forall n \in ℤ . \end{array}

(73)

2.2.3. JvNT for Discrete Time Signals with Finite Support Length

Since

l_{N}^{2}

is a subspace of

l^{2}

, the dictionary

V_{K}

can be employed to perform analysis and synthesis of finite length signals as well. Theoretically, the JvNT for signals of

l_{N}^{2}

is the same as for the signals in

l^{2}

. Nevertheless, in order to exploit the finite length of signals support and to make possible the design of numerical algorithms that yield effective computation of JvN coefficients with finite, but controlled accuracy, a slightly different JvNT will be defined. For the new transform, the orthogonality is preserved, but the exact invertibility is replaced by the nearly exact (approximate) invertibility.

To better understand how the new JvNT is defined, an engineering point of view is adopted next. One starts by illustrating in Figure 1 three tfas of dictionary

V_{K}

(to the left), together with the frequency representations of each tfa, i.e., their spectra and phases, as derived from their FT (to the right).

On top of figure, the JvN mw is depicted, here for

K = 7 Hz

. However, the time variation was truncated. Since the sinc envelope is a hyperbola, the function (1) can be neglected as soon as the hyperbola becomes smaller than a truncation threshold, say

ε > 0

. (In Figure 1, the threshold was set to

1 %

). This allows for the restriction of the JvN mw to a practical compact support

[- N_{ε}, - N_{ε}]

, where the bound

N_{ε} \in ℕ^{*}

is determined as follows:

\frac{K}{π N_{ε}} < ε \Leftrightarrow N_{ε} > \frac{K}{π ε} \Rightarrow N_{ε} = ⌈ \frac{K}{π ε} ⌉ .

(74)

In Figure 1,

N_{ε} = ⌈ 700 / π ⌉ = 223

, which corresponds to approximately 31.86 s on the real time axis. The truncation introduced distortions in the frequency characteristic. Both the spectrum and the phase are affected, as the figure reveals. Nevertheless, the characteristic is nearly ideal (with rectangular spectrum and linear phase on pass band). The smaller the

ε

, the smaller the distortions. The normalized cut-off pulsation is

ω_{c} = π / 7 ≅ 0.1429 π

(as seen on the figure).

In the middle of Figure 1, the variations for a tfa with

p = 15

and

k = 2

are drawn. Since

p > 0

, the tfa is delayed with respect to the position of mw. Only the real part of the tfa is shown to the left, as the imaginary part is similarly located. Additionally, the harmonic index has shifted the spectrum to the right side, and over the band

[(2 k - 1) ω_{c}, (2 k + 1) ω_{c}] = [3 ω_{c}, 5 ω_{c}] ≅

≅ [0 . 4286 π, 0 . 7143 π]

(while the phase is completely linear now), since the previous characteristic only displayed the half-tband of the mw spectrum.

At bottom of Figure 1, the variations of the tfa with

p = - 10

and

k = 1

can be seen. This time, the tfa is anticipated, as

p < 0

. The imaginary part is drawn to the left. On the right side, one can notice that the spectrum has migrated over the band

[ω_{c}, 3 ω_{c}] ≅ [0.1429 π, 0 . 4286 π]

(while the phase remains linear on the pass band).

As the figure clearly reveals, according to expression (46), the maximum point of real part in the tfa

ν_{K}^{[p, k]}

is

p K

. So, for any increment/decrement of time-shifting index the tfa jumps from a

p K

normalized time to another. The distance between maximum points of

ν_{K}^{[p, k]}

and

ν_{K}^{[p + 1, k]}

is equal to

K

.

Also, the spectrum of the tfa jumps from a

2 ω_{c}

normalized pulsation to another, for any increment/decrement of harmonic index. Geometrically, one can associate any tfa with some window in time, as well as in frequency. The windows slide along the time-frequency axes as the couple

{p, k}

varies. The basic window is the JvN mw, which sometimes is referred to as ‘mother-window’.

By convention,

Supp (x) = \bar{0, N - 1}

for any

x \in l_{N}^{2}

. Since

{Supp}_{ε} (ν_{K}^{[0, 0]}) = \bar{- N_{ε}, N_{ε}}

(the practical support of JvN mw), by sliding the mw along the signal, the two supports intersect with each other only for a finite number of time-shifting indices. Obviously,

{Supp}_{ε} (ν_{K}^{[p, 0]}) = \bar{p K - N_{ε}, p K + N_{ε}}

. One can consider that, if

p \notin \bar{P_{\min}, P_{\max}}

, then

Supp (x) \cap Supp (ν_{K}^{[p, 0]}) = \emptyset

. In this case, all JvN coefficients computed with the direct transform (65) are nearly null and can be neglected. To determine the two bounds, the following restrictions are enforced:

{\begin{cases} (P_{\min} - 1) K + N_{ε} < 0 \leq P_{\min} K + N_{ε} \\ P_{\max} K - N_{ε} \leq N - 1 < (P_{\max} + 1) K - N_{ε} \end{cases} \Leftrightarrow {\begin{cases} - \frac{N_{ε}}{K} \leq P_{\min} < 1 - \frac{N_{ε}}{K} \\ \frac{N + N_{ε} - 1}{K} - 1 < P_{\max} \leq \frac{N + N_{ε} - 1}{K} \end{cases} \Rightarrow {\begin{cases} P_{\min} = - ⌊ \frac{N_{ε}}{K} ⌋ < 0 \\ P_{\max} = ⌊ \frac{N + N_{ε} - 1}{K} ⌋ > 0 \end{cases} .

(75)

Now, the practical JvNT for finite length discrete signals can be defined.

Definition 5.

Assuming the parameters

K

and

ε

are set, the bounds (75) can be computed and the practical JvNT is defined as follows, for any signal

x \in l_{N}^{2}

:

N_{K} (x) [p, k] = X_{K} [p, k] = 〈 x, ν_{K}^{[p, k]} 〉 = \sum_{n = 0}^{N - 1} x [n] \bar{ν_{K}^{[p, k]} [n]} = \sum_{n = 0}^{N - 1} x [n] sinc [π (\frac{n}{K} - p)] e^{- \frac{2 k n π}{K} j}, \forall p \in \bar{P_{\min}, P_{\max}}, \forall k \in \bar{0, K - 1} .

(76)

By way of difference from Definition 4 in the definition above, the sum is finite and, moreover, the time-shifting index varies along a finite set. This allows for the design of a numerical algorithm to implement the JvNT, without losing the orthogonality property. The algorithm can fully exploit the symmetry Equations (68)–(70).

Although the inverse JvNT relies on Equation (71), the original signal can never be exactly recovered, because only a finite number of tfas from basis

V_{K}

are employed. This drawback enforces the defining of an approximate inverse, by assuming a synthesis error, as in the following result.

Theorem 5.

In context of analysis Equation (76), the following signal can be synthesized:

\tilde{x} [n] = \frac{δ_{K ℕ} [n] χ_{\bar{0, N - 1}} [n]}{K} \sum_{p = P_{\min}}^{P_{\max}} \sum_{k = 0}^{K - 1} X_{K} [p, k] ν_{K}^{[p, k]} [n] + \frac{1 - δ_{K ℕ} [n] χ_{\bar{0, N - 1}} [n]}{K} \cdot \frac{\sum_{p = P_{\min}}^{P_{\max}} \sum_{k = 0}^{K - 1} X_{K} [p, k] ν_{K}^{[p, k]} [n]}{\sum_{p = P_{\min}}^{P_{\max}} {(ν_{K}^{[p, 0]} [n])}^{2}}, \forall n \in \bar{0, N - 1} .

(77)

Then the signal

\tilde{x}

exactly recovers the original signal

x

only in normalized instants expressed as integer multiples of

K

. For the remaining normalized instants,

\tilde{x}

can only approximate

x

and the synthesis error is:

Δ x [n] = x [n] - \tilde{x} [n] = \frac{\sum_{\begin{matrix} m = - ⌊ \frac{N - n - 1}{K} ⌋ \\ m \neq 0 \end{matrix}}^{⌊ \frac{n}{K} ⌋} x [n - m K] \sum_{p = P_{\min}}^{P_{\max}} ν_{K}^{[p + m, 0]} [n] ν_{K}^{[p, 0]} [n]}{\sum_{p = P_{\min}}^{P_{\max}} {(ν_{K}^{[p, 0]} [n])}^{2}}, \forall n \in \bar{0, N - 1} \ K ℕ .

(78)

Proof.

The rationale employed to prove Theorem 4 can help to complete the proof of this theorem. For convenience, define the following signal:

y [n] = \frac{1}{K} \sum_{p = P_{\min}}^{P_{\max}} \sum_{k = 0}^{K - 1} X_{K} [p, k] ν_{K}^{[p, k]} [n] = \frac{1}{K} \sum_{p = P_{\min}}^{P_{\max}} \sum_{k = 0}^{K - 1} X_{K} [p, k] sinc [π (\frac{n}{K} - p)] e^{\frac{2 k n π}{K} j}, \forall n \in \bar{0, N - 1} .

(79)

Notice how signal (77) depends on signal (79):

\tilde{x} [n] = δ_{K ℤ} [n] χ_{\bar{0, N - 1}} [n] y [n] + \frac{1 - δ_{K ℤ} [n] χ_{\bar{0, N - 1}} [n]}{\sum_{p = P_{\min}}^{P_{\max}} {(ν_{K}^{[p, 0]} [n])}^{2}} \cdot y [n] = (δ_{K ℤ} [n] χ_{\bar{0, N - 1}} [n] + \frac{1 - δ_{K ℤ} [n] χ_{\bar{0, N - 1}} [n]}{\sum_{p = P_{\min}}^{P_{\max}} {(ν_{K}^{[p, 0]} [n])}^{2}}) y [n], \forall n \in \bar{0, N - 1} .

(80)

Then, according to analysis Equation (76), the signal (79) becomes:

y [n] = \frac{1}{K} \sum_{p = P_{\min}}^{P_{\max}} \sum_{k = 0}^{K - 1} \sum_{m = 0}^{N - 1} x [m] sinc [π (\frac{m}{K} - p)] sinc [π (\frac{n}{K} - p)] e^{- \frac{2 k m π}{K} j} e^{\frac{2 k n π}{K} j}, \forall n \in \bar{0, N - 1} .

(81)

The values of

y

can be computed in two cases.

Assume that $n = l K \in \bar{0, N - 1} \cap K ℕ$ . In this case:

$\begin{array}{l} y [l K] & = \frac{1}{K} \sum_{p = P_{\min}}^{P_{\max}} \sum_{k = 0}^{K - 1} \sum_{m = 0}^{N - 1} x [m] sinc [π (\frac{m}{K} - p)] \underset{δ_{0} [l - p]}{\underset{︸}{sinc [π (l - p)]}} e^{- \frac{2 k m π}{K} j} = \frac{1}{K} \sum_{m = 0}^{N - 1} x [m] sinc [π (\frac{m}{K} - l)] \underset{K δ_{K ℤ} [m]}{\underset{︸}{\sum_{k = 0}^{K - 1} e^{- \frac{2 k m π}{K} j}}} = \\ = \sum_{r = 0}^{⌊ \frac{N - 1}{K} ⌋} x [r K] \underset{δ_{0} [r - l]}{\underset{︸}{sinc [π (r - l)]}} = x [l K], \forall l \in \bar{0, ⌊ \frac{N - 1}{K} ⌋} . \end{array}$

(82)

(The Poisson formula (59) was used in the manipulations above). Property (81) actually proves the first assertion of theorem, as Equation (80) involves $\tilde{x} [l K] = y [l K]$
Assume that $n \in \bar{0, N - 1} \ K ℕ$ . In this case, the finite sums can be switched, such that the harmonic part be computed first, with the help of the Poisson formula (59). More specifically:

$y [n] = \frac{1}{K} \sum_{p = P_{\min}}^{P_{\max}} \sum_{m = 0}^{N - 1} x [m] sinc [π (\frac{m}{K} - p)] sinc [π (\frac{n}{K} - p)] \underset{K δ_{K ℤ} [n - m]}{\underset{︸}{\sum_{k = 0}^{K - 1} e^{\frac{2 k (n - m) π}{K} j}}} .$

(83)

The Poisson formula enforces the index $m$ to take values of the form $m = n - l K$ , which, moreover, must belong to the set $\bar{0, N - 1}$ . Consequently, the new index, $l$ , vary in the set: $\bar{- ⌊ \frac{N - n - 1}{K} ⌋, ⌊ \frac{n}{K} ⌋}$ . Thus, Equation (83) becomes:

$y [n] = \sum_{l = - ⌊ \frac{N - n - 1}{K} ⌋}^{⌊ \frac{n}{K} ⌋} x [n - l K] \sum_{p = P_{\min}}^{P_{\max}} \underset{ν_{K}^{[p + l, 0]}}{\underset{︸}{sinc [π (\frac{n}{K} - p - l)]}} \underset{ν_{K}^{[p, 0]}}{\underset{︸}{sinc [π (\frac{n}{K} - p)]}} .$

(84)

after switching the two sums. Obviously, $l = 0$ is always included in the variation range of the second sum and can be extracted as a separate term. For the remaining term, $l$ can be re-noted by $m$ . Hence:

$y [n] = x [n] \sum_{p = P_{\min}}^{P_{\max}} {(ν_{K}^{[p, 0]} [n])}^{2} + \sum_{\begin{matrix} m = - ⌊ \frac{N - n - 1}{K} ⌋ \\ m \neq 0 \end{matrix}}^{⌊ \frac{n}{K} ⌋} x [n - m K] \sum_{p = P_{\min}}^{P_{\max}} ν_{K}^{[p + m, 0]} [n] ν_{K}^{[p, 0]} [n] .$

(85)

It is easy to notice that Equation (80) involves:

\tilde{x} [n] = \frac{y [n]}{\sum_{p = P_{\min}}^{P_{\max}} {(ν_{K}^{[p, 0]} [n])}^{2}}, \forall n \in \bar{0, N - 1} \ K ℕ .

(86)

Combining (85) and (86), one obtains:

\tilde{x} [n] = x [n] + \frac{\sum_{\begin{matrix} m = - ⌊ \frac{N - n - 1}{K} ⌋ \\ m \neq 0 \end{matrix}}^{⌊ \frac{n}{K} ⌋} x [n - m K] \sum_{p = P_{\min}}^{P_{\max}} ν_{K}^{[p + m, 0]} [n] ν_{K}^{[p, 0]} [n]}{\sum_{p = P_{\min}}^{P_{\max}} {(ν_{K}^{[p, 0]} [n])}^{2}}, \forall n \in \bar{0, N - 1} \ K ℕ,

(87)

which proves the last assertion of the theorem. □

Theorem 5 gives an insight on how the synthesis can be performed in case of finite length discrete signals. Thus, the approximate inverse of JvNT is:

\tilde{x} [n] = \frac{1}{K} {\begin{array}{l} \sum_{p = P_{\min}}^{P_{\max}} \sum_{k = 0}^{K - 1} X_{K} [p, k] ν_{K}^{[p, k]} [m K] & , n = m K \in \bar{0, N - 1}; \\ \frac{\sum_{p = P_{\min}}^{P_{\max}} \sum_{k = 0}^{K - 1} X_{K} [p, k] ν_{K}^{[p, k]} [n]}{\sum_{p = P_{\min}}^{P_{\max}} {(ν_{K}^{[p, 0]} [n])}^{2}} & , n \in \bar{0, N - 1} \ K ℕ . \end{array}

(88)

The first branch of Equation (88) can furthermore be simplified, as:

ν_{K}^{[p, k]} [m K] = e^{\frac{2 π k m K}{K} j} sinc [π (m - p)] = δ_{0} [m - p], \forall m, p \in ℤ, \forall k \in ℕ,

(89)

according to Equation (46). Hence:

\tilde{x} [m K] = \frac{1}{K} \sum_{k = 0}^{K - 1} X_{K} [m, k] = x [m K], \forall m \in \bar{0, ⌊ \frac{N - 1}{K} ⌋} .

(90)

Equations (88) and (90) can be employed to implement the practical inverse of the JvNT (76). For efficient implementation, symmetry properties (72) and (73) can be considered.

The only remaining problem to solve is how to set the parameters

K

and

ε

. The solution is not as simple, as it might seem at a first sight. Apparently, they are independent parameters, but, in fact, correlations between them exist. One bond is given by the imperfection of the inverse JvNT (88). Is there a couple

{K, ε}

that minimizes the standard deviation (std) of error

Δ x \equiv x - \tilde{x}

(between the genuine signal and the recovered signal)? To soundly answer this question, denote by

σ_{y}

the std of a signal

y \in l_{N}^{2}

. Then the following cost-function can be defined:

A (K, ε) = \frac{100}{1 + 10 \frac{σ_{Δ x} (K, ε)}{σ_{x}}} [%], \forall K \in ℕ^{*}, \forall ε > 0,

(91)

which comes from the hyperbola

1 / (1 + x)

that maps the infinite length interval

[0, + \infty)

into the normalized interval

(0, 1]

. In definition (91), the std of error

Δ x

(i.e.,

σ_{Δ x}

) was normalized by the std of the original signal (i.e.,

σ_{x}

), to obtain the relative std of error. Since the relative std of error needs to be minimized, the cost-function (91) has to be maximized, while varying the two parameters. One can associate the cost-function to the normalized synthesis accuracy (hence, the notation

“ A ”

).

In case of

x \in l_{N}^{2}

, it is reasonable to limit the variation of sampling rate

K

to the upper bound of

⌈ N / 2 ⌉

. Additionally, the threshold

ε

should be set at least equal to

10^{- 3}

(as this limit is small enough to neglect the JvN mw tails), and at most equal to

0.05

. Although the cost-function is nonlinear, its optimization can be realized through various techniques, depending on the signal length,

N

. However, if

N

is not excessively big (for example,

N = 1000

), then the optimization can be realized by exhaustive search into the rectangle

\bar{2, ⌊ N / 2 ⌋} \times [10^{- 3}, 0.05]

, where the thresholds axis can be made discrete with step

10^{- 4}

.

In Figure 2, on top, a pseudo-random discrete signal with Gaussian distribution is depicted (on top, in blue), together with the synthesized signal (in the middle, in red) and the reconstruction error (at bottom, in magenta).

The signals length is

N = 1000

. For this signal, the surface of the cost-function is illustrated at bottom, on the left side of figure. On the right side, the variation of cost-function along

K

axis is drawn, for

ε = 0.01

. The cost-function surface is quite irregular. This fractal aspect is mainly due to the stochastic nature of the signal to be analyzed. Since the JvN atoms are smooth, while the signal to analyze is a pseudo-random signal, the analysis coefficients, as well as the reconstruction error, inherit the stochastic behavior, which leads to ruptures in the accuracy criterion. The same characteristic exhibited in the cost-function variation for

ε = 0.01

. If the signal length is quite large, a metaheuristic [38] should be employed to solve the optimization problem. The optimal point of the cost-function is

{K_{opt} = 457, ε_{opt} = 0.0001}

, for which

A_{\max} = 99.64 %

. The synthesis signal and the reconstruction error have been evaluated for the optimal point in Figure 2. Thus, the relative std of the reconstruction error is approximately

0 . 03584 %

, which means the synthesis signal is almost identical to the original one. As one can see, the bigger the

K

and the smaller the

ε

, the more accurate the synthesized signal. Nevertheless, the optimal cost-function for

ε = 0.01

is only slightly smaller than the one for

ε = 0.001

, i.e.,

A_{\max} = 99.62 %

(for

K_{opt} = 456

). Or, in terms of computational burden, the difference between the two optimal points is quite large. For

ε = 0.001

, one obtains

N_{ε} = ⌈ 1000 K / π ⌉

, whereas, for

ε = 0.01

,

N_{ε} = ⌈ 100 K / π ⌉

(i.e., about 10 times smaller). Since the gain in terms of accuracy is very small (

99.64 %

versus

99.62 %

), the wise selection is

ε = 0.01

.

Focus now on

K_{opt}

. There is a correlation between this parameter and the time-frequency representation of the JvNT, i.e., of the JvN coefficients. Since

X_{K}

is a complex-valued function of two integer variables, one can represent its magnitude and phase over the time-frequency plane generated by the two indices: time-shifting (

p

) and harmonic modulation (

k

). As in case of FT,

| X_{K} |

is the spectrum. Unlike the case of FT, the JvN spectrum varies in time (for each index

p

, a different spectrum can be obtained). The time variation of spectrum is a characteristic of non-stationary signals (which constitute the overwhelming majority of real-world signals). Thus, the FT spectrum only reveals the average behavior in frequency, while the JvN spectrum is closer to reality. In SP terminology, the surface representation of a time varying spectrum is referred to as a spectrogram.

Return to the previous example. The JvN coefficients are represented as displayed on the left side of Figure 3 (spectrogram up and phase surface down). On the right side, one can see the classical representation of FT applied to the Gaussian signal (spectrum up and phase down). In both cases, the spectral power is represented in logarithmic scale or decibels (dB), whereas the phase is measured in degrees (deg).

Since the generated signal is close to a white noise, its average spectrum exhibits no dominant frequencies (see the right side of figure). This property is verified by the JvN spectrogram as well (see the left side of figure). However, when closely looking at the spectrogram, one can see that the spectral variation in time has strong decays towards the bounds of the time-frequency plane, which is inaccurate. Practically, the main part of the spatial spectrum and phase from the left side of figure are almost the same as the planar ones forming the right side of the figure. This effect is produced by the tails of tfas (with small magnitudes) and not by their central variation. However, the real cause is that the sampling rate was set to

K_{opt}

. This value is close to half of the signal length, which results in the fact that the maximum point of JvN mw makes big jumps and falls into the signal support only three times, for

n \in {0, K_{opt}, 2 K_{opt}} = {0, 457, 914}

. In fact, the useful part of the JvN spectrogram is quite poor and includes only the three instantaneous spectra corresponding to these three positions. If finer variation in the time of spectrum is wanted, then the sampling rate

K

should be set to smaller values.

In SP terminology, the sampling rate is associated to the resolution of the time-frequency representation. The essential question is this: where to locate the coefficient

X_{K} [p, k]

over the time-frequency plane? Since both the signal and the set of analysis coefficients are discrete, a grid covers this plane, as illustrated in Figure 4.

The grid density is determined by the value of

K

along the time axis and the value of

2 / K

along the frequency axis. Thus, the elementary grid mesh is a rectangle with sides of length

K

and

2 / K

. Therefore, its area is constant (and equal to 2), regardless of the value of

K

. Although the coefficient

X_{K} [p, k]

can be located at the coordinates

(p K, \frac{2 k + 1}{K})

, in reality, it could lie anywhere on the mesh centered in these coordinates. In fact, this is a manifestation of the Gabor–Heisenberg uncertainty principle [2]. According to this principle, the product of the representation resolutions in time and in frequency is constant. In case of the grid from Figure 4, the time resolution can conventionally be set to

1 / K

and the frequency resolution to

K

. Increasing the resolution in one domain automatically decreases the resolution in the dual domain, as their product is unitary. Localization of JvN coefficient is uncertain inside the mesh. If one tries to increase the localization accuracy along one time-frequency axis (i.e., to increase the corresponding resolution), the localization along the other axis becomes more uncertain. Thus, a trade-off should be found.

In the example above, the resolutions game is strongly unbalanced, as the cost-function (91) focused too much on frequency axis, to the detriment of the time axis. The JvNT leads to excellent localization in frequency, but poor (uncertain) localization in time. A good trade-off should keep both resolutions in balance. For example, if

K = ⌊ \sqrt{2 N} + 0.5 ⌋

(the nearest integer to

\sqrt{2 N}

), then the maximum point of JvN mw jumps in about

K / 2

normalized instants inside the signal support. This means the grid has about

K / 2

important meshes (where the signal spectrum is accurately determined) along the time axis. In turn, along the frequency axis, since the discretization step is

2 / K

, about

K / 2

meshes exist as well. Thus, approximately the same number of important meshes is obtained along both axes.

For the Gaussian signal in Figure 2, the following trade-off was adopted:

K = ⌊ \sqrt{2000} + 0.5 ⌋ = 45

and

ε = 0.01

. The results obtained after applying the JvNT to the signal are displayed in Figure 5. This time, the cost-function decreased to

A (45, 0.01) = 87 . 24 %

and the relative std of error increased to

1 . 46 %

(i.e., it was about 41 times bigger than in the case of the optimal point). However, the synthesized signal still is accurate, as the top of figure exhibits. The time-frequency analysis led to the spectrogram in the left side of figure at bottom and the phase surface on the right side.

When making comparisons to the JvN spectrogram for the optimal point of the cost-function (see the middle of Figure 3 again), one can easily observe the time variation of the signal spectrum, although its resolution in frequency is poorer. Instead of three significant instantaneous spectra, in the figure above, one can see about 45 such significant spectra. The same effect can be noticed for the phase surface. Figure 3 and Figure 5 reveal that, in fact, the Gaussian signal is almost stationary (its spectrum is almost constant in time), due to the number of generated samples,

N

, which is rather small (1000). In Section 4, a 10 times longer Gaussian signal is tested, such that the non-stationary characteristic can be noticed.

To conclude this subsection, one can say that the JvNT of discrete signals with finite length support is an engineering tool working under the constraint of the uncertainty principle. Its performance depends on the balance between the two representation resolutions, both in time and in frequency.

3. Numerical Algorithms to Implement JvNTs

Two algorithms were designed and implemented, based on the previous section. The direct JvNT for 1D signals can be computed in two cases: when the signal is real-valued (and, thus, the symmetry property can be considered) and when the signal is complex-valued. Similarly, the inverse JvNT for 1D signals requires two approaches, depending on the nature of signals. Although the algorithms described next are easier to implement within the MATLAB™ programming environment, any other environment can be selected as well.

Note that, in MATLAB™ programming language, the congregate symmetrical block of matrix

C

can be added after completing the main loop, by concatenation (with no need to access every element of the block). Additionally, it is easy to see that the first column of matrix

C

only takes real values, since it was computed for a null harmonic index. Moreover, if

K

is even, the column

K_{sym}

is real-valued as well. Although the numerical procedure above did not consider such properties, they can help to increase the efficiency of Algorithm 1. The efficiency refers here to the computational burden and, ultimately, concerns the running time. For example, one procedure is more efficient than another if the result is provided by making a smaller number of arithmetic operations and/or obtained more quickly. In case of Algorithm 1, the there are two key parameters for efficiency: the signal length

N

and the accuracy threshold

ε

. The configuring parameters

K

,

N_{ε}

,

P_{\min}

and

P_{\max}

, all depend on

N

and

ε

. The efficiency of Algorithm 1 (as well as that of Algorithm 2 which follows) depends on the number of atoms in dictionary to operate with. The larger the

N

and the smaller the

ε

, the larger the number of such atoms and, thus, the slower the procedure. The symmetry properties allow for the increase of efficiency by decreasing the computational burden, as identical arithmetic operations are prevented from being performed twice.

Algorithm 1 Direct JvNT for 1D signals

For the following algorithm, the inverse JvNT (88) was expressed in matrix form. As already proven, all samples

\tilde{x} [m K]

(

m \in \bar{0, ⌊ (N - 1) / K ⌋}

) can simply be computed by averaging several rows of the coefficients matrix, regardless of whether they are real-valued or complex-valuated (see Equation (90)). For the remaining samples, focus on the main term in Equation (88), namely:

\sum_{p = P_{\min}}^{P_{\max}} \sum_{k = 0}^{K - 1} X_{K} [p, k] ν_{K}^{[p, k]} [n] = \sum_{p = P_{\min}}^{P_{\max}} \sum_{k = 0}^{K - 1} X_{K} [p, k] sinc [π (\frac{n}{K} - p)] e^{\frac{2 k n π}{K} j}, \forall n \in \bar{0, N - 1} \ K ℕ

(92)

Since the sinc kernel and the electrons of harmonic part use independent indices, they can be packed into two different vectors:

{\begin{cases} s [n] = {[\begin{matrix} sinc [π (\frac{n}{K} - P_{\min})] & sinc [π (\frac{n}{K} - P_{\min} - 1)] & \dots & sinc [π (\frac{n}{K} - P_{\max})] \end{matrix}]}^{T} \in ℝ^{P}; \\ e [n] = {[\begin{matrix} 1 & e^{\frac{2 n π}{K} j} & \dots & e^{\frac{2 (K - 1) n π}{K} j} \end{matrix}]}^{T} \in ℂ^{K} . \end{cases}

(93)

(In the first definition of (93),

P = P_{\max} - P_{\max} + 1

). Then, if

X_{K} \in ℂ^{P \times K}

denotes the matrix of all analysis coefficients, Equation (92) becomes:

\sum_{p = P_{\min}}^{P_{\max}} \sum_{k = 0}^{K - 1} X_{K} [p, k] ν_{K}^{[p, k]} [n] = s^{T} [n] X_{K} e [n], \forall n \in \bar{0, N - 1} \ K ℕ .

(94)

Consequently, the second branch of transform (88) is straightforwardly expressed in compact form below:

\tilde{x} [n] = \frac{\sum_{p = P_{\min}}^{P_{\max}} \sum_{k = 0}^{K - 1} X_{K} [p, k] ν_{K}^{[p, k]} [n]}{K \sum_{p = P_{\min}}^{P_{\max}} {(ν_{K}^{[p, 0]} [n])}^{2}} = \frac{s^{T} [n] X_{K} e [n]}{K s^{T} [n] s [n]} = \frac{s^{T} [n] X_{K} e [n]}{K {‖ s [n] ‖}^{2}}, \forall n \in \bar{0, N - 1} \ K ℕ .

(95)

Algorithm 2 Inverse JvNT for 1D signals

The symmetry property (included in Algorithm 2) can sensibly reduce the runtime, in case of real-valued signals.

4. Simulation Results and Discussion

After implementing the algorithms from Section 3 within the MATLAB™ programming environment, several tests were performed upon one artificial and one real-life signal. Both of them were real-valued. The JvN dictionaries were configured with the accuracy threshold

ε = 0.01

. Two types of sampling rates are employed for comparison:

K_{1} = ⌊ \sqrt{N} + 0.5 ⌋

and

K_{2} = ⌊ \sqrt{2 N} + 0.5 ⌋

, where

N

is the signal length.

4.1. Gaussian Pseudo-Randomly Generated 1D Signal

The signal was generated by means of function randn from the MATLAB™ library. Its length is

N = 10, 000

. Consequently, the two JvNTs are defined by

K_{1} = 100

and

K_{2} = 141

, respectively. In Figure 6, the generated signal is displayed on top, while its frequency representation is shown below. The spectrum (in the middle) is drawn in dB (as usual) and the phase (at bottom)—in deg.

The signal looks like a realization of white noise (which is non autocorrelated and unpredictable), its spectrum having almost constant envelope. Real-world signals such as this are, for instance, the seismic ones. Such signals are difficult to compress or predict. Therefore, as the FT spectrum reveals, the coefficients of JvNT are expected to have an almost plane envelope. The phase is linear, as the signal is similar to all pass filters impulse response. Why Gaussian? The most stochastic signals in nature are Gaussian, as direct consequence of the central limit theorem. Thus, the artificial signal is intended to be as close as possible to real-life stochastic white noises.

In the next figures, the results of JvNT are displayed on two columns: for

K_{1} = 100

on the first column and for

K_{2} = 141

on the second column. In Figure 7, the JvN analysis coefficients are represented as spectrogram and phase surface. The spectrogram is shown in two representations: linear (on the top row) and in dB (in the middle row). The phase is measured in deg, as for the FT.

By way of difference from the signals of Figure 3 and Figure 5, the signal of Figure 6 is non-stationary, as revealed by the spectrograms in Figure 7 (especially by the ones drawn on the top row). The non-stationary behavior is slightly better-decoded in case of the smaller sampling rate (see the linear spectrograms). Nevertheless, the initial energy of signal seems to be almost equally divided between the JvN coefficients, in both cases, because no dominant frequencies exist in the signal spectrum (see the logarithmic spectrograms in the middle).

After applying the inverse JvNT, the synthesized signals on the top row of Figure 8 were obtained. The difference from the original signal cannot be visually detected at the usual graphical resolution. However, the reconstruction error is non-null, as displayed on the bottom row of Figure 8. The variations of reconstruction error are particularly interesting, as, against expectations, the original signal is more accurately recovered for the smaller sampling rate.

Both variations were drawn at the same scale on purpose, so that the difference between their amplitudes might become observable. Recall that, according to the discussion in Section 2, the accuracy of reconstruction should be higher when the sampling rate is bigger. However, the cost-function surface (see Figure 2 again) is irregular, which opens the possibility of obtaining increasingly accurate synthesized signals for some smaller sampling rates. Thus, it seems that, in the case of Gaussian signal,

K_{1} = ⌊ \sqrt{N} + 0.5 ⌋

is the winner.

In Table 1, the performance of JvNTs is summarized, for the Gaussian signal above.

The relative std of error is computed as in definition of accuracy (91). Both the relative std of error and the accuracy confirm that the first JvNT performs better. The analysis-synthesis algorithms have been implemented and run within the MATLAB™ environment, on a regular computer of an octa-core type. The runtimes in the table show that the synthesis procedure performs at least six times faster than the analysis procedure.

4.2. Speech Signal

A male was recorded saying the following sentence: “The Fourier transform of a real-valued signal is congregate symmetric.” (which is a true assertion). The speech signal is represented in the left-side window of Figure 9 and counts 110,033 samples acquired at a 22.05 kHz sampling rate (CD quality). The sentence took approximately 5 s to be said. The FT of the speech signal is represented to the right in the figure.

Unlike the previous Gaussian signal, the speech signal under consideration is not only 11 times longer, but also highly autocorrelated. On the spectral envelope, four formants can be seen. (The speech signals can exhibit up to five formants). This involves the FT spectrum exhibits dominant frequencies, although not clearly revealed. Basically, each central frequency of a formant is a dominant frequency. Yet, the localization of such a frequency with the help of the FT spectrum is very uncertain, as they can lie anywhere in the formants’ sub-bands, which are quite large. Beneath the spectrum, the phase is nonlinear, which is an indication of non-stationary behavior. The two selected sampling rates are:

K_{1} = 332

and

K_{2} = 469

.

The JvN analysis led to the results in Figure 10 (with similar structure as Figure 7). The linear spectrograms on the top row of the figure reveal that the information the speech signal encodes is allocated to a reduced number of JvN coefficients, especially at a low frequency. This is a characteristic of many signals from real-life. In the middle of the figure, where the spectrograms in dB are drawn, the spectrum variation in time proves that the speech is a nonstationary signal.

One can see how the formants change their shape in time. In fact, only few of the 332 (to the left) or 235 (to the right) spectra have more than two formants, while all of them exhibit the main formant at low frequency. If a frequency index is selected, one can see how the corresponding frequency varies in time. This is the reason such a frequency is called instantaneous for a given time moment, in SP terminology. The phase depicted at the bottom of figure varies in time as well. The variation is more dynamic in case of the first (smaller) sampling rate than in the case of the second sampling rate, due to the fact that the time resolution is bigger.

Beside the time variation of JvN coefficients, the spectrograms in Figure 10 suggest that the speech signal can be compressed by selecting the JvN coefficients with the highest spectral power values (at the dominant instantaneous frequencies) to be sent for signal reconstruction. Note that the total number of coefficients is 131,140 for

K_{1} = 332

and 139,762 for

K_{2} = 469

. Both numbers are comparable to the signal length. To assess the theoretical compression capacity of JvNT, the matrices were made linear and the coefficients were sorted in descending order of magnitude. In Figure 11, two variations are drawn—one for each sampling rate. Any variation shows how many coefficients would be necessary to obtain some relative energy threshold.

More specifically, assume that the energy of all coefficients is

E (C)

and the linearized array of coefficients is the vector

c

, after being sorted in descending order of coefficients’ magnitude. Set the threshold at

η \in [0, 1]

of relative energy.

The number of necessary coefficients from

c

to accumulate at most the energy

η E (C)

can be denoted by

N_{η} \in ℕ

. Then, the variations in Figure 11 correspond to inequality:

E (c_{N_{η}}) = \sum_{n = 1}^{N_{η}} {| c_{n} |}^{2} \leq η E (C),

(96)

where the partial vector

c_{N_{η}}

includes the first

N_{η}

coefficients of vector

c

. Thus, the relative energy thresholds

100 η

(in percents) are put into correspondence with the numbers

N_{η}

. Intuitively, one assumes that the most energy (and image information) is concentrated in a reduced number of coefficients, especially ones located at low frequencies. The variations above prove that the relative energy increases rapidly when a very small number of coefficients are selected. For example, if

η = 0.95

, then

N_{0.95} \in {3, 888; 3, 994}

. This means the most part of the image energy (i.e., approximately

95 %

) is concentrated in a small number of coefficients (i.e., no more than 0.04% of the coefficients’ total number). Even comparing to the total number of samples in the speech (i.e., 110,033), the number

N_{0.95}

is quite small. Nevertheless, to be fair, it should be outlined that the JvN coefficients are complex valued, which means that

N_{0.95}

must be multiplied by two. Therefore, the number

2 N_{0.95}

is no more than 0.073% from the total number of samples in the speech (in both cases).

To better assess the compression capacity of JvNT, one can compute the average angle of the first derivative in origin for the mapping of

N_{η} - η

. Obviously, for any couple

{N_{η}, η}

, this angle can be approximated by:

α_{η} = \arctan \frac{100 η}{N_{η}}

(97)

The bigger the

η

, the rougher the approximation and the higher the slope of the derivative estimation. Nevertheless, one can notice in the variations of Figure 11 that the most approximations are obtained from the left side, which are more accurate than the few ones from the right side. In Figure 11, the interval

[0, 1]

was sampled with the step 0.01, such that 101 thresholds of relative energy were considered (including both the null and the unit ones). By averaging all estimations (97) over the last 100 points (the first one being evidently removed), one obtains the desired average angle, which can serve as a theoretical measure of compression capacity. The total number of JvN coefficients contributes to the average angle as well, thanks to the last point from the considered 100. Thus, the bigger the average angle, the better the compression capacity. The average angles and corresponding derivatives are depicted in both panels of Figure 11. One can see that the right-side approximations (although very poor) affected the derivatives very little, as they were very close to the real derivative in origin.

In Table 2, the two angles, along with some samples of variations in Figure 11 are listed.

Clearly, the second JvNT (adjusted for higher sampling rate) has better compression capacity than the first one, although the difference between them is not statistically significant.

Figure 12 displays the JvN synthesis results.

All analysis coefficients were employed, regardless of their spectral powers. The synthesized signals on the top row of figure seem to be identical between them and with the original signal in Figure 9. However, reconstruction errors exist, as proven by the bottom row of Figure 12. As in the case of previous signal, the speech is better recovered for smaller sampling rates than for larger sampling rates. In turn, given the spectrogram variations in Figure 10 and according to previous discussion, better compression factor could be obtained for the second (higher) sampling rate, as the formants’ variation points to lesser dominant instantaneous frequencies and the phase has large almost flat zones.

The performance of JvNTs applied to speech is summarized in Table 3.

As one can easily notice, in both cases, the synthesis runtime is approximately six times smaller than the analysis runtime. Compared to the Gaussian signal, the analysis-synthesis algorithms were more than 85 times slower, because the speech signal was 11 times longer. This suggests the advantages of segmenting the speech signal into 11 frames that can be processed separately. Thus, the runtime would be only 11 times larger, but at the expense of higher reconstruction error (as each synthesized frame would have its own errors).

The previous remarks and the performance parameters listed in the table above suggest that there is a balance between the two JvNTs. On one hand, the first JvNT (for smaller sampling rate) seems more accurate and faster. On the other hand, the second JvNT seemingly has better compression capacity. At the bottom of Figure 13, one can see the fitness variation depending on the number of the strongest JvN coefficients selected to perform the synthesis (the other coefficients being enforced to null values). The first JvNT (for smaller sampling rate) has a slightly better performance, as proven by the curve in blue above the curve in red (higher fitness at some number

N_{η}

for the first JvNT than for the second one).

At the right-side end of the fitness variations, the fitness was computed for all JvN coefficients, which led to the signals in Figure 12. Two other points were focused on each fitness characteristic, to illustrate to what extent the final signal can be degraded by the removal of some of the JvN coefficients.

On top left side of Figure 13, the synthesized signals correspond to relative energy threshold

η = 0.95

. Thus, according to Table 2, the number of strongest coefficients (nsc) is 3888 for

K_{1} = 332

and 3994 for

K_{2} = 469

. Although the cumulative energy of those coefficients is quite high, their number is very small, which led to low values of fitness: 30.76% for

K_{1} = 332

and 30.66% for

K_{2} = 469

. This time, the two signals and the recovering errors are not so different. Moreover, the synthesized signals do not seem so different from the original signal. Nevertheless, compared to the signals in Figure 12, the recovering errors are more than 22 times bigger, as the relative std of error is 22.56% for

K_{1} = 332

and 22.62% for

K_{2} = 469

.

Increase the nsc to approximately 28% of the JvN coefficients’ total number, i.e., to 36,879 for

K_{1} = 332

and 39,644 for

K_{2} = 469

. This choice corresponds to the energy threshold

η = 0.999

. The synthesized signal and the recovering errors are shown on the top right side of Figure 13. Fitness increased to 74.27% for

K_{1} = 332

and 73.33% for

K_{2} = 469

, while the relative std of error decreased to 3.47% for

K_{1} = 332

and 3.64% for

K_{2} = 469

. In this case too, there is not much difference between the two JvNTs in terms of accuracy and compression capacity.

Despite what may appear to be the case, in the right side of the figure, the errors are drawn at a different scale from the one in the left side of figure. In fact, they are approximately 6.5 times smaller.

5. Concluding Remarks

In the manner of some of the scientists who studied John von Neumann’s work, one can say that his function, as simple as it is, has its own magic. Translating this function by an integer offset results in a new function, one which is orthogonal to the basic one. By simply looking at the variations of the two functions, the orthogonality property is undetectable. It is only clearly revealed when working in the frequency domain instead of the time domain.

Orthogonality and the compression capacity of a transform are strongly correlated, as fully proven by the JvNTs (and, in fact, by the most orthogonal transforms). This is one reason the orthogonality is a very important feature in SP, especially in modern telecommunications, where signal compression plays the leading role. Both material and financial resources can be saved by wisely selecting the orthogonal transform for integration in signal compression technology. From this perspective, the JvNTs seemingly are useful tools, with good compression capacity. This characteristic mainly is due to the property of JvN’s function of approximating the impulse response of ideal low-pass filters, either in continuous or in discrete time, which yields easier achievement of orthogonality.

As for future research and development, the definitions of JvNTs will be extended to the case of 2D signals (i.e., images in real-life), where achieving the orthogonality remains the main challenge.

Author Contributions

D.S. and J.C. equally contributed to conceptualization, methodology, software, validation, formal analysis, investigation, resources, data curation, writing—original draft preparation, writing—review and editing, visualization, supervision, project administration, and funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data employed while testing the numerical algorithms can be sent to readers on request.

Conflicts of Interest

The authors declare no conflict of interest.

Acronyms

{1,2}D	{One,two} dimension(s)
DCT	Discrete cosine transform
DFT(s)	Discrete Fourier transform(s)
FT	Fourier transform(s)
JvN	John von Neumann
JvNT(s)	John von Neumann transform(s)
KLT	Karhunen-Loeve transform
RGB	Red-Green-Blue (image digital system)
SP	Signal processing
TDR	Theorem of division with remainder
dB	Decibels (logarithmic scale)
deg	Degrees (for angles)
fp	Floating point (representation)
mw	Mother waveform/window
nsc	Number of strongest coefficients
std	Standard deviation
tfa(s)	Time-frequency atom(s)

References

Söderström, T.; Stoica, P. System Identification; Prentice Hall: London, UK, 1989; ISBN 978-0138812362. [Google Scholar]
Proakis, J.G.; Manolakis, D.G. Digital Signal Processing. Principles, Algorithms and Applications; Prentice Hall Inc.: Hoboken, NJ, USA, 1996; ISBN 0-13-394338-9. [Google Scholar]
Mason, J.C.; Handscomb, D.C. Chebyshev Polynomials; Chapman and Hall/CRC: Boca Raton, FL, USA, 2002; ISBN 978-1-4200-3611-4. [Google Scholar]
Abdulhussain, S.H.; Ramli, A.; Jassim, W.A. Shot Boundary Detection Based on Orthogonal Polynomial. Multimed. Tools Appl. 2019, 78, 20361–20382. [Google Scholar] [CrossRef]
Celeghini, E.; Gadella, M.; del Olmo, M.A. Symmetry Groups, Quantum Mechanics and Generalized Hermite Functions. Mathematics 2022, 10, 1448. [Google Scholar] [CrossRef]
Serov, V.V. Orthogonal Fast Spherical Bessel Transform on Uniform Grid. Comput. Phys. Commun. 2017, 216, 63–76. [Google Scholar] [CrossRef] [Green Version]
Dirichlet, P.G.L. On the Convergence of Trigonometric Series which Serve to Represent an Arbitrary Function Between Two Given Limits. J. Für Die Reine Und Angew. Math. 1829, 4, 157–169. (In French) [Google Scholar]
Pavez, E.; Girault, B.; Chou, P.A. Spectral Folding and Two-Channel Filter-Banks on Arbitrary Graphs. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, ON, Canada, 6–11 June 2021. [Google Scholar]
Zou, T.T.; Xu, W.J.; Ding, Z.G. Low-Complexity Linear Equalization for OTFS Systems with Rectangular Waveforms. In Proceedings of the IEEE International Conference on Communications, Montreal, QC, Canada, 14–23 June 2021. [Google Scholar]
Okade, M.; Mukherjee, J. Discrete Cosine Transform: A Revolutionary Transform that Transformed Human Lives. IEEE Circuits Syst. Mag. 2022, 22, 58–61. [Google Scholar] [CrossRef]
Hotelling, H. Analysis of a complex of statistical variables into principal components. J. Educ. Psychol. 1933, 24, 417–441, 498–520. [Google Scholar] [CrossRef]
Karhunen, K. On the Linear Methods in Probability Theory. Annals of Academy of Sciences Fennicae. Series A, I. Math.-Phys. 1947, 37, 1–79. (In German) [Google Scholar]
Loève, M. Probability Theory. Vol. II—Graduate Texts in Mathematics; Springer: Berlin/Heidelberg, Germany, 1978; Volume 46, ISBN 978-0-387-90262-3. [Google Scholar]
Pearson, K. On Lines and Planes of Closest Fit to Systems of Points in Space. Philos. Mag. 1901, 2, 559–572. [Google Scholar] [CrossRef] [Green Version]
Rissanen, J. Modeling by Shortest Data Description. Automatica 1978, 14, 465–471. [Google Scholar] [CrossRef]
Hua, Y.; Liu, W. Generalized Karhunen–Loeve Transform. IEEE Signal Process. Lett. 1998, 5, 141–142. [Google Scholar]
Hartley, R.V.L. Transmission of Information. Bell Syst. Technol. J. 1928, 7, 535–563. [Google Scholar] [CrossRef]
Flandrin, P. Time-Frequency Representations of Nonstationary Signals. Trait. Du Signal 1989, 6, 89–101. (In French) [Google Scholar]
Cohen, L. Time-Frequency Analysis; Prentice Hall: Hoboken, NJ, USA, 1995; ISBN 978-0135945322. [Google Scholar]
Zayed, A.I. Handbook of Function and Generalized Function Transformations; CRC Press: Boca Raton, FL, USA, 1996; ISBN 978-0849378515. [Google Scholar]
Bardenet, R.; Hardy, A. Time-Frequency Transforms of White Noises and Gaussian Analytic Functions. Appl. Comput. Harmon. Anal. 2021, 50, 73–104. [Google Scholar] [CrossRef] [Green Version]
Wang, Y.H. Time-Frequency Domain Local Spectral Analysis of Seismic Signals with Multiple Windows. Proc. R. Soc. Ser. A—Math. Phys. Eng. Sci. 2022, 478, 2265. [Google Scholar] [CrossRef]
Groupillaud, P.; Grossmann, A.; Morlet, J. Cycle-Octave and Related Transforms in Seismic Signal Analysis. Geoexploration 1984, 23, 85–102. [Google Scholar] [CrossRef]
Meyer, Y. Orthonormal Wavelets. In Inverse Problems Theoretical Imaging; Springer: Berlin, Germany, 1989; pp. 21–37. ISSN 0938-5509. [Google Scholar]
Mallat, S. A Theory for Multiresolution Signal Decomposition: The Wavelet Representation. IEEE Trans. Pattern Anal. Mach. Intell. 1989, 11, 674–693. [Google Scholar] [CrossRef] [Green Version]
Hedayat, A.; Wallis, W.D. Hadamard matrices and their applications. Ann. Stat. 1978, 6, 1184–1238. [Google Scholar] [CrossRef]
Haar, A. On the Theory of Orthogonal System Functions. Math. Ann. 1910, 69, 331–371. (In German) [Google Scholar] [CrossRef]
Walsh, J.L. A Closed Set of Normal Orthogonal Functions. Am. J. Math. 1923, 45, 5–24. [Google Scholar] [CrossRef] [Green Version]
Pratt, W.K.; Chen, W.H.; Welch, L.R. Slant Transform Image Coding. IEEE Trans. Commun. 1974, 22, 1075–1093. [Google Scholar] [CrossRef] [Green Version]
Kountchev, R.K.; Mironov, R.P.; Kountcheva, R.A. Hierarchical Cubical Tensor Decomposition through Low Complexity Orthogonal Transforms. Symmetry 2020, 12, 864. [Google Scholar] [CrossRef]
Ahmad, O.; Sheikh, N.A. Gabor Systems on Positive Half Line via Walsh-Fourier Transform. Carpathian Math. Publ. 2020, 12, 468–482. [Google Scholar] [CrossRef]
Dziech, A. New Orthogonal Transforms for Signal and Image Processing. Appl. Sci. 2021, 11, 7433. [Google Scholar] [CrossRef]
Daubechies, I. Orthonormal Bases of Compactly Supported Wavelets. Commun. Pure Appl. Math. 1988, 41, 909–996. [Google Scholar] [CrossRef] [Green Version]
Cohen, A.; Daubechies, I.; Feauveau, J.C. Biorthogonal Bases of Compactly Supported Wavelets. Commun. Pure Appl. Math. 1992, 45, 485–560. [Google Scholar] [CrossRef]
Gnutti, A.; Guerrini, F.; Leonardi, R. A Wavelet Filter Comparison on Multiple Datasets for Signal Compression and Denoising. Multidimens. Syst. Signal Process. 2021, 32, 791–820. [Google Scholar] [CrossRef]
Mallat, S.; Zhong, S. Matching Pursuits with Time-Frequency Dictionaries. IEEE Trans. Signal Process. 1993, 41, 3397–3415. [Google Scholar] [CrossRef] [Green Version]
Taub, A.H. Operators, Ergodic Theory and Almost Periodic Functions in a Group. In John Von Neumann Collected Works; Pergamon Press Ltd.: Oxford, UK, 1961; Volume II, ISBN 978-0080095660. [Google Scholar]
Stefanoiu, D.; Borne, P.; Popescu, D.; Filip, F.G.; El Kamel, A. Optimization in Engineering Sciences—Metaheuristics, Stochastic Methods and Decision Support; John Wiley & Sons: London, UK, 2014; ISBN 978-1-84821-498-9. [Google Scholar]

Figure 1. Three time-frequency atoms of JvN orthogonal dictionary: the mw (top), the real part of a delayed and frequency modulated atom (middle), and the imaginary part of an anticipated and frequency modulated atom (bottom).

Figure 2. Cost-function (relative accuracy) evaluation for a pseudo-randomly generated signal with Gaussian distribution. Top: original signal (top), synthesized signal (middle), and reconstruction error (down). Left-side bottom: cost-function surface. Right-side bottom: cost-function variation for

ε = 0.01

(depending on sampling rate,

K

).

Figure 2. Cost-function (relative accuracy) evaluation for a pseudo-randomly generated signal with Gaussian distribution. Top: original signal (top), synthesized signal (middle), and reconstruction error (down). Left-side bottom: cost-function surface. Right-side bottom: cost-function variation for

ε = 0.01

(depending on sampling rate,

K

).

Figure 3. Time-frequency representation of a signal. Left side: JvN spectrogram (up, in dB) and phase surface (down). Right side: FT spectrum (up, in dB) and phase (down).

Figure 4. Time-frequency localization of JvN analysis coefficients.

Figure 5. Analysis-synthesis of a pseudo-generated signal with Gaussian distribution, by using the JvNT for

K = 45

and

ε = 0.01

. Top: original signal (up), synthesized signal (middle), and reconstruction error (down). Left side, bottom: JvN spectrogram. Right side, bottom: JvN phase surface.

Figure 5. Analysis-synthesis of a pseudo-generated signal with Gaussian distribution, by using the JvNT for

K = 45

and

ε = 0.01

. Top: original signal (up), synthesized signal (middle), and reconstruction error (down). Left side, bottom: JvN spectrogram. Right side, bottom: JvN phase surface.

Figure 6. A Gaussian pseudo-random signal (on top), together with its FT spectrum (in the middle) and phase (at bottom).

Figure 7. Results of JvN analysis on a Gaussian pseudo-random signal. Top row: spectrograms in linear scale. Middle row: spectrograms in dB. Bottom row: phase surfaces in deg.

Figure 8. Results of JvN synthesis from the analysis coefficients of Gaussian pseudo-random signal. Top row: synthesized signals. Bottom row: reconstruction errors.

Figure 9. A speech signal (left side) together with its FT spectrum (right side up) and phase (right side down).

Figure 10. Results of JvN analysis on a speech signal. Top row: spectrograms in linear scale. Middle row: spectrograms in dB. Bottom row: phase surfaces in deg.

Figure 11. Theoretical compression capacity of JvNT applied on a speech signal, for

K_{1} = 332

(left side) and

K_{2} = 469

(right side).

Figure 11. Theoretical compression capacity of JvNT applied on a speech signal, for

K_{1} = 332

(left side) and

K_{2} = 469

(right side).

Figure 12. Results of JvN synthesis from the analysis coefficients of speech signal. Top row: synthesized signals. Bottom row: reconstruction errors.

Figure 13. Results on lossy synthesis of speech signal. Fitness variations at bottom. Synthesized signals and recovering errors for

η = 0.95

on top left side and for

η = 0.999

on top right side. On top both sides: for

K_{1} = 332

up and for

K_{2} = 469

down.

Figure 13. Results on lossy synthesis of speech signal. Fitness variations at bottom. Synthesized signals and recovering errors for

η = 0.95

on top left side and for

η = 0.999

on top right side. On top both sides: for

K_{1} = 332

up and for

K_{2} = 469

down.

Table 1. Performance parameters of JvNTs in the case of a Gaussian pseudo-random signal.

Signal Length $N$	Sampling Rate $K$	Relative std of Error [%]	Accuracy [%]	Analysis Runtime [s]	Synthesis Runtime [s]
1000	100	1.37	87.96	3.93	0.61
1000	141	1.83	84.55	4.32	0.62

Table 2. Relative energy

η [%]

versus JvN coefficients number

N_{η}

for a speech signal.

Table 2. Relative energy

η [%]

versus JvN coefficients number

N_{η}

for a speech signal.

$K ↓$	$η \to$	20	25	30	35	40	45	50	55	60	65	70	75	80	85	90	95	Angle
$332$	$N_{η} \to$	30	48	74	107	149	204	268	340	424	521	642	799	1028	1426	2169	3888	17.60°
$469$	$N_{η} \to$	26	43	65	92	124	162	210	268	336	422	531	686	926	1339	2110	3994	20.37°

Table 3. Performance parameters of JvNTs in the case of a speech signal.

Signal Length $N$	Sampling Rate $K$	Relative std of Error [%]	Accuracy [%]	Analysis Runtime [s]	Synthesis Runtime [s]
110,033	332	1.31	88.45	342.68	56.27
110,033	469	1.75	85.13	370.27	57.24

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Stefanoiu, D.; Culita, J. John von Neumann’s Time-Frequency Orthogonal Transforms. Mathematics 2023, 11, 2607. https://doi.org/10.3390/math11122607

AMA Style

Stefanoiu D, Culita J. John von Neumann’s Time-Frequency Orthogonal Transforms. Mathematics. 2023; 11(12):2607. https://doi.org/10.3390/math11122607

Chicago/Turabian Style

Stefanoiu, Dan, and Janetta Culita. 2023. "John von Neumann’s Time-Frequency Orthogonal Transforms" Mathematics 11, no. 12: 2607. https://doi.org/10.3390/math11122607

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

John von Neumann’s Time-Frequency Orthogonal Transforms

Abstract

1. Introduction

2. Theoretical Background

2.1. JvNTs for Continuous Time 1D Signals

2.2. JvNTs for Discrete-Time 1D Signals

2.2.1. Discrete Time Signals Framework

2.2.2. JvNT for Discrete Time Signals with Infinite Support Length

2.2.3. JvNT for Discrete Time Signals with Finite Support Length

3. Numerical Algorithms to Implement JvNTs

4. Simulation Results and Discussion

4.1. Gaussian Pseudo-Randomly Generated 1D Signal

4.2. Speech Signal

5. Concluding Remarks

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Acronyms

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI