Isometry Invariant Shape Recognition of Projectively Perturbed Point Clouds by the Mergegram Extending 0D Persistence

Elkin, Yury; Kurlin, Vitaliy

doi:10.3390/math9172121

Open AccessArticle

Isometry Invariant Shape Recognition of Projectively Perturbed Point Clouds by the Mergegram Extending 0D Persistence^†

by

Yury Elkin

and

Vitaliy Kurlin

^*

Materials Innovation Factory, Computer Science Department, University of Liverpool, Liverpool L69 3BX, UK

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of the shorter paper published in the 45th International Symposium on Mathematical Foundations of Computer Science (MFCS 2020), Prague, Czech Republic, 24–28 August 2020.

Mathematics 2021, 9(17), 2121; https://doi.org/10.3390/math9172121

Submission received: 6 July 2021 / Revised: 24 August 2021 / Accepted: 25 August 2021 / Published: 1 September 2021

(This article belongs to the Special Issue Computational Algebraic Topology and Neural Networks in Computer Vision)

Download

Browse Figures

Versions Notes

Abstract

:

Rigid shapes should be naturally compared up to rigid motion or isometry, which preserves all inter-point distances. The same rigid shape can be often represented by noisy point clouds of different sizes. Hence, the isometry shape recognition problem requires methods that are independent of a cloud size. This paper studies stable-under-noise isometry invariants for the recognition problem stated in the harder form when given clouds can be related by affine or projective transformations. The first contribution is the stability proof for the invariant mergegram, which completely determines a single-linkage dendrogram in general position. The second contribution is the experimental demonstration that the mergegram outperforms other invariants in recognizing isometry classes of point clouds extracted from perturbed shapes in images.

Keywords:

shape recognition; Topological Data Analysis; machine learning; computer vision

1. Introduction: Motivations, Shape Recognition Problem, and Overview of Results

Real-life objects are often represented by unstructured point clouds obtained by laser range scanning or by selecting salient or feature points in images [1]. Point clouds are easy to store and can be used as primitives for visualization [2]. The above advantages strongly motivate the problem of comparing and classifying unstructured point clouds.

Rigid objects are naturally studied up to rigid motion or isometry (including reflections), which is any map that preserves inter-point distances. The recognition of point clouds of the same number of points is practically solved by the histogram of all pairwise distances, which is a complete isometry invariant in general position [3].

Real shapes are often given in a distorted form because of noisy measurements, when points are perturbed, missed or accidentally added. One of the first approaches to recognize nearly identical point clouds

A, B

of different sizes in the same metric space, for example, in

R^{m}

, is to use the Hausdorff distance [4]

HD (A, B) = min ϵ^{\geq} 0

such that the first cloud A is covered by

ϵ^{} - b a l l s c e n t e r e d a t a l l p o i n t s o f

B

, a n d v i c e v e r s a .

However, we also need to take into account infinitely many potential isometries of the ambient space

R^{m}

. The exact computation of

{inf}_{f} HD (f (A), B)

minimized over isometries f of

R^{m}

has a high polynomial complexity already for dimension

m = 2

[5]. An approximate algorithm is cubic in the number of points for

m = 3

[6].

This paper extends the 12-page conference version [7], which introduced the new invariant mergegram but did not prove its continuity under perturbations. In addition to the proof of continuity, another contribution is Theorem 2 showing how to reconstruct a dendrogram of single-linkage clustering from a mergegram in general position.

The practical novelty is the harder recognition problem including perturbations of isometries within wider classes of affine and projective maps motivated by computer vision applications. Indeed, different positions of cameras produce projectively equivalent images of the same rigid shape. The new experiments in Section 6 extensively compared several approaches on 15,000 clouds obtained from real images; see examples in Figure 1.

Problem 1

(isometry shape recognition under noise). Find an isometry invariant of point clouds in

R^{m}

that is (a) independent of a cloud size, (b) provably continuous under perturbations of a cloud, (c) computable in a near linear time in the size of a cloud, and (d) more efficient for recognizing isometry classes of clouds than past invariants.

The key contributions are Theorems 2 and 3 and the experiments in Section 6 showing that the mergegram achieves a state-of-the-art recognition on substantially distorted images.

2. Related Work on Isometry Shape Recognition and Topological Data Analysis

For the isometry classification of clouds consisting of the same number of points, the easiest invariant is the distribution of all pairwise distances, whose completeness (or injectivity) was proved for all point clouds in general (non-singular) position in

R^{m}

[3].

Fixed point clouds

A, B \subset R^{m}

of different sizes can be pairwisely compared by the Hausdorff distance [4]

HD (A, B) = max {sup_{p \in A} d_{B} (p), sup_{q \in B} d_{A} (q)}

, where

d_{B} (p) = inf_{q \in B} d (p, q)

is the (Euclidean or another) distance from a point

p \in A

to the cloud B.

The rigid shape recognition problem for non-fixed clouds

A, B

is harder because of infinitely many potential isometries that can match

A, B

exactly or approximately. Partial cases of this problem were studied for clouds representing surfaces [10] and when two clouds have a given isometric matching of one pair of points [11]. Shape Google by Ovsjanikov et al. practically extends these ideas to non-rigid shape recognition [12].

The most general framework for the isometry shape recognition of point cloud data was proposed by Mémoi and Sapiro [13]. They studied the Gromov-Hausdorff distance

d_{G H} (A, B) = inf_{f, g, M} HD (f (A), g (B))

minimized over all isometric embeddings

f : A \to M

and

g : B \to M

of given point clouds into a metric space M. Since the above definitions involve even more minimizations over infinitely many maps and spaces,

GH

can be only approximated. The Farthest Point Sampling (FPS) has a quadratic complexity in the number of points (Reference [13], Section 3.6) and was successfully tested on small clouds.

The proposed invariant mergegram extends the 0-dimensional persistence in the area of Topological Data Analysis (TDA), which grew from the theory of size functions [14]. TDA views a point cloud

A \subset R^{m}

not by fixing any distance threshold but across all scales s, for example, by blurring given points of A to balls of a variable radius s. The evolution of this growing union of balls is summarized by a persistence diagram, which is invariant under isometries of

R^{m}

. TDA can be combined with machine learning and statistical tools due to stability under noise, which was first proved by Cohen-Steiner et al. [15] and then extended to a very general form by Chazal et al. [16].

In dimension 0 the persistence diagram

PD (A)

for distance-based filtrations of a point cloud A consists of the pairs

(0, s) \in R^{2}

, where values of s are distance scales at which subsets of A merge by the single-linkage clustering. These scales equal half-lengths of edges in a Minimum Spanning Tree

MST (A)

. If distances between all points of A are known,

MST (A)

is a connected graph with the vertex set A and a minimal total length.

Representing a point cloud A by

PD (A)

loses a lot of geometry of A, but gains stability under perturbations, which can be expressed in the case of point clouds as

BD (PD (A), PD (B)) \leq HD (A, B)

. Here, the bottleneck distance

BD

between diagrams is defined as a minimum

ϵ^{\geq} 0

such that all pairs of

PD (A)

can be bijectively matched to

ϵ^{} - c l o s e p o i n t s o f

PD(B)

o r t o d i a g o n a l p a i r s

(s,s)

, a n d v i c e v e r s a . H e r e,

-closeness of pairs

(a, c)

and

(b, d)

in

R^{2}

is measured in the distance

L_{\infty} = max {| a - b |, | c - d |}

.

The mergegram extends

PD (A)

to a stronger invariant, whose stability under perturbations in the above sense is proved in Section 5 for the first time. The idea of a mergegram is related to the Reeb graph [17] or the merge tree [18] for the sublevel set filtration of a scalar function. The mergegram

MG

is defined at a more abstract level for any clustering dendrogram, which opens a possibility to extend a Homologically Persistent Skeleton (HoPeS) visualizing most persistent cycles in point clouds [19].

Since any persistence diagram and a mergegram are unordered collections of pairs, the experiments in Section 6 use the neural network PersLay [20], whose output is invariant under permutations of input points by design. PersLay extends the neural network DeepSet [21] and introduces new layers to accept as an input any diagram of unordered points. In other related work, deep learning was recently applied to outputs of hierarchical clustering [22,23,24] and to 0-dimensional persistence [25,26].

3. Single-Linkage Clustering and the Invariant Mergegram of a Dendrogram

Example 1.

Figure 2 illustrates the key concepts before formal Definitions 1, 3 and 4 for the point cloud

A = {0, 1, 3, 7, 10}

in the real line

R

. Imagine that we gradually blur original data points by growing disks of the same radius s around the given points.

The disks of the closest points

0, 1

start overlapping at the scale

s = 0.5

when these points merge into one cluster

{0, 1}

. This merger is shown by blue arcs joined at the node at

s = 0.5

in the single-linkage dendrogram; see the left picture at the bottom of Figure 2. The persistence diagram

PD

in the middle picture at the bottom of Figure 2 represents this merger by the pair

(0, 0.5)

, meaning that a singleton cluster of (say) point 1 was born at the scale

s = 0

and then died later at

s = 0.5

by merging into another cluster of point 0.

When clusters

{0, 1, 3}

and

{7, 10}

merge at

s = 2

, this merger was earlier encoded in the persistence diagram by the single pair

(0, 2)

, meaning that one cluster inherited from (say) point 7 was born at

s = 0

and died at

s = 2

. The new mergegram in the bottom right picture of Figure 2 represents the above merger by the following two pairs. The pair

(1, 2)

means that the cluster

{0, 1, 3}

is merging at the current scale

s = 2

and was previously formed at the smaller scale

s = 1

. The pair

(1.5, 2)

means that another cluster

{7, 10}

is merging at the scale

s = 2

and was previously formed at

s = 1.5

.

The 0D persistence diagram represents the cluster of the whole cloud A by the pair

(0, + \infty)

because A was inherited from a singleton cluster starting from

s = 0

. The mergegram represents the same cluster A by the pair

(2, + \infty)

because A was formed during the last merger of

{0, 1, 3}

and

{7, 10}

at

s = 2

and continues to live as

s \to + \infty

.

In the above dendrogram, every vertical arc going up from a scale b to d contributes one pair

(b, d)

to the mergegram. So, both singleton clusters

{7}

,

{10}

merging at

s = 1.5

contribute one pair

(0, 1.5)

of multiplicity two, shown by two red circles in Figure 2.

Definition 1

(single-linkage clustering). Let A be a finite set in a metric space X with a distance

d : X \times X \to [0, + \infty)

. For a distance threshold, which can be called a scale s, any points

a, b \in A

should belong to oneSL clusterif and only if there is a finite sequence

a = a_{1}, \dots, a_{m} = b \in A

such that any two successive points have a distance at most s, so

d (a_{i}, a_{i + 1}) \leq s

for

i = 1, \dots, m - 1

. Let

Δ_{S L} (A; s)

be the set of all single-linkage clusters at the scale s. For

s = 0

, any point

a \in A

forms a singleton cluster

{a}

. Representing each cluster from

Δ_{S L} (A; s)

over all

s \geq 0

by one point gives thesingle-linkage dendrogram

Δ_{S L} (A)

visualizing how clusters merge; see the first picture at the bottom of Figure 2.

For any

s > 0

, all SL clusters

Δ_{S L} (A; s)

can be obtained as connected components of a Minimum Spanning Tree

MST (A)

by removing all edges longer than s.

Definition 2

(partition set

P (A)

). For any set A, apartitionof A is a finite set of disjoint non-empty subsets

A_{1}, \dots, A_{k} \subset A

, whose union is A. The single set A forms thesingle-blockpartition of A. Thepartition set

P (A)

consists of all partitions of A.

The partition set

P (A)

of the abstract set

A = {0, 1, 2}

consists of the five partitions

({0}, {1}, {2}), ({0, 1}, {2}), ({0, 2}, {1}), ({1, 2}, {0}), ({0, 1, 2}) .

For example, the collections

({0}, {1})

and

({0, 1}, {0, 2})

are not partitions of A.

Definition 3 below extends a dendrogram from (Reference [27], Section 3.1) to arbitrary (possibly, infinite) sets A. Every partition of A is finite by Definition 2. So, there is no need to add that an initial partition of A is finite. Hence, non-singleton sets are now allowed.

Definition 3

(dendrogram

Δ

of merge sets). A dendrogram Δ over any set A is a function

Δ : [0, + \infty) \to P (A)

of a scale

s \geq 0

satisfying the following conditions.

(a) There exists a scale

r \geq 0

such that

Δ (A; s)

is the single block partition for

s \geq r

.

(b) If

s \leq t

, then

Δ (A; s)

refines

Δ (A; t)

, so any set from

Δ (t)

is a subset of some set from

Δ (A; t)

. These inclusions of subsets of X induce the natural map

Δ_{s}^{t} : Δ (s) \to Δ (t)

.

(c) There are finitely many merge scales

s_{i}

such that

s_{0} = 0 a n d s_{i + 1} = s u p {s ∣ t h e m a p Δ_{s}^{t} i s i d e n t i t y f o r s^{'} \in [s_{i}, s)}, i = 0, \dots, m - 1 .

Since

Δ (A; s_{i}) \to Δ (A; s_{i + 1})

is not an identity map, there is a subset

B \in Δ (s_{i + 1})

, whose preimage consists of at least two subsets from

Δ (s_{i})

. This subset

B \subset X

is a merge set with the birth scale

s_{i}

. All sets of

Δ (A; 0)

are merge sets at the birth scale 0. The

life (B)

is the interval

[s_{i}, t)

from its birth scale

s_{i}

to its death scale

t = sup {s ∣ Δ_{s_{i}}^{s} (B) = B}

.

Dendrograms are often drawn as trees whose nodes represent all sets from the partitions

Δ (A; s_{i})

at merge scales. Edges of such a tree connect any set

B \in Δ (A; s_{i})

with its preimages under

Δ (A; s_{i}) \to Δ (A; s_{i + 1})

. Figure 3 shows

Δ

for

A = {0, 1, 2}

.

In Figure 3, the partition

Δ (A; 1)

consists of

{0, 1}

and

{2}

. The maps

Δ_{s}^{t}

induced by inclusions respect the compositions in the sense that

Δ_{s}^{t} \circ Δ_{r}^{s} = Δ_{r}^{t}

for any

r \leq s \leq t

. For example,

Δ_{0}^{1} ({0}) = {0, 1} = Δ_{0}^{1} ({1})

and

Δ_{0}^{1} ({2}) = {2}

, so

Δ_{0}^{1}

is a well-defined map from the partition

Δ (A; 0)

of 3 singleton sets to

Δ (A; 1)

but is not an identity.

At the scale

s_{0} = 0

the merge sets

{0}, {1}

have

life = [0, 1)

, the merge set

{2}

has

life = [0, 2)

. At the scale

s_{1} = 1

the single merge set

{0, 1}

has

life = [1, 2)

. At the scale

s_{2} = 2

the single merge set

{0, 1, 2}

has

life = [2, + \infty)

. The first (Greek) letter in the word ‘dendrogram’ and a

Δ

-shape of a typical tree motivate the notation

Δ

.

Condition (Definition 3 (a)) says that a partition of a set X is trivial for all large scales. Condition (Definition 3 (b)) means that, if the scale s is increasing, any sets from a partition

Δ (s)

can merge but cannot split into smaller sets. Condition (Definition 3 (c)) implies that there are only finitely many mergers, when two or more subsets of X merge into a larger merge set.

Lemma 1

([7], Lemma 3.3). Given a metric space

(X, d)

and a finite set

A \subset X

, the single-linkage dendrogram

Δ_{S L} (X)

from Definition 1 satisfies Definition 3.

A mergegram represents life spans of merge sets by pairs

(birth, death) \in R^{2}

.

Definition 4

(mergegram

MG (Δ)

). The mergegram of a dendrogram Δ has the pair

(birth, death) \in R^{2}

for each merge set B of Δ with

life (B) = [birth, death)

. If any life interval appears k times, the pair (birth,death) has the multiplicity k in

MG (Δ)

.

If our input is a point cloud A in a metric space, then the mergegram

MG (Δ_{S L} (A))

is an isometry invariant of A because

Δ_{S L} (A)

depends only on inter-point distances. Though

Δ_{S L} (A)

as any dendrogram is unstable under perturbations of points, the key advantage of

MG (Δ_{S L} (A))

is its stability, which will be proved in Theorem 4.

Figure 4 shows the metric space

X = {a, b, c, p, q}

with distances defined by the shortest path metric induced by the specified edge-lengths; see the distance matrix.

The dendrogram

Δ

in the first picture of Figure 5 leads to the mergegram as follows:

each of the 1-point sets ${b}$ , ${c}$ , ${p}$ , ${q}$ has pair (0, 1), so its multiplicity is 4;
each of the merge sets ${b, c}$ and ${p, q}$ has the pair (1,2), so its multiplicity is 2;
the singleton set ${a}$ has the pair $(0, 3)$ ; the merge set ${b, c, p, q}$ has the pair (2, 3);
the full set ${a, b, c, p, q}$ continues to leave up to $s = 3$ , hence having the pair $(3, + \infty)$ .

4. Explicit Relations between 0-Dimensional Persistence and Mergegram

This section recalls the concept of persistence and then shows how any 0D persistence and dendrogram in general position can be reconstructed from a mergegram.

Definition 5

(persistence module

V

). A persistence module

V

over the real numbers

R

is a collection of vector spaces

V_{t}

,

t \in R

with linear maps

v_{s}^{t} : V_{s} \to V_{t}

,

s \leq t

such that

v_{t}^{t}

is the identity on

V_{t}

, and the composition is respected:

v_{s}^{t} \circ v_{r}^{s} = v_{r}^{t}

for any

r \leq s \leq t

.

The set of real numbers can be considered as a category

R

in the following sense. The objects of

R

are all real numbers. Any real numbers

a \leq b

define a single morphism

a \to b

. The composition of morphisms

a \to b

and

b \to c

is the morphism

a \leq c

. In the language of category theory, a persistence module is a functor from

R

to the category of vector spaces. A basic example of a persistence module

V

is an interval module. An interval J between points

p < q

in

R

can be one of the following types: closed

[p, q]

, open

(p, q)

, half-open or half-closed

[p, q)

and

(p, q]

, all encoded as follows:

[p^{-}, q^{+}] : = [p, q], [p^{+}, q^{-}] : = (p, q), [p^{+}, q^{+}] : = (p, q], [p^{-}, q^{-}] : = [p, q) .

The endpoints

p, q

can have the infinite values

\pm \infty

, but without superscripts.

Example 2

(interval module

I (J)

). Let

J \subset R

be an interval. The interval module

I (J)

is the persistence module defined by the following vector spaces

I_{s}

and linear maps

i_{s}^{t} : I_{s} \to I_{t}

I_{s} = \{\begin{matrix} Z_{2}, & f o r s \in J, \\ 0, & o t h e r w i s e; \end{matrix} i_{s}^{t} = \{\begin{matrix} id, & f o r s, t \in J, \\ 0, & o t h e r w i s e \end{matrix} f o r a n y s \leq t .

The direct sum

W = U \oplus V

of persistence modules

U, V

is defined as the persistence module with the vector spaces

W_{s} = U_{s} \oplus V_{s}

and linear maps

w_{s}^{t} = u_{s}^{t} \oplus v_{s}^{t}

.

We illustrate the abstract concepts above by geometric constructions. Let

f : X \to R

be a continuous function on a topological space. The sublevel sets

X_{s}^{f} = f^{- 1} ((- \infty, s])

form nested subspaces

X_{s}^{f} \subset X_{t}^{f}

for any

s \leq t

. The inclusions of the sublevel sets respect compositions similarly to a dendrogram

Δ

in Definition 3. On a metric space X with a metric

d : X \times X \to [0, + \infty)

, a typical example of a function

f : X \to R

is the distance

d_{A}

to a finite subset

A \subset X

. For any point

p \in X

, let

d_{A} (p)

be the distance from p to a closest point of A. For any

r \geq 0

, the preimage

X_{r}^{d_{A}} = d_{A}^{- 1} ((- \infty, r]) = {p \in X ∣ d_{A} (p) \leq r}

is the union of closed balls with radius r and centers at all points

q \in A

. For example,

X_{0}^{d_{A}} = {d_{A}}^{- 1} ((- \infty, 0]) = A

and

X_{+ \infty}^{d_{A}} = {d_{A}}^{- 1} (R) = X

.

Any continuous function

f : X \to R

induces the inclusion

X_{s}^{f} \subset X_{r}^{f}

for any parameters

s \leq r

. Then, all sublevel sets

X_{s}^{f}

form a nested sequence of subspaces within X. The above construction of a filtration

{X_{s}^{f}}

can be considered as a functor from

R

to the category of topological spaces. Below, we discuss the simplest case of dimension 0.

Example 3

(persistent homology). For any topological space X, the 0-dimensional homology

H_{0} (X)

is the vector space (with coefficients

Z_{2}

) generated by all connected components of X. Let

{X_{s}}

be any filtration of nested spaces, e.g., sublevel sets

X_{s}^{f}

based on a continuous function

f : X \to R

. The inclusions

X_{s} \subset X_{r}

for

s \leq r

induce the linear maps between homology groups

H_{0} (X_{s}) \to H_{0} (X_{r})

and define thepersistent homology

{H_{0} (X_{s})}

, which satisfies the conditions of a persistence module from Definition 5.

If X is a finite set of m points, then

H_{0} (X)

is the direct sum

Z_{2}^{m}

of m copies of

Z_{2}

.

The persistence modules that are decomposable as direct sums of interval modules can be described in a simple combinatorial way by persistence diagrams in

R^{2}

.

Definition 6

(persistence diagram

PD (V)

). Let a persistence module

V

be decomposed as a direct sum of interval modules :

V ≅ ⨁_{l \in L} I (p_{l}^{*}, q_{l}^{*})

, where * is + or −. The persistence diagram

PD (V)

is the multiset

PD (V) = {(p_{l}, q_{l}) ∣ l \in L} ∖ {p = q} \subset R^{2}

.

The 0-dimensional persistence diagram of a topological space X with a continuous function

f : X \to R

is denoted by

PD {H_{0} (X_{s}^{f})}

. Lemma 2 will prove that the merge module

M (Δ)

of any dendrogram

Δ

is decomposable into interval modules. The mergegram

MG (Δ)

will be interpreted as the persistence diagram of

M (Δ)

.

The following result describes how the persistence diagram

PD

of the distance-based filtration of any point cloud A can be obtained from the mergegram

MG (Δ_{SL} (S))

.

Theorem 1

([7], Theorem 5.3). For any finite set A in a metric space

(X, d)

, let

d_{A} : X \to R

be the distance to A. Let the mergegram

MG (Δ_{S L} (A))

be a multiset

{(b_{i}, d_{i})}_{i = 1}^{k}

, where some pairs can be repeated. Then, the persistence diagram

PD {H_{0} (X_{s}^{d_{A}})}

is the difference of the multisets

{(0, d_{i})}_{i = 1}^{k} - {(0, b_{i})}_{i = 1}^{k}

containing each pair

(0, s)

exactly

# b - # d

times, where

# b

is the number of births

b_{i} = s

, and

# d

is the number of deaths

d_{i} = s

. All trivial pairs

(0, 0)

are ignored, and, alternatively, we take

{(0, d_{i})}_{i = 1}^{k}

only with

d_{i} > 0

.

Theorem 1 is illustrated by Example 1, where

A = {0, 1, 3, 7, 10}

has the diagram

PD (A) = {(0, 0.5), (0, 1), (0, 1.5), (0, 2), (0, + \infty)}

obtained from the mergegram

MG (Δ_{SL} (A)) = {(0, 0.5), (0, 0.5), (0, 1), (0, 1.5), (0, 1.5), (0.5, 1), (1, 2), (1.5, 2), (2, + \infty)}

as follows: one pair

(0, 0.5) \in PD (A)

comes from two deaths and one birth

s = 0.5

in

MG (Δ_{SL} (A))

. Similarly, each of the pairs

(0, 1), (0, 1.5), (0, 2) \in PD (A)

comes from two deaths and one birth equal to the same scale s. The cloud

B = {0, 4, 6, 9, 10} \subset R

in Reference [7] (Example 1.1) has exactly the same

PD (B) = PD (A)

but different

MG (Δ_{SL} (B)) \neq MG (Δ_{SL} (A))

. This example jointly with Theorem 1 justify that the mergegram is strictly stronger than 0D persistence as an isometry invariant of a point cloud.

The New Reconstruction Theorem 2 below can be contrasted with the weakness of 0D persistence

PD {H_{0} (X_{s}^{d_{A}})}

consisting of only pairs

(0, s)

whose finite deaths are half-lengths of edges in a Minimum Spanning Tree

MST (A)

. In Example 1, these scales

s = 0.5, 1, 1.5, 2

are insufficient to reconstruct the SL dendrogram in Figure 2. Such a unique reconstruction is possible by using the richer invariant mergegram as follows.

Theorem 2

(from a mergegram to a dendrogram). Let A be a finite point cloud in general position in the sense that all merge scales of A in a dendrogram Δ from Definition 3 are different. Then, the dendrogram Δ can be reconstructed from its mergegram

MG (Δ)

, uniquely up to a permutation of nodes in Δ at scale

s = 0

.

Proof.

Consider all merge scales one by one in the increasing order starting from the smallest. The general position implies that only two clusters merge at any merge scale. For any current scale s, the mergegram contains exactly two pairs

(b_{1}, s)

and

(b_{2}, s)

.

For a smallest merge scale

s > 0

, the births should be

b_{1} = b_{2} = 0

. We start drawing a dendrogram

Δ

by merging any two points of A at this smallest scale s. To realize a merger at any larger s, we should select two clusters representing

(b_{1}, s)

and

(b_{2}, s)

.

If

b_{i} = 0

, then we take any of the unmerged points of A. If

b_{i} > 0

, then the already constructed dendrogram should contain a unique non-singleton cluster determined by the scale

b_{i} \in (0, s)

. Hence, at any merge scale s, we know how to select two clusters to merge. The only choice comes from choosing points of A or permuting notes of

Δ

. □

Following the above proof of Theorem 2 for the cloud

A = {0, 1, 3, 7, 10}

in Example 1, the first two pairs

(0, 0.5) \in MG (Δ_{SL} (A))

indicate that we should merge two points of A at

s = 0.5

. The scale

s = 0.5

uniquely determines this 2-point cluster.

The next two pairs

(0, 1), (0.5, 1)

mean that the above cluster born at

s = 0.5

should merge at

s = 1

with a singleton cluster (any free point of A). The resulting 3-point cluster is uniquely determined by its merge scale

s = 1

. The further two pairs

(0, 1.5), (0, 1.5)

say that a new 2-point cluster is formed at

s = 1.5

by the two remaining points of A.

The final pairs

(1, 2), (1.5, 2)

tell us to merge at

s = 2

the two clusters formed earlier at

s = 1

and

s = 1.5

. The resulting dendrogram

Δ

has the expected combinatorial structure as in Figure 2, though we can draw

Δ

in another way by permuting points of A.

5. Stability of the Mergegram for Any Single-Linkage Dendrogram

This section fully proves the stability of a mergegram, which was stated in Reference [7] (Theorem 7.4), without proving key Lemmas 2 and 3. For simplicity, we consider vector spaces with coefficients only in

Z_{2} = {0, 1}

, which can be replaced by any field.

Definition 7 introduces homomorphisms between persistence modules, which are needed to state the stability of persistence diagrams

PD {H_{0} (X_{s}^{f})}

under perturbations of a function

f : X \to R

. This result will imply a stability for the mergegram

MG (Δ_{S L} (A))

for the dendrogram

Δ_{S L} (A)

of the single-linkage clustering of a set

A \subset X

.

Definition 7

(a homomorphism between persistence modules). Let

U

and

V

be persistent modules over

R

. A homomorphism

U \to V

of a degree

δ \in R

consists of linear maps

ϕ_{t} : U_{t} \to V_{t + δ}

,

t \in R

, such that the diagram below commutes for all

s \leq t

.

Let

{Hom}^{δ} (U, V)

be all homomorphisms

U \to V

of degree δ. Persistence modules

U, V

areisomorphicif there are inverse homomorphisms

U \to V \to U

of degree 0.

For a persistence module

V

with maps

v_{s}^{t} : V_{s} \to V_{t}

, the simplest example of a homomorphism of a degree

δ \geq 0

is

1_{V}^{δ} : V \to V

defined by the maps

v_{s}^{s + δ}

,

t \in R

. So,

v_{s}^{t}

defining the structure of

V

shift all vector spaces

V_{s}

by the difference

δ = t - s

.

Interleaved modules defined below algebraically generalize a geometric perturbation of a set X in terms of (the homology of) its sublevel sets

X_{s}

.

Definition 8

(interleaving distance ID). Persistence modules

U

and

V

are called δ-interleaved if there are homomorphisms

ϕ \in {Hom}^{δ} (U, V)

and

ψ \in {Hom}^{δ} (V, U)

such that

ϕ \circ ψ = 1_{V}^{2 δ} and ψ \circ ϕ = 1_{U}^{2 δ}

. The interleaving distance between the persistence modules

U

and

V

is

ID (U, V) = inf {δ \geq 0 ∣ U and V are δ - interleaved} .

If

f, g : X \to R

are continuous functions such that

| | f - {g | |}_{\infty} \leq δ

in the

L_{\infty}

-distance, the modules

H_{k} {f^{- 1} (- \infty, s]}

,

H_{k} {g^{- 1} (- \infty, s]}

are

δ

-interleaved for any k [15]. The last conclusion extends to persistence diagrams for the bottleneck distance below.

Definition 9

(bottleneck distance BD). Let

C, D

be multisets of finitely many points

(p, q) \in R^{2}

,

p < q

, of finite multiplicity and all diagonal points

(p, p) \in R^{2}

of infinite multiplicity. For

δ \geq 0

, a δ-matching is a bijection

h : C \to D

such that

{| h (a) - a |}_{\infty} \leq δ

in the

L_{\infty}

-distance for any point

a \in C

. Thebottleneckdistance between modules

U, V

is defined as

BD (U, V) = \inf {δ ∣ there is a δ - matching between PD (U), PD (V)}

.

Historically, stability of persistence for sequences of sublevel sets was extended as Theorem 3 to q-tame persistence modules. A persistence module

V

is q-tame if any non-diagonal square in the persistence diagram

PD (V)

contains only finitely many of points; see Reference [16] (Section 2.8). Any finitely decomposable persistence module is q-tame.

Theorem 3

(stability of persistence modules [16], isometry Theorem 4.11). Let

U

and

V

be q-tame persistence modules. Then,

ID (U, V) = BD (PD (U), PD (V))

, where

ID

is the interleaving distance, and

BD

is the bottleneck distance between persistence modules.

Definition 10

(merge module

M (Δ)

). For any dendrogam Δ on a finite set X, the merge module

M (Δ)

consists of the vector spaces

M_{s} (Δ)

,

s \in R

, and linear maps

m_{s}^{t} : M_{s} (Δ) \to M_{t} (Δ)

,

s \leq t

. For any

s \in R

and

A \in Δ (s)

, the space

M_{s} (Δ)

has the generator or a basis vector

[A] \in M_{s} (Δ)

. For

s < t

and any set

A \in Δ (s)

, if the image of A under

Δ_{s}^{t}

coincides with

A \subset X

, so

Δ_{s}^{t} (A) = A

, then

m_{s}^{t} ([A]) = [A]

, else

m_{s}^{t} ([A]) = 0

, see Figure 6.

In a dendrogram

Δ

from Definition 3, any merge set A of

Δ

has

life (A) = [b, d)

from its birth scale b to its death scale d. Lemmas 2 and 3 are proved for the first time.

Lemma 2

(merge module decomposition). For any dendrogram Δ from Definition 3, the merge module

M (Δ) ≅ ⨁_{A} I (life (A))

decomposes over all merge sets A.

Proof

(Proof of Lemma 2). The goal is to prove that

M (▵) ≅ ⨁_{A} I (life (A))

. Recall that the interval module

I (life (A))

consists of only vector spaces 0 and

Z_{2}

. For a scale r, let

I_{r} (life (A))

be its vector space, whose generator is denoted by

[I_{r} (life (A))]

. Define

ψ_{r} : M_{r} (▵) \to ⨁_{A} I_{r} (life (A)) such that [A] \to [I_{r} (life (A))] for all A \in ▵ (r),

ϕ_{r} : ⨁_{A} I_{r} (life (A)) \to M_{r} (Δ) such that [I_{r} (life (A))] \to [A] for all life (A) containing r .

We will first prove that

ϕ_{r}

is well-defined. If

r \in life (A)

then

A \in M_{r} (▵)

. We know that

M_{r} (▵)

is generated by elements

A \in ▵ (r)

for which

r \in life (A)

. Thus, the compositions satisfy

ϕ_{r} \circ ψ_{r} = {id}_{r}

and

ψ_{r} \circ ϕ_{r} = {id}_{r}

. It remains to prove that morphisms correctly behave under the functors

ψ, ϕ

. The proofs for both cases are essentially the same; thus, we will prove it only for

ψ

. The goal is to prove that the following diagram commutes: Mathematics 09 02121 i002

Here,

i_{s}^{t}

is the direct sum of the corresponding maps of interval modules

⨁_{A} {(i_{s}^{t})}^{A}

. Let

[A]

be an arbitrary generator of

M_{r} (▵)

. There are two possibilities how

m_{s}^{t}

can map

[A]

. If

t \in life (A)

, then

m_{s}^{t} ([A]) = [A] \in M_{t} (▵)

and by definition

ϕ_{t} \circ m_{s}^{t} ([A]) = [I_{t} (life (A))] .

Since both

s, t \in life (A)

, we also have that

m_{s}^{t} \circ ϕ_{t} ([A]) = [I_{t} (life (A))] = ϕ_{t} \circ m_{s}^{t} ([A]) .

Assume now that

t \notin life (A)

. Then,

m_{s}^{t} ([A]) = 0

; thus,

ϕ_{t} (m_{s}^{t} ([A])) = 0

. On the other hand,

i_{s}^{t} \circ ϕ_{s} ([A]) = [I_{t} (life (A))] = Z_{2}

. Since

t \notin life (A)

, we get

i_{s}^{t} \circ ϕ_{s} ([A]) = 0

. □

Lemma 3

(interleaving of merge modules). If subsets

A, B

of a metric space

(X, d)

have

HD (A, B) = δ

, then the merge modules

M (Δ_{S L} (A))

,

M (Δ_{S L} (B))

are δ-interleaved.

Proof

(Proof of Lemma 3). The equality

HD (A, B) = δ

means that A is covered by the union of closed balls that have the radius

δ

and centers at all points

b \in B

. This union is the preimage is

d_{B}^{- 1} ([0, δ])

, i.e.,

A \subset d_{B}^{- 1} ([0, δ])

. Extending the distance values by

s \geq 0

, we get

d_{A}^{- 1} ([0, s]) \subset d_{B}^{- 1} ([0, s + δ])

and similarly

d_{B}^{- 1} ([0, s]) \subset d_{A}^{- 1} ([0, s + δ])

.

Let U be an arbitrary set in

Δ_{S L} (A)

. Define map

ϕ_{r} : M (A; r) \to M (B; r + δ)

ϕ_{r} ([U]) = \{\begin{matrix} [U], & if r + δ \in life (Δ_{S L} (B), U), \\ 0, & otherwise . \end{matrix}

Symmetrically, for any

V \in Δ_{S L} (B)

, we define

ψ_{r} : M (B; r) \to M (A; r + δ)

ψ_{r} ([V]) = \{\begin{matrix} [V], & if r + δ \in life (Δ_{S L} (A), V), \\ 0, & otherwise . \end{matrix}

In the notation above,

life (Δ_{S L} (B), U)

is the

life (U)

in the dendrogram

Δ_{S L} (B)

. If

U \notin Δ_{S L} (B) (t)

for all values t, then

life (U) = \emptyset

. By symmetry, it is enough to prove that the following diagrams commute: Mathematics 09 02121 i003

We note first that, if

[a, b) = life (Δ_{S L} (A), U)

, then

life (Δ_{S L} (B), U) \subseteq [a - ϵ, b + ϵ)

.

We begin by proving commutativity of the first diagram. Let U be arbitrary element of

Δ_{S L} (A) (s)

. If

t \notin life (Δ_{S L} (A), U)

then

ϕ_{t} \circ m_{s}^{t} = 0

. If

s + δ \notin life (Δ_{S L} (B), U)

or

t + δ \notin life (Δ_{S L} (B), U)

then we are done. Since

t \notin life (Δ_{S L} (A), U)

, it follows that

t + δ \notin life (Δ_{S L} (B), U)

. Thus, , with given assumptions, the diagram commutes.

Assume now that

t + δ \notin life (Δ_{S L} (A), U)

. Then, both

ϕ_{t} (m_{s}^{t} (U)) = 0 = m_{s + δ}^{t + δ} (ϕ_{s} (U))

. In the last case, we assume that

t \in life (Δ_{S L} (A), U)

and

t + δ \in life (Δ_{S L} (B), U)

. In this case, obviously,

s + δ \in life (Δ_{S L} (B), U)

; thus,

ϕ_{t} (m_{s}^{t} ([U])) = [U] = m_{s + δ}^{t + δ} (ϕ_{s} ([U]))

.

For the second diagram, assume now that

U \in M (Δ_{S L} (A)) (s - δ)

. Assume first that

s \notin life (Δ_{S L} (B), U)

, then

s + δ \notin life (Δ_{S L} (B), U)

and

m_{s - δ}^{s + δ} ([U]) = 0 = ψ_{s} (ϕ_{s - δ} [U])

.

Assume then that

s \in life (Δ_{S L} (B), U)

. Now, the outcome of both maps

ψ_{s}

and

m_{s - δ}^{s + δ}

depends on whether

s + δ \in life (Δ_{S L} (A), U)

; thus,

m_{s - δ}^{s + δ} ([U]) = ψ_{s} (ϕ_{s - δ} [U])

. Since all the diagrams commute, the required conclusion follows. □

Theorem 4

(stability of a mergegram). The mergegrams of any finite point clouds

A, B

in a metric space

(X, d)

satisfy

BD (MG (Δ_{S L} (A)), MG (Δ_{S L} (B)) \leq HD (A, B)

. Hence, any small perturbation of a cloud A in the Hausdorff distance leads to a similarly small perturbation in the bottleneck distance for its mergegram

MG (Δ_{S L} (A))

.

Proof.

The given clouds

A, B \subset X

with

HD (A, B) = δ

have

δ

-interleaved merge modules by Lemma 3, so

ID (MG (Δ_{S L} (A)), MG (Δ_{S L} (B)) \leq δ

. Since any merge module

M (Δ)

is finitely decomposable, hence, it is q-tame by Lemma 2. The corresponding mergegram

MG (M (Δ))

satisfies Theorem 3, so

BD (MG (Δ_{S L} (A)), MG (Δ_{S L} (B)) \leq δ

. □

Figure 7 illustrates Theorem 4 on a cloud and its perturbation by showing their close mergegrams. The more extensive experiment on 100 clouds in Reference [7] (Figure 8) similarly confirms that the mergegram is perturbed within expected bounds. The computational complexity of the mergegram

MG (Δ_{S L} (A))

was proved to be near linear in the number n of points in a cloud

A \subset R^{m}

; see Reference [7] (Theorem 8.2). The results above justify that the invariant mergegram satisfies conditions (a,b,c) of Isometry Recognition Problem 1.

6. New Experiments on Isometry Recognition of Substantially Distorted Real Shapes

This section fulfills final condition (d) of Problem 1 by experimentally comparing the mergegram with 0D persistence and distributions of distances to neighbors on 15,000 clouds. The earlier paper by Reference [7] did experiments only on randomly generated clouds.

We considered 15 classes of shapes represented by black and white images of mythical creatures [9]; see Figure 1. These shapes were chosen to make the shape recognition problem really challenging. Indeed, similar creatures from this dataset are represented by slightly different shapes, which can be hard to isometrically distinguish from each other. For example, several images of a horse include only minor differentiating features, such as a saddle or a different tails, which makes horses nearly identical.

Shape generation. For each image, we generated 1000 perturbed images by affine and projective transformations to get 15,000 distorted shapes split into 15 classes.

First, we rotated each image around its central point by an angle generated uniformly in the interval

[0, 2 π)

using the function cv::rotate from the OpenCV library. If needed, we extended the resulting image to fit all black pixels of the rotated shape into a bounding box. Then, both affine and projective transformations distort each image by using a noise parameter

δ

such that the value

δ = 0

represents the identity transformation.

Figure 8 illustrates how an original image is randomly rotated, and then randomly distorted by affine or projective transformations, depending on the noise parameter

δ

.

Affine transformations are implemented as compositions of the already applied rotations above and the function cv::resize() from the OpenCV library. This function scales an image of size

w \times h

by horizontal and vertical factors

a, b

sampled as follows.

Uniform noise: $a \in [1 - δ w, 1 + δ w]$ , $b \in [1 - δ h, 1 + δ h]$ have uniform distributions.
Gaussian noise: $a \in N (1, δ h) \cap R_{+}$ and $b \in N (1, δ w) \cap R_{+}$ have Gaussian distributions with mean 1 and standard variance $δ h, δ v$ , truncated to positive numbers.

Projective transformation are implemented as compositions of the already applied rotations above and the OpenCV function cv::getPerspectiveTransform() function, which is parametrized by 4-dimensional array

v = (a_{0}, a_{1}, a_{2}, a_{3})

consisting of points

a_{i} \in Z^{2}

,

i = 0, 1, 2, 3

. This function maps the corners of the image as follows:

(0, 0) \mapsto a_{0}, (0, h) \mapsto a_{1}, (w, 0) \mapsto a_{2} and (w, h) \mapsto a_{3} .

The projective transformation of the rectangle

w \times h

is uniquely determined by the above corners. The above points

a_{i}

are randomly sampled by using a noise parameter

δ

.

Uniform noise: each coordinate has a uniform distribution with a noise parameter $δ$

$a_{0} \in [0, δ w] \times [0, δ h], a_{1} \in [0, δ w] \times [h - δ h, h],$

$a_{2} \in [w - δ w, w] \times [0, δ h], a_{3} \in [w - δ w, w] \times [h - δ h, h] .$
Gaussian noise: each coordinate has a Gaussian distribution truncated to the image

$a_{0} \in (N (0, δ w) \cap [0, w]) \times (N (0, δ h) \cap [0, h]),$

$a_{1} \in (N (0, δ w) \cap [0, w]) \times (N (h, δ h) \cap [0, h]),$

$a_{2} \in (N (w, δ w) \cap [0, w]) \times (N (0, δ h) \cap [0, h]),$

$a_{3} \in (N (w, δ w) \cap [0, w]) \times (N (h, δ h) \cap [0, h]) .$

Point cloud extraction. For each distorted image, we extract classical Harris point corners [28] due to their simplicity; see the red points in Figure 8. For detecting corner points, the OpenCV function cv::cornerHarris was used with the parameters blockSize = 3, apertureSize = 5, k = 0.04, thresh = 120. However, one can use any reliable algorithms, such as FAST [29] or scale-invariant feature transform (SIFT) [30].

After describing the available point cloud data above, we specify condition (d) of Isometry Recognition Problem 1 in the context of supervised machine learning.

Problem 2

(experimental recognition). Given a labeled dataset split into classes of similar but projectively distorted shapes, develop a learning tool to recognize a class of distorted shapes with a highrecognition rate(percentage of correctly recognized classes).

Since all isometry invariants are independent of point ordering, the most suitable neural network is PersLay [20], whose output is invariant under permutations by design. Each layer is a combination of a coefficient layer

ω (p) : R^{m} \to R

, a transformation

ϕ (p) : R^{m} \to R^{q}

and a permutation invariant layer op combined as follows

PersLay (D) = op ({w (p) ϕ (p)}_{p \in D}), where D is a diagram or multiset of points in R^{m} .

Coordinates of all input points are linearly normalized to

[0, 1]

. We have used the following parameters of the PersLay network for all experiments below.

The max layer

MAX (q)

consists of the following functions.

The coefficient layer $w : R^{m} \to R$ is the weight $w (x_{1}, \dots, x_{m}) = k | x_{1} - x_{2} |$ , where k is a trainable scalar, and the dimension is typically $m = 2$ .
The transformation layer $ϕ : {diagrams of points in R^{m}} \to R^{q}$ is the function $ϕ (D) = \sum_{p \in D} λ p + γ maxpool (D) + β$ , where $λ$ , $γ$ are $R^{m \times q}$ trainable matrices, $β$ is a $R^{q}$ trainable vector, and maxpool returns a maximum value for every $i = 1, \dots, m$ .
The operational layer $op : R^{q} \to R^{t}$ puts all coordinates in increasing order and composes the result with standard densely connected layer [31] $Dense : R^{q} \to R^{t}$ .

An output is a vector in

R^{t}

for

t = 15

of image classes. A final prediction is obtained by choosing a class with a largest coordinate in the output vector.

The image layer

Im [x, y]

for integer parameters

x, y

and a multiset of points in the unit square

{[0, 1]}^{2}

consists of the following functions.

The coefficient layer $w : R^{2} \to R$ is a piecewise constant function trained on $x \cdot y$ parameters, defined on the unit square partition

$P (x, y) = \{[\frac{i}{x}, \frac{i + 1}{x}] \times [\frac{j}{y}, \frac{j + 1}{y}] ∣ i = 0, \dots, x - 1 and j = 0, \dots, y - 1\} .$
Let $ϕ_{p} : R^{2} \to R$ be the Gaussian distribution centered at $p \in D$ with a trainable standard deviation $σ$ . The transformation layer $ϕ : R \to R^{x y}$ consists of $x y$ functions $ϕ_{p}$ , where p runs over all centroids of the partition $P (x, y)$ .
The operation layer op takes the sum over the given point cloud. A final prediction is made by composing the operation layer with the Dense layer.

Finally, the PersLay network used the optimizer tf.keras.adam with the standard learning rate 0.01 and 150 epochs, the loss function SparseCategoricalCrossEntropy, the 80:20 of training and testing, a 5-fold Monte Carlo cross validation for each run.

Figure 9, Figure 10 and Figure 11 show that the mergegram

MG

consistently outperforms two other isometry invariants: 0D persistence and the multiset

N N (4)

consisting of 4-tuple distances to neighbors per given point. The simpler multiset

N N (2)

performed worse. A given cloud

C \subset R^{2}

was considered as a baseline input. The noise factors

δ

reached 25%, which means that original images were distorted up to a quarter of image sizes.

7. A Discussion of Novel Contributions and Further Open Problems

This paper has further demonstrated that the provably stable-under-noise invariant mergegram of a dendrogram is a fast and efficient tool in the challenging problem of isometry shape recognition, especially for substantially distorted images.

In comparison with the conference version [7], Section 4 proved new Theorem 2 describing how to reconstruct a single-linkage dendrogram in general position from its much simpler mergegram. It is hard to define a continuous metric between dendrograms, especially because they can be unstable under perturbations. Theorem 2 allows us to measure a continuous similarity between dendrograms in general position as the bottleneck distance between their unique mergegrams. This distance can be computed in time

O (n^{1.5} log n)

[32] for diagrams consisting of at most n points.

Section 5 provided a full proof of stability of the mergegram under perturbations of points, while the earlier paper by Reference [7] only announced this result without proving highly non-trivial Lemmas 2 and 3, which required a heavy algebraic machinery.

In addition to Example 1 and Theorem 1 showing that the mergegram is stronger than 0D persistence, its strength is confirmed by the new experiments on 15,000 point samples from substantially distorted real shapes. In Figure 9, Figure 10 and Figure 11, the mergegram outperformed other isometry invariants. Since the distribution

N N (2)

of distances to two closest neighbors per point performed badly, we have tried distances to four nearest neighbors, but this

N N (4)

performed worse than the original cloud because of noise.

For very high levels of 20% and 25% distortions in projective transformations, the PersLay network trained on a point cloud achieved high recognition rates because we have extensively tried many parameters in the layers MAX(75) and Im[20,20] for a best trade-off between accuracy and speed. The C++ code for the mergegram is at Reference [7].

The recent stronger invariants are Pointwise Distance Distributions [33]. Their generic completeness under isometry holds in a more challenging setting of periodic point sets [34,35,36,37,38].

Author Contributions

Conceptualization, V.K.; data curation, Y.E.; formal analysis, Y.E.; funding acquisition, V.K.; investigation, Y.E.; methodology, V.K.; project administration, V.K.; resources, Y.E.; software, Y.E.; writing—original draft preparation, Y.E.; writing—review and editing, V.K.; supervision, V.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the EPSRC grant ‘Application-driven Topological Data Analysis’, reference EP/R018472/1. The APC was funded by the University of Liverpool.

Data Availability Statement

The original image dataset is available at http://tosca.cs.technion.ac.il (accessed on 25 August 2021).

Conflicts of Interest

The authors declare no conflict of interest.

References

Pauly, M.; Gross, M.; Kobbelt, L.P. Efficient simplification of point-sampled surfaces. In Proceedings of the IEEE Visualization, Boston, MA, USA, 27 October–1 November 2002; pp. 163–170. [Google Scholar]
Zwicker, M.; Pauly, M.; Knoll, O.; Gross, M. Pointshop 3D: An interactive system for point-based surface editing. ACM Trans. Graph. (TOG) 2002, 21, 322–329. [Google Scholar] [CrossRef] [Green Version]
Boutin, M.; Kemper, G. On reconstructing n-point configurations from the distribution of distances or areas. Adv. Appl. Math. 2004, 32, 709–735. [Google Scholar] [CrossRef] [Green Version]
Huttenlocher, D.P.; Klanderman, G.A.; Rucklidge, W.J. Comparing images using the Hausdorff distance. Trans. Pattern Anal. Mach. Intell. 1993, 15, 850–863. [Google Scholar] [CrossRef] [Green Version]
Chew, P.; Goodrich, M.; Huttenlocher, D.; Kedem, K.; Kleinberg, J.; Kravets, D. Geometric pattern matching under Euclidean motion. Comput. Geom. 1997, 7, 113–124. [Google Scholar] [CrossRef] [Green Version]
Goodrich, M.T.; Mitchell, J.S.; Orletsky, M.W. Approximate geometric pattern matching under rigid motions. Trans. Pattern Anal. Mach. Intell. 1999, 21, 371–379. [Google Scholar] [CrossRef] [Green Version]
Elkin, Y.; Kurlin, V. The mergegram of a dendrogram and its stability. In Proceedings of the MFCS 2020, Prague, Czech Republic, 24–28 August 2020. [Google Scholar]
Bronstein, A.M.; Bronstein, M.M.; Kimmel, R. Numerical Geometry of Non-Rigid Shapes; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
Bronstein, A.M.; Bronstein, M.M.; Bruckstein, A.M.; Kimmel, R. Analysis of two-dimensional non-rigid shapes. Int. J. Comput. Vis. 2008, 78, 67–88. [Google Scholar] [CrossRef] [Green Version]
Elad, A.; Kimmel, R. Bending invariant representations for surfaces. In Proceedings of the Computer Vision and Pattern Recognition, Kauai, HI, USA, 8–14 December 2001. [Google Scholar]
Ovsjanikov, M.; Mérigot, Q.; Mémoli, F.; Guibas, L. One point isometric matching with the heat kernel. In Computer Graphics Forum; Blackwell Publishing: Oxford, UK, 2010; Volume 29, pp. 1555–1564. [Google Scholar]
Ovsjanikov, M.; Bronstein, A.M.; Bronstein, M.M.; Guibas, L.J. Shape google: A computer vision approach to isometry invariant shape retrieval. In Proceedings of the ICCV Workshops, Kyoto, Japan, 27 September–4 October 2009; pp. 320–327. [Google Scholar]
Mémoli, F.; Sapiro, G. A theoretical and computational framework for isometry invariant recognition of point cloud data. Found. Comput. Math. 2005, 5, 313–347. [Google Scholar] [CrossRef] [Green Version]
Verri, A.; Uras, C.; Frosini, P.; Ferri, M. On the use of size functions for shape analysis. Biol. Cybern. 1993, 70, 99–107. [Google Scholar] [CrossRef]
Cohen-Steiner, D.; Edelsbrunner, H.; Harer, J. Stability of persistence diagrams. Discret. Comput. Geom. 2007, 37, 103–120. [Google Scholar] [CrossRef] [Green Version]
Chazal, F.; De Silva, V.; Glisse, M.; Oudot, S. The Structure and Stability of Persistence Modules; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
Parsa, S. A Deterministic O(m log m) Time Algorithm for the Reeb Graph. Discret. Comput. Geom. 2013, 49, 864–878. [Google Scholar] [CrossRef]
Morozov, D.; Beketayev, K.; Weber, G. Interleaving distance between merge trees. Discret. Comput. Geom. 2013, 49, 52. [Google Scholar]
Smith, P.; Kurlin, V. Skeletonisation algorithms with theoretical guarantees for unorganised point clouds with high levels of noise. Pattern Recognit. 2021, 115, 107902. [Google Scholar] [CrossRef]
Carriere, M.; Chazal, F.; Ike, Y.; Lacombe, T.; Royer, M.; Umeda, Y. PersLay: A Neural Network Layer for Persistence Diagrams and Graph Topological Signatures. In Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS), Palermo, Italy, 26–28 August 2020. [Google Scholar]
Zaheer, M.; Kottur, S.; Ravanbakhsh, S.; Poczos, B.; Salakhutdinov, R.R.; Smola, A.J. Deep sets. In Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 3391–3401. [Google Scholar]
Fu, G.; Hou, C.; Yao, X. Learning topological representation for networks via hierarchical sampling. In Proceedings of the International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 14–19 July 2019; pp. 1–8. [Google Scholar]
Cirrincione, G.; Ciravegna, G.; Barbiero, P.; Randazzo, V.; Pasero, E. The GH-EXIN neural network for hierarchical clustering. Neural Netw. 2020, 121, 57–73. [Google Scholar] [CrossRef] [PubMed]
Karim, M.R.; Beyan, O.; Zappa, A.; Costa, I.G.; Rebholz-Schuhmann, D.; Cochez, M.; Decker, S. Deep learning-based clustering approaches for bioinformatics. Briefings Bioinform. 2021, 22, 393–415. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Clough, J.; Byrne, N.; Oksuz, I.; Zimmer, V.A.; Schnabel, J.A.; King, A. A topological loss function for deep-learning based image segmentation. Trans. PAMI 2020. [Google Scholar] [CrossRef]
Gabrielsson, R.B.; Nelson, B.J.; Dwaraknath, A.; Skraba, P. A topology layer for machine learning. In Proceedings of the International Conference Artificial Intelligence and Statistics, Virtually, 26–28 August 2020; pp. 1553–1563. [Google Scholar]
Carlsson, G.; Memoli, F. Characterization, stability and convergence of hierarchical clustering methods. J. Mach. Learn. Res. 2010, 11, 1425–1470. [Google Scholar]
Sánchez, J.; Monzón, N.; Salgado De La Nuez, A. An analysis and implementation of the harris corner detector. Image Process. Line 2018, 8, 305–328. [Google Scholar] [CrossRef]
Rosten, E.; Porter, R.; Drummond, T. Faster and better: A machine learning approach to corner detection. IEEE Trans. Pattern Anal. Mach. Intell. 2008, 32, 105–119. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Lowe, D.G. Object recognition from local scale-invariant features. In Proceedings of the Seventh IEEE International Conference on Computer Vision, Kerkyra, Greece, 20–27 September 1999; Volume 2, pp. 1150–1157. [Google Scholar]
Abadi, M.; Agarwal, A.; Barham, P.; Brevdo, E.; Chen, Z.; Citro, C.; Corrado, G.S.; Davis, A.; Dean, J.; Devin, M.; et al. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv 2016, arXiv:1603.04467. [Google Scholar]
Kerber, M.; Morozov, D.; Nigmetov, A. Geometry Helps to Compare Persistence Diagrams. In Proceedings of the ALENEX, Arlington, VA, USA, 10–13 January 2016; pp. 103–112. [Google Scholar]
Widdowson, D.; Kurlin, V. Pointwise distance distributions of periodic sets. arXiv 2021, arXiv:2108.04798. [Google Scholar]
Mosca, M.; Kurlin, V. Voronoi-Based Similarity Distances between Arbitrary Crystal Lattices. Cryst. Res. Technol. 2020, 55, 1900197. [Google Scholar] [CrossRef]
Anosova, O.; Kurlin, V. Introduction to Periodic Geometry and Topology. arXiv 2021, arXiv:2103.02749. [Google Scholar]
Anosova, O.; Kurlin, V. An isometry classification of periodic point sets. In Proceedings of the Discrete Geometry and Mathematical Morphology, Uppsala, Sweden, 24–27 May 2021. [Google Scholar]
Edelsbrunner, H.; Heiss, T.; Kurlin, V.; Smith, P.; Wintraecken, M. The Density Fingerprint of a Periodic Point Set. In Proceedings of the SoCG, Virtually, 7–10 June 2021. [Google Scholar]
Widdowson, D.; Mosca, M.; Pulido, A.; Kurlin, V.; Cooper, A. Average Minimum Distances of Periodic Point Sets. MATCH Communications in Mathematical and in Computer Chemistry. Available online: https://match.pmf.kg.ac.rs (accessed on 6 July 2021).

Figure 1. Images from the dataset of mythical creatures at http://tosca.cs.technion.ac.il (accessed on 25 August 2021) [8,9].

Figure 2. Top: the 5-point cloud

A = {0, 1, 3, 7, 10} \subset R

. Bottom: from left to right: single-linkage dendrogram

Δ_{S L} (A)

from Definition 1, the 0D persistence diagram

PD

in Definition 6, the novel mergegram

MG

from Definition 4, where double circles show pairs of multiplicity 2.

Figure 2. Top: the 5-point cloud

A = {0, 1, 3, 7, 10} \subset R

. Bottom: from left to right: single-linkage dendrogram

Δ_{S L} (A)

from Definition 1, the 0D persistence diagram

PD

in Definition 6, the novel mergegram

MG

from Definition 4, where double circles show pairs of multiplicity 2.

Figure 3. The dendrogram

Δ

on

A = {0, 1, 2}

and its mergegram

MG (Δ)

from Definition 4.

Figure 3. The dendrogram

Δ

on

A = {0, 1, 2}

and its mergegram

MG (Δ)

from Definition 4.

Figure 4. The set

X = {a, b, c, p, q}

has the distance matrix obtained by the shortest path metric.

Figure 4. The set

X = {a, b, c, p, q}

has the distance matrix obtained by the shortest path metric.

Figure 5. Left: the dendrogram

Δ

for the single linkage clustering of the set 5-point set

X = {a, b, c, p, q}

in Figure 4. Right: the mergegram

MG (Δ)

with one pair (0,1) of multiplicity 4.

Figure 5. Left: the dendrogram

Δ

for the single linkage clustering of the set 5-point set

X = {a, b, c, p, q}

in Figure 4. Right: the mergegram

MG (Δ)

with one pair (0,1) of multiplicity 4.

Figure 6. The merge module

M (Δ)

of the dendrogram

Δ

on the set

X = {0, 1, 2}

in Figure 3.

Figure 6. The merge module

M (Δ)

of the dendrogram

Δ

on the set

X = {0, 1, 2}

in Figure 3.

Figure 7. Left: the cloud C of 5 blue points is close to the cloud

C^{'}

of 10 red points in the Hausdorff distance. Right: the mergegrams are close in the bottleneck distance as predicted by Theorem 4.

Figure 7. Left: the cloud C of 5 blue points is close to the cloud

C^{'}

of 10 red points in the Hausdorff distance. Right: the mergegrams are close in the bottleneck distance as predicted by Theorem 4.

Figure 8. Generating distorted shapes by applying random rotations, affine and projective transformations, which substantially affect the extracted clouds of Harris corner points [28] in red.

Figure 9. Recognition rates are obtained by training the max layer MAX(75) of PersLay on three isometry invariants and a cloud of corner points extracted from 15,000 affinely distorted images.

Figure 10. Recognition rates are obtained by training the max layer MAX(75) of PersLay on isometry invariants and corner points extracted from 15,000 projectively distorted images.

Figure 11. Recognition rates are obtained by training the image layer IM[20,20] of PersLay on isometry invariants and a cloud of corner points extracted from 15,000 affinely distorted images.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Elkin, Y.; Kurlin, V. Isometry Invariant Shape Recognition of Projectively Perturbed Point Clouds by the Mergegram Extending 0D Persistence. Mathematics 2021, 9, 2121. https://doi.org/10.3390/math9172121

AMA Style

Elkin Y, Kurlin V. Isometry Invariant Shape Recognition of Projectively Perturbed Point Clouds by the Mergegram Extending 0D Persistence. Mathematics. 2021; 9(17):2121. https://doi.org/10.3390/math9172121

Chicago/Turabian Style

Elkin, Yury, and Vitaliy Kurlin. 2021. "Isometry Invariant Shape Recognition of Projectively Perturbed Point Clouds by the Mergegram Extending 0D Persistence" Mathematics 9, no. 17: 2121. https://doi.org/10.3390/math9172121

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Isometry Invariant Shape Recognition of Projectively Perturbed Point Clouds by the Mergegram Extending 0D Persistence^†

Abstract

1. Introduction: Motivations, Shape Recognition Problem, and Overview of Results

2. Related Work on Isometry Shape Recognition and Topological Data Analysis

3. Single-Linkage Clustering and the Invariant Mergegram of a Dendrogram

4. Explicit Relations between 0-Dimensional Persistence and Mergegram

5. Stability of the Mergegram for Any Single-Linkage Dendrogram

6. New Experiments on Isometry Recognition of Substantially Distorted Real Shapes

7. A Discussion of Novel Contributions and Further Open Problems

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Isometry Invariant Shape Recognition of Projectively Perturbed Point Clouds by the Mergegram Extending 0D Persistence †

Abstract

1. Introduction: Motivations, Shape Recognition Problem, and Overview of Results

2. Related Work on Isometry Shape Recognition and Topological Data Analysis

3. Single-Linkage Clustering and the Invariant Mergegram of a Dendrogram

4. Explicit Relations between 0-Dimensional Persistence and Mergegram

5. Stability of the Mergegram for Any Single-Linkage Dendrogram

6. New Experiments on Isometry Recognition of Substantially Distorted Real Shapes

7. A Discussion of Novel Contributions and Further Open Problems

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Isometry Invariant Shape Recognition of Projectively Perturbed Point Clouds by the Mergegram Extending 0D Persistence^†