A Convergent Algorithm for Equilibrium Problem to Predict Prospective Mathematics Teachers’ Technology Integrated Competency

Jun-on, Nipa; Cholamjiak, Watcharaporn; Suparatulatorn, Raweerote

doi:10.3390/math10234464

Open AccessArticle

A Convergent Algorithm for Equilibrium Problem to Predict Prospective Mathematics Teachers’ Technology Integrated Competency

by

Nipa Jun-on

¹

,

Watcharaporn Cholamjiak

²

and

Raweerote Suparatulatorn

^3,*

¹

Department of Mathematics, Faculty of Science, Lampang Rajabhat University, Lampang 52100, Thailand

²

School of Science, University of Phayao, Phayao 56000, Thailand

³

Department of Mathematics, Faculty of Science, Chiang Mai University, Chiang Mai 50200, Thailand

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(23), 4464; https://doi.org/10.3390/math10234464

Submission received: 23 October 2022 / Revised: 20 November 2022 / Accepted: 23 November 2022 / Published: 26 November 2022

(This article belongs to the Special Issue Research Trends and Challenges in the Theory of Nonlinear Analysis and Its Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Educational data classification has become an effective tool for exploring the hidden pattern or relationship in educational data and predicting students’ performance or teachers’ competency. This study proposes a new method based on machine learning algorithms to predict the technology-integrated competency of pre-service mathematics teachers. In this paper, we modified the inertial subgradient extragradient algorithm for pseudomonotone equilibrium and proved the weak convergence theorem under some suitable conditions in Hilbert spaces. We then applied to solve data classification by extreme learning machine using the dataset comprised of the technology-integrated competency of 954 pre-service mathematics teachers in a university in northern Thailand, longitudinally collected for five years. The flexibility of our algorithm was shown by comparisons of the choice of different parameters. The performance was calculated and compared with the existing algorithms to be implemented for prediction. The results show that the proposed method achieved a classification accuracy of 81.06%. The predictions were implemented using ten attributes, including demographic information, skills, and knowledge relating to technology developed throughout the teacher education program. Such data driven studies are significant for establishing a prospective teacher competency analysis framework in teacher education and contributing to decision-making for policy design.

Keywords:

subgradient extragradient algorithm; pseudomonotone equilibrium problem; data classification problem; prospective mathematics; technology-integrated competency

MSC:

65K15; 47H05; 97M10; 97C70; 97U70

1. Introduction

The National Council of Teachers of Mathematics (NCTM) [1] reports that teaching with technology to support conceptual development has been a focus of mathematics education for decades. Utilizing multiple technologies to teach mathematics is significantly more difficult than using technology in everyday life. It included teachers’ perspectives, recognizing the significance of technology in teaching mathematics and their confidence in using technology when constructing relevant technology-integrated mathematics classrooms. Therefore, the technology-integrated competency of teachers was defined in this study as the competency to design mathematics lessons that enable students to work on challenging mathematics problems through technology [2,3,4,5].

To unleash the benefits of technology in the mathematics classroom [6,7], teachers require extensive preparation and support. Accordingly, the integration of technology in mathematics education to produce competent prospective mathematics teachers has been incorporated into the most recent mathematics teacher preparation standards [8]. Utilizing validated classification to evaluate prospective mathematics teachers’ technology-integrated competency provides a solid foundation for entry into the profession [9,10].

In recent years, there has been significant interest in the application of data classification approaches to the field of education. Classification is a method for identifying the class of data points provided, referred to as targets/labels or categories. It can be the study of discovering new and potentially helpful information or meaningful outcomes from data. It also seeks to obtain new trends and patterns from datasets by employing various categorization techniques. Particularly, data classification in the education field is now an effective technique for identifying hidden patterns in educational data, predicting students’ academic performance, determining teachers’ competency or enhancing the learning and teaching policy plan. Thus, in this study, we focused on prospective mathematics teachers’ information as our educational data, longitudinally collected over five years, for classification to identify hidden patterns in their technology-integrated competency development.

First of all, we studied the equilibrium problem (EP), initially introduced by Muu and Oettli [11]. The EP is to find an element

z^{*}

in a nonempty closed convex subset C of a real Hilbert space

H

such that

f (z^{*}, y) \geq 0, \forall y \in C,

(1)

where

f : H \times H \to R

is a bifunction with

f (x, x) = 0

for all

x \in C

, and

E P (C, f)

is denoted for a solution set of the EP (1). The EP (1) generalizes various mathematical problems in optimization analysis such as variational inequalities, minimization problems, linear programming problems, and Nash equilibrium problems, among others, see in [12,13,14,15].

In 2008, Tran et al. [16] presented the two-step extragradient method (TSEM) for solving the EP (1), which is inspired by the concept of solving the variational inequalities of Korpelevich [17]. The iterative scheme is formulated as follows:

x_{1} \in C

and

\begin{array}{l} y_{n} & = arg min_{y \in C} \{λ f (x_{n}, y) + \frac{1}{2} {∥ x_{n} - y ∥}^{2}\}, \\ x_{n + 1} & = arg min_{y \in C} \{λ f (y_{n}, y) + \frac{1}{2} {∥ x_{n} - y ∥}^{2}\}, \end{array}

(2)

where

λ

is some constant depending on the interval that makes the bifunction f satisfy the Lipschitz condition. However, as remarked by the authors of [18], two projections on C of the two-step extragradient algorithm, which was introduced by Korpelevich [17], are very difficult to use and can affect the efficiency of the method if C has a complex structure.

In 2019, Rehman et al. [19] modified the subgradient explicit iterative algorithm to solve the problem of pseudomonotone equilibrium problems. The weak convergence of these algorithms is well established under stepsize, which are updated at each iteration without the Lipschitz-type condition. The algorithm is generated by arbitrary elements

x_{0} \in H

,

λ_{0} > 0

and

μ \in (0, 1)

,

\begin{matrix} y_{n} & = arg min_{y \in C} \{λ_{n} f (x_{n}, y) + \frac{1}{2} {∥ x_{n} - y ∥}^{2}\}, \\ x_{n + 1} & = arg min_{y \in H_{n}} \{λ_{n} f (y_{n}, y) + \frac{1}{2} {∥ x_{n} - y ∥}^{2}\}, \end{matrix}

(3)

where

w_{n} \in \partial_{2} f (x_{n}, y_{n})

satisfy

x_{n} - λ_{n} w_{n} - y_{n} \in N_{C} (y_{n})

with a half-space

H_{n} = {z \in H : 〈 x_{n} - λ_{n} w_{n} - y_{n}, z - y_{n} 〉 \leq 0},

and

λ_{n + 1} = \{λ_{n}, \frac{μ (∥ x_{n} - y_{n} ∥^{2} + {∥ x_{n + 1} - y_{n} ∥}^{2})}{2 [f (x_{n}, x_{n + 1}) - f (x_{n}, y_{n}) - f (y_{n}, x_{n + 1})]}\} .

Very recently, Rehman et al. [20] focused on improving the stepsize of the subgradient extragradient method to find a solution to the problems of pseudo-monotone equilibrium in a real Hilbert space. The inertial technique term which was first proposed by Polyak [21] was added to speed up the convergence of the algorithm. The weak convergence of the method is well-established based on the standard assumptions on a bifunction. This algorithm is generated by arbitrary elements

x_{0}, x_{1} \in H

. Choose

ϱ \in (0, 1)

,

σ < min \{\frac{1 - 3 δ}{{(1 - δ)}^{2}}, \frac{1}{2 c_{1}}, \frac{1}{2 c_{2}}\}

,

μ \in (0, σ)

,

λ_{1} > 0

and non-decreasing sequence

0 \leq δ_{n} \leq δ \in [0, \frac{1}{3})

;

\begin{matrix} ρ_{n} & = x_{n} + δ_{n} (x_{n} - x_{n - 1}) \\ y_{n} & = arg min_{y \in C} \{λ_{n} f (ρ_{n}, y) + \frac{1}{2} {∥ ρ_{n} - y ∥}^{2}\}, \\ x_{n + 1} & = arg min_{y \in H_{n}} \{λ_{n} f (y_{n}, y) + \frac{1}{2} {∥ ρ_{n} - y ∥}^{2}\}, \end{matrix}

(4)

where

w_{n} \in \partial_{2} f (ρ_{n}, y_{n})

satisfy

ρ_{n} - λ_{n} w_{n} - y_{n} \in N_{C} (y_{n})

and construct a half-space

H_{n} = {z \in H : 〈 ρ_{n} - λ_{n} w_{n} - y_{n}, z - y_{n} 〉 \leq 0},

and

λ_{n + 1} = \{σ, \frac{μ f (y_{n}, x_{n + 1})}{f (ρ_{n}, x_{n + 1}) - f (ρ_{n}, y_{n}) - c_{1} ∥ ρ_{n} - y_{n} ∥^{2} - c_{2} {∥ x_{n + 1} - y_{n} ∥}^{2} + 1}\} .

Inspired by the above research, we introduce a new modified inertial subgradient extragradient method for obtaining weak convergence to a solution of the set

E P (f, C)

and try to relax the update stepsize

{λ_{n}}

that can be chosen in many ways. In applications, we apply our algorithm to solve classification problems in machine learning and show the performance of our algorithm by comparing it with existing algorithms to predict prospective mathematics teachers’ technology-integrated competency.

2. Preliminaries

In what follows, recall that

H

is a real Hilbert space. Let C be a nonempty, closed and convex subset of

H

. We denote ⇀ and → as weak and strong convergence, respectively. We next collect some necessary definitions and lemmas for proving our main results. Assume

u, v \in H

,

\begin{matrix} {∥ a u + (1 - a) v ∥}^{2} & = {a ∥ u ∥}^{2} + {(1 - a) ∥ v ∥}^{2} - a (1 - a) {∥ u - v ∥}^{2} \end{matrix}

(5)

for any

a \in [0, 1]

.

A normal cone of C at

x \in C

is defined by

N_{C} (x) = {z \in H : 〈 z, y - x 〉 \leq 0, for all y \in C} .

Let

g : C \to R

be a convex function and subdifferential of g at

x \in C

defined by

\partial g (x) = {z \in H : g (y) - g (x) \geq 〈 z, y - x 〉, for all y \in C} .

A bifunction

f : H \times H \to R

on C is stated to be

(i)

pseudomonotone if

f (u, v) \geq 0 ⟹ f (v, u) \leq 0, for all u, v \in C

;

(i i)

satisfies the Lipschitz-like criteria for some

c_{1}, c_{2} > 0

, the following inequality is satisfied

f (u, w) \leq f (u, v) + f (v, w) + c_{1} {∥ u - v ∥}^{2} + c_{2} {∥ v - w ∥}^{2}, for all u, v, w \in C .

Lemma 1

([22]). Let

g : C \to R

be a subdifferentiable, convex and lower semi-continuous function on C. An element

x \in C

is a minimizer of a function g if and only if

0 \in \partial g (x) + N_{C} (x),

where

\partial g (x)

stands for the subdifferential of g at

x \in C

and

N_{C} (x)

the normal cone of C at x.

Lemma 2

([23]). Let

{a_{n}}

and

{b_{n}}

be nonnegative sequences of real numbers satisfying

\sum_{n = 1}^{\infty} b_{n} < \infty

and

a_{n + 1} \leq a_{n} + b_{n} .

Then,

{a_{n}}

is a convergent sequence.

Lemma 3

([24], Opial). Let Φ be a nonempty set of

H

and

{x_{n}}

be a sequence in

H

. Suppose the following assertions hold.

(i)

For every

x \in Φ

, the sequence

\{∥ x_{n} - x ∥\}

converges;

(i i)

Every weak sequential cluster point of

{x_{n}}

belongs to Φ.

Then

{x_{n}}

converges weakly to a point in Φ.

3. Convergence Theorem

To study the convergence analysis, consider the following conditions.

(C 1)

The solution set

E P (f, C)

is nonempty and f is pseudomonotone on C;

(C 2)

f meets the Lipschitz-like condition on

H

through

c_{1} > 0

and

c_{2} > 0

;

(C 3)

f (z, \cdot)

is subdifferentiable and convex on

H

for each fixed

z \in H

;

(C 4)

\underset{n \to \infty}{lim sup} f (z_{n}, y) \leq f (z^{*}, y)

for each

y \in C

and

{z_{n}} \subset C

satisfies

z_{n} ⇀ z^{*}

.

Lemma 4.

Let

ρ_{n} = y_{n}

in Algorithm 1, then

ρ_{n} \in E P (f, C)

.

Algorithm 1 Modified inertial subgradient extragradient Mann Algorithm 1

Initialization: Select arbitrary elements $x_{0}, x_{1} \in H$ .
Iterative Steps: Construct ${x_{n}}$ by using the following steps:
Step 1. Set $ρ_{n} = x_{n} + δ_{n} (x_{n} - x_{n - 1})$ , where ${δ_{n}} \subset [0, \infty)$ , and compute

$\begin{matrix} y_{n} & = arg min_{y \in C} \{λ_{n} f (ρ_{n}, y) + \frac{1}{2} {∥ ρ_{n} - y ∥}^{2}\}, \end{matrix}$

where $0 < λ_{n} \leq λ < min \{\frac{1}{2 c_{1}}, \frac{1}{2 c_{2}}\}$ . If $ρ_{n} = y_{n}$ , then stop. Otherwise,
Step 2. Compute

$\begin{matrix} u_{n} & = arg min_{y \in H_{n}} \{λ_{n} f (y_{n}, y) + \frac{1}{2} {∥ ρ_{n} - y ∥}^{2}\}, \end{matrix}$

where $w_{n} \in \partial_{2} f (ρ_{n}, y_{n})$ satisfying $ρ_{n} - λ_{n} w_{n} - y_{n} \in N_{C} (y_{n})$ and construct a half-space

$\begin{matrix} H_{n} & = {z \in H : 〈 ρ_{n} - λ_{n} w_{n} - y_{n}, z - y_{n} 〉 \leq 0} . \end{matrix}$

Step 3. Compute

$\begin{matrix} x_{n + 1} & = θ_{n} ρ_{n} + (1 - θ_{n}) u_{n}, \end{matrix}$

where ${θ_{n}} \subset (0, 1)$ . Replace n with $n + 1$ and then repeat Step 1.

Proof.

By the definition of

y_{n}

with Lemma 1, we have

\begin{matrix} 0 \in \partial_{2} (λ_{n} f (ρ_{n}, \cdot) + \frac{1}{2} {∥ ρ_{n} - \cdot ∥}^{2}) (y_{n}) + N_{C} (y_{n}) . \end{matrix}

Thus, we can write

λ_{n} {\tilde{w}}_{n} + y_{n} - ρ_{n} + {\bar{w}}_{n} = 0

, where

{\tilde{w}}_{n} \in \partial_{2} f (ρ_{n}, y_{n})

and

{\bar{w}}_{n} \in N_{C} (y_{n})

. Due to

ρ_{n} = y_{n}

implies that

λ_{n} {\tilde{w}}_{n} + {\bar{w}}_{n} = 0

. Thus, we have

\begin{matrix} λ_{n} 〈 {\tilde{w}}_{n}, y - y_{n} 〉 + 〈 {\bar{w}}_{n}, y - y_{n} 〉 = 0 \end{matrix}

for all

y \in C

. By

{\bar{w}}_{n} \in N_{C} (y_{n})

implies

〈 {\bar{w}}_{n}, y - y_{n} 〉 \leq 0

for all

y \in C

and through above expression, we obtain

\begin{matrix} λ_{n} 〈 {\tilde{w}}_{n}, y - y_{n} 〉 \geq 0 \end{matrix}

(6)

for all

y \in C

. Due to

{\tilde{w}}_{n} \in \partial_{2} f (ρ_{n}, y_{n})

and using the subdifferential definition, we obtain

\begin{matrix} 〈 {\tilde{w}}_{n}, y - y_{n} 〉 \leq f (ρ_{n}, y) - f (ρ_{n}, y_{n}) \end{matrix}

(7)

for all

y \in C

. From the inequalities (6) and (7) with

0 < λ_{n} \leq λ

implies that

f (ρ_{n}, y) \geq 0

for all

y \in C

, that is,

ρ_{n} \in E P (f, C)

. □

Lemma 5.

Suppose that

f : H \times H \to R

meet the items

(C 1) - (C 3)

, we have

\begin{matrix} ∥ u_{n} - \bar{ξ} ∥^{2} + (1 - 2 c_{1} λ_{n}) ∥ ρ_{n} - y_{n} ∥^{2} + (1 - 2 c_{2} λ_{n}) ∥ y_{n} - u_{n} ∥^{2} \leq {∥ ρ_{n} - \bar{ξ} ∥}^{2} \end{matrix}

(8)

for all

\bar{ξ} \in E P (f, C)

.

Proof.

Let

\bar{ξ} \in E P (f, C)

, then by using Lemma 1, we have

\begin{matrix} 0 \in \partial_{2} (λ_{n} f (y_{n}, \cdot) + \frac{1}{2} {∥ ρ_{n} - \cdot ∥}^{2}) (u_{n}) + N_{H_{n}} (u_{n}) . \end{matrix}

Thus, we can write

λ_{n} {\tilde{w}}_{n} + u_{n} - ρ_{n} + {\bar{w}}_{n} = 0

, where

{\tilde{w}}_{n} \in \partial_{2} f (y_{n}, u_{n})

and

{\bar{w}}_{n} \in N_{H_{n}} (u_{n})

. This implies that

\begin{matrix} 〈 ρ_{n} - u_{n}, y - u_{n} 〉 = λ_{n} 〈 {\tilde{w}}_{n}, y - u_{n} 〉 + 〈 {\bar{w}}_{n}, y - u_{n} 〉 \end{matrix}

for all

y \in H_{n}

. Given that

{\bar{w}}_{n} \in N_{H_{n}} (u_{n})

then

〈 {\bar{w}}_{n}, y - u_{n} 〉 \leq 0

for all

y \in H_{n}

. Therefore, we have

\begin{matrix} 〈 ρ_{n} - u_{n}, y - u_{n} 〉 \leq λ_{n} 〈 {\tilde{w}}_{n}, y - u_{n} 〉 \end{matrix}

(9)

for all

y \in H_{n}

. Since

{\tilde{w}}_{n} \in \partial_{2} f (y_{n}, u_{n})

, we have

\begin{matrix} 〈 {\tilde{w}}_{n}, y - u_{n} 〉 \leq f (y_{n}, y) - f (y_{n}, u_{n}) \end{matrix}

(10)

for all

y \in H

. From (9) and (10), we obtain

\begin{matrix} 〈 ρ_{n} - u_{n}, y - u_{n} 〉 \leq λ_{n} f (y_{n}, y) - λ_{n} f (y_{n}, u_{n}) \end{matrix}

(11)

for all

y \in H_{n}

. Substituting

y = \bar{ξ}

in (9), we obtain

\begin{matrix} 〈 ρ_{n} - u_{n}, \bar{ξ} - u_{n} 〉 \leq λ_{n} f (y_{n}, \bar{ξ}) - λ_{n} f (y_{n}, u_{n}) . \end{matrix}

(12)

Given

\bar{ξ} \in E P (f, C)

imply that

f (\bar{ξ}, y_{n}) \geq 0

and owing to the item

(C 1)

gives that

f (y_{n}, \bar{ξ}) \leq 0

. Thus, we obtain

\begin{matrix} 〈 ρ_{n} - u_{n}, u_{n} - \bar{ξ} 〉 \geq λ_{n} f (y_{n}, u_{n}) . \end{matrix}

(13)

Following the condition

(C 2)

, we have

\begin{matrix} f (y_{n}, u_{n}) \geq f (ρ_{n}, u_{n}) - f (ρ_{n}, y_{n}) - c_{1} ∥ ρ_{n} - y_{n} ∥^{2} - c_{2} {∥ y_{n} - u_{n} ∥}^{2} . \end{matrix}

(14)

Combining (13) and (14), we obtain

\begin{matrix} 〈 ρ_{n} - u_{n}, u_{n} - \bar{ξ} 〉 \geq λ_{n} f (ρ_{n}, u_{n}) - λ_{n} f (ρ_{n}, y_{n}) - c_{1} λ_{n} ∥ ρ_{n} - y_{n} ∥^{2} - c_{2} λ_{n} {∥ y_{n} - u_{n} ∥}^{2} . \end{matrix}

(15)

By using the half-space definition, we have

〈 ρ_{n} - λ_{n} w_{n} - y_{n}, u_{n} - y_{n} 〉 \leq 0

, which implies that

\begin{matrix} 〈 ρ_{n} - y_{n}, u_{n} - y_{n} 〉 \leq λ_{n} 〈 w_{n}, u_{n} - y_{n} 〉 . \end{matrix}

(16)

Since

w_{n} \in \partial_{2} f (ρ_{n}, y_{n})

, we obtain

\begin{matrix} 〈 w_{n}, y - y_{n} 〉 \leq f (ρ_{n}, y) - f (ρ_{n}, y_{n}) \end{matrix}

for all

y \in H

. By replacing

y = u_{n}

, we obtain

\begin{matrix} 〈 w_{n}, u_{n} - y_{n} 〉 \leq f (ρ_{n}, u_{n}) - f (ρ_{n}, y_{n}) . \end{matrix}

(17)

It follows from inequalities (16) and (17) that

\begin{matrix} 〈 ρ_{n} - y_{n}, u_{n} - y_{n} 〉 \leq λ_{n} f (ρ_{n}, u_{n}) - λ_{n} f (ρ_{n}, y_{n}) . \end{matrix}

(18)

From (15) and (18), we have

\begin{matrix} 〈 ρ_{n} - u_{n}, u_{n} - \bar{ξ} 〉 \geq 〈 ρ_{n} - y_{n}, u_{n} - y_{n} 〉 - c_{1} λ_{n} ∥ ρ_{n} - y_{n} ∥^{2} - c_{2} λ_{n} {∥ y_{n} - u_{n} ∥}^{2} . \end{matrix}

(19)

Now, we obtain the following equalities:

\begin{matrix} ∥ ρ_{n} - \bar{ξ} ∥^{2} - ∥ u_{n} - ρ_{n} ∥^{2} - {∥ u_{n} - \bar{ξ} ∥}^{2} = 2 〈 ρ_{n} - u_{n}, u_{n} - \bar{ξ} 〉 \end{matrix}

and

\begin{matrix} ∥ ρ_{n} - y_{n} ∥^{2} + ∥ u_{n} - y_{n} ∥^{2} - {∥ ρ_{n} - u_{n} ∥}^{2} = 2 〈 ρ_{n} - y_{n}, u_{n} - y_{n} 〉 . \end{matrix}

Combining the above equalities with expression (19) finalizes the proof. □

Lemma 6.

Assume that the items

(C 1) - (C 4)

hold. If there is a subsequence

{ρ_{n_{k}}}

of

{ρ_{n}}

such that

ρ_{n_{k}} ⇀ x^{*} \in H

and

\begin{matrix} lim_{k \to \infty} ∥ ρ_{n_{k}} - y_{n_{k}} ∥ = lim_{k \to \infty} ∥ ρ_{n_{k}} - u_{n_{k}} ∥ = lim_{k \to \infty} ∥ u_{n_{k}} - y_{n_{k}} ∥ = 0, \end{matrix}

(20)

then

x^{*} \in E P (f, C)

.

Proof.

From

y_{n} \in C

,

ρ_{n_{k}} ⇀ x^{*}

and

lim_{k \to \infty} ∥ ρ_{n_{k}} - y_{n_{k}} ∥ = 0

, we get

y_{n_{k}} ⇀ x^{*} \in C

. This follows from

lim_{k \to \infty} ∥ u_{n_{k}} - y_{n_{k}} ∥ = 0

that the subsequence

{u_{n_{k}}}

is bounded. For any

y \in H_{n}

, using (11), (14) and (18), we have

\begin{matrix} λ_{n_{k}} f (y_{n_{k}}, y) & \geq λ_{n_{k}} f (y_{n_{k}}, u_{n_{k}}) + 〈 ρ_{n_{k}} - u_{n_{k}}, y - u_{n_{k}} 〉 \\ \geq λ_{n_{k}} f (ρ_{n_{k}}, u_{n_{k}}) - λ_{n_{k}} f (ρ_{n_{k}}, y_{n_{k}}) - c_{1} λ_{n_{k}} ∥ ρ_{n_{k}} - y_{n_{k}} ∥^{2} - c_{2} λ_{n_{k}} {∥ y_{n_{k}} - u_{n_{k}} ∥}^{2} \\ + 〈 ρ_{n_{k}} - u_{n_{k}}, y - u_{n_{k}} 〉 \\ \geq 〈 ρ_{n_{k}} - y_{n_{k}}, u_{n_{k}} - y_{n_{k}} 〉 + 〈 ρ_{n_{k}} - u_{n_{k}}, y - u_{n_{k}} 〉 - c_{1} λ_{n_{k}} {∥ ρ_{n_{k}} - y_{n_{k}} ∥}^{2} \\ - c_{2} λ_{n_{k}} {∥ y_{n_{k}} - u_{n_{k}} ∥}^{2} . \end{matrix}

This implies by (20) and the boundedness of

{u_{n_{k}}}

that the right hand side tends to zero. Due to

0 < λ_{n_{k}} \leq λ < min \{\frac{1}{2 c_{1}}, \frac{1}{2 c_{2}}\}

, the condition

(C 4)

, and

y_{n_{k}} ⇀ x^{*}

, we obtain

0 \leq \underset{k \to \infty}{lim sup} f (y_{n_{k}}, y) \leq f (x^{*}, y)

for all

y \in H_{n}

. Since

C \subset H_{n}

, we get

f (x^{*}, y) \geq 0

for all

y \in C

, that is,

x^{*} \in E P (f, C)

. □

With the above results, we are now ready for the main convergence theorem.

Theorem 1.

Suppose that

\sum_{n = 1}^{\infty} δ_{n} ∥ x_{n} - x_{n - 1} ∥ < \infty, \underset{n \to \infty}{lim inf} θ_{n} (1 - θ_{n}) > 0

and the items

(C 1) - (C 4)

are satisfied. Then, the sequence

{x_{n}}

generated due to Algorithm 1 converges weakly to a point in

E P (f, C)

.

Proof.

Let

\bar{ξ} \in E P (f, C)

. Since

0 < λ_{n} \leq λ < min \{\frac{1}{2 c_{1}}, \frac{1}{2 c_{2}}\}

with expression (8) implies that

\begin{matrix} ∥ u_{n} - \bar{ξ} ∥ \leq ∥ ρ_{n} - \bar{ξ} ∥ . \end{matrix}

(21)

By the definition of

ρ_{n}

and

\sum_{n = 1}^{\infty} δ_{n} ∥ x_{n} - x_{n - 1} ∥ < \infty

, we get

\begin{matrix} lim_{n \to \infty} ∥ ρ_{n} - x_{n} ∥ = lim_{n \to \infty} δ_{n} ∥ x_{n} - x_{n - 1} ∥ = 0 . \end{matrix}

(22)

Next, from the definitions of

ρ_{n}

and

x_{n + 1}

, and using (21), the following relation is obtained:

\begin{matrix} ∥ x_{n + 1} - \bar{ξ} ∥ & \leq θ_{n} ∥ ρ_{n} - \bar{ξ} ∥ + (1 - θ_{n}) ∥ u_{n} - \bar{ξ} ∥ \\ \leq θ_{n} ∥ ρ_{n} - \bar{ξ} ∥ + (1 - θ_{n}) ∥ ρ_{n} - \bar{ξ} ∥ \\ = ∥ ρ_{n} - \bar{ξ} ∥ \\ \leq ∥ x_{n} - \bar{ξ} ∥ + δ_{n} ∥ x_{n} - x_{n - 1} ∥ . \end{matrix}

(23)

Applying this to

\sum_{n = 1}^{\infty} δ_{n} ∥ x_{n} - x_{n - 1} ∥ < \infty

with Lemma 2, we can conclude that the sequence

\{∥ x_{n} - \bar{ξ} ∥\}

converges. It follows from (22) and (23) that

\begin{matrix} lim_{n \to \infty} ∥ x_{n} - \bar{ξ} ∥ = lim_{n \to \infty} ∥ ρ_{n} - \bar{ξ} ∥ . \end{matrix}

(24)

Next, applying the definition of

x_{n + 1}

with (5) and (21), we have

\begin{matrix} ∥ x_{n + 1} - \bar{ξ} ∥^{2} & \leq θ_{n} ∥ ρ_{n} - \bar{ξ} ∥^{2} + (1 - θ_{n}) ∥ u_{n} - \bar{ξ} ∥^{2} - θ_{n} (1 - θ_{n}) {∥ ρ_{n} - u_{n} ∥}^{2} \\ \leq ∥ ρ_{n} - \bar{ξ} ∥^{2} - θ_{n} (1 - θ_{n}) {∥ ρ_{n} - u_{n} ∥}^{2}, \end{matrix}

which means that

\begin{matrix} θ_{n} (1 - θ_{n}) {∥ ρ_{n} - u_{n} ∥}^{2} & \leq ∥ ρ_{n} - \bar{ξ} ∥^{2} - {∥ x_{n + 1} - \bar{ξ} ∥}^{2} . \end{matrix}

It is implied by expression (24) and

\underset{n \to \infty}{lim inf} θ_{n} (1 - θ_{n}) > 0

that

\begin{matrix} lim_{n \to \infty} ∥ ρ_{n} - u_{n} ∥ = 0 . \end{matrix}

(25)

By the inequality (8), we obtain

\begin{matrix} (1 - 2 c_{1} λ_{n}) ∥ ρ_{n} - y_{n} ∥^{2} + (1 - 2 c_{2} λ_{n}) {∥ y_{n} - u_{n} ∥}^{2} & \leq ∥ ρ_{n} - \bar{ξ} ∥^{2} - {∥ u_{n} - \bar{ξ} ∥}^{2} \\ \leq (∥ ρ_{n} - \bar{ξ} ∥ + ∥ u_{n} - \bar{ξ} ∥) (∥ ρ_{n} - \bar{ξ} ∥ - ∥ u_{n} - \bar{ξ} ∥) \\ \leq \bar{M} ∥ ρ_{n} - u_{n} ∥ \end{matrix}

(26)

for some

\bar{M} > 0

. Using (25) with (26) and

0 < λ_{n} \leq λ < min \{\frac{1}{2 c_{1}}, \frac{1}{2 c_{2}}\}

, we infer that

\begin{matrix} lim_{n \to \infty} ∥ ρ_{n} - y_{n} ∥ = lim_{n \to \infty} ∥ y_{n} - u_{n} ∥ = 0 . \end{matrix}

(27)

Finally, let

x^{*} \in H

such that

x_{n_{k}} ⇀ x^{*}

as

k \to \infty

for some subsequence

{x_{n_{k}}}

of

{x_{n}}

. By (22), we get

ρ_{n_{k}} ⇀ x^{*}

as

k \to \infty

. Then, Lemma 6 together with (25) and (27) implies that

x^{*} \in E P (f, C)

. Using Opial’s lemma (Lemma 3), we can conclude that

{x_{n}}

converges weakly to a point in

E P (f, C)

. □

We next show that we can construct stepsizes under the Lipschitz-like condition of the parameter

λ_{n}

for obtaining new algorithms in many ways. This means that our algorithm is flexible to use. Inspired by the stepsize idea of Rehman et al. [19,20], we can modify stepsizes

{λ_{n}}

for our Algorithm 1 which satisfy the condition that

0 < λ_{n} \leq λ < min \{\frac{1}{2 c_{1}}, \frac{1}{2 c_{2}}\}

; then, we obtain the following algorithms.

4. Application to Data Classification Problem of Educational Dataset

The educational dataset shown in this classification is prospective mathematics teachers’ technology-integrated competency level identified as A, B, C, and D. According to Niess et al. [25], the levels to which prospective mathematics teachers integrate technology into their teaching were classified in this study: Exploring (A), Adapting (B), Accepting (C), and Recognizing (D).

At level D, prospective mathematics teachers recognize technology usage in the classroom as distinct from pedagogical content knowledge. At level C, prospective mathematics teachers desire to integrate technology into their classrooms but may struggle to find ways to connect it to specific topics. At level B, to determine the use of technology in their classrooms, prospective mathematics teachers begin to make noticeably different adjustments in their pedagogy. Lastly, at level A, prospective mathematics teachers begin seeking more ways to integrate technology throughout the curriculum as another learning tool.

This study is a part of a longitudinal study; the research team planned the research design before collecting the data, according to the educational theory of mathematics education. The observation and analysis were validated by three experts in mathematics education and were proved reliable by three researchers to find consensus from all experts and researchers.

In the very first phase of this research, only four attributes were considered as factors. However, other unobserved factors emerged. Those factors were analyzed to determine whether they affected the competency by statistical analysis and affirmed by inspecting pieces of literature. This analysis was provided in another part of the research and was in-proceeding in other contributions. Moreover, all inputs (prospective mathematics teachers) entered the program under the same selective examination and were controlled by the exit examination.

Consequently, in data training, 954 instances were used, containing ten attributes, including major, gender, GPA, IT for learning competency, innovative skill, technology knowledge for a specific subject, number of supplementation, curriculum pattern, selective technology courses and competency level. The statistical overview of the data is illustrated in Table 1.

Denoted CV as coefficient of variation (%) and SD as standard deviation.

Before starting our work, we will provide a brief concept of an extreme learning machine (ELM) [26] for data classification problems. Let

U : = {(x_{n}, b_{n}) : x_{n} \in R^{K}, b_{n} \in R^{G}

,

n = 1, 2, . . ., N}

be a training set of N distinct samples where

x_{n}

is input training data and

b_{n}

is a target. The output function of ELM for single-hidden layer feed forward neural networks (SLFNs) with H hidden nodes and activation function

L

is

O_{n} = \sum_{i = 1}^{H} w_{i} L (c_{i} x_{n} + e_{i}),

where

c_{i}

and

e_{i}

are parameters of weight and finally the bias, respectively. To find the optimal output weight

w_{i}

at the i-th hidden node, then the hidden layer output matrix

L

is generated as follows:

L = [\begin{matrix} L (c_{1} x_{1} + e_{1}) & \dots & L (c_{H} x_{1} + e_{H}) \\ ⋮ & ⋱ & ⋮ \\ L (c_{1} x_{N} + e_{1}) & \dots & L (c_{H} x_{N} + e_{H}) \end{matrix}] .

To solve ELM is to find optimal output weight

w = {[w_{1}^{T}, . . ., w_{H}^{T}]}^{T}

such that

{[O_{1}^{T}, . . ., O_{N}^{T}]}^{T} = L w = B

, where

B = {[b_{1}^{T}, . . ., b_{N}^{T}]}^{T}

is the training target data. The least square problem is considered for finding the solution in the cases of the Moore–Penrose generalized inverse of

L

and may be difficult to find when the matrix

L^{†}

does not exist.

To avoid overfitting in machine learning, we use least square regularization. This problem can be determined as the following convex minimization problem:

min_{w \in R^{K}} {{∥ L w - B ∥}_{2}^{2} + {λ ∥ w ∥}_{1}},

(28)

where

λ

is a regularization parameter. This problem is called the least absolute shrinkage and selection operator (LASSO) [27]. For applying our algorithms we set the bifunction

f (x, y) = 〈 L^{T} (L x - B), y - x 〉

for all

x, y \in R^{H}

We use four evaluation metrics: Accuracy, Precision, Recall, and F1-score [28] as explained below for comparing the performance of the classification algorithms.

\begin{matrix} A c c u r a c y (%) = \frac{T P + T N}{T P + F P + T N + F N} \times 100 % . \end{matrix}

(29)

\begin{matrix} P r e c i s i o n (%) = \frac{T P}{T P + F P} \times 100 % . \end{matrix}

(30)

\begin{matrix} R e c a l l (%) = \frac{T P}{T N + F N} \times 100 % . \end{matrix}

(31)

\begin{matrix} F 1 - s c o r e (%) = \frac{2 \times (P r e c i s i o n \times R e c a l l)}{P r e c i s i o n + R e c a l l}, \end{matrix}

(32)

where these matrices gave True Negative (

T N

), False Positive (

F P

), False Negative (

F N

), and True Positive (

T P

) results. The multi-class cross entropy loss is used in multi-class classification by the form:

L o s s = - \sum_{i = 1}^{N} y^{k} log {\hat{y}}^{k},

(33)

where

y^{k}

is 0 or 1, indicating whether class label k is the correct classification and

{\hat{y}}^{k}

is a probability of class

y^{k}

and N is the number of scalar values in the model output.

To start our computation, we set the activation function as sigmoid, hidden nodes

K = 300

, and regularization parameter

λ = 10

. Setting

δ_{n} = \frac{1}{3 n}

and

α_{n} = \frac{5 n}{10 n + 1}

for Algorithms 1–3,

μ = 0.99

for Algorithm 2 and

μ = \frac{λ_{1}}{2}

for Algorithm 3. The stopping criteria is the best accuracy of the training process (81.06%). The comparison of all algorithms with different parameters

λ_{n}

of Algorithm 1 and

λ_{1}

of Algorithms 2 and 3 is presented in Table 2:

Where

K = max (e i g e n v a l u e (L^{T} L))

, we can see that

λ_{1} = 0.999 / max (e i g e n v a l u e (L^{T} L))

of Algorithms 1–3 receives less training time and iteration number. This means that it highly improves the performance of the algorithm. Next, we consider the differences of the parameters

δ_{n}

and

α_{n}

of Algorithms 1–3 in Table 3 and Table 4, respectively, when

λ_{1} = 0.999 / max (e i g e n v a l u e (L^{T} L))

. Setting

α_{n} = \frac{5 n}{10 n + 1}

, then we obtain the following numerical results of different parameters

δ_{n}

.

Using the best parameter

δ_{n} = 0.99

for Algorithms 1 and 2 and

δ_{n} = 1 / 3 n

for Algorithm 3 in Table 3 and setting

λ_{1} = 0.999 / max (e i g e n v a l u e (L^{T} L))

,

μ = λ_{1} / 2

, we obtain the following numerical results of different parameters

α_{n}

.

Algorithm 2 Modified inertial subgradient extragradient Mann Algorithm 2

Initialization: Select arbitrary elements $x_{0}, x_{1} \in H$ , $0 < λ_{1} < min \{\frac{1}{2 c_{1}}, \frac{1}{2 c_{2}}\}$ and $μ \in (0, 1)$ .
Iterative Steps: Construct ${x_{n}}$ by using the following steps:
Step 1. Set $ρ_{n} = x_{n} + δ_{n} (x_{n} - x_{n - 1})$ , where ${δ_{n}} \subset [0, \infty)$ , and compute

$\begin{matrix} y_{n} & = arg min_{y \in C} \{λ_{n} f (ρ_{n}, y) + \frac{1}{2} {∥ ρ_{n} - y ∥}^{2}\} . \end{matrix}$

If $ρ_{n} = y_{n}$ , then stop. Otherwise
Step 2. Compute

$\begin{matrix} u_{n} & = arg min_{y \in H_{n}} \{λ_{n} f (y_{n}, y) + \frac{1}{2} {∥ ρ_{n} - y ∥}^{2}\}, \end{matrix}$

where $w_{n} \in \partial_{2} f (ρ_{n}, y_{n})$ satisfying $ρ_{n} - λ_{n} w_{n} - y_{n} \in N_{C} (y_{n})$ and construct a half-space

$\begin{matrix} H_{n} & = {z \in H : 〈 ρ_{n} - λ_{n} w_{n} - y_{n}, z - y_{n} 〉 \leq 0} . \end{matrix}$

Step 3. Compute

$\begin{matrix} x_{n + 1} & = θ_{n} ρ_{n} + (1 - θ_{n}) u_{n}, \end{matrix}$

where ${θ_{n}} \subset (0, 1)$ and $λ_{n + 1} = \{λ_{n}, \frac{μ (∥ z_{n} - y_{n} ∥^{2} + {∥ x_{n + 1} - z_{n} ∥}^{2})}{2 {[f (y_{n}, x_{n + 1}) - f (y_{n}, z_{n}) - f (z_{n}, x_{n + 1})]}_{+}}\}$ .
Replace n with $n + 1$ and then repeat Step 1.

Algorithm 3 Modified inertial subgradient extragradient Mann Algorithm 3

Initialization: Select arbitrary elements $x_{0}, x_{1} \in H$ , $0 < λ_{1} < min \{\frac{1}{2 c_{1}}, \frac{1}{2 c_{2}}\}$ and $μ \in (0, 1)$ .
Iterative Steps: Construct ${x_{n}}$ by using the following steps:
Step 1. Set $ρ_{n} = x_{n} + δ_{n} (x_{n} - x_{n - 1})$ , where ${δ_{n}} \subset [0, \infty)$ , and compute

$\begin{matrix} y_{n} & = arg min_{y \in C} \{λ_{n} f (ρ_{n}, y) + \frac{1}{2} {∥ ρ_{n} - y ∥}^{2}\}, \end{matrix}$

If $ρ_{n} = y_{n}$ , then stop. Otherwise,
Step 2. Compute

$\begin{matrix} u_{n} & = arg min_{y \in H_{n}} \{λ_{n} f (y_{n}, y) + \frac{1}{2} {∥ ρ_{n} - y ∥}^{2}\}, \end{matrix}$

where $w_{n} \in \partial_{2} f (ρ_{n}, y_{n})$ satisfy $ρ_{n} - λ_{n} w_{n} - y_{n} \in N_{C} (y_{n})$ and construct a half-space

$\begin{matrix} H_{n} & = {z \in H : 〈 ρ_{n} - λ_{n} w_{n} - y_{n}, z - y_{n} 〉 \leq 0} . \end{matrix}$

Step 3. Compute

$\begin{matrix} x_{n + 1} & = θ_{n} ρ_{n} + (1 - θ_{n}) u_{n}, \end{matrix}$

where ${θ_{n}} \subset (0, 1)$ and $λ_{n + 1} = \{λ_{n}, \frac{μ f (z_{n}, x_{n + 1})}{f (y_{n}, x_{n + 1}) - f (y_{n}, z_{n}) - c_{1} ∥ y_{n} - z_{n} ∥^{2} - c_{2} {∥ x_{n + 1} - z_{n} ∥}^{2} + 1}\}$ .
Replace n by $n + 1$ and then repeat Step 1.

From Table 4, we see that

α_{n} = 9 n / (10 n + 1)

highly improves the performance of the Algorithms 1 and 2 and

α_{n} = 3 n / (10 n + 1)

highly improves the performance of Algorithm 3. We next show the performance of our Algorithms 1–3 compared with the other existing Algorithms (2)–(4).

Table 5 demonstrates that our algorithm is among those with the highest precision, recall, F1-score, and accuracy efficiency. Additionally, it has the lowest number of iterations. Although it may slightly reduce training time, compared to previous examinations, it has the best probability of correctly categorizing prospective mathematics teachers’ technology-integrated competency level. Moreover, we deliver the training and validation loss with the accuracy of training to show that our algorithm has no overfitting in the training dataset.

From Figure 1 and Figure 2, we see that our model from Algorithm 3 with the suitable parameters in Table 2, Table 3 and Table 4 obtains a good fitting model that is the measure of a machine learning model and generalizes well to similar data. Based on Figure 1 and Figure 2, the over fitting problem can be controlled by finding the best parameters of our algorithms to solve the least square regularization problem (28).

We implemented an inertial subgradient extragradient method for the equilibrium problem on an educational dataset of 954 instances containing ten attributes, including major, gender, GPA, IT for learning competency, innovative skill, technology knowledge for a specific subject, number of supplementation, curriculum pattern, selective technology courses and competency level. The accuracy of classification achieved by the proposed machine learning algorithm was evaluated and 81.06% of the dataset was classified accurately with fewer iterations compared to other methods.

5. Conclusions and Discussion

This study proposes a new method based on machine learning algorithms to predict the technology-integrated competency level of prospective mathematics teachers, taking their data related to different aspects as the source data. Performances of an extragradient method for equilibrium problem were calculated and compared to predict the technology-integrated competency. This study emphasized two focuses. The first one was the prediction of competency based on the skills and knowledge developed throughout teacher education programs. The second focus was the comparison of the performances of machine learning algorithm.

The results show that the proposed method achieved a classification accuracy of 81.06%. Accordingly, it can be said that major, gender, GPA, IT for learning competency, innovative skill, technology knowledge for a specific subject, number of supplementation, curriculum pattern, and selective technology courses are significant predictors to be used for predicting their technology-integrated competency.

Even though this study focused on technology-integrated competency in mathematics classrooms, it was noticed that the major of prospective teachers was one of the predictors. Because there is a large number of prospective teachers in Thailand who may teach out of their field upon entering the profession [29], the major of teacher education programs was also analyzed to determine if their technology-integrated competency differs when they are required to teach mathematics in the future [30].

Comparing the results of this study to other studies on technology integration by mathematics teachers, it was discovered that gender is one of the best predictors of teachers’ intentions to implement technology in their classes [31,32]. In addition, general technological skills and knowledge, such as IT for learning competency and innovative skill, are without a doubt effective predictors of technology-integrated competency [25,33,34]. Additionally, the integration of technology knowledge and content knowledge, also known as Technological Content Knowledge (TCK), was represented by the attribute of technology knowledge for a specific subject, which is a predictor for predicting technology-integrated competency [25,34,35].

The curriculum pattern, according to the educational dataset analyzed in this study, is the new finding that distinguishes this study from others. Since Pattern 1, Pattern 2, and Pattern 3 are included in this study—the attributes of the curriculum pattern—there are three sorts of pattern. The patterns that prospective teachers study the most in courses of pedagogical knowledge, content knowledge, and technological knowledge are Pattern 1, Pattern 2, and Pattern 3, respectively. This finding indicates that when prospective teachers were trained in a variety of knowledge patterns, their technology-integrated competency also performed differently [36,37].

Using this approach, it is possible to anticipate future technology-integrated competency based on these findings. By projecting prospective teachers’ technology-integrated competency in the future, pre-service teachers can examine and improve their working techniques and proficiency. Given that there are about four years between teacher education programs, it is easier to comprehend the significance of the proposed strategy.

The practical achievement of this study is a curriculum revision policy for university-level mathematics education programs. Particularly, the program should offer additional TCK courses, and the redesigned curriculum should place a greater emphasis on technology knowledge for mathematics. In addition, the result specifies the concept of the required curriculum pattern in terms of weighing pedagogical knowledge, content knowledge, and technology knowledge courses.

In the comparison of performances of a machine learning algorithm with other methods, it was found that our Algorithm 3 uses fewer iterations than the existing algorithms with the same highest precision, recall, F1-score, and accuracy efficiency and has the same number of iterations compared with Algorithm 3, although it takes slightly less time to train the data. This means that we can choose to use both algorithms to work with these data.

The results demonstrate that machine learning techniques can be used to predict the technology-integrated competency of prospective mathematics teachers. The results of this study can assist educators in identifying pre-service teachers with below or above average technology integration. In addition, such data driven studies are very significant for establishing a prospective teacher competency analysis framework in teacher education and contributing to decision-making for policy design.

Future research can be undertaken by incorporating additional input attributes and machine learning methods into the modeling procedure. In addition, it is crucial to leverage the efficacy of an extra gradient method in order to analyze the learning patterns of individuals, address their issues, enhance the educational environment, and enable data driven decision-making for the policy design of teacher education in Thailand.

Author Contributions

Conceptualization, W.C. and N.J.-o.; methodology, W.C.; software, W.C. and R.S.; validation, R.S. and N.J.-o.; formal analysis, W.C. and N.J.-o.; investigation, R.S.; resources, N.J.-o.; data curation, N.J.-o. and W.C.; writing—original draft preparation, N.J.-o.; writing—review and editing, R.S.; supervision, W.C.; project administration, N.J.-o. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

Watcharaporn Cholamjiak would like to thank National Research Council of Thailand and University of Phayao (N42A650334), and Thailand Science Research and Innovation, University of Phayao (FF66-UoE). The authors would like to thank Lampang Rajabhat University for supportting the raw dataset to be analyzed in the main result of this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

National Council of Teachers of Mathematics. Principles to Actions: Ensuring Mathematical Success for All; National Council of Teachers of Mathematics: Reston, VA, USA, 2014. [Google Scholar]
Graham, C.R.; Burgoyne, N.; Cantrell, P.P.; Smith, L.M.; Clair, L.S.; Harris, R. TPACK development in science teaching: Measuring the TPACK confidence of inservice science teachers. TechTrends 2009, 53, 70–79. [Google Scholar]
Roshelle, J.; Leinwand, S. Improving student achievement by systematically integrating effective technology. NCSM J. Math. Educ. Leadersh. 2011, 13, 3–9. [Google Scholar]
Niess, M.L.; Roschelle, J. Transforming Teachers’ Knowledge for Teaching Mathematics with Technologies through Online Knowledge-Building Communities. In Proceedings of the 40th Annual Meeting of the North American Chapter of the International Group for the Psychology of Mathematics Education, Greenville, SC, USA, 15–18 November 2018; pp. 44–62. [Google Scholar]
Ahshan, R. A framework of implementing strategies for active student engagement in remote/online teaching and learning during the COVID-19 pandemic. Educ. Sci. 2021, 11, 483. [Google Scholar] [CrossRef]
Hill, H.C.; Rowan, B.; Ball, D.L. Effects of teachers’ mathematical knowledge for teaching on student achievement. Am. Educ. Res. J. 2005, 42, 371–406. [Google Scholar] [CrossRef] [Green Version]
Barlovits, S.; Jablonski, S.; Lázaro, C.; Ludwig, M.; Recio, T. Teaching from A Distance—Math Lessons during COVID-19 in Germany and Spain. Educ. Sci. 2021, 11, 406. [Google Scholar] [CrossRef]
National Council of Teachers of Mathematics. Catalyzing Change in Middle School Mathematics: Initiating Critical Conversations; National Council of Teachers of Mathematics: Reston, VA, USA, 2020. [Google Scholar]
Adipat, S. Developing Technological Pedagogical Content Knowledge (TPACK) through Technology-enhanced Content and Language-Integrated Learning (T-CLIL) instruction. Educ. Inf. Technol. 2021, 26, 6461–6477. [Google Scholar] [CrossRef]
Thomas, M.O.J.; Hong, Y.Y. Teacher integration of technology into mathematics learning. Int. J. Technol. Math. Educ. 2013, 20, 69–84. [Google Scholar]
Muu, L.D.; Oettli, W. Convergence of an adaptive penalty scheme for finding constrained equilibria. Nonlinear Anal. Theory Methods Appl. 1992, 18, 1159–1166. [Google Scholar] [CrossRef]
Blum, E.; Oettli, W. From optimization and variational inequalities to equilibrium problems. Math. Stud. 1994, 63, 123–145. [Google Scholar]
Tan, B.; Cho, S.Y. Strong convergence of inertial forward-backward methods for solving monotone inclusions. Appl. Anal. 2021, 101, 1–29. [Google Scholar] [CrossRef]
Rehman, H.U.; Kumam, W.; Sombut, K. Inertial modification using self-adaptive subgradient extragradient techniques for equilibrium programming applied to variational inequalities and fixed-point problems. Mathematics 2022, 10, 1751. [Google Scholar] [CrossRef]
Muangchoo, K. Three novel two-step proximal-like methods for solving equilibrium and fixed point problems in real Hilbert spaces. Comp. Appl. Math. 2022, 41, 374. [Google Scholar] [CrossRef]
Tran, D.Q.; Dung, M.L.; Nguyen, V.H. Extragradient algorithms extended to equilibrium problems. Optimization 2008, 57, 749–776. [Google Scholar] [CrossRef]
Korpelevich, G. The extragradient method for finding saddle points and other problems. Matecon 1976, 12, 747–756. [Google Scholar]
Censor, Y.; Gibali, A.; Reich, S. Algorithms for the split variational inequality problem. Numer. Algorithms 2012, 59, 301–323. [Google Scholar] [CrossRef]
Rehman, H.U.; Kumam, P.; Cho, Y.J.; Yordsorn, P. Weak convergence of explicit extragradient algorithms for solving equilibirum problems. J. Inequalities Appl. 2019, 2019, 1–25. [Google Scholar] [CrossRef]
Rehman, H.U.; Kumam, P.; Kumam, W.; Shutaywi, M.; Jirakitpuwapat, W. The inertial sub-gradient extra-gradient method for a class of pseudo-monotone equilibrium problems. Symmetry 2020, 12, 463. [Google Scholar] [CrossRef] [Green Version]
Polyak, B.T. Some methods of speeding up the convergence of iteration methods. USSR Comput. Math. Math. Phys. 1964, 4, 1–17. [Google Scholar] [CrossRef]
Tiel, J.V. Convex Analysis: An Introductory Text, 1st ed.; Wiley: New York, NY, USA, 1984. [Google Scholar]
Auslender, A.; Teboulle, M.; Ben-Tiba, S. A logarithmic-quadratic proximal method for variational inequalities. Comput. Optim. Appl. 1999, 12, 31–40. [Google Scholar] [CrossRef]
Bauschke, H.H.; Combettes, P.L. Convex Analysis and Monotone Operator Theory in Hilbert Spaces, 2nd ed.; CMS Books in Mathematics; Springer: Cham, Switzerland, 2017. [Google Scholar]
Niess, M.L.; Ronau, R.N.; Shafer, K.G.; Driskell, S.O.; Harper, S.R.; Johnston, C.; Browning, C.; Aslizgn-Koca, S.; Kersaint, G. Mathematics teacher TPACK standards and development model. Contemp. Issues Technol. Teach. Educ. 2009, 9, 4–24. [Google Scholar]
Huang, G.-B.; Zhu, Q.-Y.; Siew, C.-K. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
Tibshirani, R. Regression shrinkage and selection via the Lasso. J. R. Stat. Soc. Ser. B Stat. Methodol. 1996, 58, 267–288. [Google Scholar] [CrossRef]
Han, J.; Kamber, M.; Pei, J. Data Mining: Concepts and Techniques, 3rd ed.; Morgan Kaufman Publishers: Waltham, MA, USA, 2012; p. 978. [Google Scholar]
Ingersoll, R. A comparative study of teacher preparation and qualifications in six nations. CPRE Policy Briefs 2007, 47, 1–16. [Google Scholar]
Ndlovu, M.; Ramdhany, V.; Spangenberg, E.D.; Govender, R. Preservice teachers’ beliefs and intentions about integrating mathematics teaching and learning ICTs in their classrooms. ZDM 2020, 52, 1365–1380. [Google Scholar] [CrossRef]
Anderson, S.E.; Maninger, R.M. Preservice teachers abilities, beliefs, and intentions regarding technology integration. J. Educ. Comput. Res. 2007, 37, 151–172. [Google Scholar] [CrossRef]
Raman, A.; Thannimalai, R. Importance of technology leadership for technology integration: Gender and professional development perspective. SAGE Open 2019, 9, 1–13. [Google Scholar] [CrossRef] [Green Version]
Niess, M.L. Investigating TPACK: Knowledge growth in teaching with technology. J. Educ. Comput. Res. 2011, 44, 299–317. [Google Scholar] [CrossRef]
Bonafini, F.C.; Lee, Y. Investigating prospective teachers TPACK and their use of mathematical action technologies as they create screencast video lessons on iPads. Techtrends 2021, 65, 303–319. [Google Scholar] [CrossRef]
Mouza, C.; Karchmer-Klein, R.; Nandakumar, R.; Ozden, S.Y.; Hu, L. Investigating the impact of an integrated approach to the development of preservice teachers’ Technological Pedagogical Content Knowledge (TPACK). Comput. Educ. 2014, 71, 206–221. [Google Scholar] [CrossRef]
Durak, H.Y. Preparing pre-Service teachers to integrate teaching technologies into their classrooms: Examining the effects of teaching environments based on open-ended, hands-on and authentic tasks. Educ. Inf. Technol. 2021, 26, 5365–5387. [Google Scholar] [CrossRef]
Ratnayake, I.; Thomas, M.; Kensington-Miller, B. Professional development for digital technology task design by secondary mathematics teachers. ZDM 2020, 52, 1423–1437. [Google Scholar] [CrossRef]

Figure 1. Accuracy plots of the iteration of Algorithm 3 from Table 5.

Figure 2. Loss plots of the iteration of Algorithm 3 from Table 5.

Table 1. Overview of all attributes used to train the models.

Attributes	Mean	SD	CV	Min	Max
Major	2.54	1.08	42.72	1	4
Gender	1.77	0.42	23.71	1	2
GPA	3.55	0.68	19.24	2	5
IT for learning competency	3.81	0.99	26.05	1	5
Innovative skill	3.97	0.88	22.25	1	5
Technology knowledge for a specific subject	4.38	0.92	21.00	1	5
Number of suplementation	1.16	0.41	35.76	1	4
Curriculum pattern	2.09	0.73	35.27	1	3
Selective technology courses	7.34	2.12	28.91	2	13

Table 2. Numerical results of different parameters

λ_{n}, λ_{1}

.

Table 2. Numerical results of different parameters

λ_{n}, λ_{1}

.

$λ_{n}, λ_{1}$	Algorithm 1		Algorithm 2		Algorithm 3
$λ_{n}, λ_{1}$	Training Time (s)	Iter.	Training Time (s)	Iter.	Training Time (s)	Iter.
$0.999 / {∥ L ∥}^{2}$	0.1329	44	0.1508	44	0.0636	2
$0.999 / K$	0.1226	44	0.1464	44	0.0498	2
$0.9999 / {∥ L ∥}^{2}$	0.1497	48	0.1629	48	0.0678	2
$0.9999 / K$	0.1363	48	0.1585	48	0.0519	2
$1 / {∥ L ∥}^{2}$	0.1465	49	0.1660	49	0.0596	2
$1 / K$	0.1364	49	0.1599	49	0.0507	2

Table 3. Numerical results of different parameters

δ_{n}

.

Table 3. Numerical results of different parameters

δ_{n}

.

$δ_{n}$	Algorithm 1		Algorithm 2		Algorithm 3
$δ_{n}$	Training Time (s)	Iter.	Training Time (s)	Iter.	Training Time (s)	Iter.
0.2	0.1114	37	0.1902	37	0.0593	2
0.99	0.0411	11	0.0420	11	0.0714	2
$1 / 3 n$	0.1552	44	0.1800	44	0.0479	2
$1 / 10 n$	0.1746	45	0.2034	45	0.0572	2
$\frac{1}{n^{2} ∥ x_{n} - x_{n - 1} ∥ + 1}$	0.0430	14	0.0675	14	0.0573	2

Table 4. Numerical results of different parameters

α_{n}

.

Table 4. Numerical results of different parameters

α_{n}

.

$α_{n}$	Algorithm 1		Algorithm 2		Algorithm 3
$α_{n}$	Training Time (s)	Iter.	Training Time (s)	Iter.	Training Time (s)	Iter.
$n / (10 n + 1)$	0.1210	26	0.1099	26	0.0524	2
$3 n / (10 n + 1)$	0.0543	15	0.0569	15	0.0479	2
$5 n / (10 n + 1)$	0.0411	11	0.0420	11	0.0498	2
$7 n / (10 n + 1)$	0.0418	9	0.0373	9	0.0487	2
$9 n / (10 n + 1)$	0.0298	8	0.0324	8	0.0505	2

Table 5. The performance of our Algorithm 3 compared with the other existing algorithms.

	Iter. No.	Training Time (s)	Precision (%)	Recall (%)	F1-Score (%)	Accuracy (%)
Algorithm 1	8	0.0298	81.06	100	89.54	81.06
Algorithm 2	8	0.0324	81.06	100	89.54	81.06
Algorithm 3	2	0.0479	81.06	100	89.54	81.06
Algorithm (2)	23	0.0711	81.06	100	89.54	81.06
Algorithm (3)	2	0.0480	81.06	100	89.54	81.06
Algorithm (4)	23	0.0787	81.06	100	89.54	81.06

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jun-on, N.; Cholamjiak, W.; Suparatulatorn, R. A Convergent Algorithm for Equilibrium Problem to Predict Prospective Mathematics Teachers’ Technology Integrated Competency. Mathematics 2022, 10, 4464. https://doi.org/10.3390/math10234464

AMA Style

Jun-on N, Cholamjiak W, Suparatulatorn R. A Convergent Algorithm for Equilibrium Problem to Predict Prospective Mathematics Teachers’ Technology Integrated Competency. Mathematics. 2022; 10(23):4464. https://doi.org/10.3390/math10234464

Chicago/Turabian Style

Jun-on, Nipa, Watcharaporn Cholamjiak, and Raweerote Suparatulatorn. 2022. "A Convergent Algorithm for Equilibrium Problem to Predict Prospective Mathematics Teachers’ Technology Integrated Competency" Mathematics 10, no. 23: 4464. https://doi.org/10.3390/math10234464

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Convergent Algorithm for Equilibrium Problem to Predict Prospective Mathematics Teachers’ Technology Integrated Competency

Abstract

1. Introduction

2. Preliminaries

3. Convergence Theorem

4. Application to Data Classification Problem of Educational Dataset

5. Conclusions and Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI