Next Article in Journal
Wearable Technologies in Field Hockey Competitions: A Scoping Review
Previous Article in Journal
Discovering Vegetation Recovery and Landslide Activities in the Wenchuan Earthquake Area with Landsat Imagery
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Algorithm of a Game-Based System in the Relation between an Operator and a Technical Object in Management of E-Commerce Logistics Processes with the Use of Machine Learning

1
Faculty of Management and Finance, WSB University in Gdańsk, 80-266 Gdańsk, Poland
2
Faculty of Management, Czestochowa University of Technology, 42-201 Częstochowa, Poland
3
The Briese Schiffahrts GmbH & Co. KG, Hafenstraße 12, 26789 Leer, Germany
4
Institute of Environmental Engineering and Biotechnology, University of Opole, 45-040 Opole, Poland
5
Department of Information System and Technology, Poltava State Agrarian Academy, 36003 Poltava, Ukraine
*
Author to whom correspondence should be addressed.
Sensors 2021, 21(15), 5244; https://doi.org/10.3390/s21155244
Submission received: 6 July 2021 / Revised: 28 July 2021 / Accepted: 30 July 2021 / Published: 3 August 2021
(This article belongs to the Section Internet of Things)

Abstract

:
Machine learning (ML) is applied in various logistic processes utilizing innovative techniques (e.g., the use of drones for automated delivery in e-commerce). Early challenges showed the insufficient drones’ steering capacity and cognitive gap related to the lack of theoretical foundation for controlling algorithms. The aim of this paper is to present a game-based algorithm of controlling behaviours in the relation between an operator (OP) and a technical object (TO), based on the assumption that the game is logistics-oriented and the algorithm is to support ML applied in e-commerce optimization management. Algebraic methods, including matrices, Lagrange functions, systems of differential equations, and set-theoretic notation, have been used as the main tools. The outcome is a model of a game-based optimization process in a two-element logistics system and an algorithm applied to find optimal steering strategies. The algorithm has been initially verified with the use of simulation based on a Bayesian network (BN) and a structured set of possible strategies (OP/TO) calculated with the use of QGeNie Modeller, finally prepared for Python. It has been proved the algorithm at this stage has no deadlocks and unforeseen loops and is ready to be challenged with the original big set of learning data from a drone-operating company (as the next stage of the planned research).

1. Introduction: E-Commerce Development Ways and Challenges

The convenience, security and safety, and user experience of e-commerce have improved exponentially since its inception in the 1970s; however, its sense continues to change as new techniques, tools, and possibilities are gradually introduced. Unfortunately, this observation does not refer to the field of scientific e-commerce definitions and its range of understanding.
According to very first definitions from the beginning of modern electronic commerce by Roger Clarke, e-commerce is defined as the conduct of commerce in goods and services, with the assistance of telecommunication and telecommunication-based tools [1]. Jerry Allison also defines e-commerce as electronic contracting for the exchange of value through the use of information technology (IT), computing, and communication technology (ICT) [2]. According to Vladimir Zwass [3], e-commerce shares business information, maintaining business relationships and conducting business transactions by means of telecommunication networks. E-commerce has also been defined as doing business over the internet, selling goods and services that are delivered offline as well as products which can be “digitized” and delivered online, such as computer software. Anil Khurana defines e-commerce as using the power of computer, the internet, and shared software to send and receive product specifications and drawings; bids, purchase orders, and invoices; and any other type of data that needs to be communicated to customers, suppliers, employees, or the public. A similar but more developed definition is presented by Dan Cudjoe stating that e-commerce is “[…] business, technology, society, and skills of buying and selling of products and services with the aid of internet and computer or handheld and automated devices which involves the process of ordering products or services to the time of delivery to the consumer or customer” [4].
Unfortunately, there is a significant lack of definitions that reflect the newest “achievements” of e-commerce based on artificial intelligence (AI) implementation in the automation of transportation as a part of e-commerce supply/delivery processes [5]. This is relevant especially considering the hierarchical framework of e-commerce, consisting of three meta-levels: infrastructure, services, and products [6]. Additionally, several nodal problems have defined future developments in e-commerce, including integrating electronic platforms and electronic data interchange (EDI), new intermediation, and automation and structurally integrated autonomous transportation processes [7]. According to recent research findings, this is a way e-commerce can develop [8]. However, apart from a few attempts at the enlargement of e-commerce application to automated processes in warehousing [9] and management [10], there is a lack of developed definitions covering automated delivery processes (with use of AI/ML) as a vital and modern part of e-commerce.
Utilising this cognitive gap, the Authors make an attempt to prove that automation and autonomous transportation in e-commerce logistic processes have become practical activities and should be therefore treated as a part of contemporary e-commerce [11]. As a logic consequence, this type of e-commerce activity should be supported by the properly prepared ML algorithms with control applications (prepared in the field of applied theory) for better use of automated guided vehicles (AGV), including drones within the range of B2C e-commerce delivery processes [12].
The organization of this paper is based on the architecture of the presented research. The domain of the entire research is set on the modern e-commerce activities (depicted in the Introduction), and it assumes utilization of AI and ML in the field of e-commerce development and optimization (paragraph 2). The discussion (part 3) shows a cybernetic foundation of the selected elements of a game-based system (GS) and its usability in ML in the activities related to e-commerce automated delivery systems (drones). The solution is presented in paragraph 4. It involves application of algebraic methods viewed from the game-based perspective of e-commerce logistics processes (entities: OP/TO). The findings related to this part of the research are presented in paragraph 5 (Results): an original model of a game-based process as a logistics zero-sum game with an algorithm of choice optimization for e-commerce services has been developed. In order to verify the presented algorithm, a simulation based on a Bayesian Network has been introduced. A structured set of possible strategies (OP/TO) as a matrix (C) has been calculated and presented with the use of QGeNie Modeller, and finally the algorithm has been prepared with Python (only the initial dataset). The presented paper and this stage of research have been eventually concluded in paragraph 6. The research will be continued with the use of a big dataset (learning data) obtained from a drones-operating company (flights of unmanned aerial systems (UAS) beyond visual line of the operator’s sight (BVLOS) with full information about flight conditions e.g., weather, cargo, directions, battery status, geographical conditions, topography, etc.—the Authors’ note).

2. Artificial Intelligence and Machine Learning as Tools for E-Commerce Development

The past decade has been a time of unprecedented development of AI—including not only breakthrough research on ML algorithms [13] but also more common application of intelligent machines in various fields of human life [14]. Considering their widespread application in the pragmatics of economic processes, logistics processes that take place in e-commerce constitute an excellent testing field for the implementation of AI and ML solutions. ML(the term of machine learning (ML) was introduced in 1959 by Arthur Samuel who defined it as a field of study that gives computers the ability to learn without being explicitly programmed—the Authors’ note) is a branch of AI (the term was introduced by a computer scientist and a pioneer in the field of machine learning, Tom Mitchell—the Authors’ note) defined as “[…] developing and testing computer algorithms which allow computer software to improve through experience” [15]. ML is a method by which we expect to achieve AI. In practice, ML involves working with sets of data in order to find common and universal behaviour patterns, while deep learning (DL) [16] is based on multilayer perceptrons (MLPs) [17] understood as a mathematical function mapping some set of input values to output values [18].
According to Tom Mitchell [19], we can state that a machine learns how to perform a “T” task on the basis of experience “E” and a measure of performance quality “P” if the quality of performing the “T” task is improved with an increase in experience “E”, measured with “P”. Hence, the main aim of ML solutions is the practical application of algorithms to develop automatic systems that can improve themselves using experience (collected data) and, based on that, can acquire new knowledge.
ML can be classified by various criteria. Considering the type of examples and information provided [20], it is possible to distinguish supervised learning and unsupervised learning. Some representatives of the AI theoretical stream [21,22] indicate another method, namely, reinforcement learning.
In accordance with the assumed classification of ML methods, supervised learning can be observed when a set of data provided to a machine to learn also includes expected answers, whereas unsupervised learning takes place when there are no answers provided except for a set of data [23,24]. Reinforcement learning is observed when a system operates in a completely unknown environment. There are no specific input and output data. The only piece of information received by a learning machine is a reinforcement signal. The reinforcement signal can be either positive (reward) or negative (punishment)(an example can be a new game, the rules of which we do not know. After the game has been finished, we find out whether we have won or lost (reward or punishment). Hence, in the subsequent games, the level of the player’s advancement should be higher (as it results from the increase in experience)—the Authors’ note).
The algorithm of reinforcement learning can be generally defined as a repetitive (recursive) procedure of knowledge acquisition by the trial-and-error method. In applied terminology, in the control theory [25,26], it is possible to state that a controller (a regulator) interacts with a controlled object (environment or process), using three signals:
  • The status of the system (x),
  • Control (action–u),
  • Reward–r (or control cost).
According to the assumed scheme of the procedures [27], at each stage of the algorithm, the controller observes the status of the object and then performs an action which takes the object to the next stage. At the same time, the controller receives a signal that evaluates (assesses) the performed action in the form of a reward. After the reward has been received, the controller takes the next step in the algorithm.
Understood in such a way, the model of procedures in reinforcement learning algorithms is applied in various systems, e.g., Game-based Systems (GS) (two-entity zero-sum games in the environment of the theory of logistics games), which can be utilized in e-commerce automated delivery systems [28].

3. Discussion: Selected Elements of a Game-Based System (GS) and Its Usability in ML in the Field Related to E-Commerce Automated Delivery Systems

Recently, the development of e-commerce has dramatically increased [29], especially considering the pandemic (covid-19) change in customer behaviours and a need for contactless logistic processes mainly in purchasing and delivery phases of such specific (adopted to the pandemic situation) chains of supply.
To achieve this goal, several innovative delivery techniques have been introduced and practically challenged. Apart from the wide use of AGV within logistic processes, there is an idea of automated drone delivery. Prime Air is a drone delivery service currently in development by Amazon. Operations were expected to begin in selected cities in late 2019; however, in December 2020 the service was still not available (Amazon received US Federal Aviation Administration approval to operate its fleet of Prime Air; however, it is not the only company seeking to expand commercial drone delivery; Alphabet-owned Wing and the US UPS have also been granted FAA approval, and overall status of the concept at the end of May 2021 is still experimental —the Authors’ note.)
This service is designed to use delivery drones to autonomously fly individual packages to customers (within e-commerce chain of supply) within 30 min from ordering. To be qualified for such a service, the order must be small and light enough (up to 5 pounds—2.25 kg) to fit the drone cargo box and delivery location must be within a 10-mile (16 km) radius of the participating Amazon order fulfilment centre. The biggest challenge is, however, to introduce an intelligent delivery drone control algorithm based on the game theory and use of AI within logistic e-commerce system.
Considering various approaches towards the definition of a system [30,31,32,33], it is possible to identify a category related to this term, which is referred to GS. Such systems, in which there are two entities participating in a game, can be theoretically and practically applied in the environment of logistics processes. The functioning of GS involves two types of information [34]:
  • Indispensable initial information about the properties of the process (IIPP) ( t θ ) , which is to form a set of parameters referring to the properties of the process implemented in (GS), which are necessary to identify a possible set of solutions and tasks of the convenience function (fv);
  • Initial (a priori) information referring to the properties of the object (IIPO) ( t θ ) , which will become a predefined set of the object parameters, information about limitation, coercion, and other factors indispensable to the implementation of the process in (GS).
Obtaining the IIPP and IIPO information also assumes the use of the Internet sources and big data [35]. It allows the object and the function of purpose to have an indispensable scope of information indicated. Hence, the information about the object includes:
  • Information about the operator of the object
    -
    Information about external factors and interference affecting the object;
    -
    Information about the response of the object to signals.
  • Information that defines the purpose includes
    -
    The criterion defining the quality of the (GS) operation,
    -
    The satisfying signal determining the purpose of operation;
  • Working information WI ( t θ ) , a set of information about the status of the process acquired.
An identifier of GS is using orders generated from the minimal piece of working information, according to the algorithm of playing the game (G). In accordance with the principles [36], the game consists of numerous subsequent stages, and it is reflected in a given game-based process. As a result, in GS orders are most often generated by one of the entities participating in the game. The finished game (G) is presented with the use of matrices [M]. A matrix is a function f : { 1 , 2 , , m ¯ } × [ 1 , 2 , n } R that is defined with the following equation [33]:
f : ( i ,   j ) = m ¯ i j
This function is explicitly defined by its values, which are presented as the following matrix [33]:
[ M ] = [ m ¯ 11 ,     m ¯ 1 w ,   ,   m ¯ 1 n m ¯ 21 ,     m ¯ 22 ,   ,   m ¯ 2 n                                                                                                                                 m ¯ m 1 ,     m ¯ m 2 ,   ,   m ¯ m n ]
The main principle of GS is selecting orders on the basis of a comparison performed in a set of solutions—possible choices at each stage of the process in the system. The criterion for comparing various possible solutions is the convenience function fv. This function is given during the development of the (GS), based on the analysis of the process. Raj, Biswas, and Srivastava [36] and Kordel and Kuriata [33] refer to the solutions that correspond to the extreme value fv as to quasi-optimal. The main link of a game-based system is a digital machine, which implements a game-based algorithm: it defines a set of possible choices and a quasi-optimal choice in that set. The game-based system (GS) and its essence are presented in Figure 1.
At the beginning of the process in a game-based system, some working information about the A entity is put into a digital machine implementing a game-based algorithm. Based on the working information and initial information (indispensable initial information), the digital machine defines a quasi-optimal choice and generates relevant orders. The orders affect the process implemented in (GS). The indispensable working information about action or about the status of the B entity at the initial moment (generating the first order) can be the complete IIPO ( t θ ) .
The most important parameters of the dynamics of processes implemented in the (GS) can be expressed with the use of two basic characteristics: a time-varying value of the expected convenience function E ( f v ( t θ ) ) and the dispersion of the convenience function R ( f v ( t θ ) ) defined by the form of that function.
The sequence of the values of the expected convenience function E ( f v ( t θ ) ) at the particular stages of the operation is characterised by the main expected relation of the operation and its change in time. The discussed sequence is a discrete function of time that defines the expected value of the convenience function after the impact on the particular n-stages of the operation is finished. The process implemented in the (GS) is always a random process. The convenience function fv is also a discrete function of time. The second characteristic of the dynamics of the process implemented in the (GS) is a change in the dispersion of the function in time. The dispersion of the convenience function R ( f v ( t θ ) ) can be characterised by the entropy or the distribution of that function.
Based on the above-mentioned considerations and general statements [36,37], it is possible to state that the notion of a game-based system can be expressed with the use of its semantic model which is a set-theoretic model. Hence, a model of the (GS) notion can be the following system of notions [33]:
G S = G S { { A ( I I I A E ( t ) ,   I I A E ( t ) ,   W I A E ( t ) ;   B ( I I B E ( t ) ,   W I B E ( t ) ;   G A { [ M } ,   X ( t ) } S A ( t ) ;   S B ( t ) ; S A * ( t ) ;   S A * ( t ) ;   W W A ( t ) ;   W W A ( t ) ;   t θ ;   R } }
where GA—Is a game-based algorithm; SA(t)—is a set of possible strategies of the A entity in the (GS) at the particular moment ( t θ ) ; SB(t)—is a set of possible strategies of the B entity in the (GS) at the particular moment ( t     θ ) ; S A * ( t ) —is a set of mixed strategies of the A entity in the (GS) at the particular moment ( t     θ ) ; S B * ( t ) —is a set of mixed strategies of the B entity in the (GS) at the particular moment ( t     θ ) ; W W A ( t ) —is the level of the payoff obtained by the A entity if the A entity applies the S A i ( t ) strategy and the B entity applies the S B j ( t ) strategy in (GS); W W B ( t ) —is the level of the payoff obtained by the B entity if the B entity applies the S B j ( t ) strategy and the A entity applies the S A i ( t ) strategy in the (GS); and R—is the set of relations observed among the elements in the (GS).
Presented in the Equation (3), the structure of the (GS) may become a foundation for theoretical considerations including also logistics games. It may be also treated as a basis for decisions, which are comprised of a set of possible decisions and are made after the implementation of the process in a defined logistics system, for example, viewed from the perspectives of (OP) and (TO) in the particular time ( t   Θ ) . The information model of such a relation is presented in Figure 2.
Provided above, the presentation of the (GS) relation for (OP) and (TO) in the time makes it ( t Θ ) possible to present the processes that can be observed there in more detail, through the algebraisation [38] of a game-based system in the relation between an operator and a technical object with the use of matrices, Lagrangian functions, systems of differential equations, and set-theoretic notations, with the assumption that the considerations refer to the theory of logistics games.

4. Solution: Application of Algebraic Methods Viewed from the Game-Based Perspective of E-Commerce Logistics Processes (Entities: An Operator (OP)—A Technical Object (TO) e.g., a Drone and Its Operator)

A problem to be considered is a description of the course of a process under the conditions of uncertain information between two logistics-oriented entities: an operator (an agent) (OP) and a technical object (TO), which makes a drone and its operator a very suitable example. The course of the process must include potential sets of strategies for both entities (SOP, STO) and the level of interferences (Z) defined under the IPSA ( t Θ ) . A process-based approach toward the discussed system for the reality of a logistics game (LG) is presented in Figure 3.
Hence, it is possible to assume that the LG is a sum of elements, which match the following equation [33]:
L G = L G { ( O P ) , ( T O ) ;   ( G ) ; S O P ( t ) ;   S T O ( t ) ; S T O * ( t ) ;   S T O * ( t ) ;   W W O P ( t ) ;   W W T O ( t ) ;   t θ ;   R
A finished game (G) is given with the use of the matrices:
G ( i , j ) { 1 , 2 ,   ,   n } 2 c i j C
where ( i ,   j ) = 1 ,   4   ¯   ,   because it is assumed that the sets of possible strategies of (TO) and of (OP) have four elements each, hence: Sensors 21 05244 i001
Considering a statement referring to the game theory [39], it is assumed that each two-entity zero-sum game has at least one solution that defines the value of the game and the entities’ optimal strategies if:
  • α = ν = β —It is a closed game with optimal pure strategies for the (OP) and (TO) sub-systems in the (GS);
  • α ν β —It is an open game and sets of mixed strategies for the (OP) and (TO) sub-systems in the (GS).
The sets of mixed strategies for the (OP) and (TO) sub-systems can be presented as follows [33]:
S O P *   ( t ) = ( S O P 1   ( t ) , S O P 2   ( t ) ,   S O P 3   ( t ) ,   S O P 4   ( t )   p O P 1   ( t ) , p O P 2   ( t ) ,   p O P 3   ( t ) ,   p O P 4   ( t ) )   S T O *   ( t ) = ( S T O 1   ( t ) , S T O 2   ( t ) ,   S T O 3   ( t ) ,   S T O 4   ( t )   q T O 1   ( t ) , q T O 2   ( t ) ,   q T O 3   ( t ) ,   q T O 4   ( t ) )  
where: p O P 1   ( t ) , p O P 2   ( t ) ,   p O P 3   ( t ) ,   p O P 4   ( t ) —is the frequency of selection from the set of possible strategies for the (OP) sub-system in the (GS) during the time period t θ , q T O 1   ( t ) , q T O 2   ( t ) ,   q T O 3   ( t ) ,   q T O 4   ( t ) —is the frequency of selection from the set of possible strategies for the (TO) sub-system in the (GS) during the time period t θ , and:
p T O 1   ( t ) , p T O 2   ( t ) ,   p T O 3   ( t ) ,   p T O 4   ( t ) = 1
q T O 1   ( t ) , q T O 2   ( t ) ,   q T O 3   ( t ) ,   q T O 4   ( t ) = 1
and, as a consequence [33]:
  • i = 1 4 c i 1 p O P i —Is the expected value of the gain of the (OP) sub-system in the (GS) if the (TO) sub-system in the (GS) selects the first strategy S O P 1   ( t ) in the time period t θ ,
  • i = 1 4 c i 2 p O P i —Is the expected value of the gain of the (OP) sub-system in the (GS) if the (TO) sub-system in the (GS) selects the second strategy S O P 2   ( t ) in the time period t θ ,
  • i = 1 4 c i 3 p O P i —Is the expected value of the gain of the (OP) sub-system in the (GS) if the (TO) sub-system in the (GS) selects the third strategy S O P 3   ( t ) during the time period t θ ,
  • i = 1 4 c i 4 p O P i —Is the expected value of the gain of the (OP) sub-system in the (GS) if the (TO) sub-system in the (GS) selects the fourth strategy S O P 4   ( t ) in the time period t θ . Sensors 21 05244 i002 and:
  • j = 1 4 c j 1 q T O i —Is the expected value of the loss of the (OP) sub-system in the (GS) if the (TO) sub-system in the (GS) selects the first strategy S T O 1   ( t ) in the time period t θ ;
  • j = 1 4 c 2 j q T O i —Is the expected value of the loss of the (OP) sub-system in the (GS) if the (TO) sub-system in the (GS) selects the second strategy S T O 2   ( t ) in the time period t θ ;
  • j = 1 4 c 3 j q T O i —Is the expected value of the loss of the (OP) sub-system in the (GS) if the (TO) sub-system in the (GS) selects the third strategy S T O 3   ( t ) in the time period t θ ;
  • j = 1 4 c 4 j q T O i —Is the expected value of the loss of the (OP) sub-system in the (GS) if the (TO) sub-system in the (GS) selects the fourth strategy S T O 4   ( t ) in the time period t θ .
Sensors 21 05244 i003
The expected value of the gain of the (OP) sub-system in the (GS) in the time period t θ is defined as:
( i = 1 4 c i 1 p O P i )   q T O 1 ( t ) + ( i = 1 4 c i 2 p O P i )   q T O 2 ( t ) + ( i = 1 4 c i 3 p O P i )   q T O 3 ( t ) + ( i = 1 4 c i 4 p O P i )   q T O 4 ( t )
The expected value of the loss of the (OP) sub-system in the (GS) in the time period t θ is defined as:
( j = 1 4 c i j q T O 1 ) p O P i ( t ) + ( j = 1 4 c j 2 q T O i ) p O P 2 ( t ) + ( j = 1 4 c 3 j q T O i ) p O P 3 ( t ) + ( j = 1 4 c i 4 q T O j ) p O P 4 ( t )
In order to define the value of the game (v), to identify the optimal strategy from a set of possible strategies of the (OP) sub-system in the (GS) and the (TO) sub-system in the (GS) and to define the level of the payoff, it is necessary to [33]
  • Identify the values p T O 1 , p T O 2 ,   p T O 3 ,   p T O 4 and the values q T O 1 , q T O 2 ,   q T O 3 ,   q T O 4 , for which the value of the expected gain of the (OP) sub-system in the (GS) is maximal with the use of the Lagrangian function [40]:
    L ( p O P 1 , p O P 2 ,   p O P 3 ,   p O P 4 , q T O 1 , q T O 2 ,   q T O 3 ,   q T O 4 ; λ 1 , λ 2 , ) = ( i = 1 4 c i 1 p O P i ) q T O 1 + ( i = 1 4 c i 2 p O P i ) q T O 2 + ( i = 1 4 c i 3 p O P i ) q T O 3 + ( i = 1 4 c i 4 p O P i ) q T O 4 + λ 1 ( p O P 1 , p O P 2 ,   p O P 3 ,   p O P 4 1 ) + λ 2 ( q T O 1 , q T O 2 ,   q T O 3 ,   q T O 4 1 )
    where λ 1 ,   λ 2 are the Lagrangian multipliers, and
  • Equate the partial derivatives to zero, obtaining the following [33]:
    { L p O P 1 = 0 .......... .......... L p O P 4 = 0       { L q T O 1 = 0 .......... .......... L q T O 4 = 0     { L λ 1 = 0 .......... .......... L λ 2 = 0
  • Calculate the values q T O 1 , q T O 2 ,   q T O 3 ,   q T O 4 and p T O 1 , p T O 2 ,   p T O 3 ,   p T O 4 , for which the expected value of the loss of the (TO) sub-system in the (GS) reaches its minimum with the use of the Lagrangian function described with the Equation (11) and so:
    L ( q T O 1 , q T O 2 ,   q T O 3 ,   q T O 4 ; p O P 1 , p O P 2 ,   p O P 3 ,   p O P 4 , λ 1 , λ 2 , ) = ( j = 1 4 c 1 j q T O 1 ) p O P 1 + ( j = 1 4 c 2 j q T O 1 ) p O P 2 + ( j = 1 4 c 3 j q T O 1 ) p O P 3 + ( j = 1 4 c 4 j q T O 1 ) p O P 4 + λ 1 ( q T O 1 + q T O 2 + q T O 3 , + q T O 4 1 ) + λ 2 ( p O P 1 , p O P 2 ,   p O P 3 ,   p O P 4 1 )
  • To equate the partial derivatives to zero (in a way analogical to the system of Equation (12)).

5. Results: An Original Model of a Game-Based Process as a Logistics Zero-Sum Game with an Algorithm of Choice Optimisation for E-Commerce Services

The above-mentioned considerations allow the Authors to present:
  • A model of a game-based process as an approach toward the theory of logistics games (LG) in the game-based system (GS), as presented in Figure 4;
  • An algorithm applied to search for optimal strategies in a set of possible strategies in the (OP) and (TO) sub-systems in the game-based system (GS), as presented in Figure 5.
In order to verify the presented algorithm, a simulation based on a Bayesian network has been developed. Considering data availability, only an experimental (reduced) dataset has been used; however, all the variables have been discretized and selected for the Bayesian network structure learning and parameter learning, according to the BN rules. The structured set of possible strategies (OP/TO) as a matrix (Cxj) has been calculated and presented in Table 1.
Aiming to achieve the maximum level of simplicity at this stage, all the preferences excluding diagonal of the matrix have been calculated as zero. All the attributes are discrete, and there is no missing value. A 10-fold cross-validation method has been employed in this procedure, and selected results (optimization of possible strategies) are depicted with use of QGeNie Modeller in Figure 6.
Finally, the algorithm has been prepared for Python (only the initial dataset) as illustrated in Figure 7.
The entire research has been planned in two stages: the first one, as concluded in this article, has been based on theoretical foundation for developing an optimizing algorithm, and it has been proved that the algorithm at this stage has no deadlocks and unforeseen loops and is ready to be challenged with the original learning data from drones-operating company (as the second stage of the research).
Considering the pragmatics of the entity operating drones, based on the matrix of the strategy C x, j, in the first stage of the research an answer is provided to the following question: which drone of the whole fleet can be approved to fly, and which drone is not approved to fly because of its technical condition, remaining minimal battery range, and some maintenance procedures it has to undergo? The second stage of the research will provide answers to more complicated questions related to optimisation because in the fleet of drones having operational capabilities, each drone has different parameters related to its operation time (remaining battery capacity). The optimisation task at this stage is to assign a drone with a task (a delivery flight) in which the existing capabilities can be used in the most efficient way. Each flight covers a different distance, flight conditions can change (temperature, wind direction, and altitude), cargo weight may vary (maximum 2.5 kg), or other parameters that determine the consumption of energy from the battery during the performance of the task may vary (so, only one drone can be the most efficient one to perform a particular task; the system should handle such choices for at least 100 drones simultaneously). The Authors hope that the suggested algorithm will allow the interested parties to considerably increase the optimisation level in decisions they have to make and will also allow entities that use drones to deliver goods within the automated e-commerce supply chain to improve their operational and economic efficiency.
The development of the algorithm presented above has been critical, considering the purpose function of the scientific proof, and it has also allowed the Authors to draw general conclusions.

6. Conclusions: The Value of the Discussed Model for the Algorithmisation of E-Commerce Logistics Processes

The use of opportunities related to ML in logistics processes taking place in e-commerce supply chains can, first of all, provide effectiveness, efficiency, reliability, and reduction of costs generated by processes and their control (management) systems.
Therefore, there has been a continuous search for solutions that can not only collect all information coming from a logistics process but also provide its detailed analysis in real time. Algorithmisation of processes can be applied to achieve that aim. It is dedicated to their further application in control (management) systems that use ML elements. Specific to e-commerce logistics processes (B2C automated delivery), the way in which the OP-TO relation functions in the GS (as one of the fundamental models of logistics-oriented behaviour and processes which may refer both to control and management aspects in supply chains) has been analysed and mathematically proved. Algorithmisation of GS processes has allowed the Authors to develop a model of a game-based process in a GS, with consideration of the theory of LG. It has also been possible for the Authors to develop an algorithm applied to search for optimal strategies in an identified set of possible solutions in the OP and TO sub-systems for GS (based on possible automated drone delivery).
Considering the fact that there are various types of algorithms that describe logistics processes and that can be applied in ML (classification algorithms, regression algorithms, and grouping algorithms), it should be indicated that the results of the research and the scientific proof presented in this paper may be used in further work on development of an applicative ML regression algorithm for the GS OP-TO. In practice, it may transpire that it is necessary to provide further predictive modelling based on a sequence of actions presented in Figure 8. This should accelerate the introduction of pragmatic processes involved in automated drone delivery in e-commerce chains of supply.
The results of the research, the review of scientific literature [41,42,43,44,45,46,47], and business studies [48] allow the Authors to draw more general conclusions on advantages related to the use of ML algorithms in pragmatics of control and management of logistics processes, which are summarized below. When considered from a synthetic perspective, the following statements can be assumed to be true:
  • ML algorithms are able to establish priorities and to automate the process of making managerial decisions (also in the context of control) in complex and simple logistics systems (e.g.,: OP-TO);
    ML algorithms are able to establish priorities and to automate the process of making managerial decisions (also in the context of control) in complex and simple logistics systems (e.g.,: OP-TO);
  • ML uses historical and real-time generated data for learning; this fact defines flexibility of management systems that apply ML;
  • Algorithm-based business uses advanced ML algorithms to achieve a high level of automation—transition to this type of activity makes way for new innovative business models;
  • ML provides the possibility to analyse large resources of complex data and streaming data and to draw conclusions—also from predictive analysis—that can be unavailable to the human mind;
  • Intelligent, ML-supported business processes can considerably increase efficiency of logistics-oriented systems; they make it possible to develop precise plans and forecasts, to automate tasks, to reduce costs, and to eliminate most human errors;
  • As there is an increasing interest in the development of ML systems to include a function for explainability, it may be further developed in cognitive computing systems.
To sum up, ML based on GS algorithms can help to achieve better business results in the field of logistics, starting with initiating proper action, based on new chances and risk factors and culminating in the ability to make precise forecasts of results before costly and consequential e-commerce supply-chain decisions are made. Focused on more pragmatic processes associated with automated and autonomous transportation as a part of e-commerce chains of supply/delivery, ML can significantly improve the learning curve and radically accelerate real and wide implementation of such a solution into the market. As has been already mentioned, the continuation of the stage 2 and future research is needed to develop and test further predictive modelling to optimize e-commerce supply-chain management. The early adopters of ML for supply-chain logistics will be able to exponentially grow their efficiency and productivity. Hence, their superior supply (and production) will quickly outperform other outdated operations and will shut down the more inefficient and costlier outdated competitors.

Author Contributions

Formal analysis and methodology, R.K.M., A.K. (Andrzej Kuriata), A.B., A.K. (Antonina Kalinichenko); investigation, R.K.M., A.K. (Andrzej Kuriata); writing—Original draft, R.K.M., A.K. (Andrzej Kuriata), A.B., A.K. (Antonina Kalinichenko); writing—Review and editing, R.K.M., A.K. (Andrzej Kuriata), A.B., A.K. (Antonina Kalinichenko); algorithm simulation with use of BN, QGeNie, Python—A.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Acronym

Sets and Graphs, Indexing, linear Algebra Operations, Probability and Game Theory, Functions, Datasets and Distribution:
AA set
RThe set of real numbers
{0, …, n}The set containing all integers between 0 and n
[a, b]The real interval including a and b
aiElement i of a
MijElement i, j of the M matrix
det(A)Determinant of A
P(a)A probability distribution over a discrete variable
p(a)A probability distribution over a continuous variable
MTTranspose of the M matrix
( t θ )Time calculated in given timeframe
( t Θ )The particular time
λ1, λ2The Lagrangian multipliers
f(x; θ)A function x parametrized by θ; often f(x) argument θ is omitted to lighten notation
f: ABThe function f with domain A and range B
abThe random variables a and b are independent
fvThe convenience function
XA set of training examples
x(i)The ith example (input) from a dataset
y(i)The target associated with x(i) for supervised learning
SSet of strategies

References

  1. Clarke, R. Electronic Commerce Definition. 1999. Available online: http://www.rogerclarke.com/EC/ECDefns.html (accessed on 31 May 2021).
  2. Wang, B.; Liu, Y.; Parker, K.S. How does the use of information communication technology affect individuals? A work design perspective. Annals 2020, 14, 695–725. [Google Scholar] [CrossRef]
  3. Zwass, V. Electronic commerce: Structure and issues. Int. J. Electron. Commer. Res. 1996, 1, 3–23. [Google Scholar] [CrossRef]
  4. Cudjoe, D. Electronic Commerce: State-of-the-Art. Am. J. Intell. Syst. 2014, 4, 135–141. [Google Scholar] [CrossRef]
  5. Gong, Y.; van Engelenburg, S.; Janssen, M. A Reference Architecture for Blockchain-Based Crowdsourcing Platforms. J. Appl. Electron. Commer. Res. 2021, 16, 937–958. [Google Scholar] [CrossRef]
  6. Du, X.; Gao, Y.; Chang, L.; Lang, X.; Xue, X.; Bi, D. Assessing the application of big data technology in platform business model: A hierarchical framework. PLoS ONE 2020, 15, e0238152. [Google Scholar] [CrossRef]
  7. Pabian, A.; Pabian, B.; Reformat, B. E-Customer Security as a Social Value in the Sphere of Sustainability. Sustainability 2020, 12, 10590. [Google Scholar] [CrossRef]
  8. Jiang, Y.; Lai, P.; Chang, C.-H.; Yuen, K.F.; Li, S.; Wang, X. Sustainable Management for Fresh Food E-Commerce Logistics Services. Sustainability 2021, 136, 3456. [Google Scholar] [CrossRef]
  9. Khrais, L.T.; Alghamdi, A.M. The Role of Mobile Application Acceptance in Shaping E-Customer Service. Future Internet 2021, 13, 77. [Google Scholar] [CrossRef]
  10. Hussain, A.; Shahzad, A.; Hassan, R. Organizational and Environmental Factors with the Mediating Role of E-Commerce and SME Performance. J. Open Innov. Technol. Mark. Complex. 2020, 6, 196. [Google Scholar] [CrossRef]
  11. Wang, C.N.; Dang, T.T.; Nguyen, N.A.T. Outsourcing Reverse Logistics for E-Commerce Retailers. A Two-Stage Fuzzy Optimization Approach. Axioms 2021, 10, 34. [Google Scholar]
  12. Akkol, E. Analysis of Customers’ Reactions to E-Commerce Companies Using Instagram Data. Proceedings 2021, 74, 12. [Google Scholar] [CrossRef]
  13. Bonaccorso, G. Algorytmy Uczenia Maszynowego. Zaawansowane Techniki Implementacji. Helion, Warszawa. 2018. Available online: https://pdf.helion.pl/alguma/alguma.pdf (accessed on 31 May 2021).
  14. Szeliga, M. Praktyczne Uczenie Maszynowe; Wydawnictwo PWN: Warszawa, Poland, 2019; ISBN 9788301207847. [Google Scholar]
  15. Nielsen, M. Neural Networks and Deep Learning. 2018. Available online: http://neuralnetworksanddeeplearning.com/ (accessed on 31 May 2021).
  16. Mathrani, S.; Lai, X. Big Data Analytic Framework for Organizational Leverage. Appl. Sci. 2021, 11, 2340. [Google Scholar] [CrossRef]
  17. Domingos, E.; Ojeme, B.; Daramola, O. Experimental Analysis of Hyperparameters for Deep Learning-Based Churn Prediction in the Banking Sector. Computation 2021, 9, 34. [Google Scholar] [CrossRef]
  18. Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016; Available online: http://www.deeplearningbook.org (accessed on 31 May 2021).
  19. Mitchell, T. Machine learning and data mining. Commun. ACM 1999, 42, 30–36. [Google Scholar] [CrossRef]
  20. Kelton, D. Methodological Expectations for Studies Using Computer Simulation. J. Bus. Logist. 2016, 37, 82–86. [Google Scholar] [CrossRef]
  21. Guido, S.; Mueller, A.C. Introduction to Machine Learning with Python: A Guide for Data Scientists; O’Reilly Publishing: Newton, MA, USA, 2018; ISBN 978-1449369415. [Google Scholar]
  22. Raschka, S.; Mirjalili, V. Python Machine Learning: Machine Learning and Deep Learning with Python, Scikit-Learn, and Tensor Flow 2, 3rd ed.; Packt Publishing: Birmingham, UK, 2019; ISBN 9781789955750. [Google Scholar]
  23. Górecki, P. Uczenie Maszynowe, Sztuczna Inteligencja i (Samo)Świadomość. 2019. Available online: https://www.tabletowo.pl/uczenie-maszynowe-sztuczna-inteligencja-i-samoswiadomosc/ (accessed on 31 May 2021).
  24. van Engelen, J.E.; Hoos, H.H. A survey on semi-supervised learning. Mach. Learn 2020, 109, 373–440. [Google Scholar] [CrossRef] [Green Version]
  25. Gautier, N.; Aider, J.; Duriez, T.; Noack, B.; Segond,, M.; Abel, M. Closed-loop separation control using machine learning. J. Fluid Mech. 2015, 770, 442–457. [Google Scholar] [CrossRef] [Green Version]
  26. Miller, J.W.; Golicic, S.; Fugate, B.S. Reconciling Alternative Theories for the Safety of Owner–Operators. J. Busin. Logist. 2018, 39, 101–122. [Google Scholar] [CrossRef]
  27. Shalev-Shwartz, S.; Ben-David, S. Understanding Machine Learning: From Theory to Algorithms; University Press: Cambridge, UK, 2014; pp. 167–178. [Google Scholar] [CrossRef]
  28. Wang, C.N.; Dang, T.T.; Nguyen, N.A.T.; Le, T.T.H. Wspieranie lepszego podejmowania decyzji: Połączony szary model i analiza zakresu danych do oceny wydajności na rynkach handlu elektronicznego. Sustainability 2020, 12, 10385. [Google Scholar] [CrossRef]
  29. Kumar, S.; Lim, W.M.; Pandey, N.; Westland, J.C. 20 years of Electronic Commerce Research. Electron. Commer. Res. 2021, 21, 1–40. [Google Scholar] [CrossRef]
  30. Suh, N.P. Axiomatic Design Theory for Systems. Res. Eng. Des. 1998, 10, 189–209. [Google Scholar] [CrossRef]
  31. Lisowski, J. Game control methods in navigator decision support system. Arch. Transp. 2005, 17, 133–147. [Google Scholar] [CrossRef] [Green Version]
  32. Deptuła, A.; Partyka, M.A. Application of game graphs in optimization of dynamic system structures. Int. J. Appl. Mech. Eng. 2010, 15, 647–656. Available online: https://www.infona.pl/resource/bwmeta1.element.baztech-article-BPZ5-0004-0042 (accessed on 31 May 2021).
  33. Kuriata, A.; Kordel, Z. Logistyka i Transport. Teoria Oraz Praktyczne Zastosowania; CeDeWu: Warszawa, Poland, 2019; ISBN 978-83-8102-286-6. Available online: https://cedewu.pl/Logistyka-i-transport-Teoria-oraz-praktyczne-zastosowania-p2679 (accessed on 31 May 2021).
  34. Kordel, Z.; Kuriata, A. Logistyka i Transport w Ujęciu Systemowym; CeDeWu: Warszawa, Poland, 2018; ISBN 978-83-8102-138-8. Available online: https://cedewu.pl/Logistyka-i-transport-w-ujeciu-systemowym-p2199 (accessed on 31 May 2021).
  35. Massimino, B. Accessing Online Data: Web-Crawling and Information-Scraping Techniques to Automate the Assembly of Research Data. J. Busin. Logist. 2016, 37, 34–42. [Google Scholar] [CrossRef]
  36. Raj, A.; Biswas, I.; Srivastava, K.S. Designing supply contracts for the sustainable supply chain using game theory. J. Clean Prod. 2018, 185, 275–284. [Google Scholar] [CrossRef]
  37. Hamister, J.; Magazine, M.; Polak, G. Integrating Analytics through the Big Data Information Chain: A Case from Supply Chain Management. J. Busin. Logist. 2018, 39. [Google Scholar] [CrossRef]
  38. Beyerer, J.; Nagel, M.; Richter, M. Pattern Recognition; De Gruyter Oldenbourg Karlsruhe: München, Germany, 2017. [Google Scholar] [CrossRef]
  39. Chena, Y.-W.; Larbani, M. Two-person zero-sum game approach for fuzzy multiple attribute decision making problems. Fuzzy Sets Syst. 2006, 157, 34–51. [Google Scholar] [CrossRef]
  40. Haller, G. Lagrangian Coherent Structures. Ann. Rev. Fluid Mechan 2015, 47, 137–162. [Google Scholar] [CrossRef] [Green Version]
  41. Goldsby, T.J.; Zinn, W. Technology Innovation and New Business Models: Can Logistics and Supply Chain Research Accelerate the Evolution? J. Busin. Logist. 2016, 37, 80–81. [Google Scholar] [CrossRef]
  42. Sanders, N.R.; Boone, T.; Ganeshan, R.; Wood, J.D. Sustainable Supply Chains in the Age of AI and Digitization: Research Challenges and Opportunities. J. Busin. Logist. 2019, 40, 229–240. [Google Scholar] [CrossRef]
  43. Teschendorff, A.E. Avoiding common pitfalls in machine learning omic data science. Nat. Mater. 2019, 18, 422–427. [Google Scholar] [CrossRef] [PubMed]
  44. Klumpp, M.; Zijm, W.H.M. Logistics Innovation and Social Sustainability: How to Prevent an Artificial Divide in Human–Computer Interaction. J. Busin. Logist. 2019, 40, 265–278. [Google Scholar] [CrossRef]
  45. Hong, P.; Schoenherr, T.; Hult, G.T.M.; Zinn, W.; Goldsby, T. Cross-functional Management and Base-of-Pyramid Issues in Logistics and Supply Chain Management. J. Busin. Logist. 2019, 40, 76–80. [Google Scholar] [CrossRef]
  46. Miller, J.; Ganster, D.; Griffis, S. Leveraging Big Data to Develop Supply Chain Management Theory: The Case of Panel Data. J. Busin. Logist. 2018, 39, 182–202. [Google Scholar] [CrossRef]
  47. Kopishynska, O.; Utkin, Y.; Kalinichenko, A.; Jelonek, D. Efficacy of the cloud computing technology in the management of communication and business processes of the companies. Pol. J. Manag. Stud. 2016, 14, 104–114. [Google Scholar] [CrossRef]
  48. Larrañaga, P.; Atienza, D.; Diaz-Rozo, J.; Ogbechie, A.; Puerto-Santana, C.; Bielza, C. Industrial Applications of Machine Learning, 1st ed.; CRC Press: Boca Raton, FL, USA, 2018. [Google Scholar]
Figure 1. Information model of a game-based system (GS) [33].
Figure 1. Information model of a game-based system (GS) [33].
Sensors 21 05244 g001
Figure 2. Information model (OP) and (TO) in the time ( t Θ ) : adapted from [33].
Figure 2. Information model (OP) and (TO) in the time ( t Θ ) : adapted from [33].
Sensors 21 05244 g002
Figure 3. Process-based approach toward the (GS) for (OP) and (TO) during the time for a ( t Θ ) logistics game (LG), adapted from [33].
Figure 3. Process-based approach toward the (GS) for (OP) and (TO) during the time for a ( t Θ ) logistics game (LG), adapted from [33].
Sensors 21 05244 g003
Figure 4. Model of a game-based process as a logistics game (LG) in the game-based system (GS) for the relation of the (OP) and (TO) during the time period ( t Θ ) : adapted from [33].
Figure 4. Model of a game-based process as a logistics game (LG) in the game-based system (GS) for the relation of the (OP) and (TO) during the time period ( t Θ ) : adapted from [33].
Sensors 21 05244 g004
Figure 5. Algorithm applied to search for optimal strategies in the set of possible strategies of the (OP) and (TO) sub-systems in the game-based system (GS), adapted from [33].
Figure 5. Algorithm applied to search for optimal strategies in the set of possible strategies of the (OP) and (TO) sub-systems in the game-based system (GS), adapted from [33].
Sensors 21 05244 g005
Figure 6. Selected results (optimization of possible strategies (OP) and (TO) from the matrix (Cxj)) with use of QGeNie Modeller.
Figure 6. Selected results (optimization of possible strategies (OP) and (TO) from the matrix (Cxj)) with use of QGeNie Modeller.
Sensors 21 05244 g006
Figure 7. Proposed algorithm (selected elements) prepared for Python (without real data).
Figure 7. Proposed algorithm (selected elements) prepared for Python (without real data).
Sensors 21 05244 g007
Figure 8. Predictive modelling, adapted from [22].
Figure 8. Predictive modelling, adapted from [22].
Sensors 21 05244 g008
Table 1. Matrix (Cxj) set of possible strategies (OP) and (TO).
Table 1. Matrix (Cxj) set of possible strategies (OP) and (TO).
(OP)
STBUSEPREPMTE
(TO)STB1000
USE0100
PREP0010
MTE0001
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Miler, R.K.; Kuriata, A.; Brzozowska, A.; Akoel, A.; Kalinichenko, A. The Algorithm of a Game-Based System in the Relation between an Operator and a Technical Object in Management of E-Commerce Logistics Processes with the Use of Machine Learning. Sensors 2021, 21, 5244. https://doi.org/10.3390/s21155244

AMA Style

Miler RK, Kuriata A, Brzozowska A, Akoel A, Kalinichenko A. The Algorithm of a Game-Based System in the Relation between an Operator and a Technical Object in Management of E-Commerce Logistics Processes with the Use of Machine Learning. Sensors. 2021; 21(15):5244. https://doi.org/10.3390/s21155244

Chicago/Turabian Style

Miler, Ryszard K., Andrzej Kuriata, Anna Brzozowska, Akram Akoel, and Antonina Kalinichenko. 2021. "The Algorithm of a Game-Based System in the Relation between an Operator and a Technical Object in Management of E-Commerce Logistics Processes with the Use of Machine Learning" Sensors 21, no. 15: 5244. https://doi.org/10.3390/s21155244

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop