Next Article in Journal
On 3D Reconstruction Using RGB-D Cameras
Next Article in Special Issue
When BERT Started Traveling: TourBERT—A Natural Language Processing Model for the Travel Industry
Previous Article in Journal / Special Issue
ACUX Typology: A Harmonisation of Cultural-Visitor Typologies for Multi-Profile Classification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Greek Hotels’ Web Traffic: A Comparative Study Based on Search Engine Optimization Techniques and Technologies

by
Konstantinos I. Roumeliotis
,
Nikolaos D. Tselikas
* and
Christos Tryfonopoulos
Department of Informatics and Telecommunications, University of Peloponnese, 22100 Tripoli, Greece
*
Author to whom correspondence should be addressed.
Digital 2022, 2(3), 379-400; https://doi.org/10.3390/digital2030021
Submission received: 9 May 2022 / Revised: 1 July 2022 / Accepted: 7 July 2022 / Published: 11 July 2022
(This article belongs to the Collection Digital Systems for Tourism)

Abstract

:
Currently, websites rely heavily on digital marketing, notably search engine optimization (SEO), for success. In the COVID-19 era, hotels have to employ every feasible means to stay afloat despite the bleak business and travel conditions. Many of them have already invested in digital marketing, especially SEO, by applying SEO techniques to their websites to attract more visitors and bookings. This research examines hotels’ websites regarding the SEO techniques they have applied and their impact on web traffic to their websites. During a one-year observation period (February 2021–February 2022), we collected and analyzed web data from 309 top-listed Greek hotels using our own-developed software. By creating and following a specific methodology, we came to valuable conclusions. In addition, we used fuzzy cognitive mapping to develop an exploratory model. From the descriptive analysis and technical SEO perspective, we have concluded that hotels websites’ traffic and, by extension, their long-term viability are inextricably intertwined. Existing and future SEO marketers may benefit from our research’s time-accurate insights on hotel SEO tactics.

1. Introduction

Today, it is more important than ever to apply SEO techniques and technologies to websites, regardless of their industry. A website’s survival in today’s competitive online environment will require it to adapt to several new factors, including the increasing prevalence of the web, changing web search habits, the predominance of eCommerce and online bookings, increased competition, the importance of organic traffic as an unpaid form of advertisement, and changes to search engines algorithms.
A website’s traffic could be increased in various ways. Organic search, social media, direct, referral, and paid traffic are the most important, according to Google Analytics [1,2]. Users who utilize search engines to locate what they are searching for (free of charge) are the organic traffic. Each keyword search returns a list of web pages that use the term in question [3]. Thus, its search engine rankings and organic visitors influence a website’s traffic. Since ranking in organic results cannot be paid, a high organic search ranking is in most cases hard to achieve [1]. To rank highly in search engine results, a website must fulfill specific criteria—ranking factors defined in the rules that search engine owners frequently publish [2]. These factors influence the search rankings. Still, search engines do not reveal their ranking algorithms or provide information about the criteria employed in the order [4]. Several studies [5,6] have examined the most common SEO tactics and how they affect organic traffic.
Through our study, we conduct a thorough investigation of 309 Greek hotels websites which are characterized as top-listed by the Google Map’s reviewers. To determine how hotels design their SEO strategy and which particular SEO strategies are utilized to impose themselves against the competition, each website is put through a stress test using various SEO techniques and technologies checks. We use a fuzzy cognitive map to identify the correlations between SEO techniques and the traffic they generate. Lastly, in our descriptive analysis, we use research hypotheses and problem-solving methodologies to link the hotel sector’s sustainability to SEO’s role in business success.
A PHP-based software tool has been designed from scratch to check for SEO strategies applied in the source code of each website and provide the related reports used in our descriptive analysis [7]. Proxy servers and desktop user agents were used to get through the websites’ firewalls and obtain access.
The remainder of the paper is organized as follows: Section 2 analyzes on-page and off-page SEO techniques mentioned on Google Search Central documentation [2]. Section 3 describes the materials and methods used to retrieve and analyze our dataset, such as the implementation of the PHP-based web crawler tool, the APIs used to gather more information regarding our dataset, the problem formulation, and the research hypotheses. Section 4 refers to the data normalization and grouping of our dataset into smaller groups, the limitations we faced, and the data analysis performed using the SPSS software. In Section 5 we discuss about the valuable outcomes based on our study, such as the most used SEO techniques and the contribution of each SEO technique to organic traffic, while the paper concludes in Section 6.

2. Related Work

Search engine optimization (SEO) is the procedure used by SEO specialists to increase both the quality and the number of visitors from search engines to a website [1]. Keywords are words or phrases that online visitors type into search engines while looking for information. Users expect search engines to respond to their queries when they type them into the search box. A list of websites relating to the term is provided to offer the user the solution they are looking for. Search engine results pages (SERPs) are the pages that surface following a search (SERPs). It is common to see sponsored adverts in specific keywords’ results; on the contrary, organic search results appear strictly algorithmically without advertising payments. Search engine results that have been paid for show up at the top of the list and are clearly labeled as “Ad”. On the other hand, organic results are displayed below the ads. As a consequence, some website owners pay per click (PPC) to appear higher in the search results, while others rank naturally underneath the adverts without any cost (organic results).
Over 200 parameters are taken into consideration by Google’s artificial intelligence (AI) algorithms to rank websites in the organic results section for each search query [2,3]. Search engines are not publicly revealing their algorithmic way of ranking websites [8]. Consequently, search engine optimization (SEO) professionals have to follow every accessible guideline, modifying the website’s source code to discover their efficiency [8]. To help website owners meet the requirements of each ranking factor, SEO specialists have developed SEO techniques and technologies that aim to boost a website’s position in search results for specified target keywords [4].
On-page and off-page SEO are the two main instances of SEO techniques and technologies. The adjustments and additions to the website’s source code that must be made to accomplish each ranking criterion are referred to as on-page SEO techniques. On the contrary, a website’s reputation may be enhanced by using off-page SEO techniques, which are actions that must be taken on third-party online sources [8].
In this article, we focus on the on-page SEO Techniques from a technical point of view.

2.1. On-Page SEO Techniques and Technologies

On-page SEO techniques, as previously discussed, refer to the required website’s source code modifications, in order to meet the criteria of each ranking factor. Below, we will focus on 15 on-page SEO techniques and technologies recommended by Webmaster guidelines in the Google Search Central documentation [2].

2.1.1. Descriptive Title Elements

The title tag (<title>) is an essential aspect of a website, according to the World Wide Web Consortium (W3C). The web page’s title is a mix of words and phrases that describe the page’s content [9]. Search engines will include a website in their results if its title is clear and descriptive [2]. With the title information, users may make an informed decision about which results they should click on.
The title tag should have a specific length size to positively affect both web users’ experience and search engines. According to the industry standard, no more than 78 characters should be used in the title [10]. There are many cases where the title tag is misleading for ranking purposes including keyword stuffing techniques. In these cases, search engines replace the search result title with a tag considered more relevant to the content of the web page [2]. Using keywords that are towards the beginning of the title tag may have a significant impact on search engine results, according to Moz’s research and experience [11].

2.1.2. URL Structure

URL is a human-readable string of text that identifies the hierarchy of files on a web server. Each URL comprises three parts and specifically by the access protocol, the domain name, and the path [12].
This ranking factor helps search engines to determine whether to connect to the content of a website and its URL or not [5,6]. It is much easier for both search engines and site users to realize the content of a well-formatted URL [13]. Concise, brief and understandable URLs are essential for search engine optimization (SEO). URL should be consisted of words, hyphens, and slashes [3,8]. In contrast to past popular belief, URLs including ampersand symbols, numerals, terms, and question marks are not considered SEO friendly [14].
In order to be rendered in all browsers and be indexed by search engines, a URL must be less than 2083 characters long, according to Moz SEO learning center [15].

2.1.3. Image Optimization and Alternative Tags

The file size of the images used on a website affects its loading speed. Although Internet speeds have improved in recent years, it is still difficult for end users to view an image with a file size of more than 100 kb [6]. Compression techniques and formats are used to comply with this direction, such as Google’s new WebP image format, which provides up to 26% more efficient lossless compression in original images [16].
In addition to the file size, a key role in SEO has the alternative (ALT) attribute in image tags (alt tags). Even though search engines utilize computer vision and machine learning algorithms to evaluate images’ content, there are several instances in which these systems fail [17]. Search engine bots and humans with visual impairments may benefit from an image’s alt tag description [2,18,19]. The SEO score may be improved by incorporating images within the text and adding relevant keywords to the alt tag [8].
Furthermore, images’ filenames have a significant impact on SEO. Image file names can be optimized for search engines the same way as the URL structure.

2.1.4. HREF Alternative Title Tags

Internal and external links may be found on every page. In many cases, when users spot a link on a website, they hover their mouse over it before clicking. They probably want to know in advance where this link points to before click on it. The same happens to search engines too, i.e., search engines prefer to know about the landing page’s content of the redirection link [20]. As a result, the title tag is essential for website’s internal and external linking and structure to be understood by both users and search engines.

2.1.5. Descriptive Meta Tags

The meta description tag follows the title tag in the head container [9]. The title of the page, the URL, and the meta description may all be seen by searching and then clicking on one of the search results. Meta description consists of content’s related simple sentences, which search engines utilize to create an abstract of the web page [4]. What visitors will discover on this page is described in the meta description [8]. SEO specialists optimize meta description to ensure that the listing is relative with the website’s content [20]. It should not be more than 350 characters in length, and it must be at least 51 characters long [6]. A meta description including the target keyword might boost SEO results [17]. Websites enriched with metadata are rated higher in search results, than those without metadata [1]. Even when the meta description tag is present on a web page, the description shown on SERPs may differ from website’s meta description. Search engines, and mostly Google, usually characterize the meta description as a non-representative factor, because it is often misleading [21].

2.1.6. Heading Tags

As specified by the World Wide Web Consortium (W3C), HTML has six levels of heading tags, with H1 being the highest level (most significant) and H6 being the lowest one [22]. Using header tags to draw attention to the essential portions of a piece of content is recommended by the Google Central Blog [2]. Heading labels usually describe website’s highlights and this is the reason that search engines place greater importance on heading names than on the rest of the content [9]. Screen readers may also use heading tags as a valuable route for visually impaired people. When a web crawler scans a web page, it pays particular attention to the H1 and H2 tags. H1 and H2 subheadings should include the primary keyword, and their length should range between 10 and 13 words for SEO reasons [13]. In SERPs, the search engines may bypass the website’s title tag and substitute it by the H1 one.

2.1.7. Minify Resources (HTML, CSS, and JavaScript)

Code comments, formatting, shorter variables and function names, as well as other unnecessary data are all examples of minification process. This data may be removed without impacting how the browser processes the actual resource [2]. All files on a website, apart from images, should be as light as possible to improve page loading speed and user experience (UX), as mentioned in Section 2.1.3 [6]. Websites’ loading speed and SEO results may be enhanced by minifying files such as Javascript (.js), Cascading Style Sheets (.css), and HyperText Markup Language (.html). Minification has been demonstrated to reduce loading time by up to 16% and reduce the file size by up to 70% [23]. In addition to speeding up a website, minification may also improve its security. Short and meaningless names for variables and functions are used during minification to make JS code hardly readable by humans [24].

2.1.8. Sitemaps and RSS Feed

In a sitemap file, the webmaster provides a list of all of a website’s pages and how they are all linked together. In order to properly index websites, search engines need to know which pages are the most significant, how often they are updated, and whether any other language versions of those pages exist or not [20]. For large websites, with over 500 pages, it is critical to provide sitemaps to web crawlers and notify them in advance about their backbones [3,20]. Sitemaps may be in RSS or plain text, while XML is the most often supported format by search engines and, in particular, by Google. Search engines encourage web admins to submit their sitemaps by using sitemap submission tools in order to speed up the indexing of their websites [3].
A website’s search engine visibility may also be improved by using an RSS feed [5]. In most cases, RSS feeds are developed to notify visitors when new information is published [25]. RSS feeds are also used to provide a more dynamic depiction of the website’s internal links, as well as by search algorithms to promptly identify new content posted to the website [2].

2.1.9. Robots.txt

Robots exclusion protocol, or robots.txt, is a list of URLs that should not be accessible by search engine crawlers [2]. For example, web crawlers should not be able to access the admin panels of websites built with content management systems and several other pages on a website. By creating a list of URLs in the robots.txt, file we indicate that either the web access to this list or its indexing in the search results is not allowed. However, there are several search engine indexers which violate the robots.txt protocol [26].

2.1.10. Mobile-Friendliness

Mobile devices are used by about half of online users, while two out of three of them are finally coming to a purchase [26]. Smartphones have overtaken personal desktops in several countries [2]. Having a mobile-friendly website has become more critical as the number of people using mobile devices grows [2]. Aligned with this, Google has created the mobile-friendly test tool, which evaluates all the components of a website to define it as a mobile-friendly one or not [27]. Bootstrap, Materialise, Bulma, UIKIT and other design CSS frameworks provide a wide range of ready-made design templates based on responsive web techniques for content provision to any device [14].

2.1.11. Website Speed

Website loading speed is an essential factor to rank higher in the SERPs [28]. Search engines consider page speed as one of the key elements of a website. Google Lighthouse, Pingdom, GTMetrix, and SiteAnalyzer, to name a few, are some indicative available tools used for websites’ speed testing purposes [29].
Google’s Lighthouse and PageSpeed Insights tools help web admins in monitoring and optimizing the speed of their websites. Data from a website is used to provide a performance rating and an estimation of potential savings. Lighthouse uses lab data to assess the performance of a single device under a defined set of network settings. In contrast, PageSpeed uses both lab and field data to determine a website’s performance under a wide range of devices in real-world conditions. According to the website’s performance score, the webmaster may proceed with recommended modifications to improve the website’s performance. Similar strategies are used by the Pingdom tool [30].

2.1.12. Add Security with HTTPS

Internet communication protocols such as HTTPS (Hypertext Transfer Protocol Secure) ensure the integrity and privacy of data sent between user’s browser and a web server. Users expect a secure and private online experience during their navigation in a website [2]. Websites with SSL certificates are ranked higher on search engine results pages, due to their increased security [29]. The Google Chrome browser informs users when the visited website does not provide an SSL certificate (as a non-secure destination) to underline the necessity of SSL certificates [31].

2.1.13. Accelerated Mobile Pages (AMP)

Mobile phone users are about 7.26 billion today [32]. Google launched the Accelerated Mobile Project (AMP) in order to improve mobile browsing experience [33]. To this direction, Google identifies the pages that follow the AMP standards and keeps a cached copy of their source code on Google’s web servers [34]. Cached copy of the AMP page from Google’s servers is sent to mobile users when they click on an AMP result in Google’s search results [2]. Consequently, mobile users obtain requested data right away, with no additional network delay. Web pages that follow the AMP standards rank higher in mobile SERPs [33]. Although AMP technology is effective for both search engine rankings and web page performance, achieving the AMP standards is a demanding task [35,36].

2.1.14. Structured Data and Rich Snippets

An open-source group called Schema.org aims to create and promote structured data schemas on the Internet, on websites, in emails, and elsewhere [37,38]. Structured data is a standardized format for providing information about a page and classifying the page content [2]. The Schema.org vocabulary may be utilized with RDFa, Microdata, and JSON-LD [14]. Semantic annotations may be used to add schema markups to a website’s HTML source code, which search engines can then parse to offer users more relevant search results [39]. This is the main reason that major search engines support the use of markup formats [40]. To this direction, websites that use rich snippets, such as the price and the availability of a product, are rewarded by Google for using structured data [6].

2.1.15. Open Graph Protocol (OGP)

OGP is also a kind of structured data created by Facebook to integrate external material into the social networking platform [41]. Using the OGP, every web page may become a social graph object. For instance, OGP may be utilized on Facebook to enable any web page to operate in the same way as any other object on Facebook [41]. When a URL is posted on social media, social media bots visit the website and search for three essential parts: the title, the picture, and a summary of the content.
By using OGP to pre-mark these components makes it simpler for social media bots to obtain the results they search for. OGP, like any other structured data format, provides richer results and it is numbered among the decisive factors for search engine and social media end-users regarding clicking on and visiting a website, or not [14].

2.2. Off-Page SEO Techniques

A website’s ranking is also affected by variables known as off-page factors, which are independent of the page’s content and are based on a variety of external influences [1,29]. Creating backlinks to the targeted website from high-ranking websites is a crucial component of off-page SEO [3]. The volume and the quality of backlinks to the owner’s website are most important in off-page SEO. For example, ranking could be affected by the provision of helpful and remarkable content that people want to link to. Consequently, search engines treated backlinks as “up-votes,” giving sites with more links a boost in the rankings [1]. There are more than 12 off-page tactics that can impact a website’s visibility and SERP ranking. Guest posts, profile backlinks, Q&A, and comment backlinks are some of the fundamental approaches [8]. According to Google’s creators in their 1998 article, anchored links are connections to a website containing the destination website’s goal term. Anchors often provide more accurate descriptions of Web pages than the pages themselves [42]. The website must incorporate the target keyword on both on-page and off-page SEO techniques to appear higher in SERPs [1]. While using off-page SEO techniques are beneficial for website ranking, search engines advise web admins that any effort to influence website’s search engine rankings by establishing backlinks may be a criterion for degrading or removing the website from the SERPs [6].

3. Materials and Methods

The purpose of this paper is to propose an innovative methodology for implementing an efficient framework for comprehending the strong correlation between search engine optimization and web traffic, thereby providing valuable insights for hotels’ websites. We followed seven research steps to achieve the desired outcomes.
  • Problem Formulation and Research Hypotheses. In this stage, we clearly define the necessity of earning more organic traffic using SEO techniques along with five hypotheses that must be confirmed from our descriptive analysis.
  • Data Retrieval. In this stage, we describe the software developed for the purpose of this paper, and the external APIs used to collect additional information regarding our dataset.
  • Data Normalization and Limitations. In this stage, we divide our dataset into three groups (Metrics, SEO Techniques, and Web Traffic) facilitating the descriptive analysis that follows. Each group is analyzed using different procedures depending on its type. Metrics and SEO Techniques were analyzed against the dependent variable Web Traffic.
  • Descriptive Statistics. Using the SPSS software, we perform descriptive analysis on our dataset.
  • Inferential Statistics
    • T-Tests on SEO Techniques. In this stage, we perform independent t-tests on each SEO technique by using the SPSS software.
    • ANOVA, Coefficients and Scatterplots on SEO Metrics. In this stage, independent ANOVA, Coefficients and Scatterplots performed on each SEO Metric by using the SPSS software.
  • Understand cause-and-effect relationship of SEO Techniques and Web Traffic. In this stage, we conclude that the majority of SEO Techniques positively affect web traffic.
  • Understand cause-and-effect relationship of SEO Metrics and Web Traffic. In this stage, we conclude that the majority of SEO Metrics positively affect web traffic.

3.1. Websites’ Traffic Sources and Key Performance Indicators

Hotels’ online presence and bookings are directly affected by various factors that should be analyzed and factored into our estimation. To obtain a more detailed picture of the hotels’ online presence, we use their organic web traffic sources, which indicate the primary traffic source to their websites. As mentioned in Section 1, organic traffic is the traffic generated by users who use search engines (e.g., Google, Yahoo, etc.). It is an unpaid form of web traffic that plays a significant role in business success. Most websites rely on organic traffic results, which account for more than 60% of total website traffic [43]. As an unpaid source of traffic, organic traffic offers numerous benefits to businesses, including a lower initial investment than PPC campaigns, long-lasting results that can be enhanced year after year with the appropriate effort, and a significant increase in brand awareness [44].
Organic keywords, backlinks, website speed, and domain authority (DA) are four additional metrics that search engines consider when evaluating a website before ranking it in the SERPs [45,46]. Organic keywords are those for which websites are ranked in search engines’ organic SERPs [47]. Backlinks are links from third-party websites to the target page that endorse that page [1]. Website speed is the time it takes for a web page to fully download and load from the webserver to the user’s browser and it is measured in seconds. Domain authority (DA) is a Moz-developed search engine ranking score that forecasts a website’s strength and its ability to rank in SERPs. The domain authority score is a numeric value between 1 and 100, with higher values indicating a greater likelihood of ranking [48]. Domain authority is not a metric that search engines can see. It attempts to mimic how search engines rank web pages. In the majority of cases, the domain authority score is indicative. The domain authority (DA) is critical when comparing it to other websites. Websites with a higher domain authority (DA) are more likely to appear higher in SERPs than those with a lower domain authority (DA).
Table 1 summarizes the web traffic metrics proposed as key performance indicators (KPIs) in this paper.
A performance indicator, also known as a key performance indicator (KPI), is a type of performance measurement that assesses a business’s success in specific activities that it manages [47,49]. Table 1 contains the KPIs examined in this paper in addition to the web metrics. Monthly traffic analysis is beneficial for exporting more accurate results about a website’s traffic and ranking performance.

3.2. Sample Selection, Data Retrieval

We obtained data for this paper from Google Maps and the Google Maps API [53]. The sample consists of 309 Greek hotels’ websites which have user ratings greater than three out of five stars. For the above hotels’ firms, data were collected concerning their websites’ organic traffic, website speed, number of organic keywords, reviews, and their domain authority. Sample considered as representative providing with knowledge about the search engine optimization applied by hotels as well as how this translated into traffic. Monthly data collection from hotels websites allowed for a more precise examination and comprehension of web metrics variance. The testing period has been extended to 12 months, with specific dates between February 2021 and February 2022. Because this period corresponds to the COVID-19 transportation restrictions, the results will provide a clear picture of the decline in bookings during this period.
We developed a PHP-based web crawler tool to collect the sample that extracted information about each hotel listed on Google Maps website, including its legal name, and Website [7,53]. Following data extraction, we stored the entire dataset in a MySQL database, later used to perform SEO checks [7].
The software tool created for the purpose of this article draws the dataset from Google Maps API and stores each result in a MySQL database. It then traverses the dataset and, using cURL, extracts the source code from each hotel’s website. The cURL project is computer software that includes both a library (libcurl) and a command-line tool (curl) for data transfer over a variety of network protocols [54]. Each of the fifteen SEO techniques discussed in Section 2 has its function that searches the source code of the targeted website to determine whether the SEO Technique has been applied. Finally, the database is updated with the results of the checks. Figure 1 illustrates the generated code and its primary functions in detail.
To supplement our research and to validate our findings, we used four third-party API tools: the Mobile-Friendly Test Tool API, the PageSpeed Insights API, the Mozscape API, and the Ubersuggest Traffic Analytic tool. Each of these tools obtains the URL of the website and returns results based on its measurements.
  • The Google-created Mobile-Friendly Test Tool API validates a URL against responsive techniques. It returns a list of any mobile usability issues that may affect a user visiting the page on a mobile device [55].
  • Google’s PageSpeed Insights API analyses the performance of a web page and provides recommendations on how to improve various aspects of the page’s performance, including page speed, accessibility, and SEO [56].
  • The Mozscape API, developed by MOZ, accepts a website’s URL as an input and returns accurate metrics such as Domain Authority [48].
  • Ubersuggest, developed by Neil Patel, retrieves and returns a 360-degree view of any website’s metrics and its sources [52].
After completing SEO audits on 309 hotels’ websites, the exported data was imported into IBM SPSS 27. IBM SPSS 27 is one of the most widely used statistical software for ad hoc analysis, hypothesis testing, and predictive analytics. IBM SPSS Statistics is used by organizations to comprehend data, analyze trends, forecast, and plan to validate assumptions and reach accurate conclusions [57]. A descriptive analysis of various correlations confirmed the following research hypotheses. Finally, a fuzzy cognitive map (FCM) was created to visualize the relationship between SEO techniques, traffic, and backlinks, demonstrating the direct correlation between SEO techniques and traffic and, thus, with hotels’ bookings and sustainability.

3.3. Problem Formulation and Research Hypotheses

Increased competition, the COVID-19 era, and global travel restrictions tend to increase businesses’ efforts to be more efficient in their digital marketing campaigns. Hotels make extensive use of every available tool, from copywriting to email marketing, social media marketing, and search engine optimization, to reach out to potential customers in various ways. By understanding the consumer habits of their audience, hotel marketing agents develop marketing strategies to target this audience more effectively.
Hotels’ websites are the primary source of bookings, and companies devote resources to organizing and optimizing the efficiency and effectiveness of their websites. Their position on the search engine results pages (SERPs) plays a crucial role in gaining organic visitors. The web page title, meta description, and even the rich results must all be applied with care to earn the coveted click. The content, user interface, and website speed work together to pique the visitor’s interest and convert them into loyal customers. Given that hotel websites’ primary objective is traffic and conversion to clients, we define five hypotheses and attempt to confirm them to expand practical knowledge about the significance and impact of search engine optimization on hotel websites traffic.
Hypothesis 1 (H1).
SEO techniques affect the amount of traffic that hotels’ websites receive.
The primary objective of this paper is to determine whether search engine optimization has an impact on the amount of traffic that hotels’ websites receive. By deciding which SEO techniques hotels have implemented on their websites and comparing the results to the increase in traffic, we can gain valuable insight into the potential benefits of SEO techniques. As mentioned in Section 3.2, a dataset was selected from Google Maps consisting of 309 Greek hotels’ websites, which have user ratings greater than three out of five stars. Scanning and performing SEO tests on each website source code, using our own developed software [7], we gather information regarding the SEO techniques each website has implemented. Using third-party software tools described in Section 3.2, we scan each website, gathering information concerning their domain authority, website speed, organic keywords, organic traffic, and backlinks. All the data collected is stored in an excel sheet. Using the IBM SPSS 27 software, a descriptive analysis was conducted in order to find connections/patterns between SEO techniques used by websites and their organic traffic. The ultimate goal of this research is to uncover which SEO techniques used by Greek hotels have a more positive effect on organic traffic, assuming that the websites with the highest organic traffic have applied the appropriate SEO techniques in their source code.
Hypothesis 2 (H2).
On-page SEO techniques affect the ranking of a hotel’s website.
Hypothesis 2 is a segment of Hypothesis 1, targeting the on-page SEO techniques. Our second hypothesis is based on the effects of on-page SEO techniques on website ranking. Multiple keyword targeting and on-page SEO techniques are assumed to increase organic results.
Hypothesis 3 (H3)
Off-Page SEO techniques—backlinks affect the ranking of a hotel’s website.
Hypothesis 3 is a segment of Hypothesis 1, targeting the off-page SEO techniques. Hypothesis 3 examines off-page SEO techniques and their effect on the traffic to hotel websites. We assume that creating backlinks to third-party websites will boost a website’s domain authority, which will result in higher rankings in the SERPs.
Hypothesis 4 (H4).
Hotels’ organic keywords affect their website’s ranking.
We assume in Hypothesis 4 that organic keywords are valuable for the hotels’ website, resulting in higher SERP rankings and increased web traffic.
Hypothesis 5 (H5).
Hotels’ backlinks affect their domain authority.
We assume in Hypothesis 5 that backlinks are intrinsically linked to domain authority.

4. Results

4.1. Data Normalization

In this section we examine the collected data. As mentioned in Section 3.2, our tool performed SEO checks on 309 websites. The results were stored in excel so that we can use them in further data analysis [58].
Collected data are separated into three Groups.
  • Group 1: Metrics are checks performed by third-party tools and are not SEO techniques. The contents of the Metrics Group are as follows: DA (integer 0 to 100), speed_test (decimal), number of keywords (integer), and number of backlinks (integer).
  • Group 2: SEO Techniques are checks performed by our tool regarding the existence (or not) of SEO techniques on each website of the dataset. All metrics returned 1 if the SEO technique has been applied to the website, 0 otherwise. The contents of the SEO Techniques Group are as follows: images_alt, links_title, rss, sitemap, robots, heading1, heading2, web_ssl, meta_description, opengraph, url_seo_friendly, amp, minified_css, minified_js, title, structured_data, and responsive_test.
    • To extend the results of our research, by giving readers additional insights related to SEO, we introduce two variables which are not SEO techniques but can be treated statistically in the same way. The first variable is the well-known opensource CMS WordPress. We present this variable to identify if WordPress CMS can provide better results regarding SEO and organic traffic than other web platforms. The second variable is the Yoast SEO plugin. Yoast SEO plugin is a WordPress extension that uses advanced suggestion tools to help website’s administrator improve website’s content and structure proposing changes such as keyword targeting, internal linking, structured data, sitemap, etc. We present this variable to determine if the Yoast SEO plugin can deliver better results regarding SEO and organic traffic if installed in a WordPress CMS. In Group 2, two more parameters are added, «is_yoast» and «is_wp», with a value of 1 for those websites that use them and a value of 0 for those websites that do not use them.
  • Group 3: Web Traffic (organic traffic) are the monthly statistical data collected for each website. All web traffic data are integers and refer to websites’ data for the last twelve months. The contents of the Web Traffic Group are as follows: traffic_1, traffic_2, traffic_3, traffic_4, traffic_5, traffic_6, traffic_7, traffic_8, traffic_9, traffic_10, traffic_11, and traffic_12.

4.2. Limitations

During the examination of the responsive_test technique, it was found that the Mobile-Friendly Test Tool API created by Google considers non-responsive those pages that have even one non-responsive element. Manual checks revealed that these specific websites partially adopt responsive techniques and have several elements in their source code that are non-responsive. These pages are marked with the number 0 which means that they do not follow that specific SEO technique. Additionally, in 22 out of the 309 websites, the speed measurements were performed manually with the tool Pingdom website speed test by Solar Winds, because the PageSpeed Insights API by Google could not operate speed test on these pages.
During the AMP and Sitemap SEO technique tests found that none of the websites have been applied these techniques, although they are suggested by Google on Webmaster Guidelines [2]. Consequently, t-tests cannot be performed on these techniques excluding them from our final results.
We collected the monthly organic traffic for 12 months to have a more representative view of the website traffic. For the data analysis that follows, we calculated the mean value of 12 months of organic traffic for each website and saved it in a new column called Web Traffic. All the following comparisons regarding web traffic refer to the mean value of organic traffic for each website.
Raw data collected from multiple websites do not convey any significant trends or the behavior of the individuals before the corresponding analysis. Data analysis refers to the process of converting the raw data into meaningful information, using the mathematical, statistical, or computational algorithms for better comprehension [59]. For the quantitative data analysis, some major steps need to be considered to firmly execute the assessment and generate the intended results. The procedure consists of two major steps: (1) reviewing the data by descriptive analysis and (2) conducting inferential and descriptive statistics to answer the research questions [60]. Τhe study is presented in the following sections of the chapter and each section is intended to answer the research questions as the analysis must be aligned with the research objectives.

4.3. Descriptive Statistics

Descriptive analysis helps to describe, demonstrate or summarize the collected data in a constructive manner so that the trends and patterns can be easily observed and analyzed [61]. Mean, median, mode, standard deviation, skewness, and kurtosis are some of the important measures. Mean, median and mode measure the central tendency of the variable to typify the whole set of data [62]. Similarly, skewness is the measure of the degree of lopsidedness in the distribution on the frequency, and kurtosis is the measure of the degree of tailedness in the frequency distribution [63]. Standard deviation measures the risks, volatility, scatteredness, or variability in the data [64]. Table 2 indicates that the maximum web traffic, domain authority, speed test, keywords and backlinks of this study are 1,058,500, 90, 15.4, 2,898,545 and 32,965,989, respectively. The values of skewness or kurtosis within −1.5 and +1.5 are considered right skewed distribution of the data [65].

4.4. Inferential Statistics

Inferential statistics refers to the procedure of using data analysis to infer attributes of an underlying probability distribution. Inferential statistics deal with the information acquired from the sample of the population to draw conclusive statements about the entire population [66]. Inferential statistics make use of statistical models to compare the sample data to that of earlier studies. It varies from descriptive statistics in that it allows one to draw conclusions based on extrapolations rather than merely reporting the data that has been seen, like descriptive statistics does [67]. From our hypotheses, we assume that SEO Metrics and SEO Techniques are positively affecting the dependent variable Web Traffic. Using inferential statistics, we compare each SEO metric and technique against the web traffic and—without loss of generality—we extend the corresponding conclusion to the larger population.

4.4.1. Impact of SEO Techniques Group on the Web Traffic

At this point, 15 different t-tests were performed for each one of the 15 SEO techniques (Group 2) using the SPSS 27 software by IBM. After this, the t-tests mean values observed. The mean value is equal to the sum of all the values in the data set divided by the total number of values. The “YES” mean reflects the average web traffic from the websites that applied the technique. The “NO” mean reflects the average web traffic from the websites that have not applied the technique. If the “YES” mean is greater than “NO” mean the websites which have implemented this technique in their source code have greater traffic and the technique is valuable for the website. Another component to consider when comparing Yes or No is the p-value. In statistics, the p-value is the probability of obtaining results at least as extreme as the observed results of a statistical hypothesis test, assuming that the null hypothesis is correct. The p-value is used as an alternative to rejection points to provide the smallest level of significance at which the null hypothesis would be rejected. A smaller p-value means that there is stronger evidence in favor of the alternative hypothesis [68]. A p-value less than 0.05 (typically ≤ 0.05) is statistically significant. A p-value higher than 0.05 (>0.05) is not statistically significant and indicates strong evidence for the null hypothesis [69]. The results from 2 of 15 t-tests are presented in detail below, and the rest are presented briefly in Table 3 for the sake of brevity.
Impact of meta description technique on the web traffic
H0. 
There is no difference in websites traffic due to the meta description technique (null hypothesis).
H1. 
There is a difference in websites traffic due to the meta description technique.
In this study, an independent samples t-test is applied to draw a comparison of the mean web traffic between websites that implement meta description technique (n = 193) and websites that did not implement the meta description technique (n = 116) (Table 4). The t-test was statistically significant, with the mean web traffic of meta description technique-implemented websites (M = 8739.23, SD = 77,063.28) being higher than the meta description technique unimplemented websites: (M = 6608.39, SD = 53,677.81), t = −0.260, p < 0.05, two-tailed. Therefore, the null hypothesis, H0, which states that there is no difference in websites traffic due to the meta description technique, can be rejected. Consequently, it can be concluded that websites which have implemented the meta description technique have greater traffic, and by extension, this technique is valuable for the websites.
Impact of Robots.txt technique on the web traffic
H0. 
There is no difference in websites traffic due to the Robots.txt technique (null hypothesis).
H1. 
There is a difference in websites traffic due to the Robots.txt technique.
In this study, an independent samples t-test is applied to draw a comparison of the mean web traffic between websites that implement the Robots.txt technique (n = 196) and websites that did not implement the Robots.txt technique (n = 131). The t-test was statistically significant, with the mean web traffic of robot technique-implemented websites (M = 6094.49, SD = 42,905.73) being lower than the robot technique unimplemented websites: (M = 11,139.16, SD = 99,602.74), t = 0.617, p < 0.05, two-tailed. Therefore, the null hypothesis that there is no difference in websites traffic due to the robot technique was rejected. Therefore, it can be concluded that websites that have implemented Robots.txt technique have failed to reach more web traffic and the null hypothesis H0 cannot be rejected.
Table 5 presents the 15 t-tests performed for the 15 corresponding SEO techniques. In the last column of Table 5, the SEO techniques that resulted in providing a positive impact on the traffic are marked as 1, while the techniques that do not have a significant contribution to the traffic are marked as 0. At the end of Table 5, two more t-tests were performed regarding WordPress CMS and Yoast SEO against the dependent variable Web Traffic.

4.4.2. Impact of SEO Metrics Group on the Web Traffic

Regression is a statistical technique to formulate the model and analyze the relationship between the dependent and independent variables. It aims to study the degree of relationship between two or more variables. This is achieved with the help of hypothesis testing.
In this part of our research, three different tests, one-way analyses of variance (ANOVA), coefficients and scatterplots, were performed for each of the four Metrics (Group 1) using the SPSS 27 software by IBM. After the analysis, it is intended to determine the relationship between Metrics and Web Traffic.
The one-way ANOVA is used to determine whether there are any statistically significant differences between the means of two or more independent (unrelated) groups, or not. Unstandardized coefficients indicate how much the dependent variable varies with an independent variable when all other independent variables are held constant. A simple scatterplot can be used to (a) determine whether a relationship is linear, (b) detect outliers and (c) graphically present a relationship between two continuous variables [70].
The null and alternate hypotheses remain the same as the Techniques.
H0: 
[Metric] doesn’t have positive impact on the web traffic.
H1: 
[Metric] has a positive impact on the web traffics.
The model summary table reports the strength of the relationship between the model and the dependent variable (Table 6).
  • Domain Authority: The regression model summary has been explained by 16.7% in variability of the web traffic, R2 = 0.167, adjusted R2 = 0.165.
  • Website Speed: The regression model summary has been explained by only 0.1% in variability of the web traffic, R2 = 0.001, adjusted R2 = −0.002.
  • Keywords: The regression model summary has been explained by 72.7% in variability of the web traffic, R2 = 0.727, adjusted R2 = 0.726.
  • Backlinks: The regression model summary has been explained by 45.2% in the variability of the web traffic, R2 = 0.452, adjusted R2 = 0.450.
The ANOVA table represents the overall significance of the model, which is determined by the web traffic (Table 7). The F ratio is the ratio of two mean square values. Degrees of freedom (df) of error has been calculated by Total sample (n = 309)–2 = 307 and total degrees of freedom (df) has been calculated by Total sample (n = 309)–1 = 308.
  • Domain Authority: The result shows F (1, 307) = 61.738, p < 0.05, which suggests the model is highly significant. Therefore, the independent variable in this model is related to the dependent variable.
  • Website Speed: The result shows F (1, 307) = 0.420, p > 0.05, which suggests the model is not statistically significant. Therefore, the independent variable in this is not related to the dependent variable.
  • Keywords: The result shows F (1, 307) = 816.754, p < 0.05, which suggests the model is highly significant. Therefore, the independent variable in this is related to the dependent variable.
  • Backlinks: The result shows F (1, 307) = 252.759, p < 0.05, which suggests the model is highly significant. Therefore, the independent variable in this model is related to the dependent variable.
The coefficient table represents how much the dependent variable is expected to increase when that independent variable increases by one, holding all the other independent variables constant [71] (Table 8). When interpreting the coefficient table, the B variable is important, representing the increase in the dependent variable as soon as the independent increases by one unit. On the other hand, the beta variable compares the strength of the effect of each individual independent variable to the dependent variable. The higher the value of the beta coefficient, the stronger the effect, based on absolute numbers [72].
  • Domain authority: When the beta value of domain authority is increased by one-unit, web traffic will have increased by 2465.645 visitors per month. Therefore, domain authority does not have a positive impact on the web traffic null hypothesis is rejected. It can be concluded that the domain authority scores have a significant positive effect on the web traffic.
  • Website speed: Since the p value of the following table is greater than 0.05, the impact of speed is not statistically significant. Therefore, the null hypothesis that speed test does not have a positive impact on the web traffic is true and can’t be rejected.
  • Keywords: When the beta value of keywords is increased by one-unit, the web traffic increases by 0.224 visitors per month. Therefore, keywords do not have a positive impact on the web traffic, and thus, the null hypothesis is rejected. It can be concluded that the keywords have a significant positive effect on the web traffic.
  • Backlinks: When the beta value of backlinks is increased by one-unit, the web traffic increases by 0.020 visitors per month. Therefore, backlinks do not have a positive impact on the web traffic, and thus, the null hypothesis is rejected. It can be concluded that the backlinks have a significant positive effect on the web traffic.
Scatterplots are essential for identifying trends and patterns. In a scatterplot, each observation (or point) has two coordinates [73]. The strength of the link between the variables is determined by calculating the correlation coefficient. The plot shows the first variable’s value on the X axis and the second variable’s value on the Y axis for each data point [74]. The relationship between two quantitative variables is shown in a scatter plot (Figure 2).
  • Domain Authority: The following graph indicates the coordinates of domain authority and web traffic, since the web traffic has been gradually increased because of the increase in DA scores, a indicating linear relationship between domain authority and the web traffic.
  • Keywords: The following graph indicates that the more keywords that have been used in the website, the more traffic it will get. Therefore, there is a strong positive linear relationship between keywords and web traffic.
  • Website speed: The following shows the relationship between speed test and web traffic. It shows that the increase in speed test does not result in a proportional increase or decrease in the web traffic. Therefore, there is a constant and nonlinear relationship between these two variables.
  • Backlinks: Like keywords, the following graph indicates the more backlinks that have been used in the website, the more traffic it will get. Therefore, it can be concluded that there is a strong positive linear relationship between keywords and web traffic.
We used exactly the same data analysis methodology as the SEO metrics to identify the correlation between backlinks and domain authority. The findings of the regression model have found that backlinks have a strong connection with domain authority. When the beta value of backlinks is increased by one unit, on average, the domain authority will increase by 2.214 units.

4.5. Diagnostic Exploratory Model Development

Results of the t-tests on SEO Techniques and of the ANOVA, coefficients and scatterplots on SEO Metrics present significant relationships with the Web Traffic. Thus, we aim to provide additional insights to marketing strategists over the impact of SEO Techniques and Metrics usage on hotels’ websites, in favor of their digital marketing strategy. Therefore, we deployed fuzzy cognitive mapping (FCM), which will implement selected web metrics of both SEO Techniques and Metrics depicting the relationships to Web Metrics, so as to obtain a more adaptive model for process assessment. Fuzzy cognitive mapping deploys a descriptive and consolidated stochastic classification methodology, mainly used to represent the correlations between hotels’ web metrics [75]. We use the orange color to mark the SEO Techniques, green color to mark the SEO Metrics, and blue color to mark the hotels’ Web Traffic. Blue arrows in Figure 3 illustrate variables with positive relations with Web Traffic, and black arrows illustrate variables with neutral relations with Web Traffic. Development of the FCM has been conducted via the Mental Modeler cloud-based application [76].
Fuzzy cognitive maps are fuzzy graph structures that represent causal reasoning. Exploratory modelling implementation is vital when digital marketing agencies need to make a decision or a digital marketing plan [77].
As we notice in Figure 3, most of the SEO Techniques and Metrics positively affect Web Traffic. Some of them have a relationship with each other, such as Backlinks and DA.

5. Discussion

The primary objective of this paper is to develop a precise methodology grounded in a pioneering context, to provide actionable insights into the use of search engine optimization and its contribution to the growth of hotels’ web traffic and sustainability. We retrieve hotel data from Google Maps API using a tool we developed. We performed SEO checks on each hotel’s website (309 in total), evaluating the SEO techniques implemented in their source code. We also collected data for each hotel’s website using four SEO third-party APIs, including 12-month web traffic, domain authority, website speed, and organic keywords. The average domain authority of our dataset’s hotels’ websites is 28, the average page load time is 3.5 s, and the average monthly web traffic is 7939 users.
We used descriptive analysis and t-tests to determine the relative contribution of each technique and metric to hotel web traffic.
Figure 4 illustrates the adoption rates of SEO techniques and hotel websites. The majority of hotels’ websites implemented SEO techniques such as SEO-friendly URLs (99.68%), SSL certificates (95.15%), title tag (72.49%), and Heading 1 (65.70%). As previously stated, the AMP and Sitemap SEO technique tests revealed that none of the hotels’ websites utilized these techniques, even though Google recommends them in its Webmaster Guidelines [3]. The descriptive analysis also uncovered that hotel websites that are based on the open-source CMS WordPress (51.78% of our dataset) do not present more organic traffic than other websites. Additionally, the websites that use the Yoast SEO plugin (23.30% of our dataset) do not show more organic traffic than the other websites, which is in contrast to the so-called plugin that promotes SEO.
Section 4 confirms the research hypotheses listed in Section 3.3 by conducting descriptive, regression, and t-test analysis. Our study demonstrates that on-page SEO has a sizable impact on the hotels’ website traffic. The same is true for off-page SEO, where we confirm that backlinks affect hotels’ websites. More precisely, increasing the beta value of backlinks by one unit results in an increase in web traffic of 0.020 visitors per month. When the beta value of keywords is increased by one unit, the hotels’ web traffic increases by 0.224 visitors per month. On the contrary, despite being mentioned as a critical SEO factor in Section 2.1.11, website speed appears not to affect hotels’ web traffic. Additionally, our analysis revealed a strong correlation between backlinks and domain authority.

6. Conclusions

Travel restrictions and decline during the COVID-19 era have compelled hotel companies to develop new digital marketing strategies to compete with rising competition. The majority of hotels have invested in search engine optimization to improve their websites’ rankings on search engine results pages (SERPs) and, as a result, their organic traffic.
This study aims to ascertain the SEO strategies used by hotels’ websites. We scan each website’s source code, mining the corresponding SEO techniques used by hotels, using our own-developed tool. Simultaneously, we collect valuable time-accurate data from four third-party APIs, including domain authority, organic keywords, backlinks, and 12-month web traffic. Meanwhile, through our data analysis, we determine the most widely used SEO techniques and the impact on the traffic to hotels’ websites. We conclude that SEO techniques and metrics strongly correlate with web traffic, which results in increased conversions and bookings for hotels.

Author Contributions

Conceptualization, K.I.R. and N.D.T.; methodology, K.I.R. and N.D.T.; software, K.I.R.; validation, K.I.R. and N.D.T.; formal analysis, K.I.R. and N.D.T.; investigation, K.I.R. and N.D.T.; resources, K.I.R.; data curation, K.I.R.; writing—original draft preparation, K.I.R., N.D.T. and C.T.; writing—review and editing, K.I.R., N.D.T. and C.T.; visualization, K.I.R.; supervision, N.D.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by project ENIRISST+ under grant agreement No. MIS 5047041 from the General Secretary for ERDF & CF, under Operational Programme Competitiveness, Entrepreneurship and Innovation 2014–2020 (EPAnEK) of the Greek Ministry of Economy and Development (co-financed by Greece and the EU through the European Regional Development Fund).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data supporting reported results can be found, at https://github.com/rkonstadinos/seo-techniques-hotels (accessed on 5 May 2022).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Matoševic, G.; Dobša, J.; Mladenic, D. Using Machine Learning for Web Page Classification in Search Engine Optimization. Future Internet 2021, 13, 9. [Google Scholar] [CrossRef]
  2. Webmaster Guidelines, Google Search Central, Google Developers. Available online: https://developers.google.com/search/docs/advanced/guidelines/webmaster-guidelines (accessed on 5 May 2022).
  3. Luh, C.-J.; Yang, S.-A.; Huang, T.-L.D. Estimating Google’s search engine ranking function from a search engine optimization perspective. Online Inf. Rev. 2016, 40, 239–255. [Google Scholar] [CrossRef]
  4. Gandour, A.; Regolini, A. Web site search engine optimization: A case study of Fragfornet. Libr. Hi Tech News 2011, 28, 6–13. [Google Scholar] [CrossRef]
  5. Ziakis, C.; Vlachopoulou, M.; Kyrkoudis, T.; Karagkiozidou, M. Important Factors for Improving Google Search Rank. Future Internet 2019, 11, 32. [Google Scholar] [CrossRef] [Green Version]
  6. Roumeliotis, K.; Tselikas, N. An effective SEO techniques and technologies guide-map. J. Web Eng. 2022, in press. [Google Scholar]
  7. PHP-Based Software to Retrieve Data and Process Hotels’ SEO Data. Available online: https://github.com/rkonstadinos/seo-techniques-hotels (accessed on 5 May 2022).
  8. Patil, V.M.; Patil, A.V. SEO: On-Page + Off-Page Analysis. In Proceedings of the International Conference on Information, Communication, Engineering and Technology (ICICET), Pune, India, 29–31 August 2018. [Google Scholar]
  9. Kumar, G.; Paul, R.K. Literature Review on On-Page & Off-Page SEO for Ranking Purpose. United Int. J. Res. Technol. 2020, 1, 30–34. [Google Scholar]
  10. Wang, F.; Li, Y.; Zhang, Y. An empirical study on the search engine optimization technique and its outcomes. In Proceedings of the 2nd International Conference on Artificial Intelligence, Management Science and Electronic Commerce (AIMSEC), Zhengzhou, China, 8–10 August 2011. [Google Scholar]
  11. (Meta) Title Tags + Title Length Checker [2021 SEO]–Moz. Available online: https://moz.com/learn/seo/title-tag (accessed on 5 May 2022).
  12. Van, T.L.; Minh, D.P.; Le Dinh, T. Identification of paths and parameters in RESTful URLs for the detection of web Attacks. In Proceedings of the 4th NAFOSTED Conference on Information and Computer Science, Hanoi, Vietnam, 24–25 November 2017. [Google Scholar]
  13. Zhu, C.; Wu, G. Research and Analysis of Search Engine Optimization Factors Based on Reverse Engineeing. In Proceedings of the Third International Conference on Multimedia Information Networking and Security, Shanghai, China, 4–6 November 2011. [Google Scholar]
  14. Roumeliotis, K.I.; Tselikas, N.D. Search Engine Optimization Techniques: The Story of an Old-Fashioned Website. In Business Intelligence and Modelling. IC-BIM 2019. Paris, France, 12–14 September 2019; Springer Book Series in Business and Economics; Springer: Cham, Switzerland, 2019. [Google Scholar]
  15. URL Structure [2021 SEO]–Moz SEO Learning Center. Available online: https://moz.com/learn/seo/url (accessed on 5 May 2022).
  16. An Image Format for the Web|WebP|Google Developers. Available online: https://developers.google.com/speed/webp (accessed on 5 May 2022).
  17. Hui, Z.; Shigang, Q.; Jinhua, L.; Jianli, C. Study on Website Search Engine Optimization. In Proceedings of the International Conference on Computer Science and Service System, Nanjing, China, 11–13 August 2012. [Google Scholar]
  18. Hernandez, C.C.; Palos-Sánchez, P.; Rios, M.A. Website Quality Assessment: A Case Study of Chinese Airlines. Indian J. Mark. 2020, 50, 42–64. [Google Scholar] [CrossRef]
  19. Roumeliotis, K.I.; Tselikas, N.D. Evaluating Progressive Web App Accessibility for People with Disabilities. Network 2022, 2, 350–369. [Google Scholar] [CrossRef]
  20. Zhang, S.; Cabage, N. Does SEO Matter? Increasing Classroom Blog Visibility through Search Engine Optimization. In Proceedings of the 47th Hawaii International, Conference on System Sciences, Wailea, HI, USA, 7–10 January 2013. [Google Scholar]
  21. Best Practices for Creating Quality Meta Descriptions. Available online: https://developers.google.com/search/docs/advanced/appearance/snippet (accessed on 5 May 2022).
  22. All Standards and Drafts-W3C. Available online: https://www.w3.org/TR/ (accessed on 5 May 2022).
  23. Shroff, P.H.; Chaudhary, S.R. Critical rendering path optimizations to reduce the web page loading time. In Proceedings of the 2nd International Conference for Convergence in Technology (I2CT), Mumbai, India, 7–9 April 2017. [Google Scholar]
  24. Tran, H.; Tran, N.; Nguyen, S.; Nguyen, H.; Nguyen, T.N. Recovering Variable Names for Minified Code with Usage Contexts. In Proceedings of the IEEE/ACM 41st International Conference on Software Engineering (ICSE), Montreal, QC, Canada, 25–31 May 2019. [Google Scholar]
  25. Ma, D. Offering RSS Feeds: Does It Help to Gain Competitive Advantage? In Proceedings of the 42nd Hawaii International Conference on System Sciences, Waikoloa, HI, USA, 5–8 January 2009. [Google Scholar]
  26. Gudivada, V.N.; Rao, D.; Paris, J. Understanding Search-Engine Optimization. Computer 2015, 48, 43–52. [Google Scholar] [CrossRef]
  27. Mobile-Friendly Test Tool. Available online: https://search.google.com/test/mobile-friendly (accessed on 5 May 2022).
  28. MdSaidul, H.; Abeer, A.; Angelika, M.; Prasad, P.W.C.; Amr, E. Comprehensive Search Engine Optimization Model for Commercial Websites: Surgeon’s Website in Sydney. J. Softw. 2018, 13, 43–56. [Google Scholar]
  29. Kaur, S.; Kaur, K.; Kaur, P. An Empirical Performance Evaluation of Universities Website. Int. J. Comput. Appl. 2016, 146, 10–16. [Google Scholar] [CrossRef]
  30. Pingdom Website Speed Test. Available online: https://tools.pingdom.com/ (accessed on 5 May 2022).
  31. Google Chrome Help. Available online: https://support.google.com/chrome/answer/95617?hl=en (accessed on 5 May 2022).
  32. Forecast Number of Mobile Users Worldwide from 2020 to 2025. Available online: https://www.statista.com/statistics/218984/number-of-global-mobile-users-since-2010/ (accessed on 5 May 2022).
  33. Roumeliotis, K.I.; Tselikas, N.D. Accelerated Mobile Pages: A Comparative Study. Business Intelligence and Modelling. In Proceedings of the IC-BIM, Los Cabos, Mexico, 21–25 October 2019; Springer Proceedings in Business and Economics Book Series (SPBE). Springer: Chem, Switzerland, 2019. [Google Scholar]
  34. Start Building Websites with AMP. Available online: https://amp.dev/documentation/ (accessed on 5 May 2022).
  35. Jun, B.; Bustamante, F.; Whang, S.; Bischof, Z. AMP up your Mobile Web Experience: Characterizing the Impact of Google’s Accelerated Mobile Project. In Proceedings of the MobiCom’19: The 25th Annual International Conference on Mobile Computing and Networking, Los Cabos, Mexico, 21–25 October 2019. [Google Scholar]
  36. Phokeer, A. On the potential of Google AMP to promote local content in developing regions. In Proceedings of the 11th International Conference on Communication Systems & Networks (COMSNETS), Bengaluru, India, 7–11 January 2019. [Google Scholar]
  37. Welcome to Schema.org. Available online: https://schema.org/ (accessed on 5 May 2022).
  38. Guha, R.; Brickley, D.; MacBeth, S. Schema.org: Evolution of Structured Data on the Web: Big data makes common schemas even more necessary. Queue 2015, 13, 10–37. [Google Scholar] [CrossRef]
  39. Navarrete, R.; Lujan-Mora, S. Microdata with Schema vocabulary: Improvement search results visualization of open educational resources. In Proceedings of the 13th Iberian Conference on Information Systems and Technologies (CISTI), Caceres, Spain, 13–16 June 2018. [Google Scholar]
  40. Navarrete, R.; Luján-Mora, S. Use of embedded markup for semantic annotations in e-government and e-education websites. In Proceedings of the Fourth International Conference on eDemocracy & eGovernment (ICEDEG), Quito, Ecuador, 19–21 April 2017. [Google Scholar]
  41. The Open Graph Protocol. Available online: https://ogp.me/ (accessed on 5 May 2022).
  42. Sergey, B.; Lawrence, P. The anatomy of a large-scale hypertextual Web search engine. Comput. Netw. ISDN Syst. 1998, 30, 107–117. [Google Scholar]
  43. Krohn, S. Organic Traffic and Why It Is Important. 2016. Available online: https://www.linkedin.com/pulse/organic-traffic-why-important-krohn-online-traffic-generation (accessed on 5 May 2022).
  44. Jeffers, J. Is Direct Traffic an Indicator of Brand Strength? 2019. Available online: https://www.portent.com/blog/analytics/is-direct-traffic-an-indicator-of-brand-strength.htm (accessed on 5 May 2022).
  45. Gokhan, E.; Coskun, B. The Role of Search Engine Optimization on Keeping the User on the Site. Procedia Comput. Sci. 2014, 36, 335–342. [Google Scholar]
  46. SEO in eCommerce: Everything You Need to Improve 2019. Available online: https://www.cloudoe.gr/en/article/seo-in-ecommerce-everything-you-need-to-improve (accessed on 5 May 2022).
  47. Saura, J.R.; Palos-Sánchez, P.; Cerdá Suárez, L.M. Understanding the Digital Marketing Environment with KPIs and Web Analytics. Future Internet 2017, 9, 76. [Google Scholar] [CrossRef] [Green Version]
  48. Mozscape API. Available online: https://moz.com/products/api (accessed on 5 May 2022).
  49. Sakas, D.P.; Giannakopoulos, N.T. Harvesting Crowdsourcing Platforms’ Traffic in Favour of Air Forwarders’ Brand Name and Sustainability. Sustainability 2021, 13, 8222. [Google Scholar] [CrossRef]
  50. What Is Organic Search? Everything You Need to Know. 2021. Available online: https://ahrefs.com/blog/organic-search/ (accessed on 5 May 2022).
  51. Organic Keywords: SEO for Beginners. Available online: https://www.semrush.com/blog/organic-keywords/ (accessed on 5 May 2022).
  52. Ubersuggest. Available online: https://neilpatel.com/ubersuggest/ (accessed on 5 May 2022).
  53. Google Maps Platform. Available online: https://developers.google.com/maps (accessed on 5 May 2022).
  54. Command Line Tool and Library for Transferring Data with URLs. Available online: https://curl.se/ (accessed on 5 May 2022).
  55. Mobile-Friendly Test Tool. Available online: https://support.google.com/webmasters/answer/6352293?hl=en (accessed on 5 May 2022).
  56. Pagespeedapi Runpagespeed. Available online: https://developers.google.com/speed/docs/insights/v4/reference/pagespeedapi/runpagespeed (accessed on 5 May 2022).
  57. IBM SPSS 27. Available online: https://www.ibm.com/support/pages/downloading-ibm-spss-statistics-27 (accessed on 5 May 2022).
  58. Hotels Dataset Excel. Available online: https://github.com/rkonstadinos/seo-techniques-hotels/blob/main/hotels.xlsx (accessed on 5 May 2022).
  59. Ahrens, J.P. Visualization and Data Analysis at the Exascale; Los Alamos National Laboratory: Los Alamos, NM, USA, 2011. [CrossRef]
  60. Zikmund, W.G.; D’Alessandro, S.; Winzar, H.; Lowe, B.; Babin, B. Marketing Research: Cengage Learning, 4th Asia-Pacific ed.; Victoria Cengage Learning: South Melbourne, VIC, Australia, 2017; ISBN 9780170369824. [Google Scholar]
  61. Shrestha, S.K. Brand Loyalty of Baby Diaper Products. Manag. Dyn. 2018, 21, 79–88. [Google Scholar] [CrossRef]
  62. Park, H.M. Univariate Analysis and Normality Test Using SAS, Stata, and SPSS; The University Information Technology Services (UITS) Center for Statistical and Mathematical Computing; Indiana University: Bloomington, IN, USA, 2018. [Google Scholar]
  63. Measures of Skewness and Kurtosis. Available online: https://www.itl.nist.gov/div898/handbook/eda/section3/eda35b.htm (accessed on 5 May 2022).
  64. Ali, E.; Hacer, C. Literature Search Consisting of the Areas of Six Sigma’s Usage. Procedia-Soc. Behav. Sci. 2015, 195, 695–704. [Google Scholar]
  65. Tabachnick, B.G.; Fidell, L.S.; Ullman, J.B. Using Multivariate Statistics, 7th ed.; Pearson Boston: Boston, MA, USA, 2019; ISBN-13: 9780135350904. [Google Scholar]
  66. Dukkipati, P.R.V. Probability and Statistics for Scientists and Engineers; New Academic Science Ltd.: London, UK, 2011; Volume 27, p. 1, ISBN-13: 978-1906574833. [Google Scholar]
  67. Asadoorian, M.O.; Kantarelis, D. Essentials of Inferential Statistics, 4th ed.; University Press of America: Lanham, MD, USA, 2004; ISBN-13: 978-0761830306. [Google Scholar]
  68. p-Value. Available online: https://www.investopedia.com/terms/p/p-value.asp (accessed on 5 May 2022).
  69. What a p-Value Tells You about Statistical Significance. Available online: https://www.simplypsychology.org/p-value.html (accessed on 5 May 2022).
  70. The Ultimate IBM SPSS Statistics Guides. Available online: https://statistics.laerd.com (accessed on 5 May 2022).
  71. Clogg, C.C.; Petkova, E.; Haritou, A. Statistical methods for comparing regression coefficients between models. Am. J. Sociol. 1995, 100, 1261–1293. [Google Scholar] [CrossRef]
  72. Standardized Beta Coefficient: Definition & Example. Available online: https://www.statisticshowto.com/standardized-beta-coefficient/ (accessed on 5 May 2022).
  73. Ceci, M.; Hollmén, J.; Todorovski, L.; Vens, C.; Džeroski, S. (Eds.) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2017. Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2017. [Google Scholar]
  74. Keim, D.A.; Hao, M.C.; Dayal, U.; Janetzko, H.; Bak, P. Generalized Scatter Plots. Inf. Vis. 2010, 9, 301–311. [Google Scholar] [CrossRef] [Green Version]
  75. Sakas, D.P.; Giannakopoulos, N.T. Big Data Contribution in Desktop and Mobile Devices Comparison, Regarding Airlines’ Digital Brand Name Effect. Big Data Cogn. Comput. 2021, 5, 48. [Google Scholar] [CrossRef]
  76. Gray, S.A.; Gray, S.; Cox, L.J.; Henly-Shepard, S. Mental Modeler: A Fuzzy-Logic Cognitive Mapping Modeling Tool for Adaptive Environmental Management. In Proceedings of the 46th Hawaii International Conference on System Sciences, Wailea, HI, USA, 7–10 January 2013. [Google Scholar]
  77. Salmeron, J.L. Supporting Decision Makers with Fuzzy Cognitive Maps. Research-Technology Management 2009, 52, 53–59. [Google Scholar] [CrossRef]
Figure 1. PHP-based SEO software.
Figure 1. PHP-based SEO software.
Digital 02 00021 g001
Figure 2. Scatterplot of each Metric by web traffic.
Figure 2. Scatterplot of each Metric by web traffic.
Digital 02 00021 g002
Figure 3. Using fuzzy cognitive map to locate the correlations between SEO Techniques/Metrics and Web Traffic.
Figure 3. Using fuzzy cognitive map to locate the correlations between SEO Techniques/Metrics and Web Traffic.
Digital 02 00021 g003
Figure 4. SEO Techniques adoption percentage.
Figure 4. SEO Techniques adoption percentage.
Digital 02 00021 g004
Table 1. Description of the examined web analytics metrics and suggested KPIs and Performance Measurements.
Table 1. Description of the examined web analytics metrics and suggested KPIs and Performance Measurements.
KPIs (Unit)Description
Organic Traffic/Month (number of organic visitors per month)Organic search refers to the non-paid search results from a search engine. These results cannot be bought or influenced by advertisers, so they are the ones the search engine considers most relevant to the user’s search query [50]. Organic traffic is described by the number of unique visitors per unit of time resulting from users’ queries on search engines.
Domain Authority—DA (score in the range [0–100], integer)DA score can be used when comparing websites or tracking the “ranking strength” of a website over time [48]. Websites with higher DA are more likely to rank higher in SERPs than those with lower DA. Search engines do not use the DA metric to rank web pages to the search results. DA is a simulation metric made by Moz’s SEO experts to model search engines’ algorithms behavior [6,11].
Website Speed (seconds)Website speed refers to how quickly a browser can load fully functional web pages from a given website.
Organic Keywords (number of keywords)Organic keywords are keywords used in SEO to attract “free” traffic. When a user types a keyword on the search engines, it returns as a result websites which rank for this specific keyword [51]. SEO experts are constantly trying to detect high-traffic keywords that users prefer to search on search engines. The keywords are selected based on the website’s niche. By creating landing pages, and using copywriting and on-page SEO techniques, SEO experts attempt to rank the website to specific keyword queries on search results. The more keywords rank for, the higher organic traffic received. SEO software tools, such as Ubersuggest [52], are used by SEO experts to monitor in which exactly keywords the website is ranking for and in which specific ranking position.
Backlinks (number of backlinks)Backlinks are links that the considered page receives from other page(s). According to Google’s founders, anchored links are links to a website containing the target keyword of the destination website. Anchors often provide more accurate descriptions of Web pages than the pages themselves [42]. Backlinks help secure a higher SERP ranking [26]. Backlinks affect mostly the position of a website followed by relevant content [5].
Table 2. Descriptive statistics on SEO metrics.
Table 2. Descriptive statistics on SEO metrics.
NMinimumMaximumMeanStd. DeviationSkewnessKurtosis
StatisticStatisticStatisticStatisticStatisticStatisticStd. ErrorStatisticStd. Error
Web Traffic30921,058,500.007939.309169,130.0135613.3200.139188.7690.276
Domain Authority30939028.8111.4720.9070.1394.1530.276
Website Speed3090.415.43.5012.54621.7810.1393.5890.276
Organic Keywords30902,898,54527,009.03263,236.71410.1530.139102.4110.276
Backlinks309332,965,989.00218,197.97732,315,388.3512.2460.139157.3480.276
Table 3. Independent t-test on the meta description technique.
Table 3. Independent t-test on the meta description technique.
TechniqueMean (Number of Visitors per Month)Standard Deviation (Number of Visitors per Month)t-Valuep-Value
Yes (n = 193)No (n = 116)Yes (n = 193)No (n = 116)
Meta Description8739.236608.3977,063.2853,677.81−0.2620.02
Table 4. Independent t-test on the Robots.txt technique.
Table 4. Independent t-test on the Robots.txt technique.
TechniqueMean (Number of Visitors per Month)Standard Deviation (Number of Visitors per Month)t-Valuep-Value
Yes (n = 196)No (n = 131)Yes (n = 196)No (n = 131)
Robots.txt6094.4911,139.1642,905.7399,602.740.6170.04
Table 5. T-test on 15 SEO Techniques.
Table 5. T-test on 15 SEO Techniques.
TechniqueMean (Number of Visitors per Month)Standard Deviation (Number of Visitors per Month)t-Valuep-ValueValuable
YesNoYesNo
Meta Description8739.236608.3977,063.2853,677.81−0.2620.021
Image Alt16,867.691791.89107,719.974978.16−1.8920.031
SEO friendly URL7964.38215.4269,241.100−0.1120.011
Minified CSS19,867.592315.97121,203.187434.48−2.0940.001
Title10,237.101883.9281,027.666470.74−0.9480.001
Structured data13,017.175092.62101,275.8041,495.26−0.9670.001
Responsive11,117.371764.7784,892.545268.46−1.1270.021
Heading19941.264105.3684,441.1516,683.35−0.7040.041
Heading29711.046061.2784,777.0647,439.21−0.4630.001
SSL8324.65386.5670,855.70563.44−0.4330.011
Open graph9066.936680.3783,100.8349,322.96−0.3030.021
Link Title11,925.116877.5171,897.1368,486.96−0.5220.001
RSS29,721.212690.65154,236.5611,543.47−2.740.001
Robots6094.4911,139.1642,905.7399,602.740.6170.040
Minified JS23,685.242251.34132,989.817181.28−2.4250.001
Is WordPress2697.3113,568.2912,955.1298,506.261.3830.1680
Is Yoast2762.929511.887731.0078,792.800.7250.4690
Table 6. Model Summary. Strength of the relationship between the model and the dependent variable.
Table 6. Model Summary. Strength of the relationship between the model and the dependent variable.
MetricModelRR SquareAdjusted R SquareStd. Error of the Estimate
Domain Authority10.4090.1670.16563,180.47283
Website Speed10.0370.001−0.00269,195.23812
Organic Keywords10.8530.7270.72636,191.48936
Backlinks10.6720.4520.45051,279.24795
Table 7. ANOVA results. Overall significance of the model.
Table 7. ANOVA results. Overall significance of the model.
Model Sum of SquaresdfMean SquareFp-Value
Domain AuthorityRegression246,445,253,450.7131246,445,253,450.71361.7380.000
Residual1,225,474,049,215.8563073,991,772,147.283
Total1,471,919,302,666.569308
Website SpeedRegression2,009,142,195.17112,009,142,195.1710.4200.518
Residual1,469,910,160,471.3983074,787,980,978.734
Total1,471,919,302,666.569308
Organic KeywordsRegression1,069,803,364,626.08611,069,803,364,626.086816.7540.000
Residual402,115,938,040.4833071,309,823,902.412
Total1,471,919,302,666.569308
BacklinksRegression664,643,992,835.4791664,643,992,835.479252.7590.000
Residual807,275,309,831.0903072,629,561,269.808
Total1,471,919,302,666.569308
Table 8. Coefficient results. Results show how much dependent variable is expected to increase when that independent variable increases by one.
Table 8. Coefficient results. Results show how much dependent variable is expected to increase when that independent variable increases by one.
ModelBStd. ErrorBetatp-Value
(Constant)63,093.6009728.574 6.4850.000
Domain Authority2465.645313.8000.4097.8570.000
(Constant)4427.2106700.015 0.6610.509
Website Speed1003.0861548.4920.0370.6480.518
(Constant)1892.3202069.706 0.9140.361
Organic Keywords0.2240.0080.85328.5790.000
(Constant)3561.6082930.141 1.2160.225
Backlinks0.0200.0010.67215.8980.000
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Roumeliotis, K.I.; Tselikas, N.D.; Tryfonopoulos, C. Greek Hotels’ Web Traffic: A Comparative Study Based on Search Engine Optimization Techniques and Technologies. Digital 2022, 2, 379-400. https://doi.org/10.3390/digital2030021

AMA Style

Roumeliotis KI, Tselikas ND, Tryfonopoulos C. Greek Hotels’ Web Traffic: A Comparative Study Based on Search Engine Optimization Techniques and Technologies. Digital. 2022; 2(3):379-400. https://doi.org/10.3390/digital2030021

Chicago/Turabian Style

Roumeliotis, Konstantinos I., Nikolaos D. Tselikas, and Christos Tryfonopoulos. 2022. "Greek Hotels’ Web Traffic: A Comparative Study Based on Search Engine Optimization Techniques and Technologies" Digital 2, no. 3: 379-400. https://doi.org/10.3390/digital2030021

Article Metrics

Back to TopTop