Next Article in Journal
Evaluation and Spatiotemporal Differentiation of Cultural Tourism Development Potential: The Case of the Middle and Lower Reaches of the Yellow River
Previous Article in Journal
A Fine-Grained Simulation Study on the Incidence Rate of Dysentery in Chongqing, China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Random Forest Variable Importance Measures for Spatial Dynamics: Case Studies from Urban Demography

by
Marina Georgati
1,*,
Henning Sten Hansen
1 and
Carsten Keßler
1,2
1
Department of Sustainability and Planning, Aalborg University, 2450 København, Denmark
2
Department of Geodesy, Bochum University of Applied Sciences, 44801 Bochum, Germany
*
Author to whom correspondence should be addressed.
ISPRS Int. J. Geo-Inf. 2023, 12(11), 460; https://doi.org/10.3390/ijgi12110460
Submission received: 6 September 2023 / Revised: 24 October 2023 / Accepted: 31 October 2023 / Published: 9 November 2023

Abstract

:
Population growth in urban centres and the intensification of segregation phenomena associated with international mobility require improved urban planning and decision-making. More effective planning in turn requires better analysis and geospatial modelling of residential locations, along with a deeper understanding of the factors that drive the spatial distribution of various migrant groups. This study examines the factors that influence the distribution of migrants at the local level and evaluates their importance using machine learning, specifically the variable importance measures produced by the random forest algorithm. It is conducted on high spatial resolution ( 100 × 100 grid cells) register data in Amsterdam and Copenhagen, using demographic, housing and neighbourhood attributes for 2018. The results distinguish the ethnic and demographic composition of a location as an important factor in the residential distribution of migrants in both cities. We also examine whether certain migrant groups pay higher prices in the most attractive areas, using spatial statistics and mapping for 2008 and 2018. We find evidence of segregation in both cities, with Western migrants having higher purchasing power than non-Western migrants in both years. The method sheds light on the determinants of migrant distribution in destination cities and advances our understanding of the application of geospatial artificial intelligence to urban dynamics and population movements.

1. Introduction

Residential location modelling plays an important role in urban planning and policy-making [1,2]. As the world’s population grows and becomes more concentrated in urban centres, effective urban planning requires improved forecasting and a deeper understanding of the factors that influence residential locations. Inner city movements and residential relocations trigger urban dynamics with implications for economic development, socio-demographic structures, spatial segregation and inequalities [3]. Housing availability and prices, commuting time to work, proximity to services or recreation, and the quality of the urban environment are some of the determinants that influence our next residential choices [3]. However, which of these factors are the most important? And for whom?
Several studies have examined the interaction between purchase prices or rental costs and household income [2,4]. Other studies have analysed the importance of transport accessibility in the choice of where households without cars (or bikes) live, or in relation to access to work and commuting time [5]. Chen et al. [6] found interactions between households with children and neighbourhoods with good schools, while Mulder [7] and Mulder and Cooke [8] highlighted the role of family ties in residential location choices. Research has also identified proximity to various amenities as an important attraction factor. Examples include educational [9,10] and recreational facilities [10,11], services and retail [5], and transport systems [12,13]. Others have emphasised the role of previous residential location when it comes to moving out of or staying in the same neighbourhood [6].
However, little is known about the factors that specifically influence migrant’s location choices [14,15], particularly when it comes to identifying ’place-specific attributes’ [16] in destination cities and assessing their importance for certain migrant groups. This challenge is complex and multidimensional, with limited empirical research to date [14,17,18]. Its complexity is amplified in modern multicultural urban centres with increased population density and diversity [19,20,21].
Thus, several studies have focused on the residential distribution of migrants, examining spatial patterns of representation, segregation or integration [14,22,23,24,25,26,27,28,29], and have highlighted the need for novel theories to explain the residential outcomes of migrants [29]. They have also emphasised the influence of cultural background or diversity on migrants’ residential distribution [23,24,25], but the specific role of the presence of co-ethnics has not yet been quantified. We use the term co-ethnics to refer to individuals who may belong to the same ethnic group but come from different countries and share a common cultural background, language and/or religion.
To address the above research gaps and to gain a deeper understanding of the role of ethnic composition of neighbourhoods in migrant residential location modelling, we conducted a data-driven study that related several determinants of residential preferences, including the effect of migration background. Quantifying the influence of co-ethnics involved examining how the presence of individuals from the same ethnic background affected, among other factors, the distribution of migrants.
We assessed the importance of these factors using novel machine learning (ML) techniques, in particular random forest (RF; [30]) regression on gridded population and topographic data at high spatial granularity. RF variable importance measures (VIMs; [31,32])—an innovative method previously used mainly in biological studies [33,34]—assess the importance of the examined variables and quantify their impact on model performance. While they have been incorporated into geospatial studies, their use has typically been limited to certain contexts, including disaggregation methods [35,36], poverty [37] or flood zone modelling [38]. Our analysis focused on the factors influencing the residential location of migrants in two case studies in Europe with a rich migration history (i.e., Amsterdam and Copenhagen) in 2018, and unfolded in two stages, for aggregated groups and individual countries of origin (CoOs). We also explored the distribution of migrants in relation to house prices in 2008 and 2018, and whether certain migrant communities paid more in the most desirable locations.

2. Background

For households, residential choices describe the decision-making process of choosing between housing and location options [39]. Modelling residential choices refers to maximising utility for each of its members [6]. It provides the means to explain and predict discrete choices after weighing the contributions of various measures that define the decision, such as housing and location attributes [40]. It is well known that trade-offs are usually made where the utility of one attribute is reduced to gain the utility of another (e.g., longer commuting times for lower house prices).
Several variables determine households’ residential location choices [3], and many models have been developed to explain the relationships among them [41]. Although most studies provide a broad overview of the characteristics that influence household decisions, they do not address differences among migrant groups and natives. These differences in housing and location choices reflect spatial, social and economic conditions, including segregation and discrimination. They are the result of complex urban dynamics and are explained by theories such as spatial assimilation [42,43,44], place stratification [45,46], ethnic preferences [47], housing career and structural changes in the economy and labour market [48].
Spatial assimilation theory suggests that recently arrived migrants prefer affordable, diverse, potentially disadvantaged locations that already accommodate a high proportion of migrants. These new migrants adopt local attitudes, integrate over time, establish their socio-economic position and move to more desirable areas with larger native populations [48,49,50]. In contrast, according to place stratification theory, migrant groups face challenges in assimilating due to discrimination, especially in neighbourhoods dominated by the host population [46,47]. The theory of ethnic preference suggests that not all migrants seek spatial assimilation; they may instead prefer to live close to co-ethnics, even if they could afford more desirable neighbourhoods [47,51]. This decision is influenced by the benefits of maintaining cultural traditions, forming strong social connections and having access to ethnic shops and services.
In studies across northern European countries, findings based on different types of regression models are consistent with spatial assimilation theory [50,51,52,53]. While these studies highlight the importance of ethnic composition in shaping migrants’ residential decisions, they primarily use discrete choice models to measure the impact of different factors on choice probabilities. This modelling approach is confined to the use of survey data in which respondents navigate hypothetical situations and has limitations related to survey design, respondent honesty and the strong assumptions they often make. It also requires data over multiple time periods and is not suitable for capturing complex interactions between factors.
Recently, there has been a growing trend in the use of geocoded, register and census population data in segregation analysis. These applications cover a wide methodological range, including sequential studies [47], longitudinal analyses [28,54], the use of indices [55,56] and the adoption of the individualised neighbourhood concept [57,58,59,60]. Notably, the latter examples simultaneously address issues related to the modifiable aerial unit problem (MAUP; [61,62]), which has raised difficulties because the spatial units for which segregation is measured influence segregation outcomes [62,63].
Given the emerging demand for new theories and models to explain migrant residential outcomes [29], it is essential to quantify the specific impact of ethnic composition on residential modelling. This requires a deeper understanding of which ethnic groups primarily influence each other. To this end, our study departed from examining residential trajectories and changes in segregation over time. Instead, we focused on a specific time period to explore the variables in detail for the current situation, as a result of years of residential mobility up to that point. We proposed the use of innovative ML modelling on fine-grained register data, which can effectively handle large datasets and capture complex relationships between multiple factors. This allowed us to analyse, in depth, the factors involved and the population dynamics.

3. Materials and Methods

3.1. Case Studies and Population Datasets

The analysis focused on the foreign populations residing in the municipality of Amsterdam and the capital region of Copenhagen. The population datasets describe the distribution of the population by CoO in 2008 and 2018 at a resolution of 100 m. The data were provided by the municipality of Amsterdam (Onderzoek, Informatie en Statistiek (OIS; https://data.amsterdam.nl) and Statistics Denmark (DST; https://www.dst.dk) and are not publicly available as they contain sensitive information in sparsely populated areas. The spatial representation of migrants varies between the two cities (Table A1). In 2018, migrants occupied 90% of the inhabited cells in Amsterdam and 78% in Copenhagen. In both cities, only a negligible proportion of cells was inhabited exclusively by migrants.
We examined the residential distribution of both aggregated groups and individual CoOs. In total, 304 CoOs in Amsterdam and 178 CoOs in Copenhagen were aggregated into seven regions of origin (RoOs). RoOs refer to broader geographical areas from which individual CoOs were grouped on the basis of shared characteristics, such as geography or culture. The classification (Table A2) differs between the cities because of their different histories and settings as migration destinations; Amsterdam, with its colonial past, has long been an established migration hub for many nationalities; Copenhagen, following restrictive policies, has been a less accessible destination. The aggregation into groups considered these differences, the social discourse and the largest migrant groups in each case. For reference, the lower part of Table A3 and Table A4 in the appendix show the statistical summary of the migrant groups by RoO and CoO used in Amsterdam in 2018.

3.2. Selected Determinants

We adopted the classification of determinants proposed by Schirmer et al. [3] into two broad categories related to building characteristics (e.g., value of residence, size) and location/neighbourhood attributes (e.g., proximity to services, points of interest). In addition, we included socio-demographic attributes describing the population by age, income and education, where data were available from the corresponding statistical offices for 2018.
Case-specific layers were produced at a fine-grained resolution, considering the main factors influencing the residential choices of different demographic groups. Most of these layers were common to both cases, but additional socio-demographic variables were included for Copenhagen. For the analysis at CoO level, we included data on the distribution of migrants by CoO where the population was greater than 50 persons. We used the same socio-demographic, housing and neighbourhood factors but replaced the two generic layers for places of worship (i.e., Christian, non-Christian) with religion-specific layers. In Amsterdam, there were 110 places of worship—2 for Buddhism; 7 for Islam; 3 for Sikhism; 6 for Judaism; 7 for Hinduism; and 86 for Christianity (5 classified as Catholic, 1 as Lutheran and 27 as Protestant churches [64]). In Copenhagen, there were 217 churches [65], 1 place of worship for Buddhists and 2 for Muslims [64]. Table 1 describes the layers, their sources and their processing details, while Table A3 shows the statistical summary of the corresponding layers in Amsterdam in 2018.

3.3. Random Forest Variable Importance Measures

Different demographic groups are influenced by a range of socio-economic and urban variables. We aimed to assess the relationships between migrants from different CoOs/RoOs and the distribution of one group as a determinant of the residential distribution of others, together with population and topographic characteristics. The RF algorithm was applied because it is more suitable for capturing complex relationships between dependent variables, such as different population groups, than the linear and logit models used in previous studies [50,52,53]. These models are oversimplified and unsuitable for detecting such patterns, especially when a much wider range of factors is examined; they are prone to underfitting, sensitive to outliers and assume that the data are independent—which is not the case for the data at hand.
Furthermore, the RF regression algorithm can capture non-linear relationships among the investigated factors. The inclusion of non-linear associations is essential for our study as migration patterns and residential locations are often the result of complex interactions of socio-economic and cultural factors, which may not always follow a strictly linear trajectory. In this way, we examine how strong the links among various migrant groups are in supporting the relocation of others. Unlike other studies that examine mobility patterns in multiple time steps, we take a snapshot of the year in question (2018) and conduct a data-driven study to examine the outcomes of years of spatial mobility. This can serve as a baseline for a future longitudinal study, allowing us to highlight existing patterns and explore, in depth, the variables at that particular point in time.
The RF, an ‘ensemble learning’ technique, combines multiple decision trees and aggregates their predictions for enhanced accuracy [30,69,70]. It supports the interpretability and identification of informative variables [31,71] through VIMs—built-in functions that rank the training features in terms of prediction accuracy [32]. By ranking the contribution of the training variables, they capture the strength of the dependency between features and predictions [32,72].
Mean decrease in impurity (MDI) and mean decrease accuracy (MDA) are two commonly used RF VIMs supported in Python and the Pedregosa et al. [73] implementation. While MDA is typically more reliable [74], being unbiased to variables with numerous split points, it can overvalue highly correlated variables and, for high-dimensional data, its rankings are less robust to perturbations in the data than those obtained with the MDI [72,75]. For our analysis, given continuous data layers with potential correlations, we chose the MDI score.
Since the distribution of population groups was available, we employed a supervised learning framework to make predictions. The training data consisted of the layers of demographic, building and neighbourhood attributes, as well as data on the distribution of other migrant groups. These inputs were paired with the target values representing the specific migrant group under consideration. For model training, we only used data from 2018, which provided a more concise and detailed perspective on the variables of interest. During training, we retrieved the importance of each layer in predicting the target variable and distinguished the stronger relationships between migrants based on their RoO and then by individual CoO.
Our goal was not necessarily to develop the most accurate predictive model but to use it as a tool to explain a complicated phenomenon. Therefore, after experimenting with different hyperparameters (i.e., the number of trees, the maximum depth) to ensure the model’s robustness and consistency, we adopted a simple RF implementation based on Python frameworks (i.e., scikit-learn; [73]). We used a small number of trees as the relative value of feature importance did not change with the depth of the model. We allocated 25% of the data for testing and used the R-squared value to evaluate the prediction performance. The variable importance scores were extracted into EXCEL tables and ranked by determinant category. The CoO scores were further related to the corresponding population sizes and visualised using the Matplotlib [76] and Seaborn [77] libraries after eliminating scores lower than 7%.

3.4. Migrant Distribution and Real Estate Prices

We also investigated the relationship between the distribution of migrants and house prices. We examined whether migrants bought more expensive properties in areas that were more attractive to them or where their residential density was higher. We estimated the sum of purchase prices by migrant group per grid cell along the inhabited cells as described in Equation (1), where i is the individual grid cell, Nim is the migrant population in cell i, Nm is the total migrant population in the examined area and Pi is the real estate price in the individual grid cell. The bivariate maps show the distribution of the share of migrants in the total population in relation to house prices (see Figures 4 and 5). That way, we explored the proportions of migrants living in zones of expensive/inexpensive housing and how these tendencies were represented spatially in each city.
For validation purposes, we compared two different time periods from 2008 and 2018. As the Danish property value layer represents the sales prices of only those properties sold in the respective year, we aggregated the gridded layer to the district level and selected the average value among the purchased properties. In Amsterdam, we used the mean real estate prices interpolated to contours. Due to the differences between the original datasets and the studied areas—in Amsterdam we focused on only one central municipality, while in Copenhagen we included two central and 15 suburban municipalities—we expected variations, with central Amsterdam being more expensive and Copenhagen having lower average house prices due to the cheaper suburbs. For this reason, the comparison between the two cities cannot be direct.
P m = i = 1 N I ( N i m N m × P i )

4. Results

4.1. Results on Aggregated Groups

Table 2 and Table 3 show the results of the aggregated groups by RoO in Amsterdam and Copenhagen with 29 and 34 determining factors, respectively. Each column refers to a target RoO with the importance of the factor as a percentage, and each row contains the factor. The italicised headings group the factors into classes (i.e., age groups, building and neighbourhood features, and the presence of native/migrant groups) and summarise their individual importance. For example, in Table 2, the distribution of children receives a high importance score for predicting the distribution of Turks and Moroccans (66.12%) but less than 1% for predicting the distribution of natives and of Western and Eastern EU migrants. The R-squared value in the last row assesses the performance of the regression model.
In the Dutch case, dwelling features are most important for natives and migrants from the Western EU (WEU) and the rest of the world. For migrants from the Eastern EU (EEU), Turkey and Morocco, the distribution by age is the most important. For migrants from the former colonies, the Middle East and Africa (MEA), the presence of other migrant groups ranks higher; each of these groups plays a significant role in the distribution of the other. The relationship between WEU migrants, natives and the rest of the world population is noteworthy. More details on specific groups by CoO follow in Section 4.2. Neighbourhood attributes are less important for migrants from former colonies, Turkey and Morocco.
In the Danish case, building characteristics are again the most important features for the native and WEU population. Age groups are the most influential factors for migrants from EEU, MENAP, non-EU European and other non-Western countries. Other Western migrants are significantly influenced by the presence of WEU migrants. The distribution of non-EU European and MENAP migrants is closely related to the presence of other non-Western migrants and vice versa. Information on the distribution of the population by social status has a limited impact on the residential locations of MENAP and other non-Western migrants. Finally, neighbourhood characteristics have little effect on other Western migrants, who may be more attracted to city centre leisure activities, such as restaurants and bars, than other groups.

4.2. Results on Individual CoO

The previous section showed that in Amsterdam migrants from the former colonies are influenced by the presence of MEA migrants (62.15%) and vice versa with a similarly high weight (61.40%). In Copenhagen, the presence of WEU migrants is primarily important (69.94%) for other Western migrants. This subsection focuses on individual CoOs; we examine whether specific migrant groups explain the above-mentioned trends and how binational relations are expressed in the destination cities.

4.2.1. The Impact of Natives and Other Migrant Groups

Most groups examined by CoO (≈85%) in both cities have an accumulated influence of more than 50% from the presence of other migrant groups. The strongest interactions (>7% importance) between migrants by CoO are shown in Figure 1 and Figure 2. The figures reveal important relationships among migrant groups and patterns that are not clear from the previous tables. They show relationships among nations in geographical proximity and with shared cultural backgrounds, traditions and languages. These links are manifested beyond the borders of their geographical region through their residential locations in the destination cities to which they migrated in their current country of residence, regardless of whether their choices were made consciously or not.
Specifically, the figures illustrate the influence of one CoO on another as a function of their population size ratios. The factor-country and the target-country are plotted on the horizontal and vertical axes, respectively. On both axes, the countries are coloured by their RoO. A high vertical concentration of bubbles indicates the influence of the factor on several other groups. The size of the bubbles illustrates the importance of the factor-country on the horizontal axis in predicting the population of the target-country on the vertical axis; the larger the bubble, the greater the importance. Their colour indicates the ratio of the population of the factor-country to the target-country; oranges indicate that the factor-population is larger than the target-population, and blues indicate that the factor-population is significantly smaller than the target-population.
The size of the population is important in understanding whether large groups have a large or low impact on smaller groups. The presence of large groups indicates established centres and results in diversity, safety or familiarity, and ready-made facilities that consequently attract multiple smaller groups. Conversely, if small groups are more influential on equal-sized or larger groups, family ties, developed bonds or cultural relationships are more likely reasons to explain these residential locations.
I. Amsterdam.
In Figure 1, migrants from Ghana or its predecessor state (Gold Coast) have the strongest influence on each other, followed by another African state. Ghana is also an important determinant for other Africans, Latin Americans and Caribbeans (e.g., migrants from Nigeria, Dominican Republic, Suriname, Ecuador, Sierra Leone, Liberia and Cameroon; A1). The largest in population groups—the natives, Surinamese and Moroccans—have a medium to high impact on several groups. Specifically, the presence of the natives is an attractive factor primarily for migrants from the EU (e.g., Belgians, Irish, Swedes), Senegal and Israel (A2); the Surinamese are important for migrants from other former colonies, the Caribbean and Africa (A3); the Moroccans are important for other MEA migrants (A4).
In the highlighted rectangle of the graph, a list of MEA countries appears to influence migrant groups from the same region or from countries in the rest of the world (A6, A7), with interesting interactions between populations of the same size (e.g., Upper Volta–Costa Rica, Iraq–Afghanistan), or with origins with geographical or cultural proximity (e.g., Somalia–British Somalia, Azerbaijan–Ukraine, and UK–USA–Canada). Although the populations are small, the model identifies links between migrants from Sri Lanka and its predecessor Ceylon—A8.
II. Copenhagen.
The links between migrant groups in Copenhagen are much smaller in number and strength than in Amsterdam (Figure 2). Yugoslavs are influenced by the presence of North Macedonians and vice versa (51–53%; A1). Another Balkan relationship is noticed between Bosnians and Croats (A1), whose distribution is influenced by the presence of Italians. Natives are again an influential factor for WEU (e.g., Icelanders, Irish, Norwegians and Austrians; A2) and some non-Western migrants (e.g., Filipinos). Romanians are important for other EEU Europeans (e.g., Slovakians, Bulgarians), and neighbouring Moldovans. Apart from their strong internal impact, non-EU Europeans are important for the distribution of MENAP migrants (e.g., Yugoslavs–Israelis, Belarusians–Tunisians, and Bosnians–Syrians).
In MENAP relations, reciprocal links are noted both internally and with non-Western migrants (A3). The presence of Iraqis and Somalis influences the largest number of target-CoO with smaller or almost equal populations (A4). Pakistanis and Afghans are influenced by the presence of Turks (30–48%); Lebanese, Somalis and Syrians by Iraqis; Moroccans, Lebanese and Iraqis by Somalis; Eritreans by Syrians; and Jordanians by Kuwaitis. There are strong ties between African and Caribbean migrants, with the following indicative examples: Zimbabwe–Gambia, Libya–Côte d’Ivoire, and Trinidad and Tobago–Liberia (A6).
Given the high impact of the WEU on other Western migrants in Table 3, the Australians confirm the findings (A5). Although the residential area receives the highest score, it is followed by the presence of migrants from the UK, the USA, China and Belgium. The presence of Australians is significant for migrants from Switzerland, Canada, Singapore and Japan. Similar trends can be seen for Canadians, who are influenced by the presence of Australians and French.

4.2.2. The Impact of Other Factors

Regarding the influence of other factors, in Amsterdam age-related variables have a strong influence (30–76%) on migrants from another EU country (e.g., Poland, Germany, Romania, Portugal, Italy, Greece and Spain—PIGS), Indonesia, Dutch East Indies, Brazil, Vietnam, India, Yugoslavia and Czechoslovakia. The distributions of young and mobile adults are the most important.
In Copenhagen, the impact of age is low, apart from Portugal, where the non-mobile adults layer gains the highest score. The importance of higher education for the Turkish population—the largest migrant group—explains the high significance in the MENAP group in the aggregated analysis of Section 4.1. High educational attainment also impacts migrants from New Zealand, Montenegro and France, while the low-income layer has impact on people coming from Azerbaijan, Angola and the Philippines.
In addition to the native Dutch population, migrants from the Soviet Union, the UK, France, Brazil and Belgium are strongly influenced by the size of the residential area; the number of dwellings is important for migrants from Iran, Colombia, Thailand and Chile. Similarly, in Copenhagen, the residential area is significant for the native, British, Swedish, German, Russian, French, Brazilian, Dutch and American populations, while the number of dwellings influences migrants from Thailand, Poland, Iran, China and Vietnam.
Bearing in mind the housing career theory and the fact that some studies have linked migrants’ housing choices and segregation to housing markets and tenure [78,79,80,81], we find no evidence that ownership status is exceptionally important for any of the examined groups. The rented properties layer has low importance for some MEA and non-EU European groups in Amsterdam and only for Yugoslavs in Copenhagen.
The impact of neighbourhood attributes on migrants from individual CoOs is negligible in both cases. House prices have low significance for people from Israel, the USA, Bolivia, Suriname and Burundi in Amsterdam and from Trinidad and Tobago, Taiwan, Mozambique and Luxembourg in Copenhagen.

4.3. Real Estate Prices by Migrant Group

The fact that the neighbourhood attributes, and especially the value of the residence, rank so low in their influence for all migrant groups is unanticipated and raises questions. We further investigate whether migrants pay higher prices to buy property in the most appealing areas where diversity or the migrant concentration of co-ethnics is higher.
Table 4 presents the sum of purchase prices by migrant groups in the occupied cells, as estimated by Equation (1). It also shows the difference and the percentage change between the two time periods for each group in the right columns. We can immediately recognise the dramatic increase in the property market in both cities over the decade under consideration, ranging from 1600 to 2230 €/m2 group and constituting a 41–56.5% increase in Amsterdam. The percentage increase in the Danish capital was lower: 32–42.5% and 7500–11,100 DKK/m2 group (≈1000–1500 €/m2 group). We highlight in bold the two groups paying the highest prices for property.
In Amsterdam, these groups are, in order, the WEU, the natives, the group coming from the rest of the world, and the EEU migrants in both examined years. Meanwhile, migrants from the former colonies and the MEA region seem to purchased property at the lowest prices. Although Turks and Moroccans had slightly higher purchasing power than the above groups, they, along with migrants from the rest of the world and the MEA region, showed the largest percentage change over the decade.
In the Danish capital, the patterns are similar; WEU and other Western migrants had the highest purchasing power, followed by the native population and non-Western migrants. EEU and non-EU European migrants paid similar amounts. MENAP migrants paid the least and, together with EEU migrants, showed the smallest percentage change over the decade.
The stacked bar charts in Figure 3 display the shares of migrants and house prices by migrant group. On the horizontal axis, the graphs show the migrant groups, and each bar is scaled to 100% for the total population of the group. The colours illustrate the house price categories. The price ranges are measured in thousands €/m2 and DKK/m2 in Amsterdam and Copenhagen, respectively. For reference, the average sales price of existing residential property in 2021 was 3250 €/m2 in the Netherlands and 20,180 DKK/m2 in Denmark [82].
The charts illustrate that large proportions of all groups moved upwards at least one price category. In both cases, most natives and migrants lived in areas with house prices in the two lowest price intervals in 2008 (<6000 €/m2 and <25,000 DKK/m2). On the contrary, in Copenhagen in 2018 the vast majority of the population belonged to the three highest intervals. The shares of the population paying more than 35,000 DKK/m2 increased exponentially (purple and light blue bars in Figure 3c,d). In 2018, half of the Western and WEU migrants lived in areas with house prices between 35,000 and 45,000 DKK/m2, and 10% of them in even more expensive areas. Most migrants from MENAP and non-EU European countries lived in the areas with the lowest house prices.
In Amsterdam, the changes were not as radical, but variations are observed among the groups. Two-thirds of migrants from the former colonies, the MEA region, Turkey and Morocco would still pay less than 6000 €/m2 in 2018, while at the top of the scale, most natives and migrants from the WEU and the rest of the world would pay more than 6000 €/m2.
The bivariate maps in Figure 4 and Figure 5 illustrate these patterns and trends spatially. Each figure shows the distribution of migrants with the highest and lowest purchasing power, as reported in Table 4, as a percentage of the total population. The expensive areas are shown in dark brown in central Amsterdam and central and north-eastern Copenhagen. Dark blue cells display areas with high proportions of the corresponding migrant group and low real estate prices. Similarly, light yellow cells show areas with low house prices and low shares of migrants.
Figure 4a,b show that only a small share of MEA migrants lived in the central part of Amsterdam in both 2008 and 2018. The majority of this population lived in Zuidoost—the south-eastern exclave of the municipality with the largest community of migrants from Suriname [83,84] and apparently from MEA—Noord and Nieuw-West. In these regions, house prices remained at the lower end of the scale, while the concentration of MEA migrants is the main determinant of the distribution of migrants from the former colonies and vice versa (Table 4). It might not be clear whether the high concentration of migrants maintained the stable price trend or whether the cheap housing has attracted more migrants from these regions, but their combination does not attract WEU Europeans, who prefer the city centre, where, quaintly, prices are considerably higher (Figure 4c,d).
In Copenhagen, the patterns are much clearer; despite the small number of western migrants, their shares are high in the north-eastern municipalities, where the house prices are substantially higher (Figure 5c,d). These areas are primarily chosen by natives and high-income migrants. Lower shares are located in the municipalities of Copenhagen and Frederiksberg (light yellow), where most WEU migrants are located [85], explaining their high influence on other Westerners in the modelling (Table 3). On the contrary, the majority of MENAP migrants resided in cheaper districts.

5. Discussion

The results of this work highlight the diverse determinants of residential location among people from different ethnic and cultural backgrounds. They underline the complexity behind residential decisions and the fact that no single factor determines the preferences of all groups. Migrants are influenced by a number of factors, but the ethnic composition of the neighbourhood is an important one.
The study introduced a novel methodology using ML techniques and high spatial resolution data to improve our understanding of complex urban dynamics at the local level. We examined how different demographic and socio-economic factors influence the residential distribution of migrants and proposed a data-driven approach to quantify the weight of their importance. Quantifying the weight of neighbourhoods’ ethnic composition is the main contribution of this study. Although the percentages of the weights may be spurious, the methodology highlighted connections between ethnic minorities and captured their relative importance. We have attempted to interpret complex urban phenomena and cultural bonds among migrant groups and historical patterns of migration, and to contribute to the emerging field of explainable artificial intelligence (XAI; [86,87]) by relating the obtained importance scores to the geographical or cultural proximity of the CoOs. A major strength of the study is its detailed geographical scale, using 100 metre grid cells, and its extensive examination of a large number of CoOs that have not been studied before. To date, most studies have focused on broad regions of origin or racial composition and various aggregated administrative units [24,28,55,88].
Potential interpretations based on the above observations and the similarities of the patterns of the two case studies were generalised and placed into the following categories:
  • I. Influences of co-ethnics. High importance scores were found among migrants from the MEA region and the former colonies in Amsterdam, as well as among WEU and other Westerns in Copenhagen.
The findings suggested a relationship between the presence of co-ethnics or minorities with similar cultural backgrounds and the residential patterns of various migrant groups. This observation was evident for migrants originating from regions that are geographically adjacent or historically united (e.g., Ghana–Gold Coast, Yugoslavia–North Macedonia, and Bosnia and Herzegovina–Croatia). These groups may share ethnic identities and display interconnected residential patterns influenced by factors such as the formation of social connections or access to ethnic shops.
Apart from the geographical neighbourhood of the CoOs, the data also suggested patterns of common residential distributions across broader cultural identities in the two cities under study. For example, there were associations in the residential distributions between the WEU and other Western groups, as well as between several Asian and African groups. The MENAP groups also showed a correlation in their residential patterns, which appeared to influence the distribution of other non-Western migrants from Africa and Latin America. These observations may contribute to our understanding of residential segregation and socio-economic inequalities within these communities.
Moreover, the observed association between the presence of Danes in Copenhagen and the residential patterns of Filipinos—a potential result of the significant number of Filipinos working and residing in the city’s more affluent areas [85,89]—may reveal a link between housing pathways and labour market patterns. However, it is crucial to treat these findings with caution and avoid drawing direct causal relationships without further investigation.
  • II. Influences of dwelling type. The residential area was more important for predicting the distribution of natives and WEU migrants; the number of dwellings was more important for non-Western migrants.
It is a truism that housing features such as the number of dwellings and rooms, or the residential area and house prices, play a significant role in population density and in the distribution of different migrant groups. Although we cannot directly explain whether the concentration of certain migrant groups drove house prices or vice versa, the model accounted for variations in building types rather than property values and migrant groups.
Although the residential area and number of dwellings were correlated, the variations in their influence on migrants and natives provided clues to the groups’ locations and may not be coincidental. Large numbers of dwellings, usually associated with large residential areas, indicated small dwelling units, more affordable housing and compact urban development. Conversely, small residential areas with a low number of dwellings per grid cell indicated large residential units, detached houses and more open space. The model related Western migrants and natives—more affluent groups—to the residential area layer, and non-Western migrant groups—less prosperous groups—to the number of dwellings, regardless of the group’s population size. Indeed, Section 4.3 showed that Western migrants lived in expensive areas, taking advantage of a central location in Amsterdam or enjoying spacious dwellings and greener areas in Copenhagen. This allows for careful assumptions to be made about the economic status of the groups, their preferences and their quality of life, even in areas where the economic prospects of the groups are not known.
  • III. Influences of age as an indicator of migrants’ initial settlement. Distribution by age was the most important factor for migrants derived from EEU and the MEA region.
Before or in the early 2000s, large migration waves from Turkey and Morocco arrived in Amsterdam, consisting mainly of young adults. We can safely assume that these groups had families and children of their own by 2018 and indeed maintained high fertility rates [90], thus establishing major groups of non-mobile adults with children. The same can be assumed for migrants from the MENAP region in Copenhagen. More recent waves of migrants have arrived in both cities from EEU and Mediterranean countries. Motivated by unrestricted mobility, financial and educational incentives after their countries joined the EU or after the 2008 financial crisis, these waves consisted of young and mobile adults who migrated to study and work. These observations may be useful in interpreting a link between migration and the period of initial settlement of the corresponding group.
Despite the valuable insights and methodological contributions of this study, some limitations should also be discussed. Our study captured a specific snapshot in time and focused on the distribution of migrants at that moment. This approach provided a deeper understanding of the underlying factors. However, it is essential to examine several time steps to gain a comprehensive overview of whether the data and results are consistent with any of the theories discussed in Section 2. Such an analysis may challenge existing theories and open up possibilities for highlighting unexplored dimensions and developing new conceptual frameworks. It is also important to note that the observed correlations do not necessarily imply direct causality or explicit residential preferences.
The study is also limited in allowing a direct comparison between the two case studies, particularly in relation to house prices. The datasets for these regions differ significantly, affecting the conclusions of the comparisons. In future research, consistent datasets would allow for more accurate cross-study comparisons. Additionally, more detailed data on the age and gender of migrants could help in understanding individual preferences, while combining geographically weighted regression with the RF model or incorporating geographic weights for the variables would improve the analyses. Further quantitative research should also include the role of previous location and social networks in residential location modelling. The visual representation of the distribution of selected migrant groups, along with their key determinants, opens up new potential avenues for research at the local level in these destination cities.

6. Conclusions

The growing population in urban centres and the exacerbation of inequalities and segregation associated with international mobility require better local planning; better urban planning and decision-making require more efficient residential location modelling. Therefore, a deeper understanding of the factors that influence migrants’ residential locations and the reasons that enable them is crucial. This study provided data-driven evidence on the importance of the determinants of migrants’ residential locations. We showed that by using a straightforward modelling approach and a wide range of geospatial data, we can extract rich information about the distribution of migrant groups and the factors that drive their residential distributions.
The study contributed to estimating the impact of cultural background and ethnic diversity on migrants’ residential locations. We ranked which migrant groups are the most influential for other minorities and evaluated the relationship between the distribution of co-ethnics and the cultural and geographical proximity of their CoOs in 2018. We found indicative migrant groups from neighbouring CoOs with mutually high importance scores, with examples from the Balkans in Copenhagen and from Africa and Latin America in Amsterdam. Analysing the distribution of migrant groups by housing prices, we found separate clusters of WEU and non-EU migrants in both cases, with the groups with the financial means living in more desirable areas with higher housing quality and a low concentration of non-Western migrants.
To the best of our knowledge, this is the first study to perform ML-based analysis for residential location modelling using such a variety of detailed geospatial variables. Although RF and VIMs have primarily been adopted in bioinformatics, they can also be used in other research fields, such as geography and demography, to identify features that are significant in improving the explainability of complicated urban phenomena that cannot be explained by simple regression models.
The present work contributed to research in XAI, where we interpreted the decisions of the ML model by confirming existing knowledge about the residential distribution of migrants. The model captured patterns across groups of various sizes, revealing hidden relationships that might otherwise be assumed. It presented a relative association of factors and ordered them, identifying extraneous noise. The results allowed us to infer the economic status of these groups and their quality of life, and potentially find evidence of residential segregation for the year in question. We highlighted the role of diversity and ethnic composition for different minorities and their residential locations, along with their ability to acquire property in the most desirable areas.
Such an approach provides important insights for decision-making and drawing guidelines for inclusive urban development, with safeguards to limit the exploitation of foreign populations. Examples include locating not only safe but also desirable spaces for the initial settlement of new waves of migrants or refugees (e.g., Ukrainians moving to Europe), thus preventing the phenomena of segregation and ghettoisation.

Author Contributions

Conceptualization, Marina Georgati, Henning Sten Hansen and Carsten Keßler; methodology, Marina Georgati, Henning Sten Hansen and Carsten Keßler; software, Marina Georgati; validation, Marina Georgati; formal analysis, Marina Georgati; investigation, Marina Georgati; data curation, Marina Georgati; writing—original draft preparation, Marina Georgati; writing—review and editing, Marina Georgati, Henning Sten Hansen and Carsten Keßler; visualization, Marina Georgati; and supervision, Carsten Keßler and Henning Sten Hansen. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by EU Horizon 2020 Programme grant number 870649.

Data Availability Statement

The supporting computational workflow is available at Figshare (https://figshare.com/s/78580b2609f00216b34c) (https://doi/org/10.6084/m9.figshare.21983807) and can be openly re-used for similar analyses. Several of the used datasets are publicly available.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Appendix A. Details on the Case Studies

Table A1. Variations between the case studies areas in 2018.
Table A1. Variations between the case studies areas in 2018.
FeatureAmsterdamCopenhagen
Spatial extent219.5 km2527.9 km2
Cells42,244114,367
Inhabited cells751221,403
Inhabited cells by natives728821,233
Inhabited cells by migrants678216,818
Table A2. Classification of CoO to RoO.
Table A2. Classification of CoO to RoO.
Case StudyGroupCountries
AmsterdamNLDThe Netherlands.
EEUBulgaria, Hungary, Czech Republic, Poland, Roumania, Slovakia, Estonia, Latvia, Lithuania, Croatia and Slovenia.
WEUThe rest of EU and the United Kingdom (UK).
Turkey + MoroccoTurkey and Morocco.
Middle East + AfricaAll countries of Middle East and Africa.
Former ColoniesAruba, Bonaire, Curaçao, Indonesia, Saba, Saint Maarten-Dutch part, Sint Eustatius and Suriname.
Other Europe etc.Non-EU European and European Free Trade Association (EFTA) countries, Asia, Oceania and the Americas.
CopenhagenDNKDenmark.
EEUBulgaria, Hungary, Poland, Roumania, Slovakia, Estonia, Latvia, Lithuania, Croatia, Slovenia and Czechoslovakia.
WEUThe rest EU and the EFTA countries, the UK, the Faroe Islands and Greenland.
Non-EU EuropeYugoslavia, Serbia and Montenegro, and the Soviet Union.
MENAPTurkey, Algeria, Egypt, Libya, Morocco, Tunisia, Iraq, Iran, Bahrain, Israel, Palestine, Jordan, Kuwait, Lebanon, Oman, Qatar, Saudi Arabia, United Arab Emirates, Syria, Yemen and Pakistan.
Other WesternCanada, United States of America (USA), Australia and New Zealand.
Non-WesternAll other countries.
Table A3. Summary statistics of determinants for Amsterdam in 2018, including age groups, building and neighbourhood features, and native/migrant presence by region of origin. Abbreviations: Min.—minimum, Max—maximum, Std.—standard deviation and RoO—region of origin.
Table A3. Summary statistics of determinants for Amsterdam in 2018, including age groups, building and neighbourhood features, and native/migrant presence by region of origin. Abbreviations: Min.—minimum, Max—maximum, Std.—standard deviation and RoO—region of origin.
VariablesMin.Max.MeanStd.Range
Age groups
Children02953.9412.70295
Young adults04943.9315.72494
Mobile adults06085.0416.70608
Non-mobile adults05834.9715.05583
Elderly02362.569.01236
Building features
Not used01850.593.20185
Rented05767.2125.18576
Private01643.1211.06164
Residential area026,681631.351854.1526,681
Number of dwellings057610.3432.39576
Number of rooms0114533981145
Building height089.323.455.5789.32
Building oldness02016782.88958.992016
Real estate prices012,064.05891.342339.1212,064.05
Neighbourhood features
Proximity to restaurants and bars0104614.8675.411046
Proximity to bus stops0291.373.0229
Proximity to culture0420.572.8342
Proximity to supermarkets0641.996.7064
Proximity to a Christian place of worship0313.396.1931
Proximity to a non-Christian place of worship0211.613.7021
Proximity to a Buddhist place of worship020.130.402
Proximity to a Christian place of worship0202.124.0320
Proximity to a Christian place of worship (Catholic)030.200.573
Proximity to a Christian place of worship (Lutheran)010.070.251
Proximity to a Christian place of worship (Protestant)0911.719
Proximity to a Hindu place of worship030.280.723
Proximity to a Jewish place of worship040.280.864
Proximity to a Muslim place of worhsip090.691.559
Proximity to a Sunni Muslim place of worship010.060.241
Proximity to a Sikh place of worship030.170.493
Proximity to schools0262.474.1726
Proximity to train stations070.741.447
Proximity to university040.220.834
RoO
Natives079620.7663.49796
Former Colonies04992.048.42499
EU West01401.596.02140
EU East0680.472.0368
Middle East + Africa03551.376.56355
Turkey + Morocco04342.6513.40434
Other Europe etc03771.907.07377
Table A4. Summary statistics of determinants for Amsterdam in 2018, including the presence of natives/migrants by country of origin. Abbreviations: Min.—minimum, Max.—maximum, Std.—standard deviation and CoO—Country of origin.
Table A4. Summary statistics of determinants for Amsterdam in 2018, including the presence of natives/migrants by country of origin. Abbreviations: Min.—minimum, Max.—maximum, Std.—standard deviation and CoO—Country of origin.
VariablesMin.Max.MeanStd.RangeVariablesMin.Max.MeanStd.Range
CoO
Afghanistan0380.070.6638Kroatia020.000.042
Albania090.010.159Letland040.010.134
Algerije090.030.279Libanon070.020.197
Angola060.010.136Liberia060.010.126
Argentinia060.030.266Libia0110.000.1111
Aruba030.000.053Litouwen060.020.186
Australia0140.050.3414Luxemburg040.000.094
Azerbeidzjan0110.000.0811Malakka050.010.095
Bangladesh080.010.148Maleisia050.010.115
Belgisch-Congo020.000.062Malta030.000.043
Belgia0130.100.5213Marokko03491.689.02349
Bolivia040.010.104Mauritius030.000.053
Bondsrepubliek Duitsland0440.211.0844Mexico070.030.237
Brazilia0170.120.6517Moldavia030.000.063
Brits Oost-Afrika030.000.053Mongolia040.000.064
Brits West-India040.000.074Nederland079620.7663.49796
Brits-Guyana080.020.208Nederlands Nieuw-Guinea040.010.124
Brits-India080.040.288Nederlands-India0290.291.0829
Brits-Somaliland050.000.085Nederlandse Antillen0880.221.2988
Bulgarije0550.110.9955Nepal0100.010.2310
Burma060.000.106Nicaragua030.000.043
Burundi050.000.075Nieuw-Zeeland060.020.196
Cambodja070.000.067Nigeria0360.050.5036
Canada0160.040.3316Noorwegen060.010.166
Canarische Eilanden030.000.053Oekraine0230.010.1623
Ceylon050.010.135Onbekend0380.040.5038
Chili060.020.216Oostenrijk080.030.258
China0670.161.0567Opper-Volta060.000.076
Colombia0100.060.3910Pakistan0380.110.8138
Congo-Kinshasa060.000.076Palestina030.000.073
Congo060.000.086Panama030.000.053
Costa Rica040.000.064Peru070.020.207
Cuba050.010.105Polen0120.130.6712
Cyprus040.000.094Portugal0130.080.4913
Denemarken080.020.218Portugees Oost-Afrika030.000.053
Dominicaanse Republiek0490.060.6249Portugees West-Afrika040.000.064
Duitse Democratische Republiek060.010.166Puerto Rico030.000.043
Duitsland0150.150.6315Rhodesia030.000.043
Ecuador0300.020.2930Roemenia0170.090.5617
Egypte0360.130.8836Rusland (oud)020.000.052
El Salvador040.000.054Rusland0160.010.1616
Eritrea0230.020.3423Rwanda050.000.065
Estland040.010.104Saoedi-Arabia060.010.116
Ethiopia0260.060.5126Senegal070.010.127
Filipijnen0550.060.5455Siam020.000.052
Finland070.020.227Sierra Leone090.010.169
Frankrijk0180.170.8518Singapore070.010.157
Frans West-Afrika050.000.065Slovenia030.000.043
Frans-Guyana040.000.084Slowakije050.000.065
Gambia050.000.095Soedan0130.020.2313
Ghana01040.212.22104Somalia0200.030.3920
Goudkust0390.080.8239Sovjet-Unie0300.190.9430
Griekenland0230.060.4523Spanje0260.140.7126
Groot-Brittannia0350.281.2935Sri Lanka050.000.095
Guadeloupe030.000.043Suriname03921.396.61392
Guatemala030.000.063Syria0430.090.7943
Guinee080.010.148Taiwan070.010.167
Guyana060.010.116Tanzania040.000.064
Haiti050.000.075Thailand0100.030.2610
Honduras030.000.053Tibet060.010.146
Hongarije0100.050.3510Togo090.010.149
Hongkong0140.050.3614Trinidad en Tobago060.000.086
Ierland080.040.298Tsjechia050.000.065
IJsland050.000.075Tsjecho-Slowakije070.040.297
India03260.171.98326Tunesia0100.020.2610
Indonesia0170.120.5917Turkije01190.975.03119
Irak0220.070.5322Uganda0140.010.1914
Iran0470.090.5347Uruguay040.010.104
Israal0110.050.3711Venezuela070.020.207
Italiaans-Somaliland030.000.063Verenigde Arabische Emiraten030.000.063
Italia0280.211.0228Verenigde Staten van Amerika0630.211.1863
Ivoorkust090.010.129Vietnam060.020.206
Jamaica060.000.096Zambia040.000.064
Japan0260.040.4226Zaire060.000.106
Joegoslavia0150.150.7415Zimbabwe050.000.055
Jordania070.000.117Zuid-Afrika0180.050.3618
Kaapverdia070.010.177Zuid-Korea0100.010.1810
Kameroen070.010.127Zuid-Rhodesia040.000.054
Kenya060.010.116Zweden0100.030.2710
Koeweit050.000.085Zwitserland080.030.248
Korea060.000.096

References

  1. Pagliara, F.; Wilson, A. The State-of-the-Art in Building Residential Location Models. In Residential Location Choice; Springer: Berlin/Heidelberg, Germany, 2010; pp. 1–20. [Google Scholar] [CrossRef]
  2. Zolfaghari, A.; Sivakumar, A.; Polak, J.W. Choice set pruning in residential location choice modelling: A comparison of sampling and choice set generation approaches in greater London. Transp. Plan. Technol. 2012, 35, 87–106. [Google Scholar] [CrossRef]
  3. Schirmer, P.M.; Eggermond, M.A.B.V.; Axhausen, K.W. The role of location in residential location choice models. J. Transp. Land Use 2014, 7, 3–21. [Google Scholar] [CrossRef]
  4. Walker, J.L.; Li, J. Latent lifestyle preferences and household location decisions. J. Geogr. Syst. 2007, 9, 77–101. [Google Scholar] [CrossRef]
  5. Zondag, B.; Pieters, M. Influence of Accessibility on Residential Location Choice. Transp. Res. Rec. 2005, 1902, 63–70. [Google Scholar] [CrossRef]
  6. Chen, J.; Chen, C.; Timmermans, H.J.P. Accessibility trade-offs in household residential location decisions. Transp. Res. Rec. 2008, 2077, 71–79. [Google Scholar] [CrossRef]
  7. Mulder, C.H. The family context and residential choice: A challenge for new research. Popul. Space Place 2007, 13, 265–278. [Google Scholar] [CrossRef]
  8. Mulder, C.H.; Cooke, T.J. Family ties and residential locations. Popul. Space Place 2009, 15, 299–304. [Google Scholar] [CrossRef]
  9. Axhausen, K.W.; Scott, D.M.; König, A.; Jürgens, C. Locations, commitments and activity spaces. In Human Behaviour and Traffic Networks; Springer: Berlin/Heidelberg, Germany, 2001. [Google Scholar] [CrossRef]
  10. Pinjari, A.R.; Bhat, C.R.; Hensher, D.A. Residential self-selection effects in an activity time-use behavior model. Transp. Res. Part B Methodol. 2009, 43, 729–748. [Google Scholar] [CrossRef]
  11. Pinjari, A.R.; Pendyala, R.M.; Bhat, C.R.; Waddell, P.A. Modeling the choice continuum: An integrated model of residential location, auto ownership, bicycle ownership, and commute tour mode choice decisions. Transportation 2011, 38, 933–958. [Google Scholar] [CrossRef]
  12. de Palma, A.; Motamedi, K.; Picard, N.; Waddell, P. A Model of Residential Location Choice with Endogenous Housing Prices and Traffic for the Paris Region. Eur. Transp. 2005, 31, 67–82. [Google Scholar]
  13. Muhammad, A.H.; Miller, E.J. Reference-Dependent Residential Location Choice Model within a Relocation Context. Transp. Res. Rec. 2009, 2133, 92–99. [Google Scholar]
  14. Beckers, P.; Boschman, S. Residential choices of foreign highly skilled workers in the Netherlands and the role of neighbourhood and urban regional characteristics. Urban Stud. 2019, 56, 760–777. [Google Scholar] [CrossRef]
  15. Frenkel, A.; Bendit, E.; Kaplan, S. Residential location choice of knowledge-workers: The role of amenities, workplace and lifestyle. Cities 2013, 35, 33–41. [Google Scholar] [CrossRef]
  16. Florida, R. The Rise of the Creative Class: And How It’s Transforming Work, Leisure, Community and Everyday Life; BasicBooks: New York, NY, USA, 2002. [Google Scholar]
  17. Dimou, M.; Ettouati, S.; Schaffar, A. From dusk till dawn: The residential mobility and location preferences of immigrants in France. Ann. Reg. Sci. 2020, 65, 253–280. [Google Scholar] [CrossRef]
  18. Richardson, K. Attracting and Retaining Foreign Highly Skilled Staff in Times of Global Crisis: A Case Study of Vancouver, British Columbia’s Biotechnology Sector. Popul. Space Place 2016, 22, 428–440. [Google Scholar] [CrossRef]
  19. International Organization for Migration IOM. World Migration Report 2022. 2021. Available online: https://publications.iom.int/books/world-migration-report-2022 (accessed on 23 July 2023).
  20. World Bank. Urban Development. 2023. Available online: https://www.worldbank.org/en/topic/urbandevelopment/overview (accessed on 23 July 2023).
  21. United Nations Human Settlements Programme UN-Habitat. World Cities Report 2022: Envisaging the Future of Cities; Technical Report; United Nations Human Settlements Programme: Nairobi, Kenya, 2022. [Google Scholar]
  22. Aslund, O. Now and forever? Initial and subsequent location choices of immigrants. Reg. Sci. Urban Econ. 2005, 35, 141–165. [Google Scholar] [CrossRef]
  23. Perez, L.; Dragicevic, S.; Gaudreau, J. A geospatial agent-based model of the spatial urban dynamics of immigrant population: A study of the island of Montreal, Canada. PLoS ONE 2019, 14, e0219188. [Google Scholar] [CrossRef] [PubMed]
  24. Søholt, S.; Lynnebakke, B. Do Immigrants’ Preferences for Neighbourhood Qualities Contribute to Segregation? The Case of Oslo. J. Ethn. Migr. Stud. 2015, 41, 2314–2335. [Google Scholar] [CrossRef]
  25. Ibraimovic, T.; Hess, S. A latent class model of residential choice behaviour and ethnic segregation preferences. Hous. Stud. 2018, 33, 544–564. [Google Scholar] [CrossRef]
  26. Robinson, D.; Reeve, K.; Casey, R. The Housing Pathways of New Immigrants; Joseph Rowntree Foundation: London, UK, 2007. [Google Scholar]
  27. Wang, C.; Li, C.; Wang, M.; Yang, S. Transnational migrants in Shanghai: Residential spatial patterns and the underlying driving forces. Popul. Space Place 2020, 26, e2272. [Google Scholar] [CrossRef]
  28. Andersen, H.S. Spatial assimilation? The development in immigrants’ residential career with duration of stay in Denmark. J. Hous. Built Environ. 2016, 31, 297–320. [Google Scholar] [CrossRef]
  29. Chihaya, G.K.; Marcińczak, S.; Strömgren, M.; Lindgren, U.; Tammaru, T. Trajectories of Spatial Assimilation or Place Stratification? A Typology of Residence and Workplace Histories of Newly Arrived Migrants in Sweden. Int. Migr. Rev. 2022, 56, 433–462. [Google Scholar] [CrossRef]
  30. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  31. Hapfelmeier, A.; Ulm, K. A new variable selection approach using Random Forests. Comput. Stat. Data Anal. 2013, 60, 50–69. [Google Scholar] [CrossRef]
  32. Couronné, R.; Probst, P.; Boulesteix, A.L. Random forest versus logistic regression: A large-scale benchmark experiment. BMC Bioinform. 2018, 19, 270. [Google Scholar] [CrossRef]
  33. Boulesteix, A.L.; Janitza, S.; Kruppa, J.; König, I.R. Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2012, 2, 493–507. [Google Scholar] [CrossRef]
  34. Voges, L.F.; Jarren, L.C.; Seifert, S. Exploitation of surrogate variables in random forests for unbiased analysis of mutual impact and importance of features. Bioinformatics 2023, 39, btad471. [Google Scholar] [CrossRef]
  35. Stevens, F.R.; Gaughan, A.E.; Linard, C.; Tatem, A.J. Disaggregating Census Data for Population Mapping Using Random Forests with Remotely-Sensed and Ancillary Data. PLoS ONE 2015, 10, e0107042. [Google Scholar] [CrossRef] [PubMed]
  36. Georgati, M.; Monteiro, J.; Martins, B.; Keßler, C. Spatial Disaggregation of Population Subgroups Leveraging Self-Trained Multi-Output Gradient Boosting Regression Trees. AGILE GISci. Ser. 2022, 3, 5. [Google Scholar] [CrossRef]
  37. Luo, Y.; Yan, J.; McClure, S.C.; Li, F. Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model. Environ. Sci. Pollut. Res. Int. 2022, 29, 33205–33217. [Google Scholar] [CrossRef]
  38. Kopeć, A.; Trybała, P.; Głąbicki, D.; Buczyńska, A.; Owczarz, K.; Bugajska, N.; Kozińska, P.; Chojwa, M.; Gattner, A. Application of remote sensing, gis and machine learning with geographically weighted regression in assessing the impact of hard coal mining on the natural environment. Sustainability 2020, 12, 9338. [Google Scholar] [CrossRef]
  39. Cosacov, N. Residential Choices. In The Wiley Blackwell Encyclopedia of Urban and Regional Studies; John Wiley & Sons: Hoboken, NJ, USA, 2019; pp. 1–5. [Google Scholar] [CrossRef]
  40. Hunt, L.M.; Boots, B.; Kanaroglou, P.S. Spatial choice modelling: New opportunities to incorporate space into substitution patterns. Prog. Hum. Geogr. 2004, 28, 746–766. [Google Scholar] [CrossRef]
  41. Pagliara, F.; Simmonds, D.; Preston, J. Residential Location Choice: Models and Applications; Springer: Berlin/Heidelberg, Germany, 2010. [Google Scholar] [CrossRef]
  42. Wen, T. Spatial Assimilation. In The Wiley Blackwell Encyclopedia of Urban and Regional Studies; John Wiley & Sons: Hoboken, NJ, USA, 2019. [Google Scholar] [CrossRef]
  43. Massey, D.S.; Mullan, B.P. Processes of Hispanic and Black Spatial Assimilation. Am. J. Sociol. 1984, 89, 836–873. [Google Scholar] [CrossRef]
  44. Massey, D.S.; Denton, N.A. Spatial Assimilation as a Socioeconomic Outcome. Am. Sociol. Rev. 1985, 50, 94–106. [Google Scholar] [CrossRef]
  45. Alba, R.D.; Logan, J. Minority proximity to whites in suburbs: An individual-level analysis of segregation. Am. J. Sociol. 1993, 98, 1388–1427. [Google Scholar] [CrossRef]
  46. Charles, C.Z. The Dynamics of Racial Residential Segregation. Annu. Rev. Sociol. 2003, 29, 167–207. [Google Scholar] [CrossRef]
  47. Vogiazides, L. Exiting distressed neighbourhoods: The timing of spatial assimilation among international migrants in Sweden. Popul. Space Place 2018, 24, e2169. [Google Scholar] [CrossRef]
  48. Thomas, R. Viewing immigrants’ neighbourhood and housing choices through the lens of community resilience. Sapiens 2013, 6. [Google Scholar]
  49. Burgess, E.W. The Growth of the City: An Introduction to a Research Project. In Urban Ecology; Springer: Boston, MA, USA, 2008; pp. 71–78. [Google Scholar] [CrossRef]
  50. Zorlu, A.; Mulder, C.H. Initial and Subsequent Location Choices of Immigrants to the Netherlands. Reg. Stud. 2008, 42, 245–264. [Google Scholar] [CrossRef]
  51. Bolt, G.; van Kempen, R. Ethnic Segregation and Residential Mobility: Relocations of Minority Ethnic Groups in the Netherlands. J. Ethn. Migr. Stud. 2010, 36, 333–354. [Google Scholar] [CrossRef]
  52. Hedman, L. Moving Near Family? The Influence of Extended Family on Neighbourhood Choice in an Intra-urban Context. Popul. Space Place 2013, 19, 32–45. [Google Scholar] [CrossRef]
  53. Coenen, A.; Verhaeghe, P.; de Putte, B.V. Ethnic residential segregation: A matter of ethnic minority household characteristics? Popul. Space Place 2019, 25, e2244. [Google Scholar] [CrossRef]
  54. Pais, J.; South, S.J.; Crowder, K. Metropolitan Heterogeneity and Minority Neighborhood Attainment. Soc. Probl. 2012, 59, 258–281. [Google Scholar] [CrossRef] [PubMed]
  55. Martori, J.C.; Madariaga, R. Residential segregation by nationalities: A global and multilevel approach to Barcelona and Madrid (2008–2018). Popul. Space Place 2023, 29, e2655. [Google Scholar] [CrossRef]
  56. Catney, G.; Lloyd, C.D. Population Grids for Analysing Long-Term Change in Ethnic Diversity and Segregation. Spat. Demogr. 2020, 8, 215–249. [Google Scholar] [CrossRef]
  57. Malmberg, B.; Andersson, E.K.; Nielsen, M.M.; Haandrikman, K. Residential Segregation of European and Non-European Migrants in Sweden: 1990–2012. Eur. J. Popul. 2018, 34, 169–193. [Google Scholar] [CrossRef]
  58. Sleutjes, B.; de Valk, H.A.G.; Ooijevaar, J. The Measurement of Ethnic Segregation in the Netherlands: Differences between Administrative and Individualized Neighbourhoods. Eur. J. Popul. 2018, 34, 195–224. [Google Scholar] [CrossRef] [PubMed]
  59. Imeraj, L.; Willaert, D.; de Valk, H.A.G. A comparative approach towards ethnic segregation patterns in Belgian cities using multiscalar individualized neighborhoods. Urban Geogr. 2018, 39, 1221–1246. [Google Scholar] [CrossRef]
  60. Rogne, A.F.; Andersson, E.K.; Malmberg, B.; Lyngstad, T.H. Neighbourhood Concentration and Representation of Non-European Migrants: New Results from Norway. Eur. J. Popul. 2020, 36, 71–83. [Google Scholar] [CrossRef] [PubMed]
  61. Openshaw, S. The Modifiable Areal Unit Problem; CATMOG, 38; Geo: Norwich, UK, 1980. [Google Scholar]
  62. Andersson, E.K.; Lyngstad, T.H.; Sleutjes, B. Comparing Patterns of Segregation in North-Western Europe: A Multiscalar Approach. Eur. J. Popul. 2018, 34, 151–168. [Google Scholar] [CrossRef] [PubMed]
  63. Jones, K.; Johnston, R.; Manley, D.; Owen, D.; Charlton, C. Ethnic Residential Segregation: A Multilevel, Multigroup, Multiscale Approach Exemplified by London in 2011. Demography 2015, 52, 1995–2019. [Google Scholar] [CrossRef] [PubMed]
  64. OpenStreetMap Contributors. 2022. Available online: http://download.geofabrik.de/europe.html (accessed on 3 February 2022).
  65. Bygnings-og Boligregistret (BBR). Forside—BBR Teknik. 2022. Available online: https://teknik.bbr.dk/forside (accessed on 28 April 2021).
  66. PDOK. PDOK—3D Downloads. 2022. Available online: https://3d.kadaster.nl/basisvoorziening-3d/ (accessed on 28 April 2021).
  67. Gemeente Amsterdam. Maps Amsterdam. Available online: https://maps.amsterdam.nl/open_geodata/?LANG=en (accessed on 3 February 2022).
  68. Movia Trafik. Movia. 2022. Available online: https://www.moviatrafik.dk/ (accessed on 28 April 2021).
  69. Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning Data Mining, Inference, and Prediction, 2nd ed.; Springer: New York, NY, USA, 2009. [Google Scholar] [CrossRef]
  70. Géron, A. Hands-on Machine Learning with Scikit-Learn and TensorFlow; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2009. [Google Scholar]
  71. Liaw, A.; Wiener, M. Classification and regression by randomForest. R News 2002, 2, 18–22. [Google Scholar]
  72. Nembrini, S.; König, I.R.; Wright, M.N. The revival of the Gini importance? Bioinformatics 2018, 34, 3711–3718. [Google Scholar] [CrossRef]
  73. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  74. Nicodemus, K.K. Letter to the Editor: On the stability and ranking of predictors from random forest variable importance measures. Brief. Bioinform. 2011, 12, 369–373. [Google Scholar] [CrossRef]
  75. Calle, M.L.; Urrea, V. Letter to the Editor: Stability of Random Forest importance measures. Brief. Bioinform. 2011, 12, 86–89. [Google Scholar] [CrossRef]
  76. Matplotlib—Visualization with Python. 2023. Available online: https://matplotlib.org/ (accessed on 5 September 2023).
  77. Seaborn: Statistical Data Visualization. 2023. Available online: https://seaborn.pydata.org/ (accessed on 5 September 2023).
  78. Andersen, H.S.; Andersson, R.; Wessel, T.; Vilkama, K. The impact of housing policies and housing markets on ethnic spatial segregation: Comparing the capital cities of four Nordic welfare states. Eur. J. Hous. Policy 2016, 16, 1–30. [Google Scholar] [CrossRef]
  79. Andersson, E.K.; Malmberg, B.; Costa, R.; Sleutjes, B.; Stonawski, M.J.; de Valk, H.A.G. A Comparative Study of Segregation Patterns in Belgium, Denmark, the Netherlands and Sweden: Neighbourhood Concentration and Representation of Non-European Migrants. Eur. J. Popul. 2018, 34, 251–275. [Google Scholar] [CrossRef]
  80. Arbaci, S. Ethnic segregation, housing systems and welfare regimes in Europe. Eur. J. Hous. Policy 2007, 7, 401–433. [Google Scholar] [CrossRef]
  81. Tammaru, T.; Marcińczak, S.; Aunap, R.; van Ham, M. Inequalities and Segregation across the Long-Term Economic Cycle: An Analysis of South and North European Cities; IDEAS Working Paper Series from RePEc; IZA—Institute of Labor Economics: Bonn, Germany, 2007. [Google Scholar]
  82. Deloitte. Average Sales Price of New and Existing Residential Property in Europe in 2021, per Country (In Euros Per Square Meter). 2022. Available online: https://www.statista.com/statistics/722905/average-residential-square-meter-prices-in-eu-28-per-country/ (accessed on 5 September 2023).
  83. Kveladze, I.; Georgati, M.; Kessler, C.; Hansen, H.S. Analytics of historical human migration patterns: Use cases of Amsterdam and Copenhagen. J. Locat. Based Serv. 2023. [Google Scholar] [CrossRef]
  84. Gemeente Amsterdam, Bureau Onderzoek en Statistiek. Stadsdelen in Cijfers 2013; Technical Report; Gemeente Amsterdam, Bureau Onderzoek en Statistiek: Amsterdam, The Netherlands, 2013.
  85. Georgati, M.; Keßler, C. Spatially Explicit Population Projections: The case of Copenhagen, Denmark. AGILE GISci. Ser. 2021, 2, 1–6. [Google Scholar] [CrossRef]
  86. Lent, M.V.; Fisher, W.; Mancuso, M. An explainable artificial intelligence system for small-unit tactical behavior. In Proceedings of the National Conference on Artificial Intelligence, San Jose, CA, USA, 25–29 July 2004. [Google Scholar]
  87. Carvalho, D.V.; Pereira, E.M.; Cardoso, J.S. Machine learning interpretability: A survey on methods and metrics. Electronics 2019, 8, 832. [Google Scholar] [CrossRef]
  88. Erickson, B.O.; Esser, D.E.; Turner, K.L. Spatial Dynamics of Racialized Income Inequality in the Washington, DC, Metropolitan Area. Prof. Geogr. 2021, 73, 72–84. [Google Scholar] [CrossRef]
  89. Andersen, N.T. Organizing against all Odds: Filipina chambermaids in Copenhagen, 1970s–1990s. Women’s Hist. Rev. 2021, 30, 241–255. [Google Scholar] [CrossRef]
  90. Garssen, J.; Nicolaas, H. Fertility of Turkish and Moroccan women in the Netherlands. Demogr. Res. 2008, 19, 1249–1280. [Google Scholar] [CrossRef]
Figure 1. Feature significance by CoO and proportional population size in Amsterdam (>7%). The figure shows the influence of each source-CoO on the horizontal axis on the target-country on the vertical axis in relation to the ratio of their population sizes. The size and colour of the bubble indicate the value of the influence and the relative size of the two populations. We notice that i. the most influential pairs are migrants from Ghana–Gold Coast, Turkey–Morocco, Suriname–Netherlands Antilles, and ii. larger migrant groups from Western and EU countries have a large influence on smaller ones from the same regions.
Figure 1. Feature significance by CoO and proportional population size in Amsterdam (>7%). The figure shows the influence of each source-CoO on the horizontal axis on the target-country on the vertical axis in relation to the ratio of their population sizes. The size and colour of the bubble indicate the value of the influence and the relative size of the two populations. We notice that i. the most influential pairs are migrants from Ghana–Gold Coast, Turkey–Morocco, Suriname–Netherlands Antilles, and ii. larger migrant groups from Western and EU countries have a large influence on smaller ones from the same regions.
Ijgi 12 00460 g001
Figure 2. Feature significance by CoO and proportional population size in Copenhagen (>7%). As in the previous figure, we notice that i. the most influential pairs are migrants from Yugoslavia–North Macedonia, Iran–Lebanon, Turkey–Pakistan, and ii. migrants from the MENAP region have a high influence on migrants from other MENAP and non-Western CoO.
Figure 2. Feature significance by CoO and proportional population size in Copenhagen (>7%). As in the previous figure, we notice that i. the most influential pairs are migrants from Yugoslavia–North Macedonia, Iran–Lebanon, Turkey–Pakistan, and ii. migrants from the MENAP region have a high influence on migrants from other MENAP and non-Western CoO.
Ijgi 12 00460 g002
Figure 3. Shares of migrants and house prices by migrant group in Amsterdam (top) and in Copenhagen (bottom) in 2008 (left) and 2018 (right). The graphs show the share of migrants in the corresponding group residing in areas with different ranges of house prices, indicated by the different colours. Prices are given in €/m2 for Amsterdam and in DKK/m2 for Copenhagen.
Figure 3. Shares of migrants and house prices by migrant group in Amsterdam (top) and in Copenhagen (bottom) in 2008 (left) and 2018 (right). The graphs show the share of migrants in the corresponding group residing in areas with different ranges of house prices, indicated by the different colours. Prices are given in €/m2 for Amsterdam and in DKK/m2 for Copenhagen.
Ijgi 12 00460 g003aIjgi 12 00460 g003b
Figure 4. Distribution of migrants from MEA (top) and WEU countries (bottom) in relation to average house prices in Amsterdam in 2008 (left) and 2018 (right). (a) Migrants from MEA countries and average house prices in 2008. (b) Migrants from MEA countries and average house prices in 2018. (c) Migrants from WEU countries and average house prices in 2008. (d) Migrants from WEU countries and average house prices in 2018.
Figure 4. Distribution of migrants from MEA (top) and WEU countries (bottom) in relation to average house prices in Amsterdam in 2008 (left) and 2018 (right). (a) Migrants from MEA countries and average house prices in 2008. (b) Migrants from MEA countries and average house prices in 2018. (c) Migrants from WEU countries and average house prices in 2008. (d) Migrants from WEU countries and average house prices in 2018.
Ijgi 12 00460 g004aIjgi 12 00460 g004b
Figure 5. Distribution of migrants from MENAP (top) and other Western countries (bottom) in relation to average house prices at district level in Copenhagen in 2008 (left) and 2018 (right). (a) Migrants from MENAP countries and average house prices in 2008. (b) Migrants from MENAP countries and average house prices in 2018. (c) Migrants from other Western countries and average house prices in 2008. (d) Migrants from other Western countries and average house prices in 2018.
Figure 5. Distribution of migrants from MENAP (top) and other Western countries (bottom) in relation to average house prices at district level in Copenhagen in 2008 (left) and 2018 (right). (a) Migrants from MENAP countries and average house prices in 2008. (b) Migrants from MENAP countries and average house prices in 2018. (c) Migrants from other Western countries and average house prices in 2008. (d) Migrants from other Western countries and average house prices in 2018.
Ijgi 12 00460 g005
Table 1. The selected determinants with their data sources and description of the pre-processing. The data are provided for 2018, with real estate prices also provided for 2008.
Table 1. The selected determinants with their data sources and description of the pre-processing. The data are provided for 2018, with real estate prices also provided for 2008.
DeterminantsDescriptionData Source
Demographic attributes
AgeThe total population divided into five groups: children, students, mobile adults, non-mobile adults and the elderly.Amsterdam: OIS, Copenhagen: DST
MarriagesThe number of marriages.Copenhagen: DST
Educational attainmentThe share of people with tertiary education.Copenhagen: DST
High and low incomeThe population in the highest and lowest decile of the yearly equivalent disposable income.Copenhagen: DST
Building attributes
Building height and construction yearThe average height and year of construction of the buildings in the corresponding grid cell.Amsterdam: PDOK [66], Copenhagen: Bygnings-og Boligregistret (BBR) [65]
Dimensions, usage and ownership statusThe total number of residential dwellings and rooms, the size of residential area in m2, the share of unoccupied or rented units and the share of private or public year-round dwellings.Amsterdam: OIS, Copenhagen: DST
Neighbourhood attributes
Real estate pricesFor Copenhagen, the layer represents the average purchase price per square metre of the dwellings sold. For Amsterdam, the property value map is the output of an interpolation process disaggregated to grid cells from the neighbourhood level, taking the average value in the provided range.Amsterdam: Gemeente Amsterdam [67], Copenhagen: Bygnings-og Boligregistret (BBR) [65]
Proximity to bus stops, train stations, schools, universities, leisure and cultural facilities (e.g., restaurants, cinemas), and places of worship by religionThe total number of accessible services/
facilities within walking or cycling distance.
Amsterdam: OpenStreetMap contributors [64], Copenhagen: OpenStreetMap contributors [64], Bygnings-og Boligregistret (BBR) [65], Movia Trafik [68]
Table 2. Importance of determinants of migrants’ residential locations by RoO in percent ratio in Amsterdam.
Table 2. Importance of determinants of migrants’ residential locations by RoO in percent ratio in Amsterdam.
DeterminantsNativesFormer ColoniesEU WestEU EastMiddle East
+ Africa
Turkey + MoroccoRest
Age groups4.1112.639.1978.3124.8569.6122.44
Children0.251.600.350.9611.0066.121.90
Young adults0.300.756.3551.598.181.7211.91
Mobile adults0.351.512.3424.691.350.188.23
Non-mobile adults3.128.450.090.313.770.960.30
Elderly0.090.320.060.760.550.630.10
Building features92.6310.1768.025.927.257.0262.22
Residential area72.571.0661.880.410.500.1745.89
Number of dwellings4.584.343.621.250.800.1014.34
Number of rooms15.133.560.150.830.481.230.34
Building height0.010.010.150.902.150.030.63
Building oldness0.240.310.250.430.550.260.54
Not used0.000.060.470.490.090.010.07
Rented0.010.830.220.271.614.840.23
Private0.090.001.281.341.070.380.18
Neighbourhood features0.8711.372.843.781.8515.380.67
Real estate prices0.651.250.160.880.200.110.02
Prox. to supermarkets0.010.020.120.170.130.400.13
Prox. to restaurants and bars0.020.080.490.470.180.500.18
Prox. to cultural spaces0.000.010.020.030.030.020.04
Prox. to bus stops0.000.000.030.200.210.080.06
Prox. to train stations0.025.030.020.440.0711.280.02
Prox. to schools0.030.110.540.660.610.440.10
Prox. to university0.000.000.220.020.000.000.02
Prox. to a place of Christian worship0.040.031.190.200.080.160.06
Prox. to a place of non-Christian worship0.104.840.050.710.342.390.04
Presence of natives/migrant groups2.3965.8319.9611.9866.17.9514.67
Natives 0.448.601.051.476.730.48
Former Colonies0.45 0.260.1461.400.630.21
EU West1.060.09 3.140.970.1013.19
EU East0.160.050.63 0.180.050.36
Middle East + Africa0.4062.150.050.16 0.300.24
Turkey + Morocco0.212.060.091.871.44 0.19
Other Europe etc0.111.0410.335.620.640.14
R-squared0.9380.8040.8250.5260.7360.7920.663
The italicised text indicates the classes of factors, such as age groups, building/neighbourhood characteristics and the presence of native/migrant groups, summarising their importance and highlighting the two highest scores per class in bold.
Table 3. Importance of determinants of migrants’ residential locations by RoO in percent ratio in Copenhagen.
Table 3. Importance of determinants of migrants’ residential locations by RoO in percent ratio in Copenhagen.
DeterminantsNativesEEUWEUMENAPNon-EU EuropeOther Non-WesternOther Western
Age groups1.3161.7927.9252.0555.2547.696.17
Children0.250.561.7646.058.572.801.09
Young adults0.3242.8415.913.675.557.072.30
Mobile adults0.3018.007.450.7840.6135.401.05
Non-mobile adults0.440.262.721.540.462.391.28
Elderly0.000.130.080.010.060.030.45
Social features0.453.5711.425.64.225.876.23
Marriages0.110.141.440.520.162.481.05
Higher education0.140.164.7518.580.570.782.67
High income0.010.110.780.430.350.011.00
Low income0.193.164.436.073.1222.601.51
Building features96.422.843.851.594.740.69
Residential area66.550.1342.470.100.340.042.69
Number of dwellings0.751.340.220.071.500.211.14
Number of rooms28.980.080.340.011.440.031.36
Number of floors0.000.040.030.070.160.060.35
Building oldness0.010.570.380.950.370.090.80
Not used0.100.170.160.070.080.051.36
Rented0.000.210.140.120.490.080.50
Private0.030.260.100.200.360.040.64
Public0.000.000.010.000.000.000.16
Neighbourhood features0.024.981.070.612.631.028.65
Real estate prices0.000.050.020.070.030.010.29
Prox. to supermarkets0.001.030.120.040.080.040.65
Prox. to bus stops0.000.030.040.020.130.010.91
Prox. to train stations0.000.600.070.050.980.060.53
Prox. to cultural spaces0.000.670.210.060.200.180.28
Prox. to schools0.000.980.090.130.460.140.65
Prox. to places of leisure0.000.150.060.090.400.170.27
Prox. to bars and restaurants0.020.250.380.090.150.134.56
Prox. to a place of Christian worship0.001.180.050.050.180.280.48
Prox. to a place of non-Christian worship0.000.040.030.010.020.000.03
Presence of natives/migrant groups1.826.8415.7620.1533.1724.8369.94
DNK 1.392.041.760.670.772.14
EEU0.26 3.380.560.945.320.60
WEU0.312.30 0.610.883.4463.93
MENAP0.480.282.06 25.4714.520.14
Non-EU Europe0.300.772.070.91 0.660.61
Other Non-Western0.2622.030.9816.215.12 2.52
Other Western0.190.075.230.100.090.12
R-squared0.9610.5170.6820.7850.4570.6660.552
The italicised text indicates the classes of factors, such as age groups, building/neighbourhood characteristics and the presence of native/migrant groups, summarising their importance and highlighting the two highest scores per class in bold.
Table 4. Total purchase prices by migrant group in 2008 and 2018 in Amsterdam and Copenhagen.
Table 4. Total purchase prices by migrant group in 2008 and 2018 in Amsterdam and Copenhagen.
CityRoOPrice per Square MeterDifferencePercentage
20082018Change
Amsterdam (€/m2)NLD3999.156198.262199.1154.99
Former Colonies3087.104687.161600.0651.83
EU West4292.566521.692229.1351.93
EU East3875.815501.211625.4041.94
Middle East + Africa3025.404728.881703.4556.31
Turkey + Morocco3112.104857.101745.0056.07
Other Europe etc3801.015898.652097.6455.19
Copenhagen (DKK/m2)DNK23,963.2433,144.639181.3938.31
EEU23,895.3331,706.377811.0432.69
WEU25,494.2136,254.4910,760.2842.21
MENAP21,687.9429,275.377587.4334.98
Non-EU Europe22,473.4831,272.988799.5039.16
Other Non-Western23,595.7332,188.458592.7236.42
Other Western26,554.5637,619.5111,064.9541.67
The bold indicates the two highest values per column.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Georgati, M.; Hansen, H.S.; Keßler, C. Random Forest Variable Importance Measures for Spatial Dynamics: Case Studies from Urban Demography. ISPRS Int. J. Geo-Inf. 2023, 12, 460. https://doi.org/10.3390/ijgi12110460

AMA Style

Georgati M, Hansen HS, Keßler C. Random Forest Variable Importance Measures for Spatial Dynamics: Case Studies from Urban Demography. ISPRS International Journal of Geo-Information. 2023; 12(11):460. https://doi.org/10.3390/ijgi12110460

Chicago/Turabian Style

Georgati, Marina, Henning Sten Hansen, and Carsten Keßler. 2023. "Random Forest Variable Importance Measures for Spatial Dynamics: Case Studies from Urban Demography" ISPRS International Journal of Geo-Information 12, no. 11: 460. https://doi.org/10.3390/ijgi12110460

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop