1. Introduction
Island energy systems are often seen as test beds for novel technological solutions [
1], which in turn allows for the early evaluation of relevant merits and challenges in view of their application at a greater scale. A similar course has been followed in the case of wind energy [
2], with many island regions determined by high-quality wind potential nowadays exhibiting significant wind energy capacity. This, however, may cause stresses in local energy systems, especially in islands where the variation of load demand presents high levels of seasonality [
3,
4], mainly due to their touristic character. In more detail and owing to the non-dispatchable character of wind power, it is oftentimes that, under such conditions, local wind parks are faced with increased energy curtailments [
5].
To address similar challenges, island energy systems need to become more flexible. Towards this direction, different solutions have gained maturity in recent years. While energy storage technologies remain at the forefront [
6,
7,
8,
9], new flexibility options emerge, such as with demand response [
10,
11] and, in a broader sense, advanced energy management. This strongly relates to the progress met in the fields of artificial intelligence and data sciences, enabling the development of advanced forecasting techniques that could be of support in improving dispatching terms of wind power.
To that end, artificial intelligence and, more precisely, artificial neural networks (ANNs) are currently used in the wind energy sector, helping to address aspects in both the design and operation stages of different wind energy applications [
12,
13]. Interest in the integration of artificial intelligence in the sector is also reflected in the recent body of scientific literature, with emphasis put on the prediction of wind power generation for different time horizons [
14,
15], as well as on other aspects, such as the predictive maintenance of wind turbines [
16].
Development of optimal wind power forecasting models to that end is not strictly limited by the need to generate high-accuracy predictions of the wind power component, or, as Wahndany et al. [
17] put it, “a more or less conservative forecast may be preferred over pure accuracy”. Instead, the development of optimal models may extend to also incorporate additional aspects, such as the operational requirements of wider system boundaries and the interplay of different components, aiming to serve the purpose of overarching optimization objectives at the system level. Similar research is topical in the area of wind-based microgrids [
18,
19,
20,
21,
22], where wind energy penetration is considerable and where the output of wind power forecasting models is used to inform the operation of advanced energy management systems.
In relation to the above and with the goal to further advance research in the field, the present study focuses on settings of energy-constrained island systems that are determined by high shares of wind energy and the experience of significant wind energy curtailments. More specifically, our research offers a new viewpoint on the exploitation of wind power forecasting, with the aim to address aspects of interest not only to system operators, but also to wind energy actors locally involved. With that in mind, we put forward the design, development, and performance evaluation of ANN models for the day-ahead prediction of the theoretical (expected) and actual (dispatched) wind power and proceed to a broader analysis on the interpretation of results obtained towards their meaningful exploitation in the framework of day-ahead dispatch scheduling in similar systems.
To that end, in
Section 2 of the paper, we present the relevant input data and provide the problem definition, methods, and performance metrics, focusing on the design of ANN models for the hourly, day-ahead prediction of wind power. Accordingly, in
Section 3, we provide the main results of our research and proceed further with their analysis on the basis of a post-processing exercise, seeking to introduce a meaningful interpretation in relation to the requirements of day-ahead dispatch scheduling in energy-constrained island systems. Next,
Section 4 discusses the broader meaning of the theoretical and actual wind power forecasting in similar systems, elaborating on the emergence of new actor schemes and markets locally, while
Section 5 provides the main conclusions of our research.
2. Materials and Methods
2.1. Input Data
For the purpose of our analysis, we use the Aegean Archipelagos as the study area and apply our methodology for the non-interconnected island system of Kos and Kalymnos. The latter is found in the SE region of the Aegean and comprises nine electrically interconnected islands. The local electricity system features a peak demand of ~105 MW and an annual consumption of ~360 GWh, and hosts a total of four wind parks. These are determined by a total capacity of 15.2 MW and, owing to the limited flexibility locally, are set to face energy curtailments that even exceed 25% of the respective theoretical energy yield on an annual basis [
5]. Input data used for our analysis include detailed, hourly estimations of the wind capacity factor (
CF) for each of the four wind parks examined and for a time period of two years (2019–2020), arguing that, with the use of the wind power
CF as the main feature of our predictions, we are able to better harvest spatial correlation between the examined wind parks’ specific energy performance, effectively addressing any impact of unscaled input variables (different power generation scales).
In this context, we estimate two different expressions of wind power
CF: the theoretical and the actual one (see also Equations (1) and (2), with
Etheor and
Eactual standing as the theoretical and actual (or absorbed by the grid) output of wind energy over a period of Δt, and with
No being the operational capacity of the wind park). Estimated time series of the theoretical wind power
CF are generated with the use of open wind speed datasets, available from the MERRA2 database [
23]. These are properly adjusted to the hub height of installed wind turbines for each of the examined wind parks (see also
Table 1), which allows for the estimation of the anticipated, ex ante
CF. In the same context, wind power curves of the given wind turbines are also used (
Figure 1a), considering at the same time the maximum permitted (licensed) wind power output at the wind park level, with the respective time series firstly generated at the level of each different wind park (
Figure 1b).
On the other hand, estimation of the actual, ex post wind power
CF is based on aggregate wind power generation data provided by the local Distribution System Operator (DSO) over the same time period (i.e., two consecutive years: 2019–2020). Estimation of the actual wind power
CF to that end captures in situ, layout-related efficiency characteristics of the examined wind parks, as well as aspects such as wind turbines’ downtime periods and periods of wind energy curtailments for the purpose of grid decongestion. Thus, it may provide wind actors with useful information for the investigation of relevant hedging strategies and the recovery of wind energy curtailments. Accordingly, and in
Figure 2, the aggregate, weighed output of the four wind parks is compared against the respective actual wind power
CF values for the Kos and Kalymnos system, with the 480-h comparison (
Figure 2b) designating a very similar trend and revealing a positive difference between the theoretical and actual wind power
CF.
The latter is also reflected in
Figure 3a, and is explanatory of wind energy curtailments occurring in the local system, with the overall 2-year period
CF for the theoretical and actual series, being equal to 32.5% and 23.7%, respectively. Finally, and in
Figure 3b, we also present probability density distributions in relation to different wind power
CF classes for both the theoretical and actual
CF series, with the comparison between the two patterns again suggesting the occurrence of extended curtailments in the local energy system.
2.2. Problem Definition and Methods
In order to produce day-ahead predictions of the theoretical and actual wind power generation on an hourly basis, we proceeded to the development of several ANN models, capturing different types and also architectures of ANNs. The different types of ANNs considered include time-lagged recurrent networks (TLRNs), multilayer perceptron (MLP), and general feed-forward networks (GFFNs). At the same time, different architectures were tested on the basis of a trial-and-error method, under which we evaluated a total of 15 main models for the prediction of the theoretical wind power CF, applying different training algorithms, features, sets of hidden layers and also numbers of input/hidden layers’ neurons. In this context, out of the different ANN models examined per different types of ANNs, we excluded the models that were found to be sub-optimal in terms of either accuracy or time complexity, and currently present the results for a subset of four models. The latter includes three adequately performing models (one for each type of ANNs studied), as well as a fourth, best-performing model, that is also applied in the prediction of the actual wind power CF accordingly.
With regard to the development of the models, training parameters taken into account consider input deriving from the previous five days (D-1, D2, D-3, D-4, and D-5), with the aim to provide a total of 24 distinct hourly predictions for the day ahead (D+1). In that way, the required input for the models’ execution becomes independent of data associated with day (D-0). As a result, and in the context of a rolling, day-ahead (D+1) scheduling pattern, increased flexibility is provided concerning the models’ execution time over the previous day (D-0). This is considered fundamental for the development of the models, allowing for the incorporation of new, rolling historical data of each (D-1) day, normally cleared and published every other (D-0) day (i.e., one day after). A reasonable assumption to that end is that the selected model is executed at 12:00 of day (D-0), allowing for the incorporation of (D-1) data on the one hand and for the delivery of forecasting results in view of configuring the day-ahead dispatching schedule on the other.
With this in mind, in
Table 2, we present the inputs used for the training of the given ANN models, together with the respective generated output. In more detail, and as already seen, out of the initial set of developed models, a total of four models are currently presented, with the fourth model also capturing additional input parameters, which are highlighted in
Table 2 (in grey). To that end, the first three models exploit a total of 28 inputs, forming an input layer of 28 artificial neurons, while the fourth one considers 38 inputs and, thus, an input layer of 38 neurons, with the overall aim being to generate a model of both increased accuracy and of a wide prediction span regarding different
CF classes (see also
Section 3.1).
Model 1: The first model developed is a time-lagged recurrent network (TLRN) model [
24,
25,
26]. It comprises a 28-neuron input layer and is combined with a 4-neuron hidden layer and a single-neuron output layer, using the back-propagation training algorithm and the activation function of tanhaxon.
Model 2: The second model developed is a multilayer perceptron (MLP) ANN model [
27,
28], which, in addition to the 28-neuron input layer, also carries one hidden layer of 24 neurons and an output layer of a single neuron. Moreover, it uses the same training algorithm and the same activation function as the TLRN model.
Model 3: The third model is a general feed-forward network (GFFN) model [
29,
30], having the same structure, training algorithm, and activation function as the MLP.
Model 4: The fourth model is also a GFFN model, but with a different structure. More specifically, it also incorporates the max and min
CF values of the last 12 h, up to the given prediction hour, considering each of the five previous days and aiming to the generation of predictions both of higher accuracy and for a wider span of wind power
CF bins. As such, it employs a total of 38 neurons for the input layer. Moreover, and due to the inclusion of extra features, it comprises two hidden layers, with 35 and 17 neurons, respectively, and has an output layer of a single neuron (see also
Figure 4). The activation function applied remains the same as before (tanhaxon), while for the learning rule, we used the gradient descent with momentum.
With regard to the training of the presented models, we used the time series of 2019 and developed two subsets of data. The first subset included 80% of the available training dataset, with the rest 20% of the second subset used for the purpose of cross-validation. The subsets were created randomly, while for the performance evaluation of the models, we used the data series of 2020, not used in the training stage of the models. Finally, training, validation, and also performance evaluation of the models were performed with the support of MATLAB software (ed. MATLAB R2022b).
2.3. Performance Indicators
Concerning the evaluation of the developed ANN models, appropriate statistical performance indicators were used [
31,
32], such as the correlation coefficient (
R), the mean bias error (
MBE), the root mean square error (
RMSE) the index of agreement (
IA), and the normalized mean absolute error (
NMAE), analyzed in the following and represented by the set of Equations (3) to (7), where
Pi concerns the predicted values and
Oi the corresponding observed values, and where
n is the population of predictions (pairs).
R measures how strong a relationship is between two variables, in our case, between the predicted and the observed values of CF. Values of R range from −1 to +1, with −1 indicating a perfectly linear and negative, i.e., inverse, correlation and +1 indicating a perfectly linear and positive correlation. In the case of ANN models’ prediction, the best score of R is 1, indicating that all of the predicted values are equal to the corresponding observed values.
MBE shows the degree of correspondence between the mean forecast and the mean observation. MBE is used to quantify whether the model underestimates or overestimates the observed data. Positive values indicate an overestimation, and negative values indicate an underestimation.
RMSE is a quadratic scoring rule that also measures the average magnitude of the error. More concretely, it is the square root of the average of squared differences between predictions and observations. RMSE provides the average of model prediction errors expressed in the units of the variable of interest.
IA gives a measure of how close the predicted Pi and the observed Oi values are, taking values from 0 to 1. Values of IA close to 1 indicate that the predicted values are close to the observed ones, and as such, a small prognostic error occurs.
NMAE is the average of the absolute differences between the observed and predicted values, normalized on the maximum measured value, which is currently equal to the max value of CF, i.e., 100%.
Results obtained to that end for the prediction of the theoretical and actual wind power CF for the 24 h day-ahead horizon studied are presented in the following section.
3. Results
3.1. Theoretical CF
Following the training and validation stages for the tested models, their performance was evaluated on the basis of data series from the year 2020, through the application of statistical indicators analyzed previously. Evaluation results obtained to that end concerning the prediction of the theoretical wind power
CF are summarized in
Table 3, where, as one may note, the second GFFN model (hereafter symbolized with GFFN-opt) comprises the model determined by the best performance across all of the involved metrics. In the same context, and although a direct comparison between different cases cannot be easily achieved, results obtained from other studies in the field indicate similar evaluation metrics for optimal, ANN-based day-ahead wind power prediction models addressing wind power installations of analogous scale (e.g., an
NMAE of ~17% is recorded in [
33] regarding a wind park of 15.8MW), neglecting at this stage any spatial smoothing effect [
34] due to the aggregation of four distinct wind parks in our case.
Returning to the comparison made between the different models examined, this is better reflected in
Figure 5a, where a two-week period comparison between the observed and predicted values generated by each of the models is provided for the first two weeks of 2020. As can easily be concluded, the TLRN model captures the average trend of the observed pattern, with the MLP and GFNN models presenting a better and similar behavior that outperforms TLRN. At the same time, the second GFFN model (GFFN-opt) is found to present the best fit, which is justified by the consideration, during the training stage, of the last 12 h maximum and minimum
CF values. Furthermore,
Figure 5b provides the probability distribution of residuals for each of the models examined, referring to the entire dataset of the year 2020. Residuals refer to the difference between the observed and the predicted values. As such, a positive difference suggests the underestimation of generated predictions, and a negative one overestimation. In this context, the distribution of the GFFN-opt model appears rather symmetrical around the minus 10% bin, and also narrower, opposite to the case of the other models, peaking close to the residual bin of minus 30% and presenting a wider shape. Actually, beyond the +/−30% range, cumulative probability drops to less than 15% for the GFNN-opt model, with the rest of the models presenting higher and more persisting probability values in both the lower and higher residual bins. To that end, it is for all four models that the probability value almost zeroes for residual bins that are lower than minus 70% and higher than 70%, demonstrating that even the least performing ones are able to provide predictions of moderate, but still reasonable accuracy, for the day-ahead time horizon examined.
Moreover, and by also analyzing residuals on the basis of different classes of theoretical wind power
CF (
Figure 6), the superiority of the GFNN-opt model is again demonstrated.
This is also supported by the fact that the GFFN-opt model captures a wider span of predictions of theoretical wind power CF values, which also includes the CF class of 80–90%. Meanwhile, the TLRN model is limited in the range of CF classes between 10% and 60%, while the GFFN and MLP models, although capturing predictions for CF classes reaching 80%, are also outperformed by the GFFN-opt model, especially in the area of higher-order CFs.
3.2. Actual vs. Theoretical CF
Following the comparison between the performance of the different models concerning the prediction of theoretical
CF values, we then proceed to compare the performance of the best-fit model (GFNN-opt) with regard to the prediction of the theoretical and actual wind power
CF. The latter, as already seen, takes into account wind power that is absorbed by the local energy system and thus incorporates instances of wind energy curtailments. Evaluation results on the performance of the GFNN-opt model for the actual
CF are given in
Table 4 against the respective of the theoretical
CF.
As it may be concluded from the above table, performance of the developed model is more limited in the case of the actual wind power CF. At this point, it must be noted that the actual wind power CF pattern presents a greater coefficient of variation (87.8%) in relation to the theoretical one (78.8%), which can be explained by the exercise of technical factors, which both limit the levels of CF values overall and introduce features of more stochastic character. These factors include grid constraints, such as the dynamic penetration constraint, and operational limitations of involved thermal power generators, associated with the technical characteristics of the thermal fleet, the appearing levels of load demand, and the overarching dispatching strategy adopted by the local DSO. As such, further details would be required in order to adequately describe the influence of these parameters as well.
Acknowledging the above, two different sets of curves are given in
Figure 7a, representing the observed and predicted values of the theoretical and actual wind power
CF, again for the first two weeks of the year 2020. If looking into the resulting trends, the theoretical
CF model seems to adjust better to the respective observed values.
On the other hand, and owing to the fact that the actual
CF series is determined, on average, by values that are ~25% lower than the respective of the theoretical series, the probability distributions of residuals for the two cases are almost identical (
Figure 7b).
3.3. Post-Processing and Analysis
Further elaborating on the results obtained, we accordingly proceed with a mapping exercise. The latter aims to provide meaningful post-processing and interpretation that may enable informed decision making regarding the day-ahead dispatch scheduling of power generation assets in similar, energy-constrained island systems, determined by high shares of wind energy and also the experience of increased wind energy curtailments for local wind energy actors.
3.3.1. Residuals’ Matrices
Acknowledging the above, first, we create different
CF classes (predicted values) and assess the range of anticipated residuals per different classes and hours of the day. In that way, predictions generated are also defined by the range of residuals attributed to the given hour and class of
CF, thus offering the potential for the introduction of probabilistic signals concerning wind power forecasting. Two different matrices are developed to that end. The first concerns the prediction of the theoretical wind power
CF (
Figure 8) and the second prediction of the actual wind power
CF (
Figure 9).
As far as the first matrix is concerned, it can be seen that in the range of 0%<
CF <20%, predictions generated tend to underestimate the observed values, while the opposite is valid for
CF values kept higher than 50–60%. At the same time, the greater (absolute) residual values are noted for the higher
CF bins (>70%), where overestimation of observed values may even reach 80%. On the other hand, pronounced overestimation in the case of actual wind power
CF (
Figure 9) presents itself for values greater than 40%, while classes of
CF higher than 80% are not made available, since
CF predictions generated remain below 80%, even in the case of observed values reaching 100%. As can be seen (
Figure 8), the same is valid in the case of the theoretical wind power
CF as well, with prediction results this time limited by a ceiling value of 90%.
In addition to the box plot analysis provided in the previous figures, we next proceed to the synthetic presentation of residuals per different
CF classes and hours of the day. We do so indicatively for the 25th, the 50th, and the 75th percentiles of residuals, and in the form of heatmaps presented in
Figure 10.
According to the heatmaps, significant underestimation is noted in the lower ranges of CF values, and for the higher percentiles (50th and 75th), for both the theoretical and actual wind power CF predictions, especially for CF values below 20%. At the same time, and as already seen, actual CF predictions appear to be overestimating the respective observed values of higher CF, in a rather pronounced way and for CF values exceeding 40–50%.
3.3.2. Residuals’ Weights
Accordingly, by considering the annual probability distributions for the predicted values of the theoretical and actual
CF, per class of
CF and per hour of the day, different classes of predictions carry different probability weights on an annual basis (
Figure 11). For example, although overestimation with regard to the observed, actual wind power
CF values falling in the range of 70–80% even reaches ~68%, the probability of encountering similar predictions is almost zero.
Arguing to that end that the weight of residuals estimated should also factor in the annual probability of encountering a given
CF prediction class, the heatmaps of
Figure 10 are accordingly rebuilt (
Figure 12), using the product of residual percentiles’ and predictions’ probabilities. The latter is currently assumed to better represent the significance of each given residual, per different day hours and
CF bins, which, as expected, causes the weights of the residuals to strengthen for the lower of
CF bins, weakening them for the higher ones.
3.3.3. Analysis per Class of CF and Day-Ahead Hours
A more abstract view of results is provided in
Figure 13 and
Figure 14, where the average weights of residuals per different
CF classes and day hours are provided. Average values are estimated on the basis of heatmaps given in
Figure 12 (per column and row, respectively), while the weight of residuals is provided in a normalized fashion, relative to the maximum absolute value for each different (25
th, 50
th, and 75
th percentiles) pair of theoretical and actual heatmaps. According to
Figure 12, the relative weight of residuals appears more important in the case of the actual
CF for
CF values up to 30%. On the other hand, for
CF values exceeding 60%, the relative weight of residuals zeroes, owing to the annual probability attached to the respective predictions (
Figure 12b).
Moreover, by comparing the different percentile curves, the 25th percentile introduces negative weights that peak around CF of 30%, while the 75th percentile yields positive values that maximize around 20%. As such, in the given area of CF, both underestimation and overestimation of observed values amplify due to the impact of factoring in probability. The same trends apply in the case of the theoretical CF, with non-zero relative weights extending to the CF bins of 70–80%, and with maximum and minimum residual weights appearing milder. At the same time, concerning percentiles, the 75th percentile curve presents an early peak, the 25th one minimizes in the CF area of 35–40%, and the 50th presents a similar behavior to the respective of the actual CF, with residual weights moving around 0% for CF values greater than 20%.
Furthermore, a comparison on the basis of day-ahead hours between the relative weights of theoretical and actual
CF residuals is given in
Figure 14. Results are presented per different percentile, with the first of graphs (25
th percentile) demonstrating that the weight of residuals (weighed overestimation) for the actual and the theoretical
CF becomes less important between 3:00 am and 9:00 am and between 15:00 pm and 20:00 pm, respectively. Under the 50
th percentile, the weight of residuals ranges between −15% and +15%, with positive values presented (a) between 19:00 am and 2:00 pm and between 8:00 pm and 11:00 pm for the actual
CF, and (b) between 16:00 pm and 20:00 pm for the theoretical
CF. Finally, in the case of the 75
th percentile, the weight of residuals (weighed underestimation) becomes more important between 20:00 pm and 24:00 pm for the actual CF, and less important between 10:00 am and 14:00 pm for the theoretical
CF.
In the meantime, and in relation to the post-processing analysis of the prediction results, it can also be inferred that decision making on the selection of best-performance models should not be restricted to the results of evaluation metrics covering the entire problem space alone (see also
Table 3); rather, it can also relate to specific application needs, narrower problem spaces, and also to the weight of importance assigned to different classes of results (see also
Figure 15). In this context, and although not in the core scope of the present study, parallel exploitation of different models under this view may also add to the analysis towards the direction of ensemble modeling [
35].
Having said that, results provided in the previous, exemplary
Figure 15 gather the residuals of theoretical wind power
CF predictions for a certain range of
CF prediction values (between 20% and 30%) and for all four ANN models currently examined, indicating that independently of their overall performance scores (see also
Table 3 and
Figure 4), different models may find themselves outperforming one another at a lower level of analysis, for which performance per different hours of the day is also taken into account.
4. Discussion
Following the analysis of results, we currently place our arguments on their meaningful character for both local system operators and wind energy actors.
The measured ability to predict the theoretical, hourly day-ahead wind power generation CF, appreciating at the same time a fair estimation of the respective CF residuals’ probability, provides local system operators with sufficient information in order to proceed to relevant decision making concerning day-ahead dispatching schedules, aiming to the maximization of absorbed wind energy by the local, island energy systems and to the minimization of system costs. Moreover, since the corresponding results may be generated at any point of the day (D-0), being dependent only on values deriving from days (D-1) to (D-5), sufficient flexibility is also provided to island system operators in terms of execution time for the selected models.
To that end, and given also the accuracy levels of optimum forecasting models developed, a considerable reduction of spinning reserve requirements is also possible for local system operators, which in turn reduces system operational costs. At this point, it is important to mention that in energy-constrained island systems, where, as anticipated, wind farms are geographically concentrated, the rule of thumb for the determination of spinning reserve requirements concerning the component of wind power generation assumes, in the absence of reliable predictions, a rather moderate guaranteed share of wind power generation that varies, depending on the island system examined, between 0% and 20% of the respective wind capacity [
36].
Furthermore, the ability to predict the actual wind power CF is of added value concerning the design and valuation of dedicated storage systems, used for the recovery of wind energy curtailments. Such storage entities may be seen as a means of transforming existing wind parks into dispatchable units, which introduces different working schemes. The latter may comprise a single actor (coupled wind and storage with common coupling), multiple actors, or independent actors. In the last two scenarios, centralized storage that may stand as either a completely independent entity or an asset that is co-owned by multiple wind actors implies the prospect of partial storage leasing on the one hand and of cooperative models on the other. Further on, under the prospect of looking at an independent, centralized storage entity, day-ahead forecasting of actual and theoretical wind power could inform price offers of wind actors on the commitment of, e.g., daily storage capacity shares, building also on the establishment of a similar, local-scale market between wind power and storage actors. The same is of course valid for the case of emerging demand response services, with relevant assets broadening the context of such markets and, thus, increasing competition.
Finally, and as far as forecasting results are concerned, further work in the given area could address ensemble modeling as well as the development of additional models, e.g., on a seasonal basis, that could better capture long-term, daily changes in the patterns of wind power generation and especially in those of the actual wind power CF. Moreover, the same is also valid for clustering-based forecasting, with a more detailed time series pre-processing potentially leading to the generation of more meaningful model features. At the same time, the development of wind park level, rather than system-level forecasting models, adds to the discussion of hedging strategies for existing and future wind park actors, allowing also for a comparative evaluation of local and aggregate-scale wind power forecasting.
5. Conclusions
In the present study, we examined the problem of day-ahead wind power forecasting in the context of saturated, in terms of wind power capacity, island electricity systems. Using ANNs, we developed and evaluated different day-ahead models for the hourly prediction of the theoretical and actual wind power CF in a given island system of the Aegean Sea, arguing that such prediction signals are of importance for both the local DSOs and wind park actors. Next, we proceeded with a post-processing of results. The latter focused on the analysis of prediction residuals with the aim of generating an extensive map for informing probabilistic decision making of day-ahead dispatch scheduling in similar island systems, looking also into the significance (probability of occurrence) of different scale residuals, projected to the dimensions of CF class and day-time hours. At the same time, optimal ANN models developed provided a fairly good accuracy over the biggest span of CF values for both the theoretical and actual wind power CF, allowing at the same time for a (D-0)-independent model execution, which adds considerably in terms of operational flexibility. Finally, in the context of a broader interpretation of the research output, exploitation of results was discussed, with an outlook on the emergence of local-scale markets and new actor schemes in similar, energy-constrained island systems.