Next Article in Journal
Assessing the Effect of Land Use Change on Surface Runoff in a Rapidly Urbanized City: A Case Study of the Central Area of Beijing
Previous Article in Journal
Hydrological Control of Vegetation Greenness Dynamics in Africa: A Multivariate Analysis Using Satellite Observed Soil Moisture, Terrestrial Water Storage and Precipitation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Understanding Landscape Influences on Aquatic Fauna across the Central and Southern Appalachians

1
Department of Forestry and Environmental Conservation, Clemson University, Clemson, SC 29634, USA
2
Office of Science Applications, U.S. Fish and Wildlife Service, Fairbanks, AK 99701, USA
*
Author to whom correspondence should be addressed.
Submission received: 6 December 2019 / Revised: 6 January 2020 / Accepted: 7 January 2020 / Published: 10 January 2020
(This article belongs to the Section Landscape Ecology)

Abstract

:
For the success of aquatic conservation efforts, it is imperative for there to be an understanding of the influences multiple stressors across the landscape have on aquatic biota, as it provides an understanding of spatial patterns and informs regional stakeholders. The central and southern Appalachians contain biodiversity hotspots for aquatic fauna. Therefore, we sought to create a comprehensive multimetric model that is based on the influence of abiotic factors on fish and aquatic macroinvertebrates that could predict watershed quality. Good spatial coverage exists for land use/land cover (LULC) and other physicochemical components throughout the region, yet biological data is unevenly distributed, which creates difficulties in making informed management and conservation decisions across large landscapes. We used boosted regression trees (BRT) to model a variety of biological responses (fish and aquatic macroinvertebrate variables) to abiotic predictors and by combining model outputs created a single score for both abiotic and biotic values throughout the region. The mean variance that was explained by BRT models for fish was 73% (range = 48–85%) and for aquatic macroinvertebrates was 81% (range = 76–89%). We categorized both predictor and response variables into themes and targets, respectively, to better understand large scale patterns on the landscape that influence biological condition of streams. The most important themes in our models were geomorphic condition for fish and water quality for aquatic macroinvertebrates. Regional models were developed for fish, but not for aquatic macroinvertebrates due to the low number of sample sites. There was strong correlation between regional and global watershed scores for fish models but not between fish and aquatic macroinvertebrate models. We propose that the use of such multimetric scores can inform managers, NGOs, and private land owners regarding land use practices, thereby contributing to large landscape scale conservation efforts.

1. Introduction

Aquatic biodiversity is declining faster than terrestrial biodiversity globally and one of the contributing factors is the condition of the landscapes in which aquatic systems are embedded, coupled with the aquatic environments themselves [1,2]. There is a need to understand the drivers of this decline across a geography of concern in order to reverse or stabilize this trend and conserve aquatic species and ecosystems (e.g., the central and southern Appalachians). Biological sampling efforts can be geographically extensive (e.g., National Fish Habitat Partnership, state agency sampling, etc.), yet it is intractable to consider biological sampling of every stream segment or small watershed across large geographies. Therefore, there is a need to develop an understanding of the underlying physical, chemical, and biological influences that affect the population and community condition, model those effects, and extend predictions to unsampled stream segments or watersheds.
Assessing the biological condition of rivers and streams has been performed at various spatial scales and grains. As data availability and computing power both increase, the ability to perform spatial analysis across large geographic extents and at small grain size is increasing. Examples of large scale spatial analyses include Esselman et al. [3], who used fish as indicators, the US Environmental Protection Agency’s Wadeable Stream Assessment [4], which included benthic macroinvertebrates and fish as indicators, and Hill et al. [5], who modeled biological stream condition across the conterminous U.S. based on anthropogenic and natural watershed features. Multimetric indices (MMIs) are commonly used in studies that attempt to understand anthropogenic influences on the condition of biological aquatic communities over vast geographies [6,7,8,9]. Multimetric indices use taxonomic and functional metrics that are known to be sensitive to gradients of anthropogenic disturbance as a means of assessing the health of aquatic systems.
Landscape-scale analyses are important, as they allow for a more complete understanding of anthropogenic stressors on biological and ecological needs of species, populations, and communities across broad spatial extents. We define landscape scale as the spatio-temporal extent that allows for multigenerational population processes to occur [10]. Such landscape scale analyses require extensive data that are representative across taxonomic groups through both space and time, and physicochemical conditions must be represented, which range from natural to extensive anthropogenic disturbance [11]. It is often difficult to ensure that data sets are consistent across regions, representative taxa (response variables), and physicochemical factors (predictor variables). These difficulties arise due to biological and environmental datasets being expensive to obtain, varying goals and objectives from entities collecting data, and varying monetary support for such collections. This often results in either poor spatial coverage and spatially clumped data or both. Data sets are often combined from various collection programs to overcome these difficulties. However, this can result in data existing at varying taxonomic levels, problems with standardization of sampling protocols, and biases that are associated with site selection. For example, variability in site densities may result in biases towards trends that are prevalent in heavily sampled areas. Angermeier and Smogor [12] found that sampling effort influenced accuracy and precision of estimates of biological community attributes.
Anthropogenic induced stressors often correlate strongly with biological conditions of aquatic systems and, thus, landscape-scale analyses typically aim to use land cover and land use practices over large geographies as surrogates for local environmental conditions to provide insight into biological conditions of streams and rivers. The use of such analyses allow for broad geographic scale understanding of the existing gradients of poor-to-good biological condition and the environmental factors that affect them. This aids in identifying the landscape-scale stressors that further the ability of conservation practitioners, managers, and regulators to take actions that positively impact aquatic fauna through a greater understanding of large-scale stressors. However, anthropogenic stressors and their biological responses are heterogenous across large areas, but they are the result of local biogeographical interactions with both natural and anthropogenic stressors over expansive time periods [3].
Indices for evaluation of biological condition across large geographies have become standard procedures, enabling transferability across regions through the use of statistically robust methods [3,11,13,14]. Biological condition indices should sufficiently cover the range of environmental gradients, exhibit low temporal variability to repeated visits to the same site(s), and demonstrate high responsiveness to environmental degradation. Indicators that do not meet these requirements should be excluded from analyses [11,15]. The use of the screening methods provided by Whittier et al. [15] and Stoddard et al. [11] allow for a large set of contending predictor variables to be winnowed down to a smaller, manageable group of indicators used to describe the biological condition gradient in a consistent manner across regions.
Multimetric indices (MMI) combine a variety of biological indicators and indices. While using boosted regression trees (BRT; [16]), we developed watershed-based MMIs that combined fish and aquatic macroinvertebrates across the Appalachian Landscape Conservation Cooperative (App LCC) region (see methods for full description) using the methods that were developed by Stoddard et al. [11] and Whittier et al. [15]. Additionally, we aimed to identify large scale themes that structure biological communities and thus biological condition across this large geography. The App LCC encompasses biologically important aquatic ecosystems (e.g., Tennessee and Mobile Rivers) and it is predicted to undergo continued land conversion, resulting particularly in increased urbanization [17]. Using broad scale land use information to develop models to predict biological condition of streams and watersheds provides the opportunity to predictively map spatially explicit biological condition across large geographic regions [5,18]. While studies have been performed to understand the response of fish and aquatic macroinvertebrates for the conterminous United States [3,4,5], a study that is spatially explicit to the central and southern Appalachians that could inform aquatic conservation within the App LCC region is lacking. An understanding of the anthropogenic threats that exist in the Appalachians is an important endeavor, as more than half of the U.S. population lives within a day’s drive of some part of the Appalachians [19]. Thus, our objective was to model and map the current biological condition and develop an understanding of the threats to conservation across the region as an important step towards identifying conservation opportunities and needs, so that conservation practitioners can act accordingly.

2. Materials and Methods

Our study focused on the rivers, streams, and landscape encompassed by the App LCC. The App LCC comprised 592,129 km2, 266,307 HUC 12 watersheds, and intersects 15 states that range from central New York to the north, and central Georgia, Alabama, and Mississippi to the south [20] (Figure 1).
The eastern most extent reaches western New Jersey in the north and the western most portions of North and South Carolina, while reaching as far west as southern Illinois. The topography and habitat of the region are complex over such a large land area, which is comprised of six Level II ecoregions (U.S. EPA), 11 Freshwater Ecoregions, and seven major river basins and has been greatly influenced by the lack of glaciation and ancient geology, which gave rise to high levels of aquatic biodiversity. The App LCC region encompasses some of the most biologically diverse freshwater fish resources in the United States [23]. This is particularly true in the southwestern portion of the region [24]. As the 19th and 20th centuries have progressed, aquatic diversity has been impacted by anthropogenic activities that range from agricultural practices, land conversion, and extractive industries.
Through the aid of a steering committee comprised of 60 regional experts, which include academic scientists, state wildlife agency managers, non-governmental organization scientists, and conservation practitioners, we identified 52 predictor variables comprising six themes that are known to influence fish and aquatic macroinvertebrate communities (Table A1). The themes included streamflow, geomorphic condition, connectivity, water quality, non-point source pollution, and point source pollution, and they were established and evaluated to understand large scale thematic processes that influence aquatic biota across the App LCC. The predictor data were compiled from regional and national datasets at the National Hydrography Dataset version 2 (NHD + V2) HUC12 watershed level (Final predictors used in BRT models are shown in Table 1; a comprehensive list of predictor data with source is provided in Appendix A).
Riparian corridors are known to be influential for developing ecological diversity [25] and in structuring aquatic communities; hence, we used The Nature Conservancy’s Active River Area [26] as a buffer into which land cover variables were clipped for modeling. The active river area framework provides an ecologically sound modelling mechanism to capture important variables that structure aquatic ecosystems [26]. We centered and scaled all of the predictor variables to overcome instances where predictor variables had wide ranges resulting single predictors dominating models.
Our fish and aquatic macroinvertebrate community data were assembled from federal databases of national datasets (United States Environmental Protection Agency’s National Rivers and Streams Assessment and the National Fish Habitat Partnership). We used these community datasets as the predictors to assess the status of biological communities within HUC12 watersheds (Table 2).
The sample point locations were spatially joined to HUC12 watersheds by projecting into a conic equal area projection in Quantum GIS [27]. Our fish community dataset was comprised of 2991 sample sites across the region and was somewhat sparse (2991 locations within 266,309 HUC12 watersheds = 1.1%), but it exhibited good spatial distribution (Figure 2).
Our aquatic macroinvertebrate dataset was comprised of only 194 sampling locations and it had poor spatial coverage (194 locations of 266,309 HUC12 watersheds = 0.07%) and distribution across the region (Figure 2).
For fish, we calculated taxonomic richness and diversity and several functional group metrics while using the fish traits database that was provided by Frimpong and Angermeier [28] and the tolerance database of Stoddard et al. [29]. In total, we calculated 45 metrics for further analysis of fish responses to our predictor dataset; however, our final analysis used only nine of these metrics and these were further aggregated into three groupings that we called targets (e.g., Shannon diversity, functional group, fish taxa quality) (Table 2). For aquatic macroinvertebrates the metrics modeled as response variables were % EPT (i.e., percent Ephemeroptera/Plecoptera/Trichoptera), % of total abundance comprised by the five most dominant taxa, % of total abundance that were tolerant taxa, and % of the total abundance that were intolerant taxa (Table 2). These four metrics were aggregated into a target called macroinvertebrate taxa quality (Table 2).
Prior to model development, we successively screened all of the predictor variables (e.g., [3,11]). Screening included evaluating predictor variables for: 1) range, where predictor variables with small range (e.g., <4) were removed, 2) percent zero or NAs, where variables with >33% zero or NA were removed, and 3) percent values equal to the mode, where the variables with >75% of the values were the same were removed. After screening, the 52 predictor variables were reduced to 24, which comprised six themes (Table 1). Subsequently, we employed boosted regression trees (BRTs) to predict fish and aquatic macroinvertebrate response variables at the watershed scale (Table 2). Boosted regression trees are robust to missing data, have both explanatory and predictive power, and they are capable of handling complex relationships that are both non-linear and interactive without the need for prior data transformation. By iteratively fitting and combining simple regression models, BRT models improve model structure and predictive performance. Tree complexity and learning rate were systematically altered to identify the optimal model structure. We reduced model complexity by removing the redundant variables (r > 0.80) and variables with minimum variation among sites. Global models were developed with the remaining variables and then further simplified while using scree plots of the predictor relative influence. Simplified models were retained if their cross-validation was greater than or equal to that of the global model. BRT models were developed with methods and code that were provided by Elith et al. [16] while using Program R: A language and environment for statistical computing [30]. Initial analyses were performed across the entire geography without accounting for regional differences (i.e., ecoregions), but we used Freshwater Ecoregions of the World [31] to understand these regional differences due to a clear need for the inclusion of regional differences within the models. Freshwater ecoregions attempt to place aquatic ecosystems in a biogeographical framework that is based on ecological and evolutionary patterns of fish distributions [31]. We aggregated the 11 ecoregions that comprised the App LCC into four due to low sample site numbers in some Freshwater Ecoregions. The four aggregated ecoregions and their composite Freshwater Ecoregions were Atlantic Slope (Appalachian Piedmont, Chesapeake Bay, Northeast US and Southeast Canada Atlantic Drainages), Cumberland (Cumberland River Basin), Ohio (Teays-Ohio and Laurential Great Lakes River Basins), and Tennessee (Tennessee and Mobile Bay River Basins). The regional models were successfully run for fish, but we were unable to model aquatic macroinvertebrates regionally due to the lack of spatial coverage, and we therefore retained models for the entire geography for aquatic macroinvertebrates.
Predictions of response variables from BRT models were centered and scaled. As a multimetric approach, the predicted responses (biological metrics in Table 1) were then averaged within their target group (see Table 1) and further condensed via averaging into overall fish and aquatic macroinvertebrate scores. Boosted regression trees provide the relative influence of predictor variables in the modelling of responses, and we averaged these relative influences within our thematic framework (see above and Table 1) for each fish and aquatic macroinvertebrate response variable.

3. Results

The screening process reduced the total number of predictor variables to 37 from 52. When the predictor variables were correlated, we attempted to retain the variable that was most responsive to our models; however, this was not formally tested. The themes with the greatest number of predictor variables were non-point source pollution and point source pollution (each with eight variables). While the theme with the least number of predictor variables was geomorphic condition (two variables) (Table 1).
Below, we present the results from the boosted regression tree models where the models were run with all ecoregions together for both fish and aquatic macroinvertebrates. We refer to this as a “Global” model (see below). After which, we present the results where separate models were run for each Freshwater Ecoregion (defined in the materials and methods section). We refer to these models as regional models and, because there were not enough data points within the regional data sets for aquatic macroinvertebrates, they were developed for fish only.

3.1. Global Models

Boosted regression tree models were developed for 45% (28/62) of the response variables; however, we only retained 12 (eight and four response variables for fish and aquatic macroinvertebrates, respectively) for our final multimetric index of biotic condition (Table 2). Response variables were collapsed into targets, where, for the fish category, we retained three targets (Shannon diversity, functional group, and taxa quality) and for aquatic macroinvertebrates, we retained macroinvertebrate taxa quality as the only target.
The range of variance that was explained by the BRT models was 48% for percentage of invertivore taxa to 85% for fish taxa diversity ( x ¯ = 73%) for fish and 76% for percentage of the five dominant taxa to 89% for percentage of tolerant taxa ( x ¯ = 81%) for aquatic macroinvertebrates. For the fish BRT models, the cross-validated deviance ranged from 32% for percentage of invertivore taxa to 69% for fish taxa diversity ( x ¯ = 52%), while it ranged from 24% for percentage of the five dominant taxa to 76% for percentage of tolerant taxa for aquatic macroinvertebrates ( x ¯ = 41%) (Table 3).
The lower cross validated deviances for aquatic macroinvertebrates relative to those of fish are likely due to the lack of spatial data coverage in the aquatic macroinvertebrate data (Figure 2b).
The predictor variables that had the greatest relative influence were freshwater ecoregion and network percent impervious surface for fish and aquatic macroinvertebrates based on the results of the BRT models, respectively. Those with the least relative influence were how confined a stream reach was (CONF_CL) and watershed mine density for fish and aquatic macroinvertebrates, respectively (Table 4).
The theme with the greatest cumulative relative influence for fish was geomorphic condition (46.6%) and for aquatic macroinvertebrates was water quality (41.7%), while point source pollution was the least important theme for both fish (1.4%) and aquatic macroinvertebrates (0.8%). The relative thematic influence was variable between response variables; where, for example, the flow theme was the most influential for % EPT (27.4%), but the most influential theme was water quality for % intolerant taxa (87.9%) (Table 5).

3.2. Regional Models

Regional differences were evident in the results of the relative influence of predictor variables on responses. For example, the variable with the greatest relative influence for richness in the Atlantic Slope data was July temperature (22.8%), for the Cumberland River Basin, was ground flow recharge (17.6%), for the Ohio River Basin, was average catchment elevation (21.4%), and for the Tennessee River Basin, it was dam storage capacity (8.2) (Table 6).
Additionally, the relative influence varied widely within a given region for the same predictor variable, but for different response variables. For example, k-factor (a measure of soil erodibility) had the 24th most influence on diversity in the Atlantic Slope, but it was the fourth most influential variable in the % herbivore taxa model. The standard deviation that was associated with the relative influence of predictor variables for a specific region was also variable; where, for example, for invertivore diversity the standard deviation was 2.4, 2.6, 5.1, and 4.3 for the Atlantic Slope, the Cumberland River Basin, the Ohio River Basin, and the Tennessee River Basin, respectively.
The mean relative influence for themes were also somewhat variable across regions; where, for example, flow was the most influential theme for each region for the BRT model for the % of species that prefer coarse sediment. However, for the % of tolerant fish species, the flow theme represented the fourth most influential theme for the Atlantic Slope ( x ¯ = 1.7%), third most for the Cumberland River Basin ( x ¯ = 2.5%), second most for the Ohio River Basin ( x ¯ = 2.7%), and first most for the Tennessee River Basin ( x ¯ = 3.6%) (Table 7).
Generally, geomorphic condition was the most influential theme followed by flow, non-point source pollution, water quality, connectivity, and point source pollution when averaged across all of the response variables. With little variation, the themes were consistent in terms of their relative influence across regions; for example, non-point source pollution was the fourth most influential theme for the Atlantic Slope, the Ohio River Basin, and the Tennessee River Basin, but the fifth most influential for the Cumberland River Basin.

3.3. Fish Models

The model results suggest there are distinct areas of the App LCC region that exhibit characteristics of poor biological condition (i.e., watershed score) (Figure 3).
For example, western Tennessee (i.e., the Southeastern Plains) and the northern-most portion of the region (i.e., the Atlantic Highlands). Additionally, large rivers tended to exhibit poor biological condition. This was especially true in areas with high agricultural land use (i.e., east Tennessee). Regions where the models displayed relatively good biological condition were largely in areas free from agricultural land use. However, large rivers exhibited poor biological condition, even in areas with good biological condition.
Pearson correlations between the global and regional models were variable and ranged from 0.33 (% intolerant taxa) to 0.99 (% invertivore taxa) for pairwise comparisons. The overall indices for watershed scores (i.e., biological condition) were moderately correlated (0.69). The targets were highly correlated for the functional group (0.98) but, were relatively poorly correlated for taxa quality (0.34) (Table 8) (Figure 4).

3.4. Aquatic Macroinvertebrate Model

We were unable to develop regional models for aquatic macroinvertebrates due to the data set being sparse in some regions. The global model for aquatic macroinvertebrates showed poor ability to identify areas with a good and bad biological condition. Only very large geographic regions were distinguishable from one another. For example, the western portion of the App LCC region was identified as having relatively higher biological condition when compared to the eastern and northern portions of the region. Where the fish models identified large rivers as having poor biological condition, the aquatic macroinvertebrate models were unable to identify the biological condition at this scale (Figure 5).
Additionally, the Pearson correlation coefficient between the aquatic macroinvertebrate watershed score and those of the global (Figure 4) and regional fish watershed scores was 0.01 for both.

4. Discussion

Freshwater ecosystems are highly threatened worldwide. In North America, the Appalachian LCC region is a particularly valuable aquatic ecosystem due, in large part, to its high aquatic biodiversity. It is also a region of great conservation concern due to current and projected rapid land cover change (i.e., urbanization), biodiversity loss, habitat fragmentation, and climate change [17,20]. Models, such as the ones presented here, and others (e.g., [3,5]), which provide insight into how land use affects aquatic resources (e.g., fish and aquatic macroinvertebrates), are valuable tools for identifying areas of greatest concern for restoration activities and those areas in greatest need of increased and/or continued protection [32,33].
Multimetric indices may be beneficial in identifying a biological condition across large geographies. Our results suggest that multimetric scores (i.e., watershed scores) corroborate one another at the global and regional scales (see Table 8), which indicate that the predictor variables we used were responsive in a consistent manner across scales. This is an important finding, as the ability to rely on a consistent set of landscape variables enables cross-regional and cross-scale comparisons, which can be particularly important when data sets are assembled from multiple sources [3].
The individual predictors that most commonly had high relative influence for fish metrics (i.e., was ranked in the top ten percent of predictors within an individual metric) were primarily related to geomorphic condition (e.g., erosive and resistive forces) and flow regime (e.g., storage and recharge capacity). However, those with the highest relative influence for aquatic macroinvertebrates were primarily related to sources of pollution (e.g., road crossings, nitrogen, and silt). Our results reinforce those of Hill et al. [5], who found that agriculture and urbanization were the primary drivers of landscape impacts on aquatic macroinvertebrates. The differences in driving factors for fish and aquatic macroinvertebrates suggest that, at a large spatial scale, such as that presented here, the important land use factors that influence aquatic communities may differ somewhat and mitigation efforts to overcome these large-scale stressors likely need to have some taxonomic specificity.
The relative influence of individual predictors to our boosted regression tree models was variable between regions in our regional models. This indicates the high variability between regions regarding what environmental factors influence community structure. These findings are not surprising, as they are congruent with other large-scale biological condition assessments [3,5] and have been found to be a shortcoming of MMIs [34,35]. While individual predictors were relatively variable between regions, our models suggested themes (i.e., combined predictors) were more consistent; yet, some discrepancies between regions still existed. We chose to condense predictor variables into themes as a higher order mode of understanding of how these groups of stressors impacted fish and aquatic macroinvertebrates. The themes that exhibited the greatest influence on fish watershed scores across the entire region were geomorphic condition and flow regime. The geomorphic condition in our analysis was a measure of soil stability and erosivity. This finding is unsurprising, as aquatic taxa are known to be sensitive to sedimentation. Surprisingly, point source pollution and connectivity were the least influential. This is likely due to these two factors being measured somewhat sparsely (i.e., few watersheds have NPDES sites) on the landscape and not necessarily in close proximity to biological samples. For example, NPDES density, which is a measure of point source pollution, is likely to exhibit strong influence on aquatic biota in the immediate area, but these sites are scattered across the landscape and, thus, across a large region, such as the App LCC region, they were less influential in our models than predictors that have greater spatial representation (e.g., soil erosivity).
A high correlation between the global and regional fish multimetric indices suggests that the predictor variables used were responsive across both scales. While there was high correlation between global and regional fish multimetric indices (i.e., watershed score), there was very low correlation between multimetric indices from either of the fish models and the global aquatic macroinvertebrate model. The low correlation between the fish watershed scores and aquatic macroinvertebrate watershed scores is most likely due to the low number of sites (N = 334), which results in poor spatial coverage for our aquatic macroinvertebrate data set, as compared to the good spatial distribution and relatively high number of sites (N = 8355) in our fish data set. Correlations between the fish and aquatic macroinvertebrate indices are likely to be higher when the same, or nearby, sites are sampled for both. For example, at 18 sites in the River Raisin watershed in Michigan, correlations between fish and aquatic macroinvertebrate indices ranged from 0.21 to 0.51 [36]. Additionally, stressors have been found to affect fish and aquatic macroinvertebrates similarly. Free flowing stream sections of the Fox River, IL exhibited higher index scores for both fish and aquatic macroinvertebrates when compared to sections of the river above low-head dams [37].
Esselman et al. [3] suggested regional stratification for analyses that span large geographies, while also including multiple biological response variables, and using statistical modeling techniques that are non-linear would lead to improved multimetric indices. This is precisely the approach that we took modeling biological condition across the 15 state App LCC region. However, one major difference between our study and previous studies is that previous work to develop multimetric indices across broad geographies included regional influences by using Omernick ecoregions [3,5,15] (Whittier et al., 2007; Esselman et al. 2013; and, Hill et al. 2017). Omernick ecoregions are a spatial framework that includes aquatic and terrestrial ecosystems [38]; however, we used Freshwater Ecoregions of the World for our analysis, because they were developed on a framework that focused exclusively on aquatic ecosystems with fish as the focal taxonomic group [31].
Another difference between our work and previous work is that we aggregated both predictor and response variables into themes and targets respectively. By aggregating predictor variables into themes, we were able to identify categories of predictors that were influential in structuring fish and aquatic macroinvertebrate communities in aggregate (e.g., watershed score for fish), as well as for individual response variables (e.g., % EPT). Response variable aggregation allows for a greater understanding of how individual and aggregate predictors (i.e., themes) structure fish and/or aquatic macroinvertebrate communities in a more functional manner. For example, two targets (e.g., fish functional group and fish taxa quality) may respond differently to the same stressor. Understanding which higher order stressors (theme) affect our targets (fish functional group) might lend insight into more effective regional management and conservation strategies. Our results suggest that regional variability of the relative influence of predictor variables should garner significant consideration as the stressors that affect fish and aquatic macroinvertebrates were not the same from ecoregion to ecoregion. Furthermore, care should be taken when using large scale modeling efforts such as ours to make management recommendations across large geographies that span more than one ecoregion, as thematic importance was variable between regions.
While we were able to develop regional models for fish indices, unfortunately, we were unable to develop regional aquatic macroinvertebrate models due to low numbers of sample sites. As aquatic macroinvertebrates are known to be good indicators of biological condition [39,40,41], we believe that there needs to be increased systematic sampling efforts that result in an increased number of sites that are spatially dispersed across multiple freshwater ecoregions and that include known areas of both high and low biological integrity. In doing so, models such as the one provided here (see also Hill et al. 2017 and Esselman et al. 2013 [3,5] can be improved in their ability to provide managers with information to help guide local plans within a regional context. For example, mitigation efforts that aim to provide ecological lift to a watershed might look for areas that have been identified as having poor condition but surrounded by watersheds with good condition or, alternatively, these models could be used to identify good quality watersheds that are in need of protection from multiple encroaching stressors.

5. Conclusions

We combined and used multiple biotic and abiotic datasets to model stream condition across the Appalachian LCCs 15 state boundary (i.e., central and southern Appalachians). We developed separate fish and aquatic macroinvertebrate steam condition models highlighting the existence of both similarities and differences in how environmental factors structure these two aquatic taxonomic communities while using BRTs. Overall, correlation existed between fish and aquatic macroinvertebrate models and global and regional fish models. Furthermore, there was high correlation between global and regional fish models for most of the fish metrics evaluated. We were unable to create aquatic macroinvertebrates regional models due to low sample size regionally; however, for fish, separate models for each of the Freshwater Ecoregions varied in what environmental factors were most important in structuring aquatic communities, which exemplified the need to carefully consider ecoregional factors when developing cross-ecoregion spatial plans and management suggestions. Biological samples are difficult and expensive to collect, and effort varies between taxa, and this results in uneven distributions in both the number of samples and spatial distribution between taxa making it difficult to create MMIs that can better represent the condition of aquatic resources across broad geographies. Careful selection of biological sample locations that are representative of the spectrum spanning from high quality to low quality environmental conditions that are spatially well dispersed can greatly enhance the ability of models such as ours to inform spatial and management plans. Additionally, common sample locations for various taxa would be useful to improve the comparison of environmental responses among and between taxa.
We recommend a centralized database where these records are made available for public download, where efforts such as ours can be more easily replicated, improved upon, and then performed at various spatial scales. With an extensive and systematic sampling approach that includes both fish and aquatic macroinvertebrates and it is integrated across jurisdictional boundaries the ability to develop accurate models of large geographies will improve. Such improved spatial models will lend greater confidence in such indices, which will only increase the ability of managers, conservation practitioners, and scientists alike to address and meet regional conservation goals.

Author Contributions

For research articles with several authors, a short paragraph specifying their individual contributions must be provided. The following statements should be used “conceptualization, R.D.H., P.B.L. and R.F.B.; methodology, R.D.H. and P.B.L.; formal analysis, R.D.H.; writing—original draft preparation, R.D.H.; writing—review and editing, R.D.H., P.B.L., and R.F.B.; visualization, R.D.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Appalachia Landscape Conservation Cooperative through the Wildlife Management Institute.

Acknowledgments

We would like to thank the U.S. Fish and Wildlife Service for funding this research along with the Appalachian Landscape Conservation Cooperative staffs, partners, and collaborators that provided great insight and guidance throughout this project. We also acknowledge the Department of Forestry and Environmental Conservation at Clemson University. We would like to acknowledge the US Environmental Protection Agency’s work and for providing access to the StreamCat data set. Technical Contribution No. 6820 of the Clemson University Experiment Station.

Conflicts of Interest

The authors declare that they have no competing interests. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Appendix A

Table A1. Full list of predictors, showing each variable’s thematic grouping and the source for each data set.
Table A1. Full list of predictors, showing each variable’s thematic grouping and the source for each data set.
AttributeMetricVariable NameSource
GeneralCatchment areaareasqkmTNC
Upstream Drainage Area (Net Catch Area)TotDASqKmTNC
Flow RegimeBase Flowbfi_avgTNC
Base Flowbfi_avg_NTNC
Rechargerchrg_avgTNC
Rechargerchrg_avgNTNC
Total Storage/mean annual flownid_storNTNC
Density of Large DamsDamDensCatStreamC
Density of Large DamsDamDensWsStreamC
Municipal Water Withdrawaldom_wdcNFHAP
Ag Water Withdrawalag_wdcNFHAP
Industrial Water Withdrawalind_wdcNFHAP
Ag Water Withdrawal Normalized w/Summer flowag_wdc_Snorm
Ag Water Withdrawal Normalized w/Annual Flowag_wdc_norm
Municipal Water Withdrawal Normalized w/ Annual Flowdom_wdc_norm
Industrial Water Withdrawal Normalized w/Annual Flowind_wdc_norm
ConnectivityDensity of Dams UpstreamupnetdensrNFHAP
Density of Dams DownstreamdownmaindensNFHAP
Density of Road Crossings (Tiger 2010)RdCrsCatStreamC
Density of Road Crossings (Tiger 2010)RdCrsWsStreamC
Water Q (Pop)Athro Nitrogenn_yieldNFHAP
Anthro Phosphorusp_yieldNFHAP
Anthro Sediment Yields_yieldNFHAP
Water Q (LU)Impervious (NLCD 2011)imp11_perTNC
Impervious (NLCD 2011)imp11_perNTNC
Riparian Buffer (NLCD 2011) 100mPctDecid2011CatRp100 StreamC
Riparian Buffer (NLCD 2011) 100mPctDecid2011WsRp100StreamC
Riparian Buffer (NLCD 2011) 100mPctConif2011CatRp100StreamC
Riparian Buffer (NLCD 2011) 100mPctConif2011WsRp100StreamC
Riparian Buffer (NLCD 2011) 100mPctMxFst2011CatRp100StreamC
Riparian Buffer (NLCD 2011) 100mPctMxFst2011WsRp100 StreamC
% Crop (NLCD 2011)nlcd11_81N nlcd11_82NTNC
% Crop (NLCD 2011)nlcd11_81 nlcd11_83TNC
Water Q (Pollution)CERCcerc_denCNFHAP
Permit Compliancepcs_denCNFHAP
Toxic Release Sitestri_denCNFHAP
Coal Mine Densityall_col_denCNFHAP
OtherTemperatureIncrTempMM07TNC
TemperatureIncrTempMM08TNC
Elevationelevcm_avgTNC
Elevationelevcm_avgNTNC
Slopeslope_avgTNC
Slopeslope_avgNTNC
R-factorrf30_avgTNC
R-factorrf30_avgNTNC
K-factorkfact_avgTNC
K-factorkfact_avgNTNC
Sandsand_avgTNC
Sandsand_avgNTNC
Siltsilt_avgTNC
Siltsilt_avgNTNC
Landscape ConditionPercent Agricultural Land within ARA (HUC 8)Perc_Ag_HUC8
Percent Natural Land within ARA (HUC 8)Perc_Nat_HUC8
Percent Impervious Surface within ARA (HUC 8)Perc_Imp_HUC8
Percent Agricultural Land within ARAPerc_Ag
Percent Natural Land within ARAPerc_Nat
Percent Impervious Surface within ARAPerc_Imp

References

  1. Vörösmarty, C.J.; Mcintyre, P.B.; Gessner, M.O.; Dudgeon, D.; Prusevich, A.; Green, P.; Glidden, S.; Bunn, S.E.; Sullivan, C.A.; Liermann, C.R.; et al. Global threats to human water security and river biodiversity. Nature 2010, 467, 555–561. [Google Scholar] [CrossRef]
  2. Dudgeon, D.; Arthington, A.H.; Gessner, M.O.; Kawabata, Z.I.; Knowler, D.J.; Lévêque, C.; Naiman, R.J.; Prieur-Richard, A.H.; Soto, D.; Stiassny, M.L.J.; et al. Freshwater biodiversity: Importance, threats, status and conservation challenges. Biol. Rev. Camb. Philos. Soc. 2006, 81, 163–182. [Google Scholar] [CrossRef]
  3. Esselman, P.C.; Infante, D.M.; Wang, L.; Cooper, A.R.; Wieferich, D.; Tsang, Y.-P.; Thornbrugh, D.J.; Taylor, W.W. Regional fish community indicators of landscape disturbance to catchments of the conterminous United States. Ecol. Indic. 2013, 26, 163–173. [Google Scholar] [CrossRef]
  4. USEPA. Wadeable Streams Assessment: A Collaborative Survey of the Nation’s Streams; USEPA: Washington, DC, USA, 2006.
  5. Hill, R.A.; Fox, E.W.; Leibowitz, S.G.; Olsen, A.R.; Thornbrugh, D.J.; Weber, M.H. Predictive mapping of the biotic condition of conterminous U.S. rivers and streams. Ecol. Appl. 2017, 27, 2397–2415. [Google Scholar] [CrossRef]
  6. Harris, J.H.; Silveira, R. Large-scale assessments of river health using an Index of Biotic Integrity with low-diversity fish communities. Freshw. Biol. 1999, 41, 235–252. [Google Scholar] [CrossRef]
  7. Hering, D.; Moog, O.; Sandin, L.; Verdonschot, P.F.M. Overview and application of the AQEM assessment system. Hydrobiologia 2004, 516, 1–20. [Google Scholar] [CrossRef]
  8. Pont, D.; Hugueny, B.; Beier, U.; Goffaux, D.; Melcher, A.; Noble, R.; Rogers, C.; Roset, N.; Schmutz, S. Assessing river biotic condition at a continental scale: A European approach using functional metrics and fish assemblages. J. Appl. Ecol. 2006, 43, 70–80. [Google Scholar] [CrossRef]
  9. Pont, D.; Hughes, R.M.; Whittier, T.R.; Schmutz, S. A Predictive Index of Biotic Integrity Model for Aquatic-Vertebrate Assemblages of Western U.S. Streams. Trans. Am. Fish. Soc. 2009, 138, 292–305. [Google Scholar] [CrossRef]
  10. Hanski, I. Metapopulation dynamics. Nature 1998, 396, 41–49. [Google Scholar] [CrossRef]
  11. Stoddard, J.L.; Herlihy, A.T.; Peck, D.V.; Hughes, R.M.; Whittier, T.R.; Tarquinio, E. A process for creating multimetric indices for large-scale aquatic surveys. J. N. Am. Benthol. Soc. 2008, 27, 878–891. [Google Scholar] [CrossRef]
  12. Angermeier, P.L.; Smorgor, R.A. Estimating number of species and relative abundances in stream-fish communities: Effects of sampling effort and discontiuous spatial distributions. Can. J. Fish. Aquat. Sci. 1995, 52, 936–949. [Google Scholar] [CrossRef]
  13. Bramblett, R.G.; Johnson, T.R.; Zale, A.V.; Heggem, D.G. Development and Evaluation of a Fish Assemblage Index of Biotic Integrity for Northwestern Great Plains Streams. Trans. Am. Fish. Soc. 2005, 134, 624–640. [Google Scholar] [CrossRef]
  14. McCormick, F.H.; Hughes, R.M.; Kaufmann, P.R.; Peck, D.V.; Stoddard, J.L.; Herlihy, A.T. Development of an Index of Biotic Integrity for the Mid-Atlantic Highlands Region. Trans. Am. Fish. Soc. 2001, 130, 857–877. [Google Scholar] [CrossRef] [Green Version]
  15. Whittier, T.R.; Hughes, R.M.; Stoddard, J.L.; Lomnicky, G.A.; Peck, D.V.; Herlihy, A.T. A Structured Approach for Developing Indices of Biotic Integrity: Three Examples from Streams and Rivers in the Western USA. Trans. Am. Fish. Soc. 2007, 136, 718–735. [Google Scholar] [CrossRef]
  16. Elith, J.; Leathwick, J.R.; Hastie, T. A working guide to boosted regression trees. J. Anim. Ecol. 2008, 77, 802–813. [Google Scholar] [CrossRef] [PubMed]
  17. Terando, A.J.; Costanza, J.; Belyea, C.; Dunn, R.R.; McKerrow, A.; Collazo, J.A. The Southern Megalopolis: Using the Past to Predict the Future of Urban Sprawl in the Southeast U.S. PLoS ONE 2014, 9. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Carlisle, D.M.; Wolock, D.M.; Meador, M.R. Alteration of streamflow magnitudes and potential ecological consequences: A multiregional assessment. Fornt. Ecol. Environ. 2011, 9. [Google Scholar] [CrossRef]
  19. Ford, T.R. The Southern Appalachian Region: A Survey; University of Kentucky Press: Lexington, KY, USA, 2015. [Google Scholar]
  20. Leonard, P.B.; Baldwin, R.F.; Hanks, R.D. Landscape-scale conservation design across biotic realms: Sequential integration of aquatic and terrestrial landscapes. Sci. Rep. 2017, 7, 1–12. [Google Scholar] [CrossRef]
  21. Baldwin, R.F.; Trombulak, S.C.; Leonard, P.B.; Noss, R.F.; Hilty, J.A.; Possingham, H.P.; Scarlett, L.; Anderson, M.G. The Future of Landscape Conservation. Bioscience 2018, 68, 60–63. [Google Scholar] [CrossRef]
  22. ESRI. ArcGIS Desktop: Release 10; Environmental Systems Research Institute: Redlands, CA, USA, 2011. [Google Scholar]
  23. Sutherland, A.B.; Meyer, J.L.; Gardiner, E.P. Effects of land cover on sediment regime and fish assemblage structure in four southern Appalachian streams. Freshw. Biol. 2002, 47. [Google Scholar] [CrossRef]
  24. Neves, R.J.; Angermeier, P.L. Habitat alteration and its effects on native fishes in the upper Tennessee River system, east-central USA. J. Fish. Biol. 1990, 37, 45–52. [Google Scholar] [CrossRef]
  25. Naiman, R.J.; Decamps, H.; Pollock, M. The role of riparian corridors in maintaining regional biodiversity. Ecol. Appl. 1993, 3, 209–212. [Google Scholar] [CrossRef] [PubMed]
  26. Smith, M.P.; Schiff, R.; Olivero, A.; MacBroom, J.G. The Active River Area: A Conservation Framework for Protecting Rivers and Streams; The Nature Conservancy: Boston, MA, USA, 2008.
  27. Team, Q.D. QGIS Geographic Infomration System; QSGeo: Chicago, IL, USA, 2016. [Google Scholar]
  28. Frimpong, E.A.; Angermeier, P.L. Fish traits: a database of ecological and life-history traits of freshwater fishes of the United States. Fisheries 2009, 34, 487–495. [Google Scholar] [CrossRef]
  29. Stoddard, J.L.; Peck, D.V.; Larsen, D.P.; Van Sickle, J.; Hawkins, C.P.; Hughes, R.M.; Whittier, T.R.; Lomnicky, G.; Herlihy, A.T.; Kaufmann, P.R. Environmental Monitoring and Assessemnt (EMAP): Western Streams and Rivers Statistical Summary; US Environmental Protection Agency, Office of Research and Development: Washington, DC, USA, 2005. [Google Scholar]
  30. R Core Team. R: A Language and Environment for Statistical Computing; R Core Team: St. Louis, MI, USA, 2018. [Google Scholar]
  31. Abell, R.; Thieme, M.L.; Revenga, C.; Bryer, M.; Kottelat, M.; Bogutskaya, N.; Coad, B.; Mandrak, N.; Balderas, S.C.; Bussing, W.; et al. Freshwater Ecoregions of the World: A New Map of Biogeographic Units for Freshwater Biodiversity Conservation. Bioscience 2008, 58, 403–414. [Google Scholar] [CrossRef] [Green Version]
  32. Carlisle, D.M.; Falcone, J.; Meador, M.R. Predicting the biological condition of streams: Use of geospatial indicators of natural and anthropogenic characteristics of watersheds. Environ. Monit. Assess. 2009, 151, 143–160. [Google Scholar] [CrossRef] [PubMed]
  33. Villeneuve, B.; Souchon, Y.; Usseglio-Polatera, P.; Ferréol, M.; Valette, L. Can we predict biological condition of stream ecosystems? A multi-stressors approach linking three biological indices to physico-chemistry, hydromorphology and land use. Ecol. Indic. 2015, 48, 88–98. [Google Scholar] [CrossRef]
  34. Mazor, R.D.; Rehn, A.C.; Ode, P.R.; Engeln, M.; Schiff, K.C.; Stein, E.D.; Gillett, D.J.; Herbst, D.B.; Hawkins, C.P. Bioassessment in complex environments: Designing an index for consistent meaning in different settings. Freshw. Sci. 2016, 35, 249–271. [Google Scholar] [CrossRef]
  35. Cao, Y.; Hawkins, C.P.; Olson, J.; Kosterman, M.A. Modeling natural environmental gradients improves the accuracy and precision of diatom-based indicators. J. N. Am. Benthol. Soc. 2007, 26, 566–585. [Google Scholar] [CrossRef] [Green Version]
  36. Lammert, M.; Allan, J.D. Assessing biotic integrity of streams: Effects of scale in measuring the influence of land use/cover and habitat structure on fish and macroinvertebrates. Environ. Manag. 1999, 23, 257–270. [Google Scholar] [CrossRef] [Green Version]
  37. Santucci, V.J.; Gephard, S.R.; Pescitelli, S.M. Effects of Multiple Low-Head Dams on Fish, Macroinvertebrates, Habitat, and Water Quality in the Fox River, Illinois. N. Am. J. Fish. Manag. 2005, 25, 975–992. [Google Scholar] [CrossRef]
  38. Omernik, J.M.; Griffith, G.E. Ecoregions of the Conterminous United States: Evolution of a Hierarchical Spatial Framework. Environ. Manag. 2014, 54, 1249–1266. [Google Scholar] [CrossRef] [PubMed]
  39. Pond, G.J.; Passmore, M.E.; Borsuk, F.A.; Reynolds, L.; Rose, C.J. Downstream effects of mountaintop coal mining: Comparing biological conditions using family- and genus-level macroinvertebrate bioassessment tools. Am. Benthol. Soc. 2008, 27, 717–737. [Google Scholar] [CrossRef]
  40. Pond, G.J.; McMurray, S.E. A Macroinvertebrate Bioassessment Index for Headwater Streams in the Eastern Coalfield Region, Kentucky; Kentucky Department for Environmental Protection, Division of Water: Frankfort, KY, USA, 2002.
  41. Gerritsen, J.; Burton, J.; Barbour, M.T. A Stream Condition Index for West Virginia Wadeable Streams; US EPA: Washington, DC, USA, 2000; Volume 3.
Figure 1. Overview map of the Appalachian Landscape Conservation Cooperative [21]. Map created by R. Daniel Hanks using ArcGIS v10.3 [22].
Figure 1. Overview map of the Appalachian Landscape Conservation Cooperative [21]. Map created by R. Daniel Hanks using ArcGIS v10.3 [22].
Land 09 00016 g001
Figure 2. Side-by-side comparison of response data for (a) fish and (b) aquatic macroinvertebrates showing spatial distribution of sample locations for both. Data source: United States Environmental Protection Agency’s National Rivers and Streams Assessment and the National Fish Habitat Partnership. Map created by R. Daniel Hanks using ArcGIS v10.3 [22].
Figure 2. Side-by-side comparison of response data for (a) fish and (b) aquatic macroinvertebrates showing spatial distribution of sample locations for both. Data source: United States Environmental Protection Agency’s National Rivers and Streams Assessment and the National Fish Habitat Partnership. Map created by R. Daniel Hanks using ArcGIS v10.3 [22].
Land 09 00016 g002
Figure 3. Comparison of the watershed score for the global and regional models. The watershed scores were centered and scaled. The regional watershed scores were centered and scaled within each respective region within the Appalachian Landscape Conservation Cooperative (LCC) region. The global watershed scores were centered and scaled across the entire Appalachian LCC region. Map created by R. Daniel Hanks using ArcGIS v10.3 [22].
Figure 3. Comparison of the watershed score for the global and regional models. The watershed scores were centered and scaled. The regional watershed scores were centered and scaled within each respective region within the Appalachian Landscape Conservation Cooperative (LCC) region. The global watershed scores were centered and scaled across the entire Appalachian LCC region. Map created by R. Daniel Hanks using ArcGIS v10.3 [22].
Land 09 00016 g003
Figure 4. Plots show Pearson correlation between (a) global and regional watershed scores, (b) global and regional functional scores, (c) global and regional taxa quality scores, and (d) global watershed scores for fish and regional watershed score for aquatic macroinvertebrates (AM). Plots generated using Program R [30].
Figure 4. Plots show Pearson correlation between (a) global and regional watershed scores, (b) global and regional functional scores, (c) global and regional taxa quality scores, and (d) global watershed scores for fish and regional watershed score for aquatic macroinvertebrates (AM). Plots generated using Program R [30].
Land 09 00016 g004
Figure 5. Global watershed scores from the boosted regression tree models for aquatic macroinvertebrates. Watershed scores were centered and scaled. Map created by R. Daniel Hanks using ArcGIS v10.3 [22].
Figure 5. Global watershed scores from the boosted regression tree models for aquatic macroinvertebrates. Watershed scores were centered and scaled. Map created by R. Daniel Hanks using ArcGIS v10.3 [22].
Land 09 00016 g005
Table 1. List of final predictors used in boosted regression tree models and how each was grouped thematically.
Table 1. List of final predictors used in boosted regression tree models and how each was grouped thematically.
Aquatic Habitat MetricThemesOverall Predictors Score
Flow Alteration from Storage (total storage/mean annual flow)Flow regime
Density and type of dam
Altered streamflow
Agricultural water withdrawal
Industrial water withdrawal
Erosive ForcesGeomorphic condition
Resistive forces
Density of dams: CatchmentConnectivity
Density of dams: Watershed
Density of crossings: Catchment
Density of crossings: Watershed
NitrogenWater Quality
Phosphorus
Dissolved Organic Carbon
% Impervious Surface in Watershed, Active River Area, & CatchmentNon-point sources of pollution
% Natural Cover in Watershed & Active River Area
% Agriculture in Watershed, Active River Area, & Catchment
Comprehensive Environmental Response, Compensation, and Liability Information System site densityPoint sources of pollution
Permit Compliance System site density
Toxic release inventory site density in Watershed and Catchment
Coal mine density
Wind turbine density
All mine density in Watershed and Catchment
Natural gas well density
Table 2. List of final responses modeled with boosted regression trees and how each was condensed (i.e., averaged and grouped) into an average score to represent larger taxonomic resolution.
Table 2. List of final responses modeled with boosted regression trees and how each was condensed (i.e., averaged and grouped) into an average score to represent larger taxonomic resolution.
Biological MetricTargetsCondensed AttributesOverall Response Score
DiversityShannon DiversityFish Score
Invertivore TaxaFunctional Group
Piscivore Taxa
Herbivore Taxa
Lithophilic SpawnersFish Taxa Quality
Taxa Preferring Coarse Sediment
Intolerant Taxa
Tolerant Taxa
EPT TaxaMacroinvert Taxa QualityMacroinvert Score
5 Dominant Taxa
Intolerant Taxa
Tolerant Taxa
Table 3. Boosted regression tree result for fish (richness, diversity, % invertivore taxa, % piscivore taxa, % herbivore taxa, % lithophilic taxa, % taxa preferring coarse substrate, % tolerant taxa, and % intolerant taxa) and aquatic macroinvertebrates (% EPT taxa, % 5 dominant taxa, % tolerant taxa, and % intolerant taxa).
Table 3. Boosted regression tree result for fish (richness, diversity, % invertivore taxa, % piscivore taxa, % herbivore taxa, % lithophilic taxa, % taxa preferring coarse substrate, % tolerant taxa, and % intolerant taxa) and aquatic macroinvertebrates (% EPT taxa, % 5 dominant taxa, % tolerant taxa, and % intolerant taxa).
FishDiversity% Invertivore Taxa% Piscivore Taxa% Herbivore Taxa% Lithophilic Taxa% Coarse Sediment Taxa% Tolerant Taxa% Intolerant Taxa
# of Trees59505600470038504600815045505650
Learning Rate0.010.0010.010.010.010.0010.010.01
Total Deviance902.950.13318.7812.5719.226.4812.8512.63
Residual Deviance298.560.128.197.89.524.976.126.49
Variance Explained85%48%79%70%78%56%79%80%
CV Deviance (se)433.96(6.15)0.12(0.03)11.31(0.22)10.23(0.22)12.79(0.17)5.51(0.23)8.85(0.31)10.08(0.26)
CV Deviance Explained69%32%64%48%61%45%51%41%
Aquatic Macroinvertebrates% EPT% 5 Dominant Taxa% Tolerant Taxa% Intolerant Taxa
# of Trees2150830027002600
Learning Rate0.0010.0010.0010.0005
Total Deviance2.832.276.451.05
Residual Deviance1.611.681.810.56
Variance Explained81%76%89%77%
CV Deviance (se)2.65(0.32)2.20(0.25)2.88(0.35)0.85(0.12)
CV Deviance Explained27%24%76%36%
Table 4. Summary of the relative influence of each predictor variable on the response variables for both fish and aquatic macroinvertebrates.
Table 4. Summary of the relative influence of each predictor variable on the response variables for both fish and aquatic macroinvertebrates.
FishAquatic Macroinvertebrates
VariableRichnessDiversity% Invertivore Taxa% Piscivore Taxa% Herbivore Taxa% Lithophilic Taxa% Coarse Sediment Taxa% Tolerant Taxa% Intolerant Taxa% EPT% 5 Dominant Taxa% Intolerant Taxa% Tolerant Taxa
ALK_CL0.30.30.10.80.41.70.60.50.70.413.20.20
bfi_avg2.95.463.81.92.53.13.74.21.41.73.51.8
CONF_CL0.40.200.10.30.20.10.60.25.70.11.40
DamDensWs0.60.50.30.61.10.70.810.60.10.60.20.1
elevcm_avg11.223.84.84.68.81044.53.51.82.62.2
FWECOREG4.829.34.56.712.14.25.57.34.911.29.42.21.9
imp11_per1.91.31.321.81.92.12.23.62.84.312.22.6
imp11_perN32.11.74.44.25.35.26.54.96.715.75.19
IncrTempMM9.35.33.516.89.24.36.15.73.52.31.28.42
kfact_avg2.71.24.33.92.43.73.93.12.60.91.13.42.1
kfactup_av2.73.22.53.62.82.32.62.63.20.91.50.82.3
MineDensWs0.40.40.30.40.40.40.80.40.30000
NCat2.81.818.23.73.42.943.94.34.37.94.62.2
nid_storN5.92.55.12.955.75.81.92.32.42.70.614.7
nlcd11_ag10.80.21.41.312.31.72.12.83.42.11.5
nlcd11_agN1.71.30.52.12.52.23.32.82.63.91.54.31.5
nlcd11_t1.310.51.51.71.41.51.82.13.51.33.20.8
NPDESDensW0.40.300.30.70.30.40.42.10000.1
P2O5Cat21.72.31.51.81.51.622.52.90.83.30.7
P2O5Ws1.82.10.81.32.21.422.62.84.30.91.31.5
Pct_Ag3.32.121.3221.51.72.84.62.51.74.6
Pct_Imp2.51.51.62.22.33.41.522.422.31.75.4
Pct_Nat31.71.92.333.44.143.450.75.42
rchrg_avg3.54.78.33.44.52.84.56.84.92.32.32.55.5
RdCrsCat0.90.90.21.11.110.92.21.62.21.91.50.3
RdCrsWs1.31.60.81.71.91.41.92.92.32.41.82.72.9
rf30_avg7.17.210.86.96.34.34.94.15.31.91.26.52.2
sand_avg2.44.72.83.33.12.53.93.64.24.83.67.19.5
silt_avg2.43.31.633.52.82.12.74.432.61.14.9
SIMP_NUM3.11.21.32.23.30.91.51.91.31.11.82.51.5
slope_avg2.722.31.72.432.42.23.26.631.90.8
slope_avgN32.92.73.63.93.13.84.152.23.63.81
SLP_CL2.70.80.21.90.72.920.61.20.11.20.23.5
SZ_CL40.86.61.81.12.521.82.31.120.68.2
TEMP_CL0.40.400.10.310.90.40.10.500.20.10
TNC6HYDP0.30.40.80.10.20.20.13.80.300.10.30.5
TRIDensWs0.310.10.70.60.50.90.60.50.801.10
Table 5. Summary of the average relative influence for each theme on response variables modeled with the boosted regression trees for both fish and aquatic macroinvertebrates. The average thematic % relative influence is the percent relative contribution of each theme to the BRT model. Themes are comprised of flow condition (NID storage, dam density, altered streamflow, agricultural and industrial water withdrawals), geomorphic condition (erosive and resistive forces), connectivity (dam and road crossing density at the watershed and catchment levels), water quality (total nitrogen load, total phosphorous load, and dissolved organic carbon), non-point source pollution (% agricultural land cover, % impervious surface, and % natural land cover in the watershed, active river area, and catchment), and point source pollution (CERCLA site density, permit compliance system site density, toxic release site density, coal mine, wind turbine and natural gas well density, and mine density).
Table 5. Summary of the average relative influence for each theme on response variables modeled with the boosted regression trees for both fish and aquatic macroinvertebrates. The average thematic % relative influence is the percent relative contribution of each theme to the BRT model. Themes are comprised of flow condition (NID storage, dam density, altered streamflow, agricultural and industrial water withdrawals), geomorphic condition (erosive and resistive forces), connectivity (dam and road crossing density at the watershed and catchment levels), water quality (total nitrogen load, total phosphorous load, and dissolved organic carbon), non-point source pollution (% agricultural land cover, % impervious surface, and % natural land cover in the watershed, active river area, and catchment), and point source pollution (CERCLA site density, permit compliance system site density, toxic release site density, coal mine, wind turbine and natural gas well density, and mine density).
FishAquatic Macroinvertebrates
VariableDiversity% Invertivore Taxa% Piscivore Taxa% Herbivore Taxa% Lithophilic Taxa% Coarse Sediment Taxa% Tolerant Taxa% Intolerant Taxa % EPT% 5 Dominant Taxa% Intolerant Taxa% Tolerant Taxa
Flow1526.317.221.825.224.817.916.420.627.414.611715.0
Geomorphic62.548.451.547.638.746.338.739.446.623.436.76.617.321.0
Connectivity4.42.366.773.710.99.36.315.713.22.389.8
Water Quality1218.59.96.76.17.414.611.510.812.89.287.956.741.7
Non-point Pollution7.57.315.716.822.217.518.722.416.020.927.86.819.218.7
Point Pollution0.60.31.41.41.81.51.831.51.800.11.20.8
Table 6. Summary of the relative influence within each region of each predictor variable on the response variables for fish. A = Atlantic Slope, C = Cumberland River Basin, O = Ohio River Basin, and T=Tennessee River Basin.
Table 6. Summary of the relative influence within each region of each predictor variable on the response variables for fish. A = Atlantic Slope, C = Cumberland River Basin, O = Ohio River Basin, and T=Tennessee River Basin.
RichnessDiversity% Invertevore Taxa
VariableACOTACOTACOT
ALK_CL0.12.10.40.10.12.30.60.30.60.50.30.3
bfi_avg1.76.83.43.23.21.22.125.82.72.60.725.8
CONF_CL0.20.10.40.10.20.40.30.10.10.10.10.1
DamDensWs0.40.50.50.610.50.80.10.80.30.80.1
elevcm_avg4.14.121.464.65.33.74.34.22.710.84.3
FWECOREG3.100.903.102.200.100.20
imp11_per1.14.20.93.31.91.31.832.85.40.53
imp11_perN2.24.42.93.63.72.53.32.78.64.90.82.7
IncrTempMM22.84.611.36.718.41.74.72.95.31.24.32.9
kfact_avg3.12.11.24.142.31.41.14.51.91.21.1
kfactup_av1.81.34.82.31.82.541.53.51.21.71.5
MineDensWs0.20.30.600.30.10.700.50.100
NCat2.731.141.92.52.61.13.29.321.61.1
nid_storN5.34.56.98.280.93.33.1120.98.13.1
nlcd11_ag1.92.10.60.81.61.11.21.31.810.81.3
nlcd11_agN2.53.91.21.11.51.31.922.22.80.32
nlcd11_t1.35.20.41.40.92.91.611.78.90.11
NPDESDensW0.50.40.30.30.40.40.40.10.5000.1
P2O5Cat2.82.11.22.51.51.51.82.92.23.60.42.9
P2O5Ws2.91.41.91.632.22.61.422.111.4
Pct_Ag2.61.82.54.82.32.82.54.73.11.40.84.7
Pct_Imp2.12.71.11.62.62.71.72.31.851.92.3
Pct_t2.12.62.72.92.61.92.81.72.260.91.7
rchrg_avg3.917.62.13.42.75.365.23.57.115.2
RdCrsCat0.80.80.410.811.40.61.620.10.6
RdCrsWs12.10.41.32.22.83.22.223.20.22.2
rf30_avg3.95.54.95.42.926.418.63.252.27.43.2
sand_avg2.32.61.65.22.36.172.74.58.62.22.7
silt_avg5.61.41.22.86.48.43.22.93.32.51.52.9
SIMP_NUM3.92.61.95.11.81.51.23.42.33.32.63.4
slope_avg1.81.52.24.81.90.92.92.92.43.43.52.9
slope_avgN22.73.62.334.23.50.93.22.20.90.9
SLP_CL1.80.73.11.92.51.91.30.40.920.10.4
SZ_CL3.11.88.16.83.20.61.30.84.21.621.10.8
TEMP_CL1.20.11.101.10.50.400.10.10.10
TNC6HYDP0.20.20.30.2000.59.50.201.79.5
TRIDensWs0.60.20.40.60.70.21.41.90.70.10.11.9
% Piscivore Taxa% Herbivore Taxa% Lithophilic Taxa
VariableACOTACOTACOT
ALK_CL0.501.10.40.66.60.60.40.480.81
bfi_avg2.452.76.12.72.12.12.71.52.222.6
CONF_CL0.10.20.50.10.10.20.10.10.80.10.10.3
DamDensWs0.81.10.50.50.81.10.50.80.71.30.50.4
elevcm_avg1.91.76.49.64.24.34.94.7173.816.93.9
FWECOREG5.405.30.20.100.20.21.301.90
imp11_per1.84.231.32.81.71.92.61.57.423.2
imp11_perN2.14.45.86.28.62.8118.711.73.65.66.1
IncrTempMM23.5512.57.15.36.55.55.826.843.5
kfact_avg31.341.84.52.74.54.11.22.24.22.3
kfactup_av5.83.75.833.52.73.43.53.51.34.33.8
MineDensWs0.10.10.20.20.50.20.30.61.30.90.30.5
NCat42.12.84.73.22.72.12.91.91.42.93.4
nid_storN1.25.81.79.3121.918.213.66.21.9312.6
nlcd11_ag0.931.31.91.81.60.81.11.63.31.41.2
nlcd11_agN1.55.42.11.92.22.21.21.92.33.62.92.1
nlcd11_t0.84.61.71.31.72.811.61.33.71.62.3
NPDESDensW0.10.60.40.10.50.80.30.400.50.20.3
P2O5Cat2.121.31.92.22.31.623.32.11.61.8
P2O5Ws1.31.91.71.624.41.21.61.82.61.52.2
Pct_Ag0.81.51.61.53.11.13.32.91.51.12.31.3
Pct_Imp1.95.72.92.41.81.51.92.122.54.33
Pct_t3.41.62.92.12.2322.41.91.83.93
rchrg_avg3.49.842.23.53.32.93.71.94.72.94.6
RdCrsCat1.31.51.41.21.610.71.21.51.111.1
RdCrsWs1.722.11.821.51.11.52.221.61.2
rf30_avg6.865.93.458.75.55.34.64.743.5
sand_avg3.632.95.24.54.74.24.46.282.14.3
silt_avg6.45.32.65.13.310.72.22.94.82.72.22.3
SIMP_NUM1.64.921.82.31.92.91.912.61.11.2
slope_avg2.41.12.33.32.44.12.42.621.62.92.1
slope_avgN4.42.24.43.53.24.43.13.42.93.53.34.8
SLP_CL0.91.41.51.70.91.80.40.92.83.63.20.7
SZ_CL1.81.11.45.24.20.85.24.52.42.21.510.7
TEMP_CL0.400.30.10.10000.50.251
TNC6HYDP00.30.10.10.20.40.10.200.10.10.7
TRIDensWs0.10.60.90.30.71.40.50.70.210.61
% Coarse Sediment Taxa% Tolerant Taxa% Intolerant Taxa
VariableACOTACOTACOT
ALK_CL0.50.30.60.71.10.110.22.10.21.20.3
bfi_avg1.53.52.410.81.72.33.37.21.443.73.9
CONF_CL0.20.10.10.20.20.21.80.31.10.40.10.2
DamDensWs2.62.10.30.60.10.80.50.30.30.50.20.7
elevcm_avg8.64.717.54.62.83.43.76.56.81.68.23.7
FWECOREG0.102.704.400.79.93.600.50
imp11_per4.16.51.71.72.173.21.93.21.43.25
imp11_perN4.42.56.58.35.12.67.79.17.92.94.14.1
IncrTempMM4.56.367.816.61.78.16.14.72.14.75.4
kfact_avg4.92.64.61.42.74.52.72.33.91.92.31.4
kfactup_av4.353.41.67.80.92.622.73.22.94.3
MineDensWs0.30.60.60.200.70.40.3000.70
NCat1.91.91.52.34.83.16.31.94.27.33.84.6
nid_storN18.71.35.57.610.222.82.80.71.64.2
nlcd11_ag12.71.31.10.51.81.31.91.20.91.22.1
nlcd11_agN1.23.64.20.90.95.81.62.21.57.21.14.6
nlcd11_t0.62.20.82.50.54.21.320.93.80.91.5
NPDESDensW1.21.40.1002.80.20.10.20.91.80
P2O5Cat2.30.80.911.33.22.12.71.91.11.11.6
P2O5Ws2.321.31.42.252.82.71.54.11.71.9
Pct_Ag1.31.91.11.91.92.80.814.23.21.32.7
Pct_Imp1.42.10.91.71.12.81.61.93.51.91.91.7
Pct_t31.44.32.95.34.841.48.14.82.73.3
rchrg_avg2.29.17.45.73.96.74.84.24.62.511.17.6
RdCrsCat2.81.20.40.51.53.11.511.20.50.71.4
RdCrsWs1.51.91.41.10.43.35.52.13.21.91.11.6
rf30_avg1.83.83.56.66.88.93.54.42.78.37.710.2
sand_avg3.78.14.67.13.63.94.72.82.418.64.41
silt_avg5.89.21.31.3322.41.65.42.842.7
SIMP_NUM0.81.50.83.10.733.511.31.90.90.9
slope_avg4.33.22.10.96.90.8232.52.41.85.2
slope_avgN2.63.33.72.75.91.13.49.43.857.96.9
SLP_CL1.40.92.60.80.30.30.71.31.30.51.11.3
SZ_CL1.70.72.18.82.62.40.82.43.70.26.32.7
TEMP_CL00.10.900.10.40.30.10.201.10.2
TNC6HYDP00.30.10.10.41.96.20.300.10.50.3
TRIDensWs0.40.91.1001.41.10.10.11.50.30.7
Table 7. Summary of the regional average relative influence for each theme on response variables modeled with the boosted regression trees for fish. The average thematic % relative influence is the percent relative contribution of each theme to the BRT model. Themes are comprised of flow condition (NID storage, dam density, altered streamflow, agricultural and industrial water withdrawals), geomorphic condition (erosive and resistive forces), connectivity (dam and road crossing density at the watershed and catchment levels), water quality (total nitrogen load, total phosphorous load, and dissolved organic carbon), non-point source pollution (% agricultural land cover, % impervious surface, and % natural land cover in the watershed, active river area, and catchment), and point source pollution (CERCLA site density, permit compliance system site density, toxic release site density, coal mine, wind turbine and natural gas well density, and mine density).
Table 7. Summary of the regional average relative influence for each theme on response variables modeled with the boosted regression trees for fish. The average thematic % relative influence is the percent relative contribution of each theme to the BRT model. Themes are comprised of flow condition (NID storage, dam density, altered streamflow, agricultural and industrial water withdrawals), geomorphic condition (erosive and resistive forces), connectivity (dam and road crossing density at the watershed and catchment levels), water quality (total nitrogen load, total phosphorous load, and dissolved organic carbon), non-point source pollution (% agricultural land cover, % impervious surface, and % natural land cover in the watershed, active river area, and catchment), and point source pollution (CERCLA site density, permit compliance system site density, toxic release site density, coal mine, wind turbine and natural gas well density, and mine density).
RichnessDiversity% Invertivore Taxa% Piscivore Taxa% Herbivore Taxa
ThemeACOTACOTACOTACOTACOT
Flow2.87.43.23.83.72.03.08.64.72.72.68.61.95.42.24.54.72.15.95.2
Geomorphic2.93.03.63.92.910.48.01.94.41.83.41.95.23.65.22.74.44.74.54.3
Connectivity0.40.50.50.61.00.50.80.10.80.30.80.10.81.10.50.50.81.10.50.8
Water Quality1.71.70.91.61.31.71.51.11.63.14.71.11.61.21.41.71.63.21.11.4
Non-point Pollution1.83.01.42.21.91.81.92.12.73.90.72.11.53.42.42.12.71.92.62.6
Point Pollution0.40.30.40.30.40.20.80.70.60.10.10.70.10.40.50.20.60.80.40.6
% Lithophilic Taxa% Coarse Sediment Taxa% Tolerant Taxa% Intolerant TaxaAverages
ThemeACOTACOTACOTACOTAllACOT
Flow2.62.52.15.06.24.03.96.21.72.52.73.62.31.94.24.13.93.43.43.35.5
Geomorphic3.12.74.23.23.73.83.83.25.84.82.92.93.14.44.35.34.03.94.44.43.3
Connectivity0.71.30.50.42.62.10.30.60.10.80.50.30.30.50.20.70.70.80.90.50.4
Water Quality1.52.81.31.71.41.00.91.11.92.32.41.52.02.61.61.71.71.62.21.81.4
Non-point Pollution2.63.02.72.51.92.62.32.31.93.52.42.43.42.91.82.82.42.32.92.02.3
Point Pollution0.50.80.40.60.61.00.60.10.01.60.50.20.10.80.90.20.50.40.70.50.4
Table 8. Pearson correlation coefficients for the global and regional models for all response variables. Correlations for the same response variables for each model are highlighted in gray.
Table 8. Pearson correlation coefficients for the global and regional models for all response variables. Correlations for the same response variables for each model are highlighted in gray.
Global Model
Diversity% Invertivore Taxa% Piscivore Taxa% Herbivore Taxa% Lithophilic Taxa% Coarse Sediment Taxa% Tolerant Taxa% Intolerant TaxaFunctional GroupTaxa QualityOverall Watershed Score
Regional Model
Diversity0.990.080.050.050.100.120.240.080.040.140.37
% Invertivore Taxa0.081.000.010.130.100.060.070.250.590.060.18
% Piscivore Taxa0.060.000.990.210.060.020.480.100.620.210.52
% Herbivore Taxa0.040.140.220.980.350.260.250.110.690.410.56
% Lithophilic Taxa0.180.180.150.430.680.300.050.000.240.400.27
% Coarse Sediment Taxa0.190.070.050.530.320.440.160.060.340.420.45
% Tolerant Taxa0.250.070.470.250.050.021.000.200.410.480.62
% Intolerant Taxa0.190.180.070.030.060.090.080.330.040.110.11
Functional Group0.040.590.630.680.210.090.420.020.990.300.65
Taxa Quality0.190.060.180.510.370.260.500.240.390.580.61
Overall Watershed Score0.390.250.490.600.260.230.620.230.690.570.86

Share and Cite

MDPI and ACS Style

Hanks, R.D.; Leonard, P.B.; Baldwin, R.F. Understanding Landscape Influences on Aquatic Fauna across the Central and Southern Appalachians. Land 2020, 9, 16. https://doi.org/10.3390/land9010016

AMA Style

Hanks RD, Leonard PB, Baldwin RF. Understanding Landscape Influences on Aquatic Fauna across the Central and Southern Appalachians. Land. 2020; 9(1):16. https://doi.org/10.3390/land9010016

Chicago/Turabian Style

Hanks, R. Daniel, Paul B. Leonard, and Robert F. Baldwin. 2020. "Understanding Landscape Influences on Aquatic Fauna across the Central and Southern Appalachians" Land 9, no. 1: 16. https://doi.org/10.3390/land9010016

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop