
Thus, Eastern Iberian chub proved to be a eurytopic species, presenting the highest suitability in microhabitats with cover present, low flow velocity (approx. Conversely, homoscedastic and cluster PNNs rendered ecologically reliable partial dependence plots. However, these two PNNs rendered ecologically unreliable partial dependence plots. Heteroscedastic and enhanced PNNs achieved the highest performance in every index but specificity. The fitness function and several performance criteria (correctly classified instances, true skill statistic, specificity and sensitivity) and partial dependence plots were used to assess respectively the performance and reliability of each habitat suitability model. To rule out this idea, the microhabitat suitability for the Eastern Iberian chub (Squalius valentinus Doadrio & Carmona, 2006) was modelled with SVMs and four types of PNNs (homoscedastic, heteroscedastic, cluster and enhanced PNNs) all of them optimised with Differential Evolution. Although several alternatives to improve PNNs' reliability and performance and/or to reduce computational costs exist, PNNs are currently not well recognised as SVMs because the SVMs were compared with standard PNNs. Probabilistic Neural Networks (PNNs) and Support Vector Machines (SVMs) are flexible classification techniques suited to render trustworthy species distribution and habitat suitability models. Such an assessment can be used both for validation and for model selection and provides important information beyond what can be learned from conventional validation and selection techniques. We recommend the use of a transferability assessment whenever there is interest in making inferences beyond the data set used for model fitting. In our example, traditional linear models have greater transferability.Ĥ.

We show that machine-learning techniques such as random forests and artificial neural networks can produce models with excellent in-sample performance but poor transferability, unless complexity is constrained. We illustrate the method by applying it to distribution modelling of brook trout (Salvelinus fontinalis Mitchill) and brown trout (Salmo trutta Linnaeus) in western United States. The method involves cross-validation in which data are assigned non-randomly to groups that are spatially, temporally or otherwise distinct, thus using heterogeneity in the data set as a surrogate for heterogeneity among data sets.ģ. We propose an intuitive method for evaluating transferability based on techniques currently in use in the area of species distribution modelling.

Conventional methods of model validation and selection assess in- or out-of-sample prediction accuracy but do not assess model generality or transferability, which can lead to overestimates of performance when predicting in other locations, time periods or data sets.Ģ.

Ecologists have long sought to distinguish relationships that are general from those that are idiosyncratic to a narrow range of conditions.
