Advertisement
Guest User

Untitled

a guest
Jul 22nd, 2019
78
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 8.24 KB | None | 0 0
  1. <title>Maxent model for Caretta_caretta</title>
  2. <CENTER><H1>Maxent model for Caretta_caretta</H1></CENTER>
  3. <br> This page contains some analysis of the Maxent model for Caretta_caretta, created Wed Jul 10 12:43:02 EDT 2019 using Maxent version 3.3.3k. If you would like to do further analyses, the raw data used here is linked to at the end of this page.<br>
  4. <br><HR><H2>Analysis of omission/commission</H2>
  5. The following picture shows the omission rate and predicted area as a function of the cumulative threshold. The omission rate is is calculated both on the training presence records, and (if test data are used) on the test records. The omission rate should be close to the predicted omission, because of the definition of the cumulative threshold.
  6. <br><img src="plots/Caretta_caretta_omission.png"><br>
  7. <br> The next picture is the receiver operating characteristic (ROC) curve for the same data. Note that the specificity is defined using predicted area, rather than true commission (see the paper by Phillips, Anderson and Schapire cited on the help page for discussion of what this means). This implies that the maximum achievable AUC is less than 1. If test data is drawn from the Maxent distribution itself, then the maximum possible test AUC would be 0.904 rather than 1; in practice the test AUC may exceed this bound.
  8. <br><img src="plots/Caretta_caretta_roc.png"><br>
  9. <br>
  10. <br>
  11. Some common thresholds and corresponding omission rates are as follows. If test data are available, binomial probabilities are calculated exactly if the number of test samples is at most 25, otherwise using a normal approximation to the binomial. These are 1-sided p-values for the null hypothesis that test points are predicted no better than by a random prediction with the same fractional predicted area. The "Balance" threshold minimizes 6 * training omission rate + .04 * cumulative threshold + 1.6 * fractional predicted area.<br>
  12. <br><table border cols=6 cellpadding=3><tr><th>Cumulative threshold</th><th>Logistic threshold</th><th>Description</th><th>Fractional predicted area</th><th>Training omission rate</th><th>Test omission rate</th><th>P-value</th><tr align=center><td>1.000</td><td>0.013</td><td>Fixed cumulative value 1</td><td>0.772</td><td>0.000</td><td>0.000</td><td>1.231E-2</td><tr align=center><td>5.000</td><td>0.039</td><td>Fixed cumulative value 5</td><td>0.388</td><td>0.059</td><td>0.176</td><td>3.061E-4</td><tr align=center><td>10.000</td><td>0.114</td><td>Fixed cumulative value 10</td><td>0.243</td><td>0.098</td><td>0.176</td><td>7.682E-7</td><tr align=center><td>1.092</td><td>0.013</td><td>Minimum training presence</td><td>0.757</td><td>0.000</td><td>0.000</td><td>8.895E-3</td><tr align=center><td>11.179</td><td>0.129</td><td>10 percentile training presence</td><td>0.225</td><td>0.098</td><td>0.176</td><td>2.797E-7</td><tr align=center><td>16.882</td><td>0.205</td><td>Equal training sensitivity and specificity</td><td>0.162</td><td>0.157</td><td>0.235</td><td>6.782E-8</td><tr align=center><td>16.281</td><td>0.196</td><td>Maximum training sensitivity plus specificity</td><td>0.167</td><td>0.137</td><td>0.235</td><td>9.823E-8</td><tr align=center><td>14.987</td><td>0.178</td><td>Equal test sensitivity and specificity</td><td>0.179</td><td>0.137</td><td>0.176</td><td>1.409E-8</td><tr align=center><td>14.987</td><td>0.178</td><td>Maximum test sensitivity plus specificity</td><td>0.179</td><td>0.137</td><td>0.176</td><td>1.409E-8</td><tr align=center><td>3.508</td><td>0.025</td><td>Balance training omission, predicted area and threshold value</td><td>0.486</td><td>0.020</td><td>0.118</td><td>8.092E-4</td><tr align=center><td>12.138</td><td>0.140</td><td>Equate entropy of thresholded and original distributions</td><td>0.212</td><td>0.118</td><td>0.176</td><td>1.268E-7</td></table><br>
  13. <br><HR><H2>Pictures of the model</H2>
  14. This is a representation of the Maxent model for Caretta_caretta. Warmer colors show areas with better predicted conditions. White dots show the presence locations used for training, while violet dots show test locations. Click on the image for a full-size version.<br>
  15. <br><a href = "plots/Caretta_caretta.png"> <img src="plots/Caretta_caretta.png" width=600></a><br>
  16. <br>Click <a href=Caretta_caretta_explain.bat type=application/bat>here<a> to interactively explore this prediction using the Explain tool. If clicking from your browser does not succeed in starting the tool, try running the script in /Users/Guest/Desktop/Caretta caretta CURRENT/Caretta_caretta_explain.bat directly. This tool requires the environmental grids to be small enough that they all fit in memory.<br><br>
  17. <br><HR><H2>Analysis of variable contributions</H2><br>
  18. The following table gives estimates of relative contributions of the environmental variables to the Maxent model. To determine the first estimate, in each iteration of the training algorithm, the increase in regularized gain is added to the contribution of the corresponding variable, or subtracted from it if the change to the absolute value of lambda is negative. For the second estimate, for each environmental variable in turn, the values of that variable on training presence and background data are randomly permuted. The model is reevaluated on the permuted data, and the resulting drop in training AUC is shown in the table, normalized to percentages. As with the variable jackknife, variable contributions should be interpreted with caution when the predictor variables are correlated.<br>
  19. <br><table border cols=3><tr><th>Variable</th><th>Percent contribution</th><th>Permutation importance</th><tr align=right><td>bio2</td><td>45.7</td><td>46.7</td></tr><tr align=right><td>geolEPA</td><td>16.1</td><td>3.9</td></tr><tr align=right><td>alt</td><td>15.7</td><td>20.2</td></tr><tr align=right><td>bio5</td><td>9.6</td><td>2.4</td></tr><tr align=right><td>bio14</td><td>3.7</td><td>1.4</td></tr><tr align=right><td>bio13</td><td>3.4</td><td>10.8</td></tr><tr align=right><td>bio8</td><td>3</td><td>0.8</td></tr><tr align=right><td>bio9</td><td>1.8</td><td>3.3</td></tr><tr align=right><td>bio12</td><td>0.9</td><td>2.2</td></tr><tr align=right><td>bio6</td><td>0.1</td><td>8.2</td></tr></table><br><br>
  20. <br><HR><H2>Raw data outputs and control parameters</H2><br>
  21. The data used in the above analysis is contained in the next links. Please see the Help button for more information on these.<br>
  22. <a href = "Caretta_caretta.asc">The model applied to the training environmental layers</a><br>
  23. <a href = "Caretta_caretta.lambdas">The coefficients of the model</a><br>
  24. <a href = "Caretta_caretta_omission.csv">The omission and predicted area for varying cumulative and raw thresholds</a><br>
  25. <a href = "Caretta_caretta_samplePredictions.csv">The prediction strength at the training and (optionally) test presence sites</a><br>
  26. <a href = "maxentResults.csv">Results for all species modeled in the same Maxent run, with summary statistics and (optionally) jackknife results</a><br>
  27. <br><br>
  28. Regularized training gain is 1.554, training AUC is 0.918, unregularized training gain is 1.841.<br>
  29. Unregularized test gain is 1.120.<br>
  30. Test AUC is 0.862, standard deviation is 0.049 (calculated as in DeLong, DeLong & Clarke-Pearson 1988, equation 2).<br>
  31. Algorithm terminated after 500 iterations (2 seconds).<br>
  32. <br>
  33. The follow settings were used during the run:<br>
  34. 51 presence records used for training, 17 for testing.<br>
  35. 10048 points used to determine the Maxent distribution (background points and presence points).<br>
  36. Environmental layers used: alt bio12 bio13 bio14 bio2 bio5 bio6 bio8 bio9 geolEPA(categorical)<br>
  37. Regularization values: linear/quadratic/product: 0.190, categorical: 0.250, threshold: 1.490, hinge: 0.500<br>
  38. Feature types used: hinge linear quadratic<br>
  39. outputdirectory: /Users/Guest/Desktop/Caretta caretta CURRENT<br>
  40. samplesfile: /Users/Guest/Desktop/Caretta caretta Samples.csv<br>
  41. environmentallayers: /Users/Guest/Downloads/ENM files/PresentLayers/maxent.cache<br>
  42. randomtestpoints: 25<br>
  43. Command line used: <br>
  44. <br>
  45. Command line to repeat this species model: java density.MaxEnt nowarnings noprefixes -E "" -E Caretta_caretta "outputdirectory=/Users/Guest/Desktop/Caretta caretta CURRENT" "samplesfile=/Users/Guest/Desktop/Caretta caretta Samples.csv" "environmentallayers=/Users/Guest/Downloads/ENM files/PresentLayers/maxent.cache" randomtestpoints=25 -t geolEPA<br>
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement