Peaks Over Threshold (POT)/Point Process (PP) Approach
Peaks Over Threshold (POT)/Point Process (PP) Approach
The GPD model from the previous chapter looks at exceedances over a threshold and those values are fit to a generalized
Pareto distribution. A more theoretically appealing way to analyze extreme values is to use a point process
characterization. This approach is consistent with a Poisson process for the occurrence of exceedances of a high threshold
and the GPD for excesses over this threshold. Inferences made from such a characterization can be obtained using other
appropriate models from above (see Coles (2001) (b)). However, there are good
reasons to consider this approach. Namely, it provides a nice interpretation of extremes that unifies all of the previously
discussed models. For example, the parameters associated with the point process model can be converted to those of the GEV
parameterization. In fact, the point process approach can be viewed as an indirect way of fitting data to the GEV
distribution that makes use of more information about the upper tail of the distribution than does the block maxima approach
(Coles (2001) (b)).
Fitting data to a Point Process Model
Figure 6.1 is not quite as easy to interpret as
Figure 5.10 for the GPD because of the fewer
thresholds, but it seems that a threshold anywhere in the range of 0.30 to 0.40 inches would be appropriate.

Figure 6.1: Point process model fits for a range of 15 thresholds from 0.2 inches to 0.80 inches for the Fort Collins, C.O. precipitation dataset.
To create the plot in Figure 6.1 do the following.
- Plot
Fit Threshold Ranges (PP)
- Select Fort from the Data Object listbox.
- Select Prec from the Select Variable listbox.
- Enter 0.2 in the Minimum Threshold field
- Enter 0.8 in the Maximum Threshold field
- Change the Number of thresholds to 15
OK.
Once a threshold is selected, a point process model can be fitted.
Figure 6.2 shows diagnostic plots (probability and quantile
plots) for such a fit.

Figure 6.2: Diagnostic plots for Fort Collins, C.O. precipitation (inches) data fit to a point process model.
To fit the Fort Collins precipitation data to a point process model, do the following.
- Analyze
Point Process Model
- Select Fort from the Data Object listbox.
- Select Prec from the Response listbox.
- Check the Plot diagnostics checkbutton.
- Enter 0.395 in the Threshold value(s)/function field
OK
MLE's found for this fit are:
1.38 inches (0.043),
0.53 inches (0.037 inches) and

0.21 (0.038) parameterized in terms of the GEV distribution for annual maxima,
with negative log-likelihood of about -1359.82.
Back to Top
Relating the Point Process Model to the Poisson-GP
The parameters of the point process model can be expressed in terms of the parameters of the GEV distribution or,
equivalently through transformations specified in appendix, in terms of
the parameters of a Poisson process and of the GPD (i.e., a Poisson-GP model).
Example 1: Fort Collins Precipitation (no covariates)
When fitting the Fort Collins precipitation data to the point process model
(using the BFGS optimization method) with a threshold of 0.395 and 365.25
observations per year, the following parameter estimates are obtained.
1.38343
0.53198
0.21199
Parameters from fitting data to the GPD (using the BFGS optimization method) with a threshold of 0.395 and 365.25
observations per year are
*
0.3225 and
0.21191--denoting the scale parameter of the
GPD by
* to distinguish it from the scale parameter
of the GEV distribution. Immediately, it can be seen that the value of
is very
nearly identical to the estimate found for the point process approach. Indeed, the small difference can be attributed to
differences in the numerical approximations. The other two parameters require a little more work to see that they
correspond.
Specifically, because there are 1,061 observations exceeding the threshold of
0.395 inches out of a total of 36,524 observations, the (log) MLE for the
Poisson rate parameter is
log
= log(365.25(1061/36524))
2.3618 per year.
Plugging into Eqs. (B.3) and (B.4)
gives
log
= log(0.3225)+0.2119(2.3618)
-0.63118
exp(-0.6311)
0.53196
= 0.395 - 0.53196/0.2119(10.61-0.2119 - 1)
1.3835
both of which are very close to the respective MLEs of the point process model.
Example 2: Phoenix summer minimum daily temperature
The Phoenix minimum temperature data included with this toolkit represents a time series of minimum and maximum temperatures
(degrees Fahrenheit) for July through August 1948 to 1990 from the U.S. National Weather Service Forecast Office at the
Phoenix Sky Harbor Airport. For more information on these data, please see
Tarleton and Katz (1995) or
Balling et al. (1990). Temperature is a good example of data that may have
dependency issues because of the tendency of hot (or cold) days to follow other hot (or cold) days. However, we do not deal
with this issue here (see
Extremes of Dependent and/or Nonstationary Sequences). For this
example, load the Tphap.R dataset and save it (in R) as Tphap. The minimum temperatures (degrees Fahrenheit)
are shown in Figure 6.3. Note the increasing trend evident from the superimposed regression
fit. Again, we will not consider this trend here, instead we defer this topic to the chapter on
Extremes of Dependent and/or Nonstationary Sequences.

Figure 6.3: Scatter plot of minimum temperature (degrees Fahrenheit), with regression line, for the summer months of July through August at Sky Harbor airport in Phoenix, A.Z.
It is of interest with this dataset to look at the minimum temperatures. To do this, we must first transform the data by
taking the negative of the MinT variable so that the extreme value distribution theory for maxima can be applied to
minima. That is,
-max(-X1,...,-Xn) =
min(X1,...,Xn)
This transformation can be easily made using extRemes.
- File
Transform Data
Negative
- Select Tphap from the Data Object listbox.
- Select MinT from the Variables to Transform listbox
OK.
For the Phoenix minimum temperature series, the Poisson log-rate parameter for a threshold of -73 degrees (using the
negative of minimum temperature, MinT.neg) is
log
= log(62.(262/2666))
1.807144 per
year, where there are 62 days in each "year" or summer season (covers two months of 31 days each; see
appendix) and 262 exceedances out of 2,666 total data points. MLEs (using the BFGS method) from
fitting data to the GPD are
*
3.91 degrees
(0.303 degrees) and
-0.25 (0.049), and
from fitting data to the point process model:
-67.29 degrees (0.323 degrees),
2.51
degrees (0.133 degrees) and
-0.25 (0.049). Clearly, the shape parameters of
the two models match up. Using Eq. (B.3) the derived scale parameter for
the point process model is log
0.92, or
2.51 degrees (the same as that of the point process estimate fitted
directly). Using Eq. (B.4) gives
-67.29 degrees (also equivalent to the point process estimate fitted directly).

Figure 6.4: Diagnostic plots of GPD fit for Phoenix Sky Harbor airport summer minimum temperature (degrees Fahrenheit) data (Tphap).

Figure 6.5: Diagnostic plots of point process fit for Phoenix Sky Harbor airport summer minimum temperature (degrees
Fahrenheit) data (Tphap).
Clearly, the probability and quantile plots (Figure 6.4 and
Figure 6.5) are identical, but the curvature in the plots indicates that the assumptions for
the point process model may not be strictly valid--although, the plots are not too far from being straight.
Back to Top
Back to Table of Contents