Water Data Boot Camp: Fall 2019

Unit 1: Task 2
Evaluating 100-year flood frequency

Our team hydrologist suggested that one method for evaluating the impacts of dam construction is to monitor changes in flood return intervals. Falls Lake is a flood control reservoir, so it should decrease the amount of downstream flooding.

image3

*Figure: Reservoirs should moderate downstream flows. There is a flood control pool to hold flood waters that can be released slowly over time. There is also a conservation pool that holds water that can be released downstream during drier conditions to meet minimum streamflow requirements.*

 

Flood insurance policy is built around the concept of the 100-year flood event. The housing industry has been working to explain what that risk actually means to homeowners with 30 year mortgages. Understanding the flood risk relative to mortages is helpful for insurance companies to know. Has Falls Lake decreased the flood risk for downstream homes?

image4

Reservoirs decrease the likelihood of downstream flooding, but that often means development occurs in areas that would have been frequently flooded prior to the reservoir. We’ve seen examples of this just quite recently with Hurricanes Harvey and Florence.

 

Framing & Executing the analysis

We will use Leopold’s (1994) flood frequency curve and the Weibull equation to calculate the recurrence interval. Here the return interval is computed as where is the number of years of data and is the rank of the year from largest to smallest (see this link for more info).

* NOTE: The accuracy of a return interval is highly affected by the length of the time series.

So, for us to do this analysis, we need to first compute maximum annual discharge, i.e. extract the largest discharge observed from each water year. Then we sort and rank our data on max annual discharge and then compute a regression line from which we can determine the discharge of a 100 and 500 year flood.


1. Computing maximum annual streamflow using pivot tables

Excel's pivot tables are one of it's more powerful features, allowing you to easily extract and cross tabulate various summaries from your data. We'll use to sift through discharge records on a per-year basis and identify the largest one.

2. Sorting and ranking the pivot table data to compute return intervals

We now have the data we want. Next we'll compute rankings and then sort the data. However, to do this we need to copy the data from the dynamic Pivot Table.

3. Compute Return Interval () and Annual Exceedance Probability ()

Recall that the return interval is computed from the sorted and ranked time series data. Now that we have our data sorted and ranked, we compute from the number of years of data we have (plus 1), divided by the ranking of a given year's discharge.

• Plot the data and compute a regression equation

• Apply the regression to compute 100, 500, and 1000-year flood discharges

Using the regression equation you just calculated and added to your chart, estimate the discharge for the 100, 500, and 1000 year events, i.e., compute y for x = 100, 500, and 1000, respectively, somewhere in your Excel worksheet. These are the estimated discharge at 100, 500, and 1000 year flood events.

To do this, we'll create a table, listing the years for which we want to explore (100, 500, 1000):

Finally, we can add these points to our original plot


♦ EXERCISE: Calculate the return interval from the pre-1980 data

Repeat the above analysis only using data prior to 1980 to calculate the return interval.

Look at the first plot you did of streamflow on the EDA spreadsheet to look at the distribution of peak events. These events are all hurricanes. How does it change your understanding of why Falls Lake doesn’t seem to impact flood frequency?

What happens to your answer if you remove those three points?

1567020888356


Recap

We've now discovered how data can be transformed, or "Pivoted" into various summary formats and how, via Excel's plotting functionality, we can compute quick regressions of our data.