### 统计代写|贝叶斯统计代写Bayesian statistics代考|Point referenced data sets used in the book

## 统计代写|贝叶斯统计代写beyesian statistics代考|Air pollution in the eastern US

This example is taken from Sahu and Bakar (2012b), where we consider modeling the daily maximum 8-hour average ozone concentration data obtained from 691 monitoring sites in the eastern US, as shown in Figure $1.3$. These pollution monitoring sites are made up of 646 urban and suburban monitoring sites known as the National Air Monitoring Stations/State and Local Air Monitoring Stations (NAMS/SLAMS) and 45 rural sites monitored by the Clean Air Status and Trends Network (CASTNET).

We analyze daily data for $T=153$ days in every year from May to September since this is the high ozone season in the US. We consider these data for the 10 year period from 1997 to 2006 that allows us to study trend in ozone concentration levels. Thus, we have a total of $1,057,230$ observations and among them approximately $10.44 \%$ are missing, which we assume to be at random, although there are some annual variation in this percentage of missingness.
The main purpose of the modeling exercise here is to assess compliance with respect to the primary ozone standard which states that the 3 -year rolling average of the annual 4 th highest daily maximum 8-hour average ozone concentration levels should not exceed $85 \mathrm{ppb}$, see e.g., Sahu et al. (2007). Figure $1.4$ plots the 4 th highest maximum and their 3 -year rolling averages with a superimposed horizontal line at 85 . As expected, the plot of the rolling averages is smoother than the plot of the annual 4th highest maximum values. The plots show that many sites are compliant with respect to the standard, but many others are not. In addition, the plot of the 3 -year rolling averages shows a very slow downward trend. Both the plots show the presence of a few outlier sites which are perhaps due to site-specific issues in air pollution, for example, due to natural disasters such as forest fires. This data set is analyzed in Section 8.3.

## 统计代写|贝叶斯统计代写beyesian statistics代考|Hubbard Brook precipitation data

Measuring total precipitation volume in aggregated space and time is important for many environmental and ecological reasons such as air and water quality, the spatio-temporal trends in risk of flood and drought, forestry management and town planning decisions.

The Hubbard Brook Ecosystem Study (HBES), located in New Hampshire, USA and established in 1955 , continuously observes many environmental outcome variables such as temperature, precipitation volume, nutrient volumes in water streams. HBES is based on the 8,000 -acre Hubbard Brook Experimental Forest (see e.g. https://hubbardbrook.org/) and is a valuable source of scientific information for policy makers, members of the public, students and scientists. Of-interest here is a spatio-temporal data set on weekly precipitation volumes collected from 22 rain-gauges from 1997 to $2015 .$

Taken from Hammond et al. (2017), this example studies long-term trends in chlorophyll (chl) levels in the ocean, which is a proxy measure for phytoplankton (marine algae). Phytoplankton is at the bottom of food chain and provides the foundation of all marine ecosystem. The abundance of phytoplankton affects the supply of nutrients and light exposure. Global warming can potentially affect the phytoplankton distribution and abundance, and hence it is of much scientific interest to study long-term trends in chl which influences the abundance of phytoplankton.

Figure $1.6$ shows a map of the 23 ocean regions of interest where we have observed satellite-based measurements. The main modeling objective here is to study long-term trends in chl levels in these 23 oceanic regions. Section $8.5$ assesses these trend values.

## 统计代写|贝叶斯统计代写beyesian statistics代考|Atlantic ocean temperature and salinity data set

This example is taken from Sahu and Challenor (2008) on modeling deep ocean temperature data from roaming Argo floats. The Argo float program, see for example, http://www.argo.ucsd.edu, is designed to measure the temperature and salinity of the upper two kilometers of the ocean globally. These floats record the actual measurements which are in contrast to satellite data, such as the ones used in the ocean chlorophyll example in Section 1.3.5, which provide less accurate observations with many missing observations. Each Argo float is programmed to sink to a depth of one kilometer, drifting at that depth for about 10 days. After this period the float sinks a further kilometer to a depth of two kilometers and adjusting its buoyancy rises to the surface, measuring temperature and conductivity (from which salinity measurements are derived) on the way. Once at the surface, the data and the position of the float are transmitted via a satellite. This gives scientists access to near realtime data. After transmitting the data the float sinks back to its ‘resting’ depth

of one kilometer and drifts for another ten days before measuring another temperature and salinity profile at a different location. Argo data are freely available via the international Argo project office, see the above-mentioned website.

We consider the data observed in the North Atlantic ocean between the latitudes $20^{\circ}$ and $60^{\circ}$ north and longitudes $10^{\circ}$ and $50^{\circ}$ west. Figure $1.7$ shows the locations of the Argo floats in the deep ocean. The figure shows the moving nature of Argo floats in each of the 12 months. The primary modeling objective here is to construct an annual map of temperature at the deep ocean along with its uncertainty. The time points at which the data are observed are not equi-lagged, and we do not assume this in our modeling endeavor. Modeling required to produce an annual temperature map of the North Atlantic ocean is performed in Section 8.6.

