Ben Southworth
Chris Hoder

Machine Learning

Project Proposal

24 Jan. 2012

Neural Network Snow Forecasting

 

 

 

Background

 

Even today we lack the ability to produce truly accurate weather forecasts, especially with respect to precipitation. Weather will sometimes differ dramatically from the predictions of professional meteorologists. The image to the right (HPC, 2012) is provided by the National Oceanic and Atmospheric Administration (NOAA) and demonstrates the accuracy of quantitative precipitation forecasts from three different prediction models, the North American Mesoscale (NAM), the Global Forecast System (GFS), and the Hydrometeorological Prediction Center (HPC). A threat score is a function that ranges from 0 to 1 and provides a measure of how good a prediction was, with 1 being a perfect prediction.

The major difficulty in weather forecasting lies in the chaotic and non-linear nature of the atmosphere as well as our incomplete understanding of all of the relevant physical processes (Santhanam et all, 2011). The fact that forecasts over one day in advance require considering variables over 1000 miles away makes forecasting even more difficult. Forecasting weather phenomena even several days in advance is so complex that a perfect physical understanding, and thus a perfect forecast, may be infeasible.

Traditionally, weather forecasting is done via two different methods. The first and foremost is based on numerically modeling physical simulations with computers (Santhanam et all 2011). Although there has been much work and progress in this field, precipitation forecasts are still far from perfect, primarily due to the complexity previously discussed. There is also some work using previous data to base current predictions on, independent of the physical processes. This is effectively what we aim to do. Specifically, we would like to use vast sets of historical data to help a computer ÔlearnÕ when it will snow based on other known weather variables, thus avoiding the necessary understanding of all relevant physical processes. Past work in this area has focused on using radar snowfall estimates to predict snowfall depth (Xiao et all, 1996). Successful results were obtained in determining snowfall depth but the prediction of snowfall itself was not considered. Additional work has been done in short-term rainfall forecasting with relatively good results in the six to twenty-four hour forecast range (French et all, 1992; McCann, 1992; Kuligowski et all, 1998; Hall et all, 1999; Manzato, 2007). Forecasting short-term snowfall with this technique is something that has not been done, but could be approached with a similar methodology.

 

 

Problem

 

We aim to predict the probability of snowfall in Hanover, NH in the next 24 hours at any point in time, as well as a confidence metric indicating the amount of snowfall expected to occur.

 

 

Methodology

 

Due to the complexity and non-linearity of atmospheric systems, previous work in machine learning-based precipitation forecast has focused on using artificial neural networks (ANN) (French et all, 1992). The ANN is an attractive choice because of its ability to determine correlations without explicit knowledge of the physics involved (Veisi and Jamzad, 2009). ANNs have been shown to produce relatively accurate predictions for short-term rainfall prediction (Manzato, 2005) as well as snowfall depth (Xiao et all, 1996). Taking this previous work as motivation for using an ANN, we aim to implement several different variations, which will include but are not limited to:

á      Back propagation

á      Feed forward network

á      Radial basis function network

á      Hopfield network

Due to the large number of variables we will be considering as well as the necessary geographic spread of measurements taken, some effort may be needed to screen input variables and determine which ones are most relevant, as well as determine how large of a geographic area is needed for our forecast.

  

 

Data

 

We will obtain our data from the NOAA Earth System Research Library, which provides weather data from 1948 to the 21st century with multiple daily updates for all variables relevant to the weather and climate (PSD, 2012). Our work will focus on the most relevant variables to snowfall prediction. At the EarthÕs surface this includes the mean sea level pressure (MSLP), wind direction, relative humidity, and dewpoint. At 850mb, 700mb, and 500mb (heights with respect to air pressure), the relevant variables considered will be geopotential height, relative vorticity, wind direction, relative humidity, and specific humidity. The data sets are time-series, organized in a gridded fashion with respect to latitude and longitude. How large of a spatial distribution of data we will use has not been determined, but in trying to predict snow 24 hours in advance, we certainly must consider variables within a several hundred-mile radius of Hanover, and potentially variables over a 1000-mile radius.

 

 

Timeline

 

-       22 Jan. – 4 Feb. 2013:

o   Develop a strong understanding of neural networks and different ways of using them, e.g. back propagation, Hopfield networks, etc.

o   Obtain and format data, as well as determine how to deal with the enormous amounts of data we will have, i.e. terabytes.

o   Begin coding and formalize how our algorithm will learn from the data. 

 

-       5 -19 Feb. 2013

o   Complete coding for our neural network.

o   Begin training from data and optimization of predictive model.

 

-       20 Feb. – 7 Mar. 2013

o   Finish training code and test on current real time data, as well as several years of previous data we will save for testing purposes. Compare results across methods.

o   Tweak algorithm, variables, and data as necessary to perfect code and improve results.

o   Complete write-up on our work and results.

 

 

Sources

 

French, Mark N., Witold F. Krajewski, and Robert R. Cuykendall. "Rainfall forecasting in space and time using a neural network." Journal of hydrology137.1 (1992): 1-31.

 

Ghosh, Soumadip, et al. "Weather data mining using artificial neural network." Recent Advances in Intelligent Computational Systems (RAICS), 2011 IEEE. IEEE, 2011.

 

Hall, Tony, Harold E. Brooks, and Charles A. Doswell III. " Precipitation forecasting using a neural network." Weather and forecasting 14.3 (1999): 338-345.

 

HPC Verification vs. the models Threat Score. National Oceanic and Atmospheric Administration Hydrometeorological Prediction Center, 2012. Web. 20 Jan. 2013. http://www.hpc.ncep.noaa.gov/images/hpcvrf/HPC6ts25.gif

 

Hsieh, William W. "Machine learning methods in the environmental sciences." Cambridge Univ. Pr., Cambridge (2009).

 

Kuligowski, Robert J., and Ana P. Barros. "Experiments in short-term precipitation forecasting using artificial neural networks." Monthly weather review 126.2 (1998): 470-482.

 

Kuligowski, Robert J., and Ana P. Barros. "Localized precipitation forecasts from a numerical weather prediction model using artificial neural networks." Weather and Forecasting 13.4 (1998): 1194-1204.

 

Luk, K. C., J. E. Ball, and A. Sharma. "A study of optimal model lag and spatial inputs to artificial neural network for rainfall forecasting." Journal of Hydrology 227.1 (2000): 56-65.

 

Manzato, Agostino. "Sounding-derived indices for neural network based short-term thunderstorm and rainfall forecasts." Atmospheric research 83.2 (2007): 349-365.

 

McCann, Donald W. "A neural network short-term forecast of significant thunderstorms." Weather and Forecasting;(United States) 7.3 (1992).

 

PSD Gridded Climate Data Sets: All. National Oceanic and Atmospheric Administration Earth System Research Laboratory, 2012. Web. 20 Jan. 2013. http://www.esrl.noaa.gov/psd/data/gridded/

 

Santhanam, Tiruvenkadam, and A. C. Subhajini. "An Efficient Weather Forecasting System using Radial Basis Function Neural Network." Journal of Computer Science 7.

 

Silverman, David, and John A. Dracup. "Artificial neural networks and long-range precipitation prediction in California." Journal of applied meteorology 39.1 (2000): 57-66.

 

Veisi, H. and M. Jamzad, 2009. sc. Int. J. Sign. Process., 5: 82-92. ÒA complexity-based approach in image compression using neural networks.Ó http://www.akademik.unsri.ac.id/download/journal/files/waset/v5-2-11-5.pdf