Machine learning models can be developed as useful tools to predict bathing water quality, enabling beach-users to access real-time water quality information, rather than retrospective results. At the majority of designated bathing sites, one water sample is collected per week, after which the results are analysed in a laboratory and reported to the public. This process can take up to 48 hours, and with water quality having the ability to change over much shorter time scales, relying on a weekly, retrospective result means that people are using bathing sites with unknown levels of water quality. As previously discussed here, entering a body of water which has poor water quality can lead to a serious illness, such as gastroenteritis. This is where having the ability to provide an accurate daily prediction of water quality, from a specially designed model, could help to protect public health, by reducing the risk of illness in recreational water users.
The first stage of building a predictive system was for our modellers to select the correct modelling approach, which was not an easy task, as the number of approaches and algorithms which could be suitable for use is considerable. After a review of the literature to explore what approaches have been used successfully in similar projects, a range of different non-linear and tree-based methods were selected for initial testing. As a large number of variables, further described below, are to be input into the model, tree-based methods were first utilised, as they can handle many predictive variables, without requiring in-depth variable selection.
Building a model to predict water quality combines many elements, including, for e.g., historical and up-to-date data from physical water samples, the level of local rainfall, direction/speed of wind, tidal patterns, and a range of other potential hyro-meteorological variables. Additionally, water quality can be impacted by other factors such as sewage infrastructure, dogs on the beach, and even the amount of birds generally present at the location. By using local environmental variables for each site, modellers can build useful tools that provide predictions that are up-to-date, accurate and easily comparable to pre-existing weekly results, ensuring the transition to the new system is as smooth and useful as possible.
As might be expected, the project’s model building process has had its own unique challenges, namely, the relatively small amount of observed historical data which has been classified as poor; the model is hungry for this type of data, as it is able to learn from it. With this in mind, a preliminary model, which will use gauged rainfall data to determine a threshold of rainfall quantity beyond which bathing water quality is likely to be impaired, has been developed. This early model, which will provide binary results, i.e. ‘poor’ or ‘not poor’, will be able to be implemented at all 9 EU SWIM sites; whilst, providing modellers with an informative benchmark against which more complex multi-variate models will be able to be compared. These more complex models are in development by the project team and could have the ability to provide results which match the existing classifications.
Building a model that has the ability to accurately predict water quality, at highly variable locations is a complicated process but once achieved could transform the level of information provided to beach users; making checking the level of water cleanliness, at your favourite beach, becoming as easy and as normal, as checking the daily weather forecast.
Something went wrong! Try Again.
User registration is disabled for now. Contact site administrator.
Please complete your details to get in touch.
The fishing village of Clogherhead is located on the east coast of Ireland in the County of Louth, approximately 70km north of Dublin. The headland affords uninterrupted views of the Cooley and Mourne Mountains 30km to the north and to Lambay Island 35km to the south. The village is in close proximity to the historic town of Drogheda. The village developed around the fishing industry with the waters of Clogherhead reputed as being the best fishing waters in the country. The harbour, known as Port Oriel was built in 1885. It was extensively enlarged and re-opened in 2007.
Newcastle Beach is comprised mainly of pebbles and some sand. Newcastle Beach is linked to Murlough Beach and their combine length is approximately 2.5 kilometres in length
Ballywalter Beach is comprised mainly of sand with a rocky shoreline. The beach is approximately 0.85 kilometres in length
Ballyholme Beach is comprised mainly of sand with a typical rocky shore at each end. The beach is approximately 1.3 kilometres in length
Waterfoot Beach is comprised entirely of sand, it is backed by sand dunes which run the entire length of the beach. The beach is approximately 1 kilometre in length
Portrush (Curran Strand) is comprised entirely of sand. Portrush (Curran Strand) is linked with Whiterocks Beach and they have combined length 3 kilometres
Castlerock Beach is comprised entirely of sand and backs onto a sand dune system and a promenade area. The beach is approximately 1 kilometre in length
Lady’s Bay beach consists of a sandy beach in Lough Swilly confined by Buncrana pier to the South and a small rocky outcrop 550m to the North. Activities at Lady’s Bay beach include swimming, boating, power boating, jet skiing and other land-based activities on the beach. The designated bathing area is approx. 0.02633 km2 and the extent along the water is approximately 550m.
Enniscrone Beach is an exposed sandy beach, backed by sand dunes, caravan park and golf course. There is a short coastal walk north of Enniscrone pier. The bathing area (i.e. that which is patrolled by lifeguards) is approximately 500m in length. However the beach is approximately 4.5km in length.