Collection of COVID-19 health & wastewater surveillance data and associated open data
This activity involved establishing a data standardisation and digitisation protocol to ensure adequate translation of testing data reported through WP1 and WP2 into a usable digital format.
The need for data standardisation
In this section we point out the demand for data standardisation and the current information gaps.
Privacy measures
Implementing strict privacy measures was not necessary within this project as the open source and sampling data collected was only retrieved at municipal level.
Existing databases
In this section we identify the required, relevant and available (open source) data for COVID-19 health and wastewater surveillance purposes.
Optimisation of storage space and connections
All separate Dutch data sources where provided in the form of flat files. These flat files were filtered, grouped and cleaned.
Data protocol for common data assembly and standardisation
A data protocol for common data assembly and standardisation for optimal data analysis processing was established. This was necessary to allow for data comparison among sources. All data sources were filtered for outliers and duplicates.
Data architecture Design
For the data architecture we used a star schema with facts and dimensions based on flat files. The star schema is a type of relational database that is composed of a single, central fact table surrounded by dimension tables.