Data Preparation
OpenCoesione
is Italy’s open-government initiative on cohesion policies. The portal provides navigable
data on planned resources and expenditures, locations, thematic areas, implementing bodies,
timelines,
progress, and payments for individual projects.
The dataset “Projects on Culture and Tourism” includes all projects financed through cohesion
policies
in the areas of culture and tourism. It covers infrastructural interventions for the protection and
conservation of cultural heritage, the improvement of tourism services, and the promotion and
enhancement of natural resources.
We downloaded the datasets corresponding to the 2014–2020 and
2021–2027 programming
periods.
Data Extraction
Based on the metadata, we extracted the columns relating to project timelines like forecasted start
date, actual start date, and actual end date (DATA_INIZIO_PREV_STUDIO_FATT, OC_DATA_INIZIO_PROGETTO,
OC_DATA_FINE_PROGETTO_EFFETTIVA), location information (region and province), the amount of funding
received (mainly public), and an internal identifier (CUP) to avoid duplicates.
To select the records relevant to our project, we filtered the data using the column
OC_DATA_INIZIO_PROGETTO, keeping only projects with an actual start date between 2014 and 2024,
assuming that funding allocation becomes effective once the project begins.
The filtered dataset is available on our GitHub under the name open_coesione.csv
.