EmpowerItalyGithub

EmpowerItaly's Project Journey: Unveiling the Methodological Steps

Embark on the journey of EmpowerItaly's project methodology in our Workflow section. Here, we meticulously outline the systematic steps and processes undertaken to realize the project's objectives. From initial ideation to data collection, analysis, ethical considerations, and eventual conclusions, this section unveils the roadmap followed, offering insight into the structured approach employed to comprehend Italy's employment landscape comprehensively.

Our workflow

hero

Formulating Hypothesis

From the project's inception, our aim was to conduct a comprehensive study on the presence of immigrants in Italy. Given the sensitivity of the topic, framing a viable research question posed a challenge. Initially, our focus centered on aspects of the labor market and education, specifically examining whether the professional positions held by foreigners corresponded with their educational qualifications. However, the scarcity of relevant data at the regional level in Italy prompted us to transition into the exploratory phase, where we formulated a well-defined hypothesis.

intro
intro

Searching Data

The project leveraged data from reputable sources, namely ISTAT and EUROSTAT, both housing extensive data repositories beneficial for students and various organizations. Notably, ISTAT's dedicated page on immigrant statistics in Italy, accessible through its older data explorer, proved invaluable. Despite encountering dynamic changes in the data presentation methods of these institutions during the project's development, such as ISTAT's transition from its [old data explorer](http://dati.istat.it/) to the [new one](https://esploradati.istat.it/databrowser/#/), the abundance of data remained consistent, ensuring a robust foundation for our analysis. Additionally, organizations like OECD, previously considered in earlier project phases, underwent transitions in their data explorer interfaces.

Cleaning and Mash up

During the cleaning phase, a standardized procedure was applied to most datasets. This involved understanding the content of each dataset, identifying relevant values, and ensuring alignment with the information available on the respective data explorer platforms.

intro
intro

Visualization

An integral aspect of the project, the visualization process adhered to the principles of simplicity and user choice. Our goal was to allow the data to speak for itself, presenting it in a straightforward manner while providing users the flexibility to select the information they wished to view.

Open Data practices

This project adheres to the FAIR principles of data—ensuring data is Findable, Accessible, Interoperable, and Reusable. All project materials are made openly available, and we welcome user feedback to address any deviations from these principles.

intro
intro

Afterlife

The EmpowerItaly datasets compile data from catalogs concerning diverse aspects of the Italian labor market. Originating from a University of Bologna course, the catalog is not actively maintained anymore, but the datasets within it are managed by their respective institutions. Despite the catalog's lack of ongoing updates, our accessible scripts can be run on new files as needed.

We encourage notification if newer versions of our input files are discovered, enabling us to update our files using our automated script. It's essential to highlight that our scripts are licensed under CC 4.0. We invite the community to contribute to our GitHub project by submitting updated files for review and potential inclusion, enhancing the dataset quality.