Anonymization
Images are rich and complex because they contain multiple visual cues, such as faces, political symbols, or indications of health status. The main challenge of anonymization is therefore to protect the identities of depicted individuals and research participants without undermining key research insights or compromising legal and ethical compliance.
From a legal and ethical standpoint, it is important to identify and define the most appropriate anonymization technique at the very beginning of the project, i.e., in the conceptual phase. Deleting, anonymizing, or pseudonymizing data entails very different compliance steps. Therefore, when thinking about whether to anonymize, pseudo-anonymize or not anonymize at all, ask yourself the following questions since the beginning of the research:
| Legal issues | Ethical issues |
|
Data minimization:
Minimizing harm:
|
Minimizing harm:
|
|
Data retention:
|
Research approaches: Attention: anonymization is not the default. It depends on the research approaches you are using.
|
|
Data security:
Financial issues are to be considered at this stage. |
|
|
Transparency:
|
|
|
External technical solutions:
Define the respective privacy roles. |
|
|
Privacy risk assessment:
|
Once you have answered these questions, you can determine whether the data should be anonymized, pseudonymized, deleted, or not anonymized at all. If you decide to anonymize the data, the anonymization technique must already be identified at this stage, as it needs to be specified in the Data Management Plan.
Traditional ways of anonymizing images include pixelation, facial blurring techniques, cropping, using black-out-bars (black masking), ethical fabrication, or AI replicas. Not all techniques are adequate for all contexts and often need to be further refined, ideally during the research design phase.Decisions may need to be adjusted in light of reviewers’ comments and the guidance of the ethics review board. If you consider future open data practices, will the resulting anonymized visual data still be usable?
For more information on the techniques, click on this link [link che riporta alla spiegazione delle tecniche che abbiamo messo nell’empirical phase].
Before concluding this phase, it is worth considering copyright issues and the degree of openness we would grant to the dataset. You will better define your decision later in the Archiving & Preservation Phase. (aggiungere link al pallino copyright della fase descritta)