A research institution wants to use a dataset containing PHI for a study.
To comply with HIPAA, they must de-identify the data.
Which of the following methods relies on a qualified statistician to apply scientific principles to determine that the risk of re-identification is 'very small'?