Sensitive attribute privacy preservation of trajectory data publishing based on l-diversity

  • PDF / 1,382,557 Bytes
  • 27 Pages / 439.37 x 666.142 pts Page_size
  • 39 Downloads / 191 Views

DOWNLOAD

REPORT


Sensitive attribute privacy preservation of trajectory data publishing based on l‑diversity Lin Yao1,2 · Zhenyu Chen3 · Haibo Hu4 · Guowei Wu5   · Bin Wu6 Accepted: 5 November 2020 © Springer Science+Business Media, LLC, part of Springer Nature 2020

Abstract The widely application of positioning technology has made collecting the movement of people feasible for knowledge-based decision. Data in its original form often contain sensitive attributes and publishing such data will leak individuals’ privacy. Especially, a privacy threat occurs when an attacker can link a record to a specific individual based on some known partial information. Therefore, maintaining privacy in the published data is a critical problem. To prevent record linkage, attribute linkage, and similarity attacks based on the background knowledge of trajectory data, we propose a data privacy preservation with enhanced l-diversity. First, we determine those critical spatial-temporal sequences which are more likely to cause privacy leakage. Then, we perturb these sequences by adding or deleting some spatial-temporal points while ensuring the published data satisfy our ( L, 𝛼, 𝛽)-privacy, an enhanced privacy model from l-diversity. Our experiments on both synthetic and real-life datasets suggest that our proposed scheme can achieve better privacy while still ensuring high utility, compared with existing privacy preservation schemes on trajectory. Keywords  Sensitive attribute · Privacy preservation · Trajectory data publishing

1 Introduction The popularity of smart mobile devices with positioning technologies triggers the collection of location information by suppliers, corporations, individuals etc. for knowledge-based decision making. Therefore, vast amounts of trajectory data are collected with other information. Data miners have also shown great interest in analyzing these data to provide plentiful serves for people. For example, recent studies [1, 2] have shown that tracking the environmental exposure of a person with his daily trajectories helps to improve diagnose. Therefore, wearable devices * Guowei Wu [email protected] Extended author information available on the last page of the article

13

Vol.:(0123456789)

Table 1  Original table

Table 2  Table without explicit identifier

Distributed and Parallel Databases ID.

Name

Trajectory

Disease

⋅⋅⋅

1

Alice

a1 → d2 → b3 → e4 → f 6 → e8

HIV

⋅⋅⋅

2

Bob

d2 → c5 → f 6 → c7 → e9

Flu

⋅⋅⋅

3

Caesar

b3 → f 6 → c7 → e8

SARS

⋅⋅⋅

4

Daniel

b3 → e4 → f 6 → e8

Fever

⋅⋅⋅

5

Eden

a1 → d2 → c5 → f 6 → c7

Flu

⋅⋅⋅

6

Freeman

c5 → f 6 → e9

SARS

⋅⋅⋅

7

Georgia

f 6 → c7 → e8

Fever

⋅⋅⋅

8

Hugo

a1 → c2 → b3 → c7 → e9

SARS

⋅⋅⋅

9

Ishtar

e4 → f 6 → e8

Fever

⋅⋅⋅

Trajectory

Disease

a1 → d2 → b3 → e4 → f 6 → e8

HIV

d2 → c5 → f 6 → c7 → e9

Flu

b3 → f 6 → c7 → e8

SARS

b3 → e4 → f 6 → e8

Fever

a1 → d2 → c5 → f 6 → c7

Flu

c5 → f 6 → e9

SARS

f 6 → c7 → e8

Fever

a1 → c2 → b3 → c7 → e9

SARS

e4 → f 6 → e8

Fever

have been generating tremendous amounts of location-rich,