INDEX
Explanations
mentions of private entities or activities
terms associated with privacy
New Auto-Interp
Negative Logits
awa
-0.76
xual
-0.74
DAY
-0.72
GOODMAN
-0.70
annis
-0.69
orthy
-0.67
ologies
-0.67
addons
-0.67
EFF
-0.67
amaz
-0.67
POSITIVE LOGITS
sector
1.10
equity
0.91
Sector
0.82
sector
0.80
affairs
0.78
rented
0.78
ownership
0.76
jets
0.76
property
0.75
ised
0.74
Activations Density 0.038%