INDEX
Explanations
references to the concept of privilege
references to privilege and entitlement
New Auto-Interp
Negative Logits
ood
-0.83
tra
-0.80
agh
-0.72
Interstitial
-0.72
thumbnails
-0.72
GH
-0.69
tered
-0.68
itate
-0.66
arre
-0.66
urgy
-0.66
POSITIVE LOGITS
ilege
1.42
privilege
1.18
ileged
1.16
afforded
0.97
holders
0.94
conferred
0.89
Priv
0.85
bestowed
0.84
privileges
0.82
holder
0.82
Activations Density 0.053%