INDEX
Explanations
phrases indicating legal or moral entitlement
phrases related to individual rights and liberties
New Auto-Interp
Negative Logits
Madness
-0.60
iries
-0.59
diligent
-0.57
Spiel
-0.57
unsuspecting
-0.56
batches
-0.55
Journals
-0.54
metics
-0.54
Hort
-0.54
duction
-0.54
POSITIVE LOGITS
whatsoever
0.85
76561
0.77
vested
0.74
ointed
0.74
veto
0.69
kees
0.69
entit
0.69
âĺ
0.68
amus
0.68
ï¸
0.67
Activations Density 0.159%