INDEX
Explanations
words related to holding beliefs or positions
expressions of possessing or maintaining views and positions of power
New Auto-Interp
Negative Logits
————
-0.78
issance
-0.71
lease
-0.71
lez
-0.68
ese
-0.67
ombies
-0.66
ghan
-0.66
endix
-0.65
FTWARE
-0.64
ibel
-0.63
POSITIVE LOGITS
sway
1.27
onto
1.11
accountable
0.99
steady
0.92
hostage
0.90
captive
0.87
dear
0.83
overs
0.81
hold
0.81
secrets
0.81
Activations Density 0.040%