INDEX
Explanations
phrases related to political figures and actions
New Auto-Interp
Negative Logits
VIDEOS
-0.96
leases
-0.78
occurs
-0.74
patents
-0.74
births
-0.73
pregnancies
-0.70
metics
-0.68
onds
-0.67
opened
-0.67
PLUS
-0.66
POSITIVE LOGITS
invincible
0.95
arrogant
0.93
traitor
0.92
savior
0.91
unbeat
0.91
hypocr
0.90
fearless
0.88
hero
0.88
ruthless
0.86
charismatic
0.86
Activations Density 0.276%