INDEX
Explanations
phrases related to actions or decisions, often involving political figures or public statements
statements about political actions or decisions
New Auto-Interp
Negative Logits
unison
-0.63
âĶľâĶĢâĶĢ
-0.63
selves
-0.59
taboola
-0.54
mination
-0.54
backgrounds
-0.53
Grid
-0.52
Container
-0.52
collective
-0.52
combined
-0.51
POSITIVE LOGITS
himself
1.14
Himself
0.88
assassinated
0.75
veto
0.73
personally
0.67
his
0.65
ardless
0.59
farious
0.58
herself
0.57
reelection
0.56
Activations Density 0.985%