INDEX
Explanations
names or titles of world leaders
references to political leaders, specifically presidents
New Auto-Interp
Negative Logits
opter
-0.85
eros
-0.82
aughs
-0.73
andem
-0.73
asca
-0.71
ritch
-0.70
ourcing
-0.66
Scot
-0.65
aylor
-0.64
ogene
-0.64
POSITIVE LOGITS
Mahmoud
1.00
Bashar
0.91
negotiator
0.89
Jinping
0.89
Viktor
0.85
Hassan
0.82
Mahm
0.77
Putin
0.76
Tayyip
0.76
bloc
0.75
Activations Density 0.087%