INDEX
Explanations
phrases related to political figures and events
mentions of the name "Trump" in various contexts
New Auto-Interp
Negative Logits
warp
-0.72
Droid
-0.72
Guinness
-0.72
Bulgarian
-0.69
Nordic
-0.65
variance
-0.65
Voyager
-0.65
cyan
-0.64
decomp
-0.64
cloak
-0.63
POSITIVE LOGITS
¬
1.16
£
1.12
ı
1.11
Į
1.11
¹
1.09
¦
1.03
ª
1.02
Ī
1.00
į
1.00
Ĵ
0.98
Activations Density 0.376%