INDEX
Explanations
phrases related to political statements or discussions
New Auto-Interp
Negative Logits
shroud
-0.79
couch
-0.68
semblance
-0.67
Belg
-0.65
hemor
-0.63
Doodle
-0.63
taxp
-0.62
guiActiveUnfocused
-0.62
entary
-0.62
canvas
-0.62
POSITIVE LOGITS
ª
1.28
¹
1.23
¸
1.10
ł
1.08
IJ
1.08
ij
1.06
ı
1.03
¤
1.01
¡
1.00
³
0.99
Activations Density 0.154%