INDEX
Explanations
references to Ukraine and its related military activities
New Auto-Interp
Negative Logits
itti
-0.17
anine
-0.17
utin
-0.16
stadt
-0.14
minded
-0.14
coc
-0.14
ochran
-0.14
cers
-0.14
Ñĩа
-0.14
Ecc
-0.14
POSITIVE LOGITS
aph
0.15
Pot
0.15
arend
0.15
-popup
0.14
iert
0.14
undo
0.14
XL
0.14
tz
0.14
neo
0.14
pot
0.14
Activations Density 0.027%