INDEX
Explanations
mentions of Afghanistan and related terms
New Auto-Interp
Negative Logits
ignum
-0.07
egg
-0.07
Fortune
-0.07
eum
-0.07
inct
-0.07
nox
-0.07
eson
-0.07
apan
-0.07
portlet
-0.07
اسب
-0.06
POSITIVE LOGITS
(AF
0.07
447
0.07
rika
0.06
ذا
0.06
ektiv
0.06
elon
0.06
_INET
0.06
elik
0.06
Utt
0.06
-*-
0.06
Activations Density 0.009%