INDEX
Explanations
references to Middle Eastern cultural contexts and social dynamics
New Auto-Interp
Negative Logits
uhan
-0.16
Mahar
-0.15
arend
-0.14
ovich
-0.14
ãĥ³ãĤ¬
-0.14
aman
-0.14
unde
-0.14
ach
-0.13
[
-0.13
ish
-0.13
POSITIVE LOGITS
otas
0.18
adin
0.16
/cop
0.16
nodoc
0.15
WARDED
0.15
errat
0.14
leurs
0.14
_QMARK
0.14
adows
0.14
ierz
0.14
Activations Density 0.675%