INDEX
Explanations
references to governmental or organizational entities
New Auto-Interp
Negative Logits
allis
-0.18
aqu
-0.16
ervas
-0.15
zar
-0.15
_nth
-0.13
consegu
-0.13
IL
-0.13
ãĤ¤ãĤ¹
-0.13
ancel
-0.13
idding
-0.13
POSITIVE LOGITS
%C
0.15
taÅŁ
0.14
aff
0.14
воÑĢ
0.14
osemite
0.14
à¹Ģ
0.13
UED
0.13
rape
0.13
Harris
0.13
_tC
0.13
Activations Density 0.564%