INDEX
Explanations
references to firearms and associated equipment
New Auto-Interp
Negative Logits
ade
-0.19
ushima
-0.16
uje
-0.15
dew
-0.15
omatic
-0.15
adel
-0.15
xo
-0.15
Pin
-0.14
lish
-0.14
ervo
-0.14
POSITIVE LOGITS
kea
0.16
aniu
0.15
urf
0.15
sky
0.15
Ñĩили
0.15
ylie
0.14
ubbo
0.14
cp
0.14
alace
0.13
yster
0.13
Activations Density 0.252%