INDEX
Explanations
objects used in violent confrontations
New Auto-Interp
Negative Logits
#
-0.15
éłĺ
-0.14
>Error
-0.14
_OM
-0.14
ipur
-0.14
vla
-0.14
maal
-0.14
stin
-0.13
ør
-0.13
chwitz
-0.13
POSITIVE LOGITS
obot
0.19
absorb
0.17
absorbed
0.15
Ĵáŀ
0.14
egie
0.14
Implements
0.14
oley
0.14
abr
0.14
nat
0.14
absorption
0.14
Activations Density 0.074%