INDEX
Explanations
references to blood and injuries
New Auto-Interp
Negative Logits
YN
-0.89
OPA
-0.87
sidx
-0.84
OUP
-0.84
IRC
-0.83
ãĤ´ãĥ³
-0.81
elligent
-0.81
TPP
-0.80
NAT
-0.80
ealous
-0.80
POSITIVE LOGITS
surfaces
1.16
cheeks
0.97
surface
0.94
walls
0.93
soaked
0.91
linen
0.89
sidewalks
0.89
bones
0.89
pavement
0.89
scratched
0.88
Activations Density 0.102%