INDEX
Explanations
references to physical injuries and medical conditions
New Auto-Interp
Negative Logits
uet
-0.16
textured
-0.15
èĥİ
-0.15
ueur
-0.14
मन
-0.14
.squeeze
-0.14
kicker
-0.14
orate
-0.14
skirts
-0.13
ÏĦικÏĮ
-0.13
POSITIVE LOGITS
injuries
0.60
injury
0.55
Injury
0.49
injured
0.47
wounds
0.41
juries
0.39
伤
0.38
wound
0.36
wounded
0.36
åĤ·
0.35
Activations Density 0.325%