INDEX
Explanations
phrases related to legal or judicial actions
references to physical punishment or legal consequences
New Auto-Interp
Negative Logits
proble
-0.72
itiz
-0.71
pires
-0.68
Ö¼
-0.62
{*-0.60
guyen
-0.57
inventoryQuantity
-0.57
ricular
-0.56
ãĥĺ
-0.55
kie
-0.54
POSITIVE LOGITS
respectively
2.42
apiece
1.98
together
1.45
respective
1.42
themselves
1.26
selves
1.22
jointly
1.21
collectively
1.20
together
1.11
each
1.09
Activations Density 0.678%