INDEX
Explanations
vocabulary related to physical violence and law enforcement situations
New Auto-Interp
Negative Logits
Sync
-0.77
natureconservancy
-0.75
iour
-0.66
sync
-0.64
nox
-0.63
ivities
-0.62
erd
-0.60
<!--
-0.59
rio
-0.59
oyer
-0.58
POSITIVE LOGITS
by
1.22
by
0.94
pursuant
0.74
owing
0.72
ãĥ¼ãĥ³
0.72
because
0.71
due
0.71
BY
0.68
since
0.64
*/(
0.63
Activations Density 0.534%