INDEX
Explanations
instances of arrests and related legal actions
New Auto-Interp
Negative Logits
acre
-0.17
urpose
-0.15
irm
-0.15
anch
-0.15
ittle
-0.15
ikat
-0.14
artz
-0.14
gth
-0.14
oller
-0.13
rint
-0.13
POSITIVE LOGITS
ees
0.25
warrant
0.23
WARRANT
0.20
ee
0.20
گاÙĩ
0.20
warrants
0.19
ingly
0.18
ivals
0.17
eeee
0.17
aurant
0.17
Activations Density 0.017%