INDEX
Explanations
phrases related to law enforcement
occurrences of the word "new."
New Auto-Interp
Negative Logits
suspic
-0.67
Reloaded
-0.64
NAACP
-0.61
dissu
-0.58
Vance
-0.58
________________________
-0.57
Reconstruction
-0.57
................................................................
-0.57
Duchess
-0.57
Monteneg
-0.56
POSITIVE LOGITS
riter
1.43
ritten
1.40
ords
1.30
estern
1.27
orth
1.25
isdom
1.23
idth
1.20
ITNESS
1.19
olf
1.19
alker
1.18
Activations Density 0.044%