INDEX
Explanations
words related to law enforcement and legal matters
words related to safety and security
New Auto-Interp
Negative Logits
ĸļ
-0.85
depths
-0.74
ById
-0.74
ponds
-0.72
ngth
-0.70
\<
-0.68
ItemThumbnailImage
-0.67
precincts
-0.65
depth
-0.64
wells
-0.63
POSITIVE LOGITS
agonist
0.88
imity
0.81
incial
0.79
zai
0.78
vernment
0.75
bably
0.74
ament
0.74
nder
0.72
asus
0.72
hers
0.72
Activations Density 0.091%