INDEX
Explanations
words related to law enforcement or police activities
references to law enforcement personnel
New Auto-Interp
Negative Logits
topic
-0.97
ayne
-0.87
ranged
-0.79
Pad
-0.78
reality
-0.77
ital
-0.76
Hop
-0.75
itus
-0.73
umen
-0.72
pine
-0.71
POSITIVE LOGITS
stationed
0.85
Polic
0.78
policemen
0.75
corrid
0.74
barric
0.70
blot
0.68
zzi
0.65
undert
0.64
guarding
0.64
/
0.64
Activations Density 0.014%