INDEX
Explanations
phrases related to dangerous incidents, crimes, and police involvement
instances of the word "the" in various contexts
New Auto-Interp
Negative Logits
ledge
-0.89
BILITIES
-0.81
imil
-0.77
ipedia
-0.75
Machina
-0.70
witch
-0.70
certs
-0.67
helm
-0.67
çļ
-0.66
abe
-0.66
POSITIVE LOGITS
vicinity
1.42
meantime
1.33
aftermath
1.17
midst
1.10
incident
1.09
area
1.08
absence
1.02
wake
1.02
altercation
0.97
intervening
0.95
Activations Density 0.171%