INDEX
Explanations
phrases related to legal cases, criminal activity, and incidents involving law enforcement
New Auto-Interp
Negative Logits
avorite
-0.73
hem
-0.67
ank
-0.66
irie
-0.65
uctions
-0.65
bluff
-0.64
ortium
-0.64
moreover
-0.64
eness
-0.62
warts
-0.61
POSITIVE LOGITS
abouts
0.89
EStream
0.80
bell
0.77
frames
0.76
ZI
0.73
sidx
0.72
frame
0.70
isch
0.70
liest
0.69
GHz
0.68
Activations Density 11.136%