INDEX
Explanations
references to a figure or concept named "Cop"
mentions of "Cop" and related variations referring to law enforcement or police
New Auto-Interp
Negative Logits
WAY
-0.75
WAYS
-0.70
Ń·
-0.67
sbm
-0.64
FORE
-0.63
hower
-0.63
çĦ
-0.62
pport
-0.62
ISTORY
-0.62
velt
-0.61
POSITIVE LOGITS
yrights
1.55
rodu
1.31
yright
1.23
enhagen
1.15
ious
1.10
ilot
0.99
eland
0.97
roph
0.93
afort
0.92
yp
0.91
Activations Density 0.037%