INDEX
Explanations
references to law enforcement officers or related terms such as "cop."
occurrences of the word "Cop" and its variations in various contexts
New Auto-Interp
Negative Logits
anwhile
-0.82
committee
-0.75
rely
-0.73
FORE
-0.69
AAAAAAAA
-0.68
itably
-0.67
çĦ
-0.66
laus
-0.65
thought
-0.65
WAY
-0.64
POSITIVE LOGITS
Cop
1.11
Cop
1.10
yrights
1.08
eland
0.88
rodu
0.84
icker
0.81
ioch
0.77
ilic
0.75
lete
0.75
cop
0.75
Activations Density 0.006%