INDEX
Explanations
mentions of law enforcement activities and surveillance
New Auto-Interp
Negative Logits
Bilim
-0.09
ardu
-0.09
spoilers
-0.08
оген
-0.08
jong
-0.08
styl
-0.07
شتÙĩ
-0.07
Ùħرک
-0.07
aeper
-0.07
ุม
-0.07
POSITIVE LOGITS
facial
0.10
Facial
0.07
police
0.07
0.07
use
0.07
racial
0.07
public
0.06
vendor
0.06
pat
0.06
Public
0.06
Activations Density 0.003%