INDEX
Explanations
references to law enforcement or sheriff-related topics
New Auto-Interp
Head Attr Weights
0:0.09
1:0.07
2:0.09
3:0.07
4:0.08
5:0.09
6:0.08
7:0.08
8:0.08
9:0.08
10:0.06
11:0.08
Negative Logits
cumbers
-2.97
hemor
-2.80
ouver
-2.75
obar
-2.62
quit
-2.46
Sov
-2.45
destro
-2.43
warr
-2.41
reprene
-2.40
disappro
-2.35
POSITIVE LOGITS
Network
2.41
iol
2.40
Image
2.40
hide
2.38
Nic
2.33
Logo
2.32
Niger
2.26
Networks
2.23
elt
2.21
LGBT
2.20
Activations Density 0.000%