INDEX
Explanations
phrases related to legal language and crimes involving possession or discrimination
New Auto-Interp
Head Attr Weights
0:0.03
1:0.01
2:0.08
3:0.09
4:0.08
5:0.03
6:0.03
7:0.38
8:0.02
9:0.04
10:0.06
11:0.10
Negative Logits
hed
-1.41
alli
-1.41
elli
-1.40
Bal
-1.36
eline
-1.29
peed
-1.28
fixes
-1.28
herry
-1.27
acio
-1.27
fix
-1.26
POSITIVE LOGITS
Combine
1.60
GOODMAN
1.36
Adults
1.35
extrad
1.33
Guinness
1.32
unlawful
1.26
conduct
1.25
openly
1.25
usb
1.23
Guilty
1.22
Activations Density 0.005%