INDEX
Explanations
phrases related to legal proceedings and consequences
New Auto-Interp
Head Attr Weights
0:0.04
1:0.02
2:0.06
3:0.12
4:0.05
5:0.07
6:0.03
7:0.03
8:0.05
9:0.18
10:0.22
11:0.08
Negative Logits
Favorite
-1.12
ゴ
-1.04
ゼウス
-1.04
Born
-1.03
Entered
-1.03
NAME
-1.03
UES
-1.00
Selected
-0.97
ointment
-0.97
reetings
-0.96
POSITIVE LOGITS
deterrent
1.24
mitigation
1.20
mitigating
1.18
moot
1.15
loopholes
1.14
deterrence
1.13
anecdotal
1.13
hazard
1.10
situational
1.03
policing
1.02
Activations Density 1.736%