INDEX
Explanations
activities and actions that indicate urgency or critical situations
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.16
3:0.23
4:0.16
5:0.03
6:0.04
7:0.06
8:0.05
9:0.04
10:0.06
11:0.06
Negative Logits
udos
-1.98
itivity
-1.73
Wide
-1.71
verages
-1.65
enza
-1.64
ciplinary
-1.58
ISION
-1.50
enne
-1.50
enum
-1.49
laus
-1.47
POSITIVE LOGITS
or
2.01
newsp
1.76
cknow
1.60
etc
1.59
?,
1.54
→
1.46
yourself
1.44
nor
1.44
XY
1.43
OR
1.42
Activations Density 0.063%