INDEX
Explanations
actions and behaviors associated with processing and decision-making
New Auto-Interp
Head Attr Weights
0:0.53
1:0.02
2:0.04
3:0.07
4:0.03
5:0.05
6:0.03
7:0.03
8:0.05
9:0.05
10:0.02
11:0.02
Negative Logits
characteristic
-1.46
distinguishing
-1.45
progressing
-1.44
Rece
-1.33
hallmark
-1.33
機
-1.30
Kats
-1.30
�
-1.29
oshenko
-1.29
Parenthood
-1.29
POSITIVE LOGITS
ify
3.02
ulate
2.85
itate
2.75
ize
2.64
perse
2.54
igrate
2.49
strate
2.41
pose
2.35
inate
2.23
rouse
2.20
Activations Density 1.329%