INDEX
Explanations
statements reflecting personal decisions and experiences
New Auto-Interp
Head Attr Weights
0:0.05
1:0.01
2:0.08
3:0.25
4:0.08
5:0.08
6:0.01
7:0.04
8:0.06
9:0.14
10:0.10
11:0.06
Negative Logits
?」
-1.62
depends
-1.51
nowadays
-1.47
)</
-1.43
ancest
-1.41
"},"
-1.36
?)
-1.35
sensit
-1.34
ain
-1.34
)]
-1.33
POSITIVE LOGITS
accompanied
1.55
iph
1.45
again
1.37
cellaneous
1.35
Suddenly
1.32
another
1.31
resumed
1.25
adr
1.25
again
1.23
finally
1.20
Activations Density 0.752%