INDEX
Explanations
phrases indicating improvement or growth experiences in life
New Auto-Interp
Head Attr Weights
0:0.04
1:0.01
2:0.11
3:0.06
4:0.37
5:0.07
6:0.03
7:0.01
8:0.07
9:0.11
10:0.04
11:0.02
Negative Logits
bryce
-1.80
Lyn
-1.59
Raven
-1.54
rants
-1.51
Merit
-1.50
NAS
-1.47
Trend
-1.44
Veter
-1.43
eps
-1.40
ums
-1.39
POSITIVE LOGITS
derog
1.67
indirect
1.53
efficiently
1.40
patriarchy
1.36
teleportation
1.36
terson
1.34
undo
1.34
shorten
1.33
workaround
1.32
shortcut
1.31
Activations Density 0.069%