INDEX
Explanations
phrases related to effort and achievement
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.07
3:0.17
4:0.17
5:0.03
6:0.04
7:0.23
8:0.03
9:0.02
10:0.05
11:0.10
Negative Logits
Cosponsors
-1.66
Mub
-1.66
audi
-1.50
osphere
-1.41
alty
-1.39
ettings
-1.39
arent
-1.39
ifier
-1.38
Tuc
-1.34
��
-1.32
POSITIVE LOGITS
succeed
1.74
grasp
1.42
apses
1.40
chie
1.39
flesh
1.36
succeeding
1.30
cade
1.30
accompl
1.29
usalem
1.29
unravel
1.29
Activations Density 0.003%