INDEX
Explanations
phrases related to maximizing resources and experiences
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.18
3:0.06
4:0.26
5:0.03
6:0.06
7:0.17
8:0.03
9:0.03
10:0.05
11:0.04
Negative Logits
captcha
-1.59
bernatorial
-1.58
nih
-1.53
aughtered
-1.51
estern
-1.50
ozo
-1.49
akura
-1.49
itially
-1.46
Cosponsors
-1.45
itcher
-1.43
POSITIVE LOGITS
newfound
1.49
resultant
1.36
gadgets
1.35
brav
1.35
bees
1.32
athleticism
1.29
fundamentals
1.28
simplicity
1.27
enjoyment
1.25
coupling
1.23
Activations Density 0.002%