INDEX
Explanations
topics related to overcoming challenges and obstacles
New Auto-Interp
Head Attr Weights
0:0.18
1:0.03
2:0.01
3:0.14
4:0.09
5:0.10
6:0.04
7:0.03
8:0.22
9:0.06
10:0.01
11:0.03
Negative Logits
vertisement
-2.36
english
-2.13
wat
-1.87
DEBUG
-1.84
guiActiveUn
-1.84
ACTED
-1.83
Referred
-1.81
osponsors
-1.80
ateur
-1.78
advertise
-1.77
POSITIVE LOGITS
conquer
2.46
adversity
2.45
overcome
2.33
heroism
2.28
overcame
2.27
triumph
2.22
breakthrough
2.22
victories
2.07
overcoming
2.05
hurdle
2.02
Activations Density 0.005%