INDEX
Explanations
references to setbacks or challenges in various contexts
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.05
3:0.07
4:0.09
5:0.04
6:0.03
7:0.38
8:0.05
9:0.03
10:0.08
11:0.09
Negative Logits
alted
-1.76
ceivable
-1.57
TABLE
-1.54
Hum
-1.50
ancies
-1.50
omial
-1.45
onomous
-1.44
ogg
-1.43
SELECT
-1.40
Redditor
-1.40
POSITIVE LOGITS
morale
1.65
efforts
1.64
setbacks
1.60
setback
1.54
progress
1.48
prospects
1.47
repair
1.44
attempts
1.44
doomed
1.41
miser
1.38
Activations Density 0.001%