INDEX
Explanations
words that convey a sense of excitement or potential
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.14
3:0.04
4:0.05
5:0.03
6:0.08
7:0.40
8:0.04
9:0.03
10:0.06
11:0.04
Negative Logits
POSE
-1.63
Overse
-1.62
incial
-1.62
Industries
-1.62
disgr
-1.60
Resp
-1.59
alias
-1.57
HAEL
-1.57
die
-1.47
utenant
-1.47
POSITIVE LOGITS
tantal
1.93
uggets
1.89
clinch
1.89
possibilities
1.86
prospects
1.67
pools
1.67
secrets
1.60
tongues
1.59
corridors
1.58
prospect
1.55
Activations Density 0.001%