INDEX
Explanations
discourses related to competition and success in various fields
New Auto-Interp
Head Attr Weights
0:0.05
1:0.02
2:0.07
3:0.25
4:0.03
5:0.06
6:0.02
7:0.03
8:0.02
9:0.02
10:0.36
11:0.02
Negative Logits
覚醒
-2.59
href
-2.18
throats
-1.94
actionGroup
-1.92
manually
-1.90
Tweet
-1.90
unpublished
-1.90
Cooldown
-1.86
ingred
-1.85
�
-1.85
POSITIVE LOGITS
resembles
3.37
resembled
3.07
embodies
2.74
paralle
2.67
succeeds
2.46
resemble
2.46
outper
2.44
parallels
2.40
differs
2.40
dwar
2.38
Activations Density 0.153%