INDEX
Explanations
phrases expressing optimism or positive outcomes, particularly in difficult situations
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.05
3:0.08
4:0.09
5:0.04
6:0.02
7:0.42
8:0.03
9:0.04
10:0.05
11:0.09
Negative Logits
AppData
-1.73
ゼウス
-1.70
endars
-1.66
ype
-1.65
ancies
-1.59
actionGroup
-1.59
bryce
-1.58
itect
-1.52
akura
-1.51
ypes
-1.51
POSITIVE LOGITS
overlooking
1.65
defeat
1.46
neglect
1.45
forfe
1.41
setbacks
1.37
failures
1.37
omission
1.34
ill
1.34
Russo
1.34
outing
1.33
Activations Density 0.001%