INDEX
Explanations
phrases related to difficulty and struggle, particularly in challenging situations
New Auto-Interp
Head Attr Weights
0:0.01
1:0.02
2:0.07
3:0.07
4:0.01
5:0.03
6:0.06
7:0.33
8:0.09
9:0.03
10:0.06
11:0.17
Negative Logits
folios
-1.34
auri
-1.26
untled
-1.17
orks
-1.13
eon
-1.09
abo
-1.07
OUP
-1.07
Gleaming
-1.05
elligent
-1.03
vous
-1.01
POSITIVE LOGITS
CTR
1.26
DK
1.11
DCS
1.09
ファ
1.01
Nanto
1.01
NCT
0.99
FML
0.98
DF
0.96
achieving
0.94
RL
0.93
Activations Density 0.042%