INDEX
Explanations
inspirational quotes or philosophical advice
New Auto-Interp
Head Attr Weights
0:0.12
1:0.03
2:0.07
3:0.06
4:0.04
5:0.13
6:0.08
7:0.07
8:0.06
9:0.08
10:0.13
11:0.07
Negative Logits
gnu
-1.17
).[
-1.15
latter
-1.12
respectively
-1.11
attest
-1.09
});
-1.07
sic
-1.07
>.
-1.07
}.
-1.06
purposes
-1.06
POSITIVE LOGITS
��
1.13
verning
1.11
ヘラ
1.10
cipled
1.07
ゼウス
1.06
は
1.04
Beat
1.03
Gim
1.03
ギ
1.01
ワン
1.00
Activations Density 0.100%