INDEX
Explanations
instances of the word "you" and its related variations, indicating encouragement or directives directed towards the reader
New Auto-Interp
Head Attr Weights
0:0.04
1:0.03
2:0.27
3:0.06
4:0.09
5:0.04
6:0.02
7:0.02
8:0.20
9:0.11
10:0.04
11:0.02
Negative Logits
alde
-1.37
ophon
-1.29
vich
-1.28
hill
-1.24
prototype
-1.21
CV
-1.21
olate
-1.16
milo
-1.16
chens
-1.12
Seah
-1.11
POSITIVE LOGITS
MpServer
1.27
longevity
1.25
crowd
1.24
ster
1.21
forgiveness
1.20
evolution
1.16
jealousy
1.16
ギ
1.16
EngineDebug
1.15
ウス
1.15
Activations Density 0.014%