INDEX
Explanations
frequently occurring letters in the text
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.19
3:0.08
4:0.08
5:0.05
6:0.24
7:0.02
8:0.05
9:0.08
10:0.06
11:0.04
Negative Logits
headlights
-1.55
imeters
-1.50
scissors
-1.32
magnets
-1.20
cloaked
-1.18
hardness
-1.17
Jinn
-1.14
grit
-1.14
Lithuan
-1.11
evenly
-1.09
POSITIVE LOGITS
rar
1.51
ヴァ
1.50
anto
1.39
intend
1.37
esta
1.32
版
1.30
avan
1.29
imate
1.29
オ
1.28
UTE
1.28
Activations Density 0.009%