INDEX
Explanations
words representing sounds, particularly those starting with the letters 'sh' and 'd'
New Auto-Interp
Head Attr Weights
0:0.03
1:0.03
2:0.06
3:0.06
4:0.05
5:0.05
6:0.39
7:0.04
8:0.06
9:0.06
10:0.07
11:0.05
Negative Logits
guaranteeing
-1.41
abase
-1.29
ashtra
-1.28
shame
-1.27
龍契士
-1.25
darkest
-1.24
ufact
-1.21
acebook
-1.18
peacefully
-1.18
ulhu
-1.18
POSITIVE LOGITS
icz
1.49
ondo
1.38
ForgeModLoader
1.38
bring
1.35
Radiation
1.31
én
1.31
tes
1.30
bash
1.26
zi
1.25
��
1.25
Activations Density 0.002%