INDEX
Negative Logits
<unused1461>
2.30
𒁍
2.28
𒁁
2.27
slideDuplicate
2.25
𒉋
2.24
𒂆
2.22
𒌇
2.21
atthane
2.21
𒂅
2.21
<unused57>
2.21
POSITIVE LOGITS
es
1.23
1
1.10
2
1.07
(
1.06
(
1.05
of
1.03
this
1.00
not
0.98
’
0.98
This
0.97
Activations Density 0.001%