INDEX
Negative Logits
'
-2.52
has
-2.47
to
-2.47
is
-2.36
↑↑↑</
-2.33
!
-2.19
-2.13
(
-2.03
D
-1.95
does
-1.95
POSITIVE LOGITS
盌
3.08
2
3.03
插画
2.58
媖
2.58
ꨄ
2.52
傒
2.50
8
2.48
჻
2.48
jepang
2.45
4
2.45
Activations Density 0.001%
'
has
to
is
↑↑↑</
!
(
D
does
盌
2
插画
媖
ꨄ
傒
8
჻
jepang
4