INDEX
Explanations
references to various time periods, particularly those related to years and months
New Auto-Interp
Negative Logits
of
-0.66
is
-0.56
iformis
-0.52
=
-0.52
lisää
-0.48
more
-0.46
zwungen
-0.45
gez
-0.45
rius
-0.45
tris
-0.45
POSITIVE LOGITS
pleaſure
0.92
Efq
0.87
)";
0.87
,:);
0.85
.")
0.83
faſt
0.83
houſe
0.82
".
0.82
脚注の使い方
0.81
ſmall
0.80
Activations Density 0.125%