INDEX
Explanations
mathematical symbols and operators
New Auto-Interp
Negative Logits
beſch
-0.79
laſſen
-0.77
AsUp
-0.75
kasarigan
-0.75
ſchaft
-0.73
niſſe
-0.71
ſehen
-0.71
Geiſt
-0.71
autorytatywna
-0.71
ſei
-0.71
POSITIVE LOGITS
2
0.48
1
0.47
3
0.41
9
0.41
5
0.41
0
0.41
4
0.41
8
0.39
7
0.36
6
0.35
Activations Density 1.701%