INDEX
Negative Logits
^(@)
-0.69
wiſe
-0.66
IMPORTED
-0.65
CopyWith
-0.65
ſame
-0.62
ſtand
-0.62
ſche
-0.61
Jefus
-0.61
ſch
-0.60
PreferredItem
-0.60
POSITIVE LOGITS
that
1.20
,
0.94
that
0.62
That
0.59
bahwa
0.53
That
0.52
which
0.51
že
0.49
kwamba
0.48
Picchu
0.48
Activations Density 0.000%