INDEX
Explanations
verbs used in various grammatical forms to describe existence, capability, or relation
New Auto-Interp
Negative Logits
autorytatywna
-0.98
ſſung
-0.93
erſt
-0.91
ſehen
-0.90
<unused52>
-0.89
<unused53>
-0.89
<unused68>
-0.89
ſehr
-0.88
<unused55>
-0.88
<unused14>
-0.88
POSITIVE LOGITS
is
0.75
has
0.59
was
0.45
will
0.45
’
0.44
are
0.41
may
0.41
是
0.39
doesn
0.36
welches
0.36
Activations Density 0.513%