INDEX
Explanations
references to scientific authors or their works
names followed by suffixes
New Auto-Interp
Negative Logits
<unused8>
-0.81
<unused41>
-0.81
<unused42>
-0.80
<unused23>
-0.80
<unused68>
-0.80
<unused43>
-0.80
<unused16>
-0.80
<unused51>
-0.80
<unused47>
-0.80
<unused14>
-0.80
POSITIVE LOGITS
<eos>
0.47
Fürst
0.38
Roskov
0.36
HideFlags
0.33
vys
0.31
cited
0.28
cshtml
0.28
account
0.28
�
0.28
таратура
0.27
Activations Density 0.001%