INDEX
    Explanations

    references to scientific authors or their works

    names followed by suffixes

    New Auto-Interp
    Negative Logits
    <unused8>
    -0.81
    <unused41>
    -0.81
    <unused42>
    -0.80
    <unused23>
    -0.80
    <unused68>
    -0.80
    <unused43>
    -0.80
    <unused16>
    -0.80
    <unused51>
    -0.80
    <unused47>
    -0.80
    <unused14>
    -0.80
    POSITIVE LOGITS
    <eos>
    0.47
     Fürst
    0.38
     Roskov
    0.36
    HideFlags
    0.33
     vys
    0.31
     cited
    0.28
    cshtml
    0.28
     account
    0.28
    0.28
    таратура
    0.27
    Act Density 0.001%

    No Known Activations