INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    AndEndTag
    -1.23
    ſſung
    -1.19
    iſchen
    -1.16
    <unused3>
    -1.14
    <unused51>
    -1.14
    <unused17>
    -1.14
    <unused41>
    -1.14
    <unused28>
    -1.14
    <unused8>
    -1.14
    <unused14>
    -1.14
    POSITIVE LOGITS
    ;
    0.57
    .
    0.53
    );
    0.44
     ;
    0.41
     sientes
    0.39
     Umum
    0.37
    2
    0.36
    ();
    0.35
     Zusammensetzung
    0.35
    ?
    0.34
    Act Density 0.004%

    No Known Activations