INDEX
    Explanations

    mathematical symbols and operators

    New Auto-Interp
    Negative Logits
     beſch
    -0.79
     laſſen
    -0.77
    AsUp
    -0.75
     kasarigan
    -0.75
    ſchaft
    -0.73
    niſſe
    -0.71
     ſehen
    -0.71
     Geiſt
    -0.71
     autorytatywna
    -0.71
     ſei
    -0.71
    POSITIVE LOGITS
    2
    0.48
    1
    0.47
    3
    0.41
    9
    0.41
    5
    0.41
    0
    0.41
    4
    0.41
    8
    0.39
    7
    0.36
    6
    0.35
    Act Density 1.701%

    No Known Activations