INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     doğ
    -0.58
     itſelf
    -0.54
    )();
    -0.53
     Mathilde
    -0.52
     ...............
    -0.49
    becher
    -0.48
    </h1>
    -0.47
     ............
    -0.47
    .";
    -0.46
    MMV
    -0.46
    POSITIVE LOGITS
     autorytatywna
    1.01
    脚注の使い方
    0.96
    +#+
    0.90
    Geplaatst
    0.85
    //
    0.81
    saraba
    0.80
    /*
    0.78
    ">//
    0.75
    AxisAlignment
    0.75
    AnchorStyles
    0.72
    Act Density 0.113%

    No Known Activations