INDEX
    Explanations

    the beginning of text ("<bos>")

    New Auto-Interp
    Negative Logits
     unspeak
    -2.81
     reluct
    -2.68
     disgra
    -2.61
     unlaw
    -2.60
     shenan
    -2.50
     impractica
    -2.49
     impra
    -2.46
     disagre
    -2.42
     ineffec
    -2.42
     horrend
    -2.41
    POSITIVE LOGITS
    <bos>
    14.48
    GEBURTSDATUM
    2.53
    expandindo
    2.52
     betweenstory
    2.49
    Autoritní
    2.46
    تقاوى
    2.20
     Italijani
    2.16
     Administrativna
    2.16
     Paglinawan
    2.12
     kasarigan
    2.10
    Act Density 0.088%

    No Known Activations