INDEX
    Explanations

    stopwords/punctuation

    New Auto-Interp
    Negative Logits
    хьтан
    -1.03
     Paglinawan
    -0.94
    ="#"><
    -0.84
    <bos>
    -0.83
    はじめに
    -0.79
     autorytatywna
    -0.78
    '%(
    -0.77
    Hauptartikel
    -0.73
    ."),
    -0.72
    клопе
    -0.71
    POSITIVE LOGITS
    .
    0.57
     TextInputType
    0.47
    0.45
    
    0.43
    <b>
    0.41
    RefreshLayout
    0.41
    TypedArray
    0.40
    0.39
    ?
    0.38
    DeleteBehavior
    0.37
    Act Density 5.531%

    No Known Activations