INDEX
    Explanations

    references to English and other languages

    New Auto-Interp
    Negative Logits
     Forumite
    -0.81
     iſt
    -0.79
    存于互联网档案馆
    -0.79
    adays
    -0.79
     itſelf
    -0.79
     myſelf
    -0.78
    AutoScaleMode
    -0.76
    ̈́
    -0.74
     ſeveral
    -0.74
    .}~\
    -0.74
    POSITIVE LOGITS
     English
    1.65
    English
    1.59
     english
    1.22
    english
    1.19
     ENGLISH
    1.17
    ENGLISH
    1.06
     inglés
    0.91
    英语
    0.89
     Spanish
    0.86
     Englisch
    0.82
    Act Density 0.061%

    No Known Activations