INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     another
    -1.84
    another
    -1.51
     Another
    -1.41
    Another
    -1.37
     ANOTHER
    -1.17
     otro
    -0.95
    另一个
    -0.91
     autre
    -0.85
     otra
    -0.83
     другой
    -0.83
    POSITIVE LOGITS
    ."</
    0.82
     Paglinawan
    0.73
     Forumite
    0.73
    Personendaten
    0.70
    ſelf
    0.69
    ."));
    0.69
    )))));
    0.68
    ſelves
    0.67
    esterday
    0.66
    MLLoader
    0.66
    Act Density 0.668%

    No Known Activations