INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     two
    -2.34
    two
    -1.91
     deux
    -1.74
    Two
    -1.66
     Two
    -1.57
     TWO
    -1.49
     dois
    -1.47
     zwei
    -1.38
    TWO
    -1.33
     двух
    -1.32
    POSITIVE LOGITS
     dozen
    0.83
     thirds
    0.79
    thirds
    0.77
     فريبيس
    0.75
     للاسماء
    0.74
    aarrggbb
    0.73
     weeks
    0.72
    İstinadlar
    0.71
     semesters
    0.69
     years
    0.67
    Act Density 0.089%

    No Known Activations