INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     الثالث
    -1.27
     three
    -1.24
     both
    -1.19
     third
    -1.13
    third
    -1.03
    Three
    -1.02
     ketiga
    -1.02
     beide
    -1.02
     Thirdly
    -1.00
    Third
    -0.98
    POSITIVE LOGITS
     fifth
    1.05
     cinquième
    0.92
     sixth
    0.88
     poslední
    0.87
    第五
    0.84
     סוג
    0.84
     impati
    0.83
    fifth
    0.81
    asiswa
    0.80
    Fifth
    0.79
    Act Density 0.228%

    No Known Activations