INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Millard
    -0.64
     cards
    -0.58
    odeficiency
    -0.57
    pekt
    -0.57
    occasione
    -0.56
    rateur
    -0.56
    ulihan
    -0.55
    -0.55
    رحلة
    -0.55
     lid
    -0.54
    POSITIVE LOGITS
     whether
    3.40
    whether
    3.19
     WHETHER
    3.04
     Whether
    2.80
    Whether
    2.69
     apakah
    1.84
    是否
    1.55
     是否
    1.40
    是否有
    1.29
     Apakah
    1.27
    Act Density 0.073%

    No Known Activations