INDEX
    Explanations

    perfect tense auxiliaries

    New Auto-Interp
    Negative Logits
    ãģķãĤĮãĤĭ
    -0.10
     youre
    -0.10
    ãģıãĤĭ
    -0.10
    æŃ£åľ¨
    -0.10
    ีà¸Ńย
    -0.09
    ëIJĺëĬĶ
    -0.09
    ãĤīãĤĮãĤĭ
    -0.09
     æŃ£
    -0.09
    ãĤĴãģĻãĤĭ
    -0.09
    777
    -0.08
    POSITIVE LOGITS
     telah
    0.47
     Äijã
    0.44
     haber
    0.35
    've
    0.30
    ’;ve
    0.30
    æĽ¾
    0.29
     have
    0.25
    å·²ç»ı
    0.24
     yapmÄ±ÅŁ
    0.24
    å·²
    0.23
    Act Density 0.386%

    No Known Activations