INDEX
    Explanations

    the word "for" and related phrases indicating purpose or reason

    New Auto-Interp
    Negative Logits
    iada
    -0.55
    ışı
    -0.54
     cortesía
    -0.52
    withIdentifier
    -0.50
    انتهای
    -0.49
    voorbeeld
    -0.49
    ardin
    -0.49
     Hotspur
    -0.49
     casco
    -0.48
     thang
    -0.48
    POSITIVE LOGITS
     there
    1.05
    there
    0.81
     they
    0.80
     Sebab
    0.74
     Ведь
    0.72
    Ведь
    0.68
     we
    0.67
     THERE
    0.65
     although
    0.65
     it
    0.65
    Act Density 0.185%

    No Known Activations