INDEX
    Explanations

    find + reflexive pronoun

    New Auto-Interp
    Negative Logits
    ور
    0.80
    ли
    0.71
    ويل
    0.71
    ле
    0.70
    ري
    0.70
    يز
    0.70
    ният
    0.68
    но
    0.68
    ко
    0.67
    во
    0.67
    POSITIVE LOGITS
    in
    0.94
     it
    0.79
     be
    0.74
     out
    0.73
    0.73
     the
    0.73
    af
    0.73
     solace
    0.72
     there
    0.70
    at
    0.70
    Act Density 0.062%

    No Known Activations