INDEX
    Explanations

    instances of the word "por" indicating reasons or purposes

    New Auto-Interp
    Negative Logits
    еÑĢп
    -0.15
    ilha
    -0.15
    urtle
    -0.14
    ustil
    -0.14
    ripsi
    -0.14
    lech
    -0.14
     NÄĽkter
    -0.14
    ksam
    -0.14
    endencies
    -0.14
    аниÑĨ
    -0.14
    POSITIVE LOGITS
     means
    0.16
    line
    0.16
    ras
    0.16
    atch
    0.16
    ridge
    0.16
     courtesy
    0.16
    ro
    0.16
    ret
    0.16
    zych
    0.15
    erro
    0.15
    Act Density 0.016%

    No Known Activations