INDEX
    Explanations

    the word "just" in various contexts

    New Auto-Interp
    Negative Logits
    بس
    -0.07
    kir
    -0.07
    dden
    -0.06
    кин
    -0.06
    ALIGN
    -0.06
     Maver
    -0.06
    pool
    -0.06
    cir
    -0.06
    uire
    -0.06
    lsi
    -0.06
    POSITIVE LOGITS
    ifying
    0.07
    ifice
    0.06
    ifies
    0.06
    opia
    0.06
    ifications
    0.06
     moments
    0.06
    jos
    0.06
    ifiers
    0.06
    tml
    0.06
    UnderTest
    0.06
    Act Density 0.016%

    No Known Activations