INDEX
    Explanations

    quotation marks

    New Auto-Interp
    Negative Logits
     oliva
    -0.08
     tegem
    -0.08
     fora
    -0.08
     κυ
    -0.08
    ريبا
    -0.08
     swiper
    -0.08
    adies
    -0.07
    (rad
    -0.07
     ذهب
    -0.07
    (ref
    -0.07
    POSITIVE LOGITS
    조건
    0.07
    0.07
     nto
    0.07
     discharge
    0.07
     പരാത
    0.07
    ամ
    0.07
    0.07
    0.07
     cou
    0.07
    וסף
    0.07
    Act Density 0.003%

    No Known Activations