INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Fragment
    -0.06
    _intro
    -0.06
    .context
    -0.06
    on
    -0.06
     divert
    -0.06
    ON
    -0.06
     objs
    -0.06
     نو
    -0.06
    uffed
    -0.06
    Rows
    -0.06
    POSITIVE LOGITS
    AL
    0.10
    al
    0.10
     Casual
    0.08
    HAL
    0.08
    hal
    0.08
    aal
    0.08
    gal
    0.07
    άλ
    0.07
    าล
    0.07
    stral
    0.07
    Act Density 0.100%

    No Known Activations