INDEX
    Explanations

    repetitive phrases emphasizing similarity or sameness

    New Auto-Interp
    Negative Logits
     defin
    -0.15
    oola
    -0.15
    aro
    -0.14
    ÄĻd
    -0.14
    null
    -0.14
    astle
    -0.14
     Kend
    -0.14
     null
    -0.14
    ersed
    -0.14
    Null
    -0.13
    POSITIVE LOGITS
    odia
    0.15
    EIF
    0.15
    abal
    0.15
    reff
    0.15
    zp
    0.14
    اÙĪØ±ÛĮ
    0.14
    unami
    0.14
     عرض
    0.14
    ãģªãĤī
    0.14
    CORE
    0.14
    Act Density 0.039%

    No Known Activations