INDEX
    Explanations

    phrases related to obstacles and challenges in achieving progress

    New Auto-Interp
    Negative Logits
    öyle
    -0.16
    eding
    -0.15
    ylon
    -0.14
    еÑĢе
    -0.14
    HEME
    -0.14
    ersions
    -0.14
    eded
    -0.13
     rus
    -0.13
    arching
    -0.13
    илÑĮ
    -0.13
    POSITIVE LOGITS
     us
    0.28
     him
    0.23
     me
    0.21
     them
    0.20
     itself
    0.18
     you
    0.17
     ÑģебÑı
    0.16
     lui
    0.15
    Translated
    0.15
    annya
    0.15
    Act Density 0.326%

    No Known Activations