INDEX
    Explanations

    phrases related to continuous improvement and striving for progress

    New Auto-Interp
    Negative Logits
     än
    -0.07
    pee
    -0.07
    kaar
    -0.07
    directive
    -0.06
    rew
    -0.06
    rax
    -0.06
    apur
    -0.06
     suddenly
    -0.06
    gua
    -0.06
    ná
    -0.06
    POSITIVE LOGITS
     improvement
    0.10
     improving
    0.09
     improve
    0.09
     improves
    0.07
     improvements
    0.07
     Improvement
    0.07
     improved
    0.07
     Impro
    0.07
    à¹Īà¸ĩ
    0.07
     learning
    0.07
    Act Density 0.016%

    No Known Activations