INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     crus
    -0.08
    -0.08
    Jes
    -0.08
    -0.07
     Jes
    -0.07
     understood
    -0.07
     Daw
    -0.07
     delo
    -0.07
    好了
    -0.07
     terce
    -0.07
    POSITIVE LOGITS
     некотор
    0.08
     노력
    0.08
     seeming
    0.08
     apparently
    0.08
     adanya
    0.08
     enduring
    0.08
     זאת
    0.07
     setbacks
    0.07
     prevailing
    0.07
     enabling
    0.07
    Act Density 0.014%

    No Known Activations