INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ignet
    -0.09
    leigh
    -0.09
    451
    -0.08
    {{{
    -0.08
    ë¦¬ë¡ľ
    -0.08
    kes
    -0.08
     alive
    -0.08
    zac
    -0.08
    ADC
    -0.08
     sev
    -0.08
    POSITIVE LOGITS
     being
    0.53
    being
    0.40
     sendo
    0.33
     becoming
    0.31
     Being
    0.30
    Being
    0.29
    被
    0.29
     essere
    0.27
     siendo
    0.26
     having
    0.22
    Act Density 0.286%

    No Known Activations