INDEX
    Explanations

    expressions of uncertainty or loss regarding existence and sustainability

    New Auto-Interp
    Negative Logits
     somehow
    -0.19
    benh
    -0.17
    /*č↵
    -0.15
    igo
    -0.15
    udas
    -0.15
    quate
    -0.15
    IGO
    -0.15
    ãĥªãĥ¼ãĤº
    -0.15
    alia
    -0.14
    اعب
    -0.14
    POSITIVE LOGITS
     anymore
    1.09
     nữa
    0.58
     lagi
    0.43
     again
    0.34
     longer
    0.33
    åĨį
    0.31
     artık
    0.31
    again
    0.27
     further
    0.26
     دÛĮگر
    0.25
    Act Density 0.228%

    No Known Activations