INDEX
    Explanations

    requests for assistance or help

    New Auto-Interp
    Negative Logits
    nmgp
    -0.51
     Etter
    -0.47
    毕竟
    -0.45
    要说
    -0.45
     miedo
    -0.44
    来说
    -0.44
    Ici
    -0.44
     temor
    -0.44
     pravi
    -0.44
     metra
    -0.43
    POSITIVE LOGITS
     please
    0.90
     PLEASE
    0.89
     Pls
    0.89
     pls
    0.89
     plz
    0.86
    Pls
    0.86
    PLEASE
    0.83
     Please
    0.82
    DockStyle
    0.82
    please
    0.82
    Act Density 0.167%

    No Known Activations