INDEX
    Explanations

    text related to system updates, error messages, and user instructions

    "please" or similar polite requests

    please or equivalent requests

    New Auto-Interp
    Negative Logits
    aget
    -0.54
    transQ
    -0.47
    Personendaten
    -0.47
    Fácil
    -0.47
    なるほど
    -0.45
    不然
    -0.44
     nok
    -0.44
    мость
    -0.43
    şte
    -0.43
    allible
    -0.43
    POSITIVE LOGITS
     please
    2.40
     Please
    2.10
    Please
    2.02
    please
    2.00
     PLEASE
    1.87
     bitte
    1.72
     pls
    1.71
    PLEASE
    1.65
     plz
    1.61
    1.46
    Act Density 0.205%

    No Known Activations