INDEX
    Explanations

    requests and polite language

    New Auto-Interp
    Negative Logits
     onData
    -0.75
    vB
    -0.75
    UDC
    -0.73
    WithIOException
    -0.71
     virke
    -0.70
     Herd
    -0.69
     Kopp
    -0.67
    ation
    -0.67
     quella
    -0.66
    GMENT
    -0.65
    POSITIVE LOGITS
     please
    2.32
     Please
    2.16
    Please
    2.07
    please
    2.05
     PLEASE
    1.94
    PLEASE
    1.75
     pls
    1.63
     Bitte
    1.49
     Pls
    1.45
    Pls
    1.43
    Act Density 0.061%

    No Known Activations