INDEX
    Explanations

    greetings and polite expressions in communication

    Formal greetings and expressions of gratitude

    New Auto-Interp
    Negative Logits
     but
    -0.60
    aarrggbb
    -0.59
    anskje
    -0.55
    Jvm
    -0.51
    但她
    -0.50
    لكن
    -0.50
     But
    -0.49
    ср
    -0.48
    -0.48
    不過
    -0.48
    POSITIVE LOGITS
    Hi
    1.65
    Dear
    1.46
     Hi
    1.42
    Hello
    1.40
     Hello
    1.34
     Dear
    1.30
     hello
    1.20
    Hey
    1.18
    hello
    1.14
     Hey
    1.11
    Act Density 0.290%

    No Known Activations