INDEX
    Explanations

    greetings and salutations in various languages

    New Auto-Interp
    Negative Logits
     bave
    -0.63
     [+
    -0.62
     sumpay
    -0.62
     ContentValues
    -0.60
    >');
    -0.60
    ssymb
    -0.60
     собі
    -0.59
    jstor
    -0.58
    fortawesome
    -0.58
     kaynağından
    -0.58
    POSITIVE LOGITS
     hello
    1.11
     Hello
    1.10
    Hello
    1.10
    hello
    0.94
    Greetings
    0.92
    HELLO
    0.90
    Greeting
    0.89
     HELLO
    0.89
    Привет
    0.87
     Greetings
    0.86
    Act Density 0.101%

    No Known Activations