INDEX
    Explanations

    conversational interactions and inquiries involving shared experiences or questions

    New Auto-Interp
    Negative Logits
     بيها
    -0.79
     للمعارف
    -0.69
     متعلقه
    -0.64
    Czytaj
    -0.59
    ślę
    -0.57
    ihnachten
    -0.56
    Identyfik
    -0.56
    +#+#
    -0.55
    unanje
    -0.54
    antaranya
    -0.52
    POSITIVE LOGITS
    ?!
    0.84
    ?!?
    0.78
    ?)
    0.78
    ?),
    0.73
    !?
    0.71
    ?).
    0.68
    ?"
    0.68
    ?!?!
    0.67
    ?:
    0.67
    ?!"
    0.67
    Act Density 0.151%

    No Known Activations