INDEX
    Explanations

    initiating dialog or asking questions

    New Auto-Interp
    Negative Logits
     بشكل
    0.71
     providing
    0.65
    複数の
    0.64
     progressivement
    0.63
    包含
    0.63
     العديد
    0.62
     multiple
    0.61
     using
    0.61
     사용하여
    0.60
     발생하는
    0.58
    POSITIVE LOGITS
     당신
    0.90
     Alright
    0.80
     mój
    0.80
     tonight
    0.79
     আমি
    0.78
     señor
    0.78
     내가
    0.78
     gentlemen
    0.77
     Tell
    0.77
     Tonight
    0.76
    Act Density 2.631%

    No Known Activations