INDEX
    Explanations

    statements discussing personal feelings and opinions about social situations

    New Auto-Interp
    Negative Logits
     closer
    -0.36
     sabar
    -0.36
     reconocer
    -0.35
     Closer
    -0.33
     reversing
    -0.31
    AndEndTag
    -0.30
     reversed
    -0.29
     recognizes
    -0.29
     recognizing
    -0.28
    reversed
    -0.28
    POSITIVE LOGITS
    httphttps
    0.73
     ſche
    0.59
    ніципалі
    0.57
     Autorizaciones
    0.56
    ruptedException
    0.56
    CPtr
    0.55
     Италијани
    0.54
     queſta
    0.54
    :✨
    0.54
    Попис
    0.54
    Act Density 0.045%

    No Known Activations