INDEX
    Explanations

    greetings and polite expressions in conversation

    New Auto-Interp
    Negative Logits
    IntoConstraints
    -1.19
    expandindo
    -1.01
     виправивши
    -0.99
    +#+#
    -0.95
     myſelf
    -0.90
    webElementXpaths
    -0.83
     <>",
    -0.82
     Roskov
    -0.81
     Italijanski
    -0.80
     itſelf
    -0.80
    POSITIVE LOGITS
     factor
    0.43
    se
    0.42
    </i>
    0.42
    <bos>
    0.41
     tend
    0.40
     ut
    0.40
    ś
    0.39
    elementAt
    0.38
     is
    0.38
    ству
    0.38
    Act Density 0.603%

    No Known Activations