INDEX
    Explanations

    phrases indicating personal reflection and intention

    New Auto-Interp
    Negative Logits
     AssemblyCulture
    -1.02
    IntoConstraints
    -0.95
     Мексичка
    -0.87
    expandindo
    -0.86
     виправивши
    -0.85
    Portály
    -0.84
    Autoritní
    -0.82
     Italijanski
    -0.82
    دانشنامهٔ
    -0.81
    Portale
    -0.77
    POSITIVE LOGITS
     hit
    0.46
    kaç
    0.45
    ";
    0.45
    If
    0.44
    my
    0.43
    "=>
    0.43
    </u>
    0.43
    will
    0.43
     my
    0.43
    '
    0.42
    Act Density 0.125%

    No Known Activations