INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     autorytatywna
    -0.66
     pinulongan
    -0.66
    </tfoot>
    -0.60
     препратки
    -0.58
    aarrggbb
    -0.57
     kaynağından
    -0.57
     Normdatei
    -0.57
    liest
    -0.54
    دانشنامهٔ
    -0.52
    SPATH
    -0.51
    POSITIVE LOGITS
     MonoBehaviour
    0.60
    ed
    0.60
     Henk
    0.57
    segue
    0.55
     Mase
    0.54
    elles
    0.54
    reserv
    0.54
     Battles
    0.54
     Addresses
    0.54
    ian
    0.53
    Act Density 0.604%

    No Known Activations