INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ordova
    -0.07
     наиболее
    -0.07
    /player
    -0.07
     keine
    -0.06
     Dir
    -0.06
     in
    -0.06
     decimals
    -0.06
    ___
    -0.06
     Ор
    -0.06
     bpm
    -0.06
    POSITIVE LOGITS
    Sentence
    0.07
     QLD
    0.07
     soldier
    0.06
    iffe
    0.06
    esel
    0.06
     Anglic
    0.06
    QPCP
    0.06
    _PROC
    0.06
     Ruiz
    0.06
    0.06
    Act Density 0.023%

    No Known Activations