INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Zika
    -0.07
    ована
    -0.06
     bekan
    -0.06
     lept
    -0.06
     máu
    -0.06
     вид
    -0.06
    突然
    -0.06
     Fisheries
    -0.06
     привы
    -0.06
     같다
    -0.06
    POSITIVE LOGITS
     ideological
    0.06
     pdf
    0.06
     Thumb
    0.06
    _THREADS
    0.06
     gameState
    0.06
     Railroad
    0.06
    armac
    0.06
     necessary
    0.06
     Fashion
    0.06
     grouped
    0.06
    Act Density 0.016%

    No Known Activations