INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.08
    捕鱼
    -0.08
     광고
    -0.08
    >NN
    -0.07
    .tv
    -0.07
     Fishing
    -0.07
    /colors
    -0.07
     adject
    -0.07
     electoral
    -0.07
    axon
    -0.07
    POSITIVE LOGITS
     nedeni
    0.10
    (Throwable
    0.10
     Throwable
    0.10
    Прич
    0.09
    原因
    0.09
     причины
    0.09
     reasons
    0.09
    ىنى
    0.09
     oorzaak
    0.09
     Ursache
    0.09
    Act Density 0.002%

    No Known Activations