INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     блок
    -0.08
     реч
    -0.08
    uve
    -0.08
     организма
    -0.07
    Blocking
    -0.07
     blokk
    -0.07
    (Block
    -0.07
     Volks
    -0.07
    attie
    -0.07
    Blocks
    -0.07
    POSITIVE LOGITS
     seguida
    0.09
     फिर
    0.09
    然后
    0.09
     তারপর
    0.08
     sonra
    0.08
     پھر
    0.08
     سپس
    0.08
     dabei
    0.08
     condiment
    0.07
     ત્યાર
    0.07
    Act Density 0.011%

    No Known Activations