INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     was
    -0.13
     were
    -0.10
     is
    -0.10
     would
    -0.09
     could
    -0.09
     Was
    -0.08
     can
    -0.08
     are
    -0.08
     had
    -0.08
    was
    -0.08
    POSITIVE LOGITS
    Unavailable
    0.07
    bew
    0.07
    0.07
    іти
    0.06
     UTF
    0.06
    undefined
    0.06
     Erie
    0.06
    ัฒนา
    0.06
    seudo
    0.06
    ceu
    0.06
    Act Density 0.769%

    No Known Activations