INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     for
    -1.78
     there
    -1.77
     wśród
    -1.46
    chrift
    -1.44
     following
    -1.40
     which
    -1.38
    Now
    -1.38
     dilihat
    -1.37
     provide
    -1.36
     after
    -1.33
    POSITIVE LOGITS
    řízení
    1.45
    Ahol
    1.35
     tuoi
    1.35
    liegende
    1.34
    ",
    
    1.30
     verdaderas
    1.29
     แต่
    1.26
    1.26
     rám
    1.26
    Ee
    1.25
    Act Density 0.206%

    No Known Activations