INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .NUM
    -0.06
    оны
    -0.06
    .mb
    -0.06
     halfway
    -0.06
    PIO
    -0.06
     numa
    -0.06
    (retval
    -0.06
    енсив
    -0.06
    tings
    -0.06
    ैप
    -0.06
    POSITIVE LOGITS
     bacter
    0.16
    acter
    0.10
    zer
    0.08
     Agent
    0.08
     Baker
    0.08
     prot
    0.08
     Victor
    0.08
    cter
    0.07
    TER
    0.07
     poster
    0.07
    Act Density 0.002%

    No Known Activations