INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Previous
    -0.07
     MMA
    -0.07
     někter
    -0.07
     quỹ
    -0.07
    .br
    -0.07
     contradictions
    -0.07
     itemName
    -0.06
     сама
    -0.06
    .species
    -0.06
     application
    -0.06
    POSITIVE LOGITS
    0.08
     bund
    0.07
     tucked
    0.07
     vanished
    0.07
     dropped
    0.07
     makeover
    0.06
    bound
    0.06
     blamed
    0.06
     snaps
    0.06
    usted
    0.06
    Act Density 0.348%

    No Known Activations