INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     antibiotics
    -0.07
     overt
    -0.06
     Ста
    -0.06
    -0.06
     Sür
    -0.06
     fond
    -0.06
    -my
    -0.06
     preferable
    -0.06
     nutné
    -0.06
    /npm
    -0.06
    POSITIVE LOGITS
    _LP
    0.07
    LP
    0.07
     cread
    0.06
    ivirus
    0.06
    (non
    0.06
    書館
    0.06
    _$_
    0.06
    0.06
     sns
    0.06
     Hitch
    0.06
    Act Density 0.002%

    No Known Activations