INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -tracking
    -0.06
    _created
    -0.06
     рів
    -0.06
     Peng
    -0.06
     erupt
    -0.06
    уж
    -0.06
     относится
    -0.06
    teen
    -0.06
     quantities
    -0.06
    kers
    -0.06
    POSITIVE LOGITS
     lib
    0.07
    liked
    0.07
    verse
    0.06
    399
    0.06
     like
    0.06
    (format
    0.06
    /right
    0.06
     mini
    0.06
     cái
    0.06
    τίου
    0.06
    Act Density 0.000%

    No Known Activations