INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     NEWS
    -0.07
     oznám
    -0.07
     fictional
    -0.07
    Ö
    -0.07
     nu
    -0.06
     lesson
    -0.06
     він
    -0.06
    _Z
    -0.06
     nos
    -0.06
    _SOL
    -0.06
    POSITIVE LOGITS
     carried
    0.14
     Carry
    0.12
     carry
    0.12
     carries
    0.12
     carrying
    0.11
     Carr
    0.09
     Carrie
    0.09
     carrier
    0.09
    carry
    0.09
    روی
    0.09
    Act Density 0.017%

    No Known Activations