INDEX
    Explanations

    punctuation marks and numbers

    New Auto-Interp
    Negative Logits
     fur
    -0.06
    nis
    -0.06
    porter
    -0.06
    oster
    -0.06
    oler
    -0.06
    eni
    -0.06
    nun
    -0.06
    ergy
    -0.06
     Redistribution
    -0.06
    jon
    -0.06
    POSITIVE LOGITS
     LENG
    0.07
    edor
    0.07
    تÙī
    0.07
     Ø¢Ùħار
    0.07
    antal
    0.07
    (æ°´
    0.07
     ----------------------------------------------------------------------↵
    0.07
     Trafford
    0.06
    جÙĨ
    0.06
    UniqueId
    0.06
    Act Density 0.031%

    No Known Activations