INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Buk
    -0.07
    -is
    -0.07
    zial
    -0.07
    masına
    -0.07
    lerine
    -0.06
     LIST
    -0.06
     stupidity
    -0.06
    Box
    -0.06
     petroleum
    -0.06
    に関
    -0.06
    POSITIVE LOGITS
    eggies
    0.06
    *****
    0.06
    _val
    0.06
     settling
    0.06
    ATEGORY
    0.06
    >(&
    0.06
    posted
    0.06
    ']):↵
    0.06
     lizard
    0.06
     →↵↵
    0.06
    Act Density 0.010%

    No Known Activations