INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
     Carson
    -0.07
     glam
    -0.06
    oves
    -0.06
    ountain
    -0.06
     جر
    -0.06
     قي
    -0.06
    LED
    -0.06
    plant
    -0.06
    iltr
    -0.06
    POSITIVE LOGITS
     injunction
    0.07
    ,key
    0.06
    ็ว
    0.06
     integrates
    0.06
    },↵↵
    0.06
    __.__
    0.06
     polar
    0.06
    atasets
    0.06
    .getConfig
    0.06
    	J
    0.06
    Act Density 0.005%

    No Known Activations