INDEX
    Explanations

    online forum/blog text

    New Auto-Interp
    Negative Logits
    ереч
    -0.06
    -u
    -0.06
     CBC
    -0.06
     NASCAR
    -0.06
    gu
    -0.06
     paternal
    -0.06
    ाल
    -0.06
     Chaos
    -0.06
     scrollTop
    -0.06
     gu
    -0.06
    POSITIVE LOGITS
     unrestricted
    0.07
     інозем
    0.07
     glimpse
    0.06
    PEC
    0.06
    (/\
    0.06
     numeral
    0.06
     для
    0.06
     primer
    0.06
    _FILE
    0.06
     لكل
    0.06
    Act Density 0.005%

    No Known Activations