INDEX
    Explanations

    interesting

    New Auto-Interp
    Negative Logits
     smash
    -0.06
     SMB
    -0.06
    PA
    -0.06
     gun
    -0.06
    Ja
    -0.06
     Automotive
    -0.06
    _acl
    -0.06
     Ava
    -0.06
    ca
    -0.06
     quadratic
    -0.06
    POSITIVE LOGITS
     interesting
    0.14
     Interesting
    0.09
     Pleasant
    0.09
     Allison
    0.08
    irk
    0.07
    Interesting
    0.07
    interesting
    0.07
    0.07
     fascinating
    0.07
    الب
    0.07
    Act Density 0.013%

    No Known Activations