INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     hairc
    -1.23
     snoopy
    -1.22
     outlander
    -1.15
     milf
    -1.12
     indescri
    -1.08
     shenan
    -1.08
     madonna
    -1.04
     yoda
    -1.04
     scrat
    -1.04
     cushi
    -1.03
    POSITIVE LOGITS
     lemp
    0.71
    2
    0.63
     perak
    0.63
     krim
    0.61
     seksi
    0.60
     silikon
    0.59
     ekos
    0.59
     karbon
    0.58
    iquest
    0.58
     panik
    0.56
    Act Density 0.144%

    No Known Activations