INDEX
    Explanations

    tall and slender figure or on his head

    New Auto-Interp
    Negative Logits
     twink
    -0.10
    çĶ·æĢ§
    -0.10
     elderly
    -0.10
     yaÅŁlı
    -0.09
     lep
    -0.09
     Adult
    -0.09
     men
    -0.09
     leer
    -0.09
     empt
    -0.09
     purple
    -0.09
    POSITIVE LOGITS
     fre
    0.13
     viv
    0.12
     Wir
    0.11
     girl
    0.10
     tom
    0.10
    ç¬ij
    0.10
     radi
    0.10
    andid
    0.10
     hoy
    0.09
     laugh
    0.09
    Act Density 0.101%

    No Known Activations