INDEX
    Explanations

    references to personal experiences and emotions

    New Auto-Interp
    Negative Logits
    igh
    -0.17
    LEY
    -0.17
     Sans
    -0.15
    onis
    -0.15
    essler
    -0.15
    302
    -0.15
    IGH
    -0.14
    ley
    -0.14
    azon
    -0.14
    adaki
    -0.14
    POSITIVE LOGITS
    roulette
    0.15
    spiel
    0.15
    ebin
    0.15
    ecies
    0.14
    æħ§
    0.14
     phá»§
    0.14
    rop
    0.14
    άνα
    0.14
    ampoo
    0.13
    apia
    0.13
    Act Density 0.257%

    No Known Activations