INDEX
    Explanations

    references to personal experiences and reflections

    New Auto-Interp
    Negative Logits
     lead
    -0.15
    ucz
    -0.15
    ίÏĥ
    -0.14
    utschein
    -0.13
     spokesperson
    -0.13
     hareket
    -0.13
    lead
    -0.13
    ocket
    -0.13
    psz
    -0.13
    ilar
    -0.13
    POSITIVE LOGITS
     mus
    0.33
     ram
    0.30
     thoughts
    0.30
     Mus
    0.30
     posts
    0.28
     Ram
    0.26
     Posts
    0.26
     Thoughts
    0.25
     randomness
    0.25
    rant
    0.25
    Act Density 0.297%

    No Known Activations