INDEX
    Explanations

    references to personal experiences and daily life activities

    New Auto-Interp
    Negative Logits
    á»ĭ
    -0.17
    pany
    -0.15
     oppon
    -0.15
    ummer
    -0.15
    ths
    -0.15
    .kotlin
    -0.15
    appe
    -0.14
    ucks
    -0.14
    å
    -0.14
     predecess
    -0.13
    POSITIVE LOGITS
    andbox
    0.15
    olio
    0.15
    meli
    0.14
    adÃŃ
    0.14
    illac
    0.14
     Blaze
    0.13
     Windsor
    0.13
    ~/
    0.13
    lient
    0.13
     Morav
    0.13
    Act Density 0.498%

    No Known Activations