INDEX
    Explanations

    references to personal experiences and opinions over time

    New Auto-Interp
    Negative Logits
    ODO
    -0.15
    kö
    -0.15
    ala
    -0.15
    estro
    -0.15
    robe
    -0.15
    usher
    -0.15
    ardown
    -0.14
    icensed
    -0.14
    inal
    -0.14
    addy
    -0.14
    POSITIVE LOGITS
    enic
    0.16
     reg
    0.15
    ose
    0.14
     Bald
    0.14
     regenerate
    0.14
    iffin
    0.14
    ITCH
    0.14
     myself
    0.13
     Echo
    0.13
     Greg
    0.13
    Act Density 0.131%

    No Known Activations