INDEX
    Explanations

    punctuation and reflective thoughts on personal development

    New Auto-Interp
    Negative Logits
    oft
    -0.16
     favored
    -0.16
     unto
    -0.16
     theater
    -0.15
     favors
    -0.15
     colorful
    -0.15
     overly
    -0.15
    oogle
    -0.14
     sher
    -0.14
    ëĪ
    -0.14
    POSITIVE LOGITS
     Till
    0.17
     Beste
    0.17
    ĵn
    0.15
     specialised
    0.15
    enticated
    0.14
    анÑĸз
    0.14
     wet
    0.14
    isci
    0.14
    ä¸įå¾Ĺ
    0.14
    Ü
    0.14
    Act Density 0.004%

    No Known Activations