INDEX
    Explanations

    phrases indicating uncertainty or hesitation

    New Auto-Interp
    Negative Logits
    ä¸ļ
    -0.17
    cken
    -0.16
    plen
    -0.14
    δά
    -0.14
    retty
    -0.14
     svÄĽ
    -0.14
    jez
    -0.13
     regards
    -0.13
    inic
    -0.13
     Tight
    -0.13
    POSITIVE LOGITS
     dwelling
    0.21
     bore
    0.21
     Into
    0.18
     spoiler
    0.18
     detail
    0.18
     boring
    0.18
    Into
    0.17
     labour
    0.17
     into
    0.17
     dig
    0.17
    Act Density 0.094%

    No Known Activations