INDEX
    Explanations

    sentences ending with a full stop

    New Auto-Interp
    Negative Logits
    ¥ŀ
    -0.84
     volunte
    -0.77
     nodd
    -0.73
     prosec
    -0.72
     encount
    -0.71
     suspic
    -0.70
     confir
    -0.68
     rall
    -0.67
    yip
    -0.67
     advoc
    -0.66
    POSITIVE LOGITS
     My
    1.71
    My
    1.60
     I
    1.56
    I
    1.43
     my
    1.31
     myself
    1.21
     Whenever
    1.17
     Somehow
    1.17
    my
    1.16
     MY
    1.15
    Act Density 0.545%

    No Known Activations