INDEX
    Explanations

    phrases related to questions or prompts

    punctuation marks, particularly periods and question marks

    New Auto-Interp
    Negative Logits
    jri
    -0.77
    vertisement
    -0.75
     extingu
    -0.70
     phased
    -0.69
     glim
    -0.68
     purch
    -0.67
     bounded
    -0.67
     confir
    -0.67
    tera
    -0.66
     satell
    -0.66
    POSITIVE LOGITS
     Lastly
    2.09
     Finally
    1.98
     These
    1.58
     Both
    1.58
     Whatever
    1.53
    Finally
    1.53
    Lastly
    1.48
     Together
    1.46
    etc
    1.41
    These
    1.40
    Act Density 0.441%

    No Known Activations