INDEX
    Explanations

    phrases related to instructions or rules, including directives and prohibitions

    punctuation marks and specific structural elements of writing, particularly within lists or sections of text

    New Auto-Interp
    Negative Logits
    GMT
    -0.76
    worm
    -0.67
    lock
    -0.64
    rosis
    -0.63
     sunset
    -0.63
    pipe
    -0.63
    nings
    -0.62
    rop
    -0.62
     starter
    -0.62
    raph
    -0.60
    POSITIVE LOGITS
    who
    1.06
     whom
    1.05
    doms
    0.94
     Including
    0.78
    friends
    0.77
    groups
    0.76
     Especially
    0.76
     faiths
    0.74
     alike
    0.72
    shows
    0.71
    Act Density 0.710%

    No Known Activations