INDEX
    Explanations

    phrases indicating additional details or information

    phrases that include the expression "not to mention."

    New Auto-Interp
    Negative Logits
    hard
    -0.77
    itton
    -0.70
    arij
    -0.70
    twitch
    -0.69
    psons
    -0.66
    sis
    -0.66
    arat
    -0.65
    heres
    -0.65
    forums
    -0.63
    irm
    -0.63
    POSITIVE LOGITS
     mentioning
    0.84
    lihood
    0.80
    minus
    0.78
     nor
    0.76
     aloud
    0.72
     mention
    0.70
    _>
    0.68
     anymore
    0.68
     suffice
    0.66
     whatsoever
    0.66
    Act Density 0.021%

    No Known Activations