INDEX
    Explanations

    phrases related to actions or commands

    action words that indicate gameplay and interaction

    New Auto-Interp
    Negative Logits
    cius
    -0.71
    thodox
    -0.67
     unlaw
    -0.67
    aith
    -0.64
    icum
    -0.64
    rake
    -0.63
    Applic
    -0.63
    minecraft
    -0.61
    LGBT
    -0.60
    iosyn
    -0.60
    POSITIVE LOGITS
     yourself
    1.04
     yourselves
    0.97
     your
    0.83
    Tube
    0.65
    animate
    0.61
     butterflies
    0.61
     YOUR
    0.60
    lda
    0.60
    pez
    0.58
     realise
    0.58
    Act Density 0.190%

    No Known Activations