INDEX
    Explanations

    phrases related to giving commands or directives

    New Auto-Interp
    Negative Logits
    ģĸ
    -0.69
    20439
    -0.67
     Combined
    -0.67
    ļéĨĴ
    -0.64
     purported
    -0.61
     ancest
    -0.59
    features
    -0.58
     culminating
    -0.58
     Vendor
    -0.58
     strikingly
    -0.58
    POSITIVE LOGITS
     yourselves
    1.45
     yourself
    1.10
     thy
    1.00
     your
    0.94
     ye
    0.93
     ya
    0.92
     Yourself
    0.89
     thou
    0.88
     me
    0.86
     fuckin
    0.84
    Act Density 0.281%

    No Known Activations