INDEX
    Explanations

    phrases related to providing instructions or steps

    commanding phrases suggesting action or engagement

    New Auto-Interp
    Negative Logits
    bara
    -0.81
    quartered
    -0.75
    éĹ
    -0.70
    lied
    -0.69
    otten
    -0.65
    ELD
    -0.64
    KO
    -0.63
    ago
    -0.63
    alky
    -0.62
    pine
    -0.60
    POSITIVE LOGITS
     ourselves
    1.07
     together
    0.71
     OUR
    0.68
    querade
    0.66
     clarify
    0.64
     anew
    0.64
     aside
    0.63
     collectively
    0.63
     REAL
    0.62
     tomorrow
    0.62
    Act Density 0.102%

    No Known Activations