INDEX
    Explanations

    phrases or sentences where an action is being initiated or discussed

    phrases that indicate actions or responses

    New Auto-Interp
    Negative Logits
    inations
    -0.71
    istant
    -0.68
    SPONSORED
    -0.67
    adoes
    -0.63
    tan
    -0.62
    adal
    -0.61
    isable
    -0.60
    Merc
    -0.60
    din
    -0.60
    Silver
    -0.59
    POSITIVE LOGITS
     virtue
    1.16
    products
    1.01
     placing
    0.95
    laws
    0.95
     eliminating
    0.94
     adding
    0.92
     leaps
    0.91
     default
    0.89
     putting
    0.89
     introducing
    0.88
    Act Density 0.084%

    No Known Activations