INDEX
    Explanations

    phrases indicating intent or desire, particularly involving "to" followed by verbs

    New Auto-Interp
    Negative Logits
    nikov
    -1.69
    anese
    -1.61
    steps
    -1.58
     steps
    -1.54
     Statutes
    -1.51
    teenth
    -1.50
     Steps
    -1.49
    ories
    -1.49
    âĢIJ
    -1.49
     coats
    -1.49
    POSITIVE LOGITS
     treat
    1.77
     resume
    1.74
     restore
    1.67
     receive
    1.65
     guarantee
    1.64
     recreate
    1.56
     safely
    1.54
     ren
    1.54
     capture
    1.52
     remove
    1.52
    Act Density 0.090%

    No Known Activations