INDEX
    Explanations

    sentences indicating a plan or decision to do something

    expressions of intention and decision-making

    New Auto-Interp
    Negative Logits
     UNCLASSIFIED
    -0.68
    âĢ¢âĢ¢
    -0.65
    à¼
    -0.64
     (âĪĴ
    -0.62
    reements
    -0.62
    doms
    -0.61
    phies
    -0.61
     clauses
    -0.60
     trillions
    -0.60
    20439
    -0.60
    POSITIVE LOGITS
     myself
    1.10
     somew
    0.94
     something
    0.87
     ourselves
    0.87
     ASAP
    0.80
     somet
    0.80
    romptu
    0.79
     sometime
    0.79
     someday
    0.78
     themed
    0.77
    Act Density 0.440%

    No Known Activations