INDEX
    Explanations

    time-related phrases like "less than" and "just."

    phrases indicating short time frames or quick actions

    New Auto-Interp
    Negative Logits
    ourge
    -0.70
    apego
    -0.67
     congratulated
    -0.64
    aily
    -0.63
    illard
    -0.61
    orem
    -0.58
    yss
    -0.58
    sers
    -0.58
     underestimate
    -0.57
     inhibitors
    -0.57
    POSITIVE LOGITS
     guise
    0.91
     nutshell
    0.85
     manner
    0.82
     terms
    0.79
     increments
    0.79
     fashion
    0.77
     regard
    0.75
     context
    0.75
     form
    0.71
    Shape
    0.71
    Act Density 0.149%

    No Known Activations