INDEX
    Explanations

    questions or statements indicating a need for action or resolution

    phrases that indicate necessity or requirement for action

    New Auto-Interp
    Negative Logits
    gdala
    -0.72
     Zip
    -0.65
     Democr
    -0.64
     Rost
    -0.62
    izon
    -0.62
     Mamm
    -0.61
     Guilty
    -0.59
     Varg
    -0.57
    theless
    -0.56
    ulty
    -0.56
    POSITIVE LOGITS
     attention
    0.94
    lessly
    0.91
     scrutiny
    0.82
    FINE
    0.77
     tweaking
    0.77
    ENTION
    0.75
     updating
    0.75
     Citation
    0.73
     correction
    0.72
     refinement
    0.72
    Act Density 0.078%

    No Known Activations