INDEX
    Explanations

    phrases indicating an overall summary or conclusion

    the phrase "in all" and its variations, indicating a focus on inclusivity or totality

    New Auto-Interp
    Negative Logits
    fell
    -0.70
    arthed
    -0.67
    eday
    -0.67
    stadt
    -0.63
     Citation
    -0.63
    bryce
    -0.62
    gaard
    -0.62
    raction
    -0.61
    sworth
    -0.61
    ALSE
    -0.59
    POSITIVE LOGITS
    clusive
    1.11
    CLUS
    0.69
     sudden
    0.67
    oots
    0.67
     together
    0.65
     toget
    0.64
    ighter
    0.63
    together
    0.63
    patient
    0.61
    ooting
    0.61
    Act Density 0.060%

    No Known Activations