INDEX
    Explanations

    mentions of various types of inclusions or organizational entities within a broader context

    instances of the word "including."

    New Auto-Interp
    Negative Logits
    iny
    -0.85
    bell
    -0.81
    iri
    -0.80
    ael
    -0.79
    ilion
    -0.78
    ould
    -0.77
    ochond
    -0.76
    erb
    -0.75
    istle
    -0.74
    orc
    -0.73
    POSITIVE LOGITS
     those
    0.89
     ones
    0.73
     ours
    0.73
     yours
    0.71
    lihood
    0.69
     some
    0.68
     prominently
    0.63
     possibly
    0.63
     visits
    0.63
     references
    0.63
    Act Density 0.083%

    No Known Activations