INDEX
    Explanations

    phrases that include the words "including" or "especially" to highlight specific entities within a larger group

    references to inclusion and participation in various contexts

    New Auto-Interp
    Negative Logits
    bis
    -0.78
    ript
    -0.72
    ielding
    -0.68
    ahime
    -0.68
    ibilities
    -0.67
    ossession
    -0.66
    ciation
    -0.66
    ioxide
    -0.66
    erred
    -0.65
    Features
    -0.64
    POSITIVE LOGITS
     myself
    1.33
     ourselves
    1.11
     journalists
    1.07
     yourselves
    1.04
     oneself
    1.03
     clergy
    1.02
     politicians
    1.02
     celebrities
    1.01
     feminists
    1.00
     strangers
    1.00
    Act Density 0.237%

    No Known Activations