INDEX
    Explanations

    mentions of people or entities being part of a group or category

    phrases that indicate a group or collective inclusion

    New Auto-Interp
    Negative Logits
     anytime
    -0.73
    ption
    -0.61
     translates
    -0.61
    rimp
    -0.60
    bs
    -0.58
    hs
    -0.58
    ario
    -0.57
     noses
    -0.57
    ober
    -0.56
    bsite
    -0.56
    POSITIVE LOGITS
     those
    0.93
    those
    0.78
    Ĥ¬
    0.77
     dozens
    0.76
    IJ
    0.76
     hundreds
    0.74
     thousands
    0.72
    st
    0.69
     billions
    0.68
     several
    0.68
    Act Density 0.033%

    No Known Activations