INDEX
    Explanations

    phrases indicating belonging or being part of a group or community

    phrases that emphasize belonging or being part of a group

    New Auto-Interp
    Negative Logits
     assumes
    -0.66
     ceilings
    -0.65
    ptive
    -0.65
     incurred
    -0.63
     withd
    -0.62
     directs
    -0.61
     spouses
    -0.60
    culosis
    -0.60
    iasis
    -0.59
    ants
    -0.58
    POSITIVE LOGITS
    ¬¼
    0.72
     the
    0.70
     Team
    0.70
    İĭ
    0.68
    Team
    0.68
    axy
    0.67
    ģĸ
    0.67
     something
    0.66
     Kinnikuman
    0.65
    circle
    0.64
    Act Density 0.101%

    No Known Activations