INDEX
Explanations
phrases indicating inclusivity and diversity
phrases and constructs that emphasize the concept of belonging or affiliation
New Auto-Interp
Negative Logits
shown
-0.68
stall
-0.68
enary
-0.66
Haunted
-0.66
imum
-0.65
govtrack
-0.65
ERC
-0.65
AAAA
-0.65
ICA
-0.64
iversary
-0.63
POSITIVE LOGITS
handling
0.92
outlook
0.90
disposition
0.86
manners
0.83
tone
0.79
methodology
0.79
posture
0.78
manner
0.77
dealings
0.77
wording
0.77
Activations Density 0.539%