INDEX
Explanations
phrases that include the words "including" or "especially" to highlight specific entities within a larger group
references to inclusion and participation in various contexts
New Auto-Interp
Negative Logits
bis
-0.78
ript
-0.72
ielding
-0.68
ahime
-0.68
ibilities
-0.67
ossession
-0.66
ciation
-0.66
ioxide
-0.66
erred
-0.65
Features
-0.64
POSITIVE LOGITS
myself
1.33
ourselves
1.11
journalists
1.07
yourselves
1.04
oneself
1.03
clergy
1.02
politicians
1.02
celebrities
1.01
feminists
1.00
strangers
1.00
Activations Density 0.237%