INDEX
Explanations
references to collective identity and ownership related to the concept of "our."
New Auto-Interp
Negative Logits
wic
-0.75
bender
-0.74
puff
-0.73
Levine
-0.70
yang
-0.70
FU
-0.69
quart
-0.68
ault
-0.68
76561
-0.67
coached
-0.67
POSITIVE LOGITS
nation
1.17
selves
1.13
ancestors
1.05
own
1.04
shores
1.02
society
1.02
collective
1.00
democracy
0.96
beloved
0.96
civilization
0.93
Activations Density 0.088%