INDEX
Explanations
mentions of the pronoun "Us" with varying levels of relevance
references to "Us" in various contexts
New Auto-Interp
Negative Logits
served
-0.76
hetti
-0.63
CPC
-0.62
confinement
-0.62
lure
-0.61
Clarkson
-0.61
chaired
-0.60
*/(
-0.59
runoff
-0.56
lull
-0.56
POSITIVE LOGITS
hers
1.20
agi
1.09
ages
0.93
urers
0.92
ern
0.91
uman
0.91
chwitz
0.90
selves
0.88
ername
0.87
umi
0.86
Activations Density 0.019%