INDEX
Explanations
the pronoun "we" followed by a verb in the present or future tense
occurrences of the word "we" in various contexts
New Auto-Interp
Negative Logits
ions
-0.75
odor
-0.72
aversion
-0.71
imum
-0.70
REDACTED
-0.67
citation
-0.66
standing
-0.65
nah
-0.62
ception
-0.59
¿½
-0.58
POSITIVE LOGITS
're
1.39
've
1.34
'll
1.21
celebrate
1.01
commemorate
1.01
athered
0.99
revisit
0.98
introduce
0.98
'd
0.97
salute
0.94
Activations Density 0.134%