INDEX
Explanations
phrases indicating a stance or position on a matter
instances of the phrase "we are" and variations thereof, indicating collective statements or affirmations
New Auto-Interp
Negative Logits
rouse
-0.73
mater
-0.71
meets
-0.65
bloc
-0.65
odor
-0.65
sed
-0.64
ousy
-0.63
othal
-0.62
emerges
-0.62
itters
-0.61
POSITIVE LOGITS
ourselves
0.98
thankful
0.94
fortunate
0.94
glad
0.93
proud
0.88
hereby
0.87
grateful
0.85
gonna
0.85
aware
0.85
hoping
0.84
Activations Density 0.106%