INDEX
Explanations
the word "are" in sentences
the phrase "We are" used in various contexts
New Auto-Interp
Negative Logits
rouse
-0.76
mater
-0.68
leck
-0.65
osate
-0.64
pedia
-0.64
Rank
-0.64
ossom
-0.63
rarily
-0.63
entails
-0.63
Shape
-0.62
POSITIVE LOGITS
glad
0.94
hereby
0.93
supposed
0.91
obligated
0.91
thankful
0.91
gonna
0.89
fortunate
0.89
proud
0.88
not
0.87
aware
0.87
Activations Density 0.143%