INDEX
Explanations
instances of the word "follow" and its variations, indicating a focus on themes of following or guidance
New Auto-Interp
Negative Logits
arts
-0.17
akash
-0.16
\<^
-0.15
-Token
-0.15
aled
-0.15
nder
-0.14
efeller
-0.14
OfYear
-0.14
olum
-0.14
pras
-0.14
POSITIVE LOGITS
closely
0.20
izard
0.18
iston
0.17
.follow
0.16
962
0.16
follow
0.15
Follow
0.15
Obr
0.14
cone
0.14
Follow
0.14
Activations Density 0.088%