INDEX
Explanations
How followed by question/instruction starters
sentence openings that initiate “how”-type questions, with especially strong response to quantitative formulations.
New Auto-Interp
Negative Logits
patrón
0.38
Muster
0.37
Anton
0.36
ینا
0.36
kiuj
0.35
Filho
0.35
❁
0.34
analisar
0.34
фаразы
0.34
Bande
0.34
POSITIVE LOGITS
days
0.42
sodium
0.40
th
0.39
to
0.39
R
0.38
octahedral
0.37
cost
0.37
three
0.37
n
0.36
time
0.36
Activations Density 0.020%