INDEX
Explanations
questions being posed or situations where asking for information is central
instances of the word "ask" and its variations
New Auto-Interp
Negative Logits
zinski
-0.72
pite
-0.68
ensual
-0.65
Ĥ¬
-0.65
Nanto
-0.63
rongh
-0.63
skelet
-0.61
cyclop
-0.61
rily
-0.61
imm
-0.61
POSITIVE LOGITS
questions
1.36
rhet
1.15
Questions
1.12
probing
1.03
question
1.02
forgiveness
0.98
permission
0.96
wered
0.94
QUEST
0.91
nicely
0.86
Activations Density 0.055%