INDEX
Explanations
words related to questioning or pondering
expressions of curiosity or questioning thoughts
New Auto-Interp
Negative Logits
ccording
-0.96
ĪĴ
-0.82
interstitial
-0.76
enses
-0.74
Ĥ¬
-0.67
existence
-0.66
alach
-0.65
orneys
-0.63
etitive
-0.62
hetic
-0.62
POSITIVE LOGITS
aloud
1.01
why
0.79
wonder
0.75
ask
0.71
WHY
0.71
warts
0.70
INGTON
0.69
ingly
0.69
how
0.69
Questions
0.69
Activations Density 0.017%