INDEX
Explanations
questions or desires to know about a variety of topics
questions or expressions of curiosity
New Auto-Interp
Negative Logits
onding
-0.71
pite
-0.70
ovie
-0.66
interstitial
-0.66
©¶æ¥µ
-0.63
projection
-0.62
ufact
-0.62
twitch
-0.61
permitting
-0.61
edition
-0.61
POSITIVE LOGITS
WHY
1.19
why
1.14
how
1.03
whether
1.00
why
0.97
ABOUT
0.94
WHERE
0.93
answers
0.90
HOW
0.90
about
0.89
Activations Density 0.098%