INDEX
Explanations
inquiries that begin with "Why" indicating questions or curiosity about a topic
New Auto-Interp
Negative Logits
Ïĥκε
-0.17
yun
-0.17
UDO
-0.15
singular
-0.14
.amazonaws
-0.14
ãģªãģĹ
-0.13
obec
-0.13
phinx
-0.13
кÑĥÑĢ
-0.13
ãĥŃãĥ¼
-0.13
POSITIVE LOGITS
Choose
0.23
alla
0.23
choose
0.22
bother
0.22
Choose
0.19
should
0.19
choose
0.18
Consider
0.18
choosing
0.17
you
0.17
Activations Density 0.021%