INDEX
Explanations
questions and expressions of uncertainty regarding information or advice
New Auto-Interp
Negative Logits
why
-0.21
why
-0.18
Why
-0.17
WHY
-0.17
Why
-0.16
"Why
-0.15
æ¬
-0.14
pourquoi
-0.14
imu
-0.14
Translated
-0.14
POSITIVE LOGITS
any
0.30
Any
0.28
anyone
0.27
Thoughts
0.25
Any
0.25
anybody
0.25
thoughts
0.24
Anyone
0.23
.any
0.23
uggestions
0.23
Activations Density 0.186%