INDEX
Explanations
questions starting with "Why"
questions that begin with "Why."
New Auto-Interp
Negative Logits
Roller
-0.74
lator
-0.64
chnology
-0.64
iece
-0.64
Hitman
-0.62
ymph
-0.59
ergy
-0.59
Pan
-0.57
phrine
-0.56
commun
-0.56
POSITIVE LOGITS
soever
0.97
why
0.85
WHY
0.84
bother
0.83
why
0.79
Why
0.77
brow
0.72
Does
0.70
?
0.70
ever
0.68
Activations Density 0.041%