INDEX
Explanations
questions starting with "Why"
instances of the word "Why" that express questioning or curiosity
New Auto-Interp
Negative Logits
Roller
-0.71
ages
-0.65
rop
-0.63
lator
-0.62
puck
-0.62
ymph
-0.62
polymorph
-0.62
tuber
-0.62
medic
-0.62
-0.61
POSITIVE LOGITS
soever
1.12
why
0.94
why
0.91
WHY
0.90
Why
0.86
iterranean
0.75
Why
0.75
tical
0.71
beit
0.71
ertodd
0.71
Activations Density 0.037%