INDEX
Explanations
the word "why"
the word "why" and its variations, indicating a focus on questions or explanations
New Auto-Interp
Negative Logits
Roller
-0.78
ymph
-0.72
trop
-0.66
rop
-0.66
amps
-0.66
robe
-0.63
hern
-0.62
lator
-0.62
puck
-0.61
Zone
-0.61
POSITIVE LOGITS
why
1.04
soever
1.03
WHY
0.96
why
0.95
iterranean
0.80
exactly
0.79
Why
0.79
ihad
0.75
icago
0.72
ricanes
0.69
Activations Density 0.034%