INDEX
Explanations
phrases and variations of the word "what."
New Auto-Interp
Negative Logits
Aze
-0.71
Verge
-0.68
reminder
-0.68
Brea
-0.67
Moors
-0.67
erl
-0.64
es
-0.63
EES
-0.62
Jolie
-0.62
Crocodile
-0.61
POSITIVE LOGITS
what
2.27
what
2.11
WHAT
2.05
WHAT
1.98
What
1.90
What
1.86
whats
1.30
whats
1.17
Whats
1.09
hvad
1.09
Activations Density 0.100%