INDEX
Explanations
short phrases or questions starting with "What's" or "What is."
occurrences of the word "what" in various contexts
New Auto-Interp
Negative Logits
horizont
-0.79
seiz
-0.69
enegger
-0.67
Tid
-0.66
wink
-0.64
fman
-0.62
indoor
-0.60
impart
-0.60
neys
-0.59
zn
-0.59
POSITIVE LOGITS
¬
1.13
ı
1.04
¡
0.99
¹
0.98
ª
0.97
į
0.96
IJ
0.95
º
0.95
ķ
0.94
Ĵ
0.94
Activations Density 0.044%