INDEX
Explanations
expressions related to accessibility and universality
New Auto-Interp
Negative Logits
others
-0.15
UTE
-0.15
ute
-0.15
Others
-0.14
elog
-0.14
lý
-0.14
Others
-0.13
aska
-0.13
others
-0.13
aldi
-0.13
POSITIVE LOGITS
everywhere
0.27
anywhere
0.24
anytime
0.22
anything
0.21
Anywhere
0.20
every
0.20
nowhere
0.20
everything
0.19
any
0.18
every
0.18
Activations Density 0.096%