INDEX
Explanations
phrases related to specifying locations or conditions
phrases indicating conditions or states related to systems
New Auto-Interp
Negative Logits
luaj
-0.77
ode
-0.71
AMY
-0.70
Others
-0.69
english
-0.64
Others
-0.64
tro
-0.62
uc
-0.62
incial
-0.60
anton
-0.59
POSITIVE LOGITS
whenever
1.43
if
1.31
whoever
1.20
each
1.13
unless
1.09
when
1.09
whichever
1.08
every
1.00
suppose
0.97
unlike
0.96
Activations Density 0.320%