INDEX
Explanations
phrases expressing conditions or requirements
conditional phrases indicating necessity or limitations
New Auto-Interp
Negative Logits
ran
-0.57
triumph
-0.57
nods
-0.56
various
-0.56
nearby
-0.55
successful
-0.54
recently
-0.54
exemplary
-0.54
rador
-0.52
æµ
-0.51
POSITIVE LOGITS
unless
3.41
unless
2.87
Unless
2.13
Unless
2.10
except
1.89
until
1.64
until
1.53
except
1.52
lest
1.49
regardless
1.49
Activations Density 0.013%