INDEX
Explanations
phrases indicating scarcity or lack
New Auto-Interp
Negative Logits
ipa
-0.14
/OR
-0.14
hiba
-0.14
ilma
-0.13
apesh
-0.13
ires
-0.13
нен
-0.13
adil
-0.13
agli
-0.13
ivate
-0.13
POSITIVE LOGITS
/no
0.35
else
0.30
chance
0.24
except
0.19
besides
0.19
else
0.18
beyond
0.18
else
0.18
mention
0.17
or
0.17
Activations Density 0.028%