INDEX
Explanations
sentences indicating a specific consequence or result
statements indicating certainty or inevitability
New Auto-Interp
Negative Logits
Saud
-0.69
¿½
-0.65
Suggest
-0.64
adh
-0.61
Newsletter
-0.60
igor
-0.60
ĪĴ
-0.58
mbuds
-0.56
charact
-0.56
Winged
-0.56
POSITIVE LOGITS
eday
0.72
pora
0.72
someday
0.71
automatically
0.70
usable
0.70
liable
0.68
'll
0.67
avoided
0.65
owe
0.65
shouldn
0.65
Activations Density 0.266%