INDEX
Explanations
conjunctions and the use of phrases that emphasize continuity or inclusion
New Auto-Interp
Negative Logits
THEN
-0.16
denen
-0.14
then
-0.14
IFE
-0.14
implication
-0.14
msp
-0.14
aso
-0.14
Then
-0.13
THEN
-0.13
agt
-0.13
POSITIVE LOGITS
is
0.36
has
0.33
can
0.25
was
0.25
will
0.24
should
0.23
may
0.21
ÑıвлÑıеÑĤÑģÑı
0.21
could
0.20
although
0.20
Activations Density 0.492%