INDEX
Explanations
instances of opinions, thoughts, and judgements using phrases like "I think", "that we know", "that would" and "argue".
auxiliary verbs
Expressing beliefs
New Auto-Interp
Negative Logits
itſelf
-0.70
سكانية
-0.70
^(@)
-0.63
})$}
-0.63
crdi
-0.63
كومونز
-0.60
незавершена
-0.60
ynos
-0.60
Baillargeon
-0.60
་་
-0.59
POSITIVE LOGITS
is
1.22
has
1.14
will
1.03
would
0.96
was
0.95
are
0.85
represents
0.82
might
0.81
may
0.78
constitutes
0.76
Activations Density 8.942%