INDEX
Explanations
phrases that indicate conditions or limitations
New Auto-Interp
Negative Logits
OGND
-1.08
Paglinawan
-1.03
CWE
-1.02
TagMode
-0.98
__':
-0.98
propOrder
-0.95
Audiodateien
-0.94
uxxxx
-0.93
myſelf
-0.93
zvuky
-0.92
POSITIVE LOGITS
also
1.06
0.84
likewise
0.70
also
0.66
was
0.64
is
0.62
0.62
أيضاً
0.62
...
0.62
(
0.59
Activations Density 1.432%