INDEX
Explanations
phrases that refer to inclusivity and the presence of multiple elements or features
New Auto-Interp
Negative Logits
and
-0.54
scot
-0.49
that
-0.49
C
-0.48
tro
-0.47
ally
-0.47
which
-0.46
baomidou
-0.46
retario
-0.45
instead
-0.45
POSITIVE LOGITS
includes
2.82
include
2.49
Includes
2.47
Includes
2.34
includes
2.30
INCLUDES
2.10
inclui
1.99
Include
1.99
incluye
1.99
INCLUDE
1.81
Activations Density 0.171%