INDEX
Explanations
pronouns and phrases indicating involvement or agency in actions
Sentences starting with "We"
we + academic verbs
New Auto-Interp
Negative Logits
OGND
-0.87
RectangleBorder
-0.84
שוליים
-0.81
InputBorder
-0.79
يتيمه
-0.76
kháu
-0.74
bootstrapcdn
-0.73
]--;
-0.72
клопе
-0.71
AttributeSet
-0.70
POSITIVE LOGITS
alſo
0.65
also
0.62
<bos>
0.59
continued
0.58
found
0.57
then
0.55
ſhould
0.54
will
0.54
would
0.53
muſt
0.52
Activations Density 1.753%