INDEX
Explanations
instances of the word "be."
can be + past participle
New Auto-Interp
Negative Logits
io
-0.40
y
-0.39
Ster
-0.37
-0.35
'
-0.35
rangs
-0.35
↵↵
-0.34
pouvoir
-0.34
Ziegler
-0.34
?
-0.34
POSITIVE LOGITS
хьтан
0.93
canst
0.84
easily
0.84
CanBe
0.82
easily
0.81
<unused52>
0.79
<unused68>
0.79
<unused41>
0.79
[@BOS@]
0.79
<unused28>
0.79
Activations Density 0.080%